money_ready_verdict.json, the 32.6M-row bt_backtest_trades table, and all audit pages, zero asset classes pass Tier-2 statistical gates. The "Forex is winning" narrative collapses under scrutiny (24% WR, 0.077 PF on n=25). This roadmap is the only defensible path from current state to ROI-positive edge.
Imagine you have a giant box of 32 million old lottery tickets. Some look like winners on paper, but you don't know which ones were bought with real money vs. ones someone just made up to look good.
Right now: We can't tell the real winners from the fake ones. Every "winning strategy" we have fails basic math tests (too few real trades, loses more money than it makes, or blows up in bad markets).
The plan: First, figure out which tickets were actually bought with real money. Then, only trust strategies that win in multiple different ways (not just one lucky trick). Then, test them with pretend money for months. Only after they pass all the hard tests do we risk real money. If after 6 months we still have nothing, we stop spending thousands on AI and try something else.
Problem: The 32.6 million rows in bt_backtest_trades (ejaguiar1_backtests) are mostly synthetic backtests, not live fills. The 15-row eagle2_consensus_picks table (also ejaguiar1_backtests) is a tiny production-consensus signal. Verified 2026-06-09: row counts and the Forex 24% WR / 0.077 PF / n=25 figures all reconcile to the live DB.
Actions:
source_db and source_table provenance on the 32M rowsDeliverable: production_edge_audit.md — verified live metrics, not backtest artifacts.
Defense: Without this step, every downstream conclusion is built on sand. The "Forex winner" claim is a perfect example of mistaking backtest overfitting for live edge.
Problem: No strategy currently passes all Tier-2 gates (n≥100, WR≥50%, PF≥1.5, MDD≤20%, DSR≥1.0, PBO/SPA/FDR, single-source, recency).
Actions:
ai_leaderboard.html with only gate-passing entries (currently empty)Deliverable: Updated leaderboard showing only strategies that survive statistical scrutiny.
Defense: The current leaderboard shows AI model accuracy (e.g., 75% WR on 60 picks), not trading P&L after slippage and drawdowns. Gate-passing is the minimum bar for real-money consideration.
Problem: The single_source_ok gate fails everywhere because profitable sleeves are single-source artifacts (e.g., stocks_rsi2_pullback alone).
Actions:
n_profitable_multi_source ≥ 2 for any production candidatesingle_source_rejects.mdDeliverable: Shortlist of multi-source validated strategies only.
Defense: Single-source profitability is the classic signature of overfitting or data snooping. The money_ready_verdict.json explicitly flags this as a failure mode across EQUITY, CRYPTO, and others.
Problem: Even gate-passing backtests can fail in live markets due to regime shifts, execution friction, or hidden lookahead bias.
Actions:
live_trades table with real slippage and latencyDeliverable: forward_test_log.md with live vs. backtest comparison tables.
Defense: This is the only honest way to validate edge. The 32M backtest rows are cheap; live fills are expensive. We must prove the edge survives the transition.
Problem: We have invested substantial recurring spend across a large fleet of AI API providers with zero validated edge to show for it.
Actions:
/audit/roi_dashboard.html showing: total AI spend (from provider usage logs), cumulative live P&L, cost per validated edge, break-even timelineDeliverable: Public ROI dashboard + documented kill switch policy.
Defense: Sunk-cost fallacy is the enemy of ROI. If the 5-phase protocol yields nothing, continuing to burn API credits is irrational. The kill switch protects capital.
| Criticism | Response |
|---|---|
| "We already have 32 million trades — why not just mine them?" | Because 99%+ are backtests. Mining synthetic data produces synthetic edges. Phase 1 explicitly separates live from backtest. |
| "Forex is winning according to another analyst." | 24% win rate and 0.077 profit factor on n=25 is not winning — it's statistical noise. The plan demands n≥100 + all gates passed. |
| "This will take months. We need edge now." | Speed without statistical rigor is how you lose real money. The current state (all classes INSUFFICIENT_DATA or NOT_READY) proves the "now" approach has failed. |
| "What if we never find edge?" | That's why Phase 5 includes an explicit kill switch and pivot plan. The goal is ROI, not perpetual AI spend. |