The question being tested
Can rules-based, AI-managed investing produce risk-adjusted returns above a broad index after trading costs and taxes? The primary metric is Sharpe ratio, net of all costs and after-tax where applicable. The target is above 1.0 — meaningfully better than SPY's long-run Sharpe around 0.5.
Supporting metrics: Sortino (downside volatility), maximum drawdown, Calmar ratio, hit rate, and alpha against a factor model. Benchmarks are chosen per signal — small-cap signals benchmark against Russell 2000, factor tilts against the corresponding factor ETF.
An honest answer requires: at least 40–100 closed trades per signal before drawing conclusions, out-of-sample evidence (not backtest fits), and explicit awareness that with multiple signals running, one is likely to look "significant" by noise alone.