Back to strategy
Backtest replay · 500 closed theses
Tune the desk gate, see history flip
Pure deterministic re-classification over the last 500 closed theses. Adjust R:R thresholds per market regime and the minimum target distance in ATR multiples — every change re-runs the same logic the desk's Risk Manager applies in production. No LLM calls, no inference; your params + historical price levels = WR you would have realized.
New WR (your params)
70.0%
35W · 15L · 79N
Approval rate
25.8%
129 / 500 pass
Old flat 1.5 WR
62.0%
81 / 500 pass
Δ WR vs flat
+8.0pp
pop WR: 58.0%
Approved theses · 129
| Asset | Side | Conv | R:R | Tgt × ATR | Regime | Outcome | PnL |
|---|---|---|---|---|---|---|---|
| SOL | short | 62 | 1.75:1 | 3.0× | trending | neutral | +0.01% |
| DYDX | long | 52 | 2.62:1 | 2.7× | trending | neutral | -2.30% |
| ETH | short | 52 | 2.00:1 | 2.8× | trending | loss | -1.96% |
| PYTH | short | 74 | 1.28:1 | — | trending | loss | -3.76% |
| FARTCOIN | short | 68 | 1.24:1 | — | trending | neutral | +0.32% |
| VIRTUAL | short | 62 | 2.50:1 | 2.5× | trending | win | +3.09% |
| ZEREBRO | long | 64 | 1.21:1 | — | trending | loss | -7.41% |
| INIT | long | 55 | 1.91:1 | — | trending | loss | -3.03% |
| ZEREBRO | long | 55 | 1.91:1 | — | trending | win | +32.17% |
| ZEREBRO | long | 55 | 1.91:1 | — | trending | neutral | -1.88% |
| INIT | long | 55 | 1.75:1 | — | trending | loss | -4.45% |
| INIT | long | 55 | 1.75:1 | — | trending | loss | -4.45% |
| FIL | long | 58 | 2.33:1 | 3.5× | chop | neutral | -2.09% |
| INJ | long | 62 | 1.53:1 | 2.6× | trending | neutral | -0.67% |
| TRX | long | 58 | 1.47:1 | — | trending | neutral | +1.10% |
| TRX | long | 55 | 1.83:1 | — | trending | neutral | +0.63% |
| TRX | long | 62 | 1.67:1 | — | trending | neutral | +0.90% |
| TRX | long | 58 | 1.24:1 | — | trending | neutral | +0.85% |
| GRASS | long | 55 | 1.67:1 | 2.5× | chop | loss | -4.24% |
| TRX | long | 55 | 1.33:1 | — | trending | neutral | +0.86% |
| TRX | long | 55 | 1.33:1 | — | trending | neutral | +0.42% |
| TRX | long | 55 | 2.27:1 | — | trending | neutral | +0.37% |
| JUP | long | 55 | 2.06:1 | — | chop | win | +9.24% |
| AERO | long | 58 | 2.00:1 | 3.0× | trending | win | +12.80% |
| CELO | long | 55 | 1.44:1 | — | trending | loss | -5.19% |
| POL | long | 58 | 1.63:1 | — | trending | neutral | +0.48% |
| INJ | long | 55 | 2.18:1 | 3.4× | trending | win | +11.44% |
| POL | long | 55 | 1.59:1 | — | trending | win | +1.63% |
| POL | long | 55 | 2.09:1 | — | trending | neutral | -0.13% |
| PYTH | long | 68 | 1.23:1 | — | trending | neutral | -3.41% |
| PYTH | long | 62 | 1.60:1 | — | trending | win | +2.48% |
| MASK | long | 62 | 1.60:1 | 4.0× | trending | neutral | +6.75% |
| MASK | long | 65 | 2.25:1 | 4.5× | trending | win | +2.29% |
| IMX | long | 58 | 1.50:1 | — | trending | win | +4.83% |
| MASK | long | 65 | 1.75:1 | 3.5× | trending | win | +4.36% |
| TIA | short | 55 | 1.67:1 | 2.5× | chop | win | +6.50% |
| GRASS | long | 58 | 2.00:1 | 3.0× | trending | win | +8.54% |
| CELO | long | 55 | 1.77:1 | — | chop | neutral | -0.80% |
| MASK | long | 62 | 1.40:1 | 3.5× | trending | win | +4.36% |
| MASK | long | 68 | 1.40:1 | 3.5× | trending | win | +6.12% |
| MASK | long | 58 | 1.75:1 | 3.5× | trending | win | +4.04% |
| BLUR | long | 74 | 1.24:1 | — | trending | neutral | -0.43% |
| MASK | long | 58 | 1.80:1 | 4.5× | trending | win | +4.10% |
| MASK | long | 62 | 1.67:1 | 5.0× | trending | win | +2.59% |
| AERO | long | 55 | 1.40:1 | 3.5× | trending | neutral | -0.13% |
| MASK | long | 62 | 2.33:1 | 3.5× | trending | neutral | +0.61% |
| SEI | long | 55 | 1.55:1 | — | trending | loss | -4.85% |
| GRASS | long | 62 | 1.40:1 | 3.5× | trending | win | +14.40% |
| AERO | long | 58 | 1.46:1 | 4.7× | trending | win | +2.03% |
| MASK | long | 58 | 1.50:1 | 3.0× | trending | win | +5.10% |
This replay applies only the gate logic retroactively. It does NOT re-run the LLM with new prompts. So if a thesis was generated with target too close (less than your ATR floor), it gets rejected here even though with a better prompt the desk would have aimed further. Use this tool to evaluate gate sensitivity, not to predict future LLM behavior. Static analysis · ~500 closed theses · ISR cache 30s.