Backtest replay · 500 closed theses

Tune the desk gate, see history flip

Pure deterministic re-classification over the last 500 closed theses. Adjust R:R thresholds per market regime and the minimum target distance in ATR multiples — every change re-runs the same logic the desk's Risk Manager applies in production. No LLM calls, no inference; your params + historical price levels = WR you would have realized.

New WR (your params)

70.0%

35W · 15L · 79N

Approval rate

25.8%

129 / 500 pass

Old flat 1.5 WR

62.0%

81 / 500 pass

Δ WR vs flat

+8.0pp

pop WR: 58.0%

Tune parameters

Min R:R · trending (ADX≥25)

1.0 1.2 1.3 1.5 1.8

Min R:R · chop (ADX 15-25)

1.2 1.5 1.8 2.0

Min R:R · high-vol (ADX<15)

1.5 1.8 2.0 2.5

Min target × ATR

off 1.5×2.0×2.5×3.0×

Approved by regime: trending 123 · chop 6 · high-vol 0Reset to desk defaults

Approved theses · 129

Asset	Side	Conv	R:R	Tgt × ATR	Regime	Outcome	PnL
SOL	short	62	1.75:1	3.0×	trending	neutral	+0.01%
DYDX	long	52	2.62:1	2.7×	trending	neutral	-2.30%
ETH	short	52	2.00:1	2.8×	trending	loss	-1.96%
PYTH	short	74	1.28:1	—	trending	loss	-3.76%
FARTCOIN	short	68	1.24:1	—	trending	neutral	+0.32%
VIRTUAL	short	62	2.50:1	2.5×	trending	win	+3.09%
ZEREBRO	long	64	1.21:1	—	trending	loss	-7.41%
INIT	long	55	1.91:1	—	trending	loss	-3.03%
ZEREBRO	long	55	1.91:1	—	trending	win	+32.17%
ZEREBRO	long	55	1.91:1	—	trending	neutral	-1.88%
INIT	long	55	1.75:1	—	trending	loss	-4.45%
INIT	long	55	1.75:1	—	trending	loss	-4.45%
FIL	long	58	2.33:1	3.5×	chop	neutral	-2.09%
INJ	long	62	1.53:1	2.6×	trending	neutral	-0.67%
TRX	long	58	1.47:1	—	trending	neutral	+1.10%
TRX	long	55	1.83:1	—	trending	neutral	+0.63%
TRX	long	62	1.67:1	—	trending	neutral	+0.90%
TRX	long	58	1.24:1	—	trending	neutral	+0.85%
GRASS	long	55	1.67:1	2.5×	chop	loss	-4.24%
TRX	long	55	1.33:1	—	trending	neutral	+0.86%
TRX	long	55	1.33:1	—	trending	neutral	+0.42%
TRX	long	55	2.27:1	—	trending	neutral	+0.37%
JUP	long	55	2.06:1	—	chop	win	+9.24%
AERO	long	58	2.00:1	3.0×	trending	win	+12.80%
CELO	long	55	1.44:1	—	trending	loss	-5.19%
POL	long	58	1.63:1	—	trending	neutral	+0.48%
INJ	long	55	2.18:1	3.4×	trending	win	+11.44%
POL	long	55	1.59:1	—	trending	win	+1.63%
POL	long	55	2.09:1	—	trending	neutral	-0.13%
PYTH	long	68	1.23:1	—	trending	neutral	-3.41%
PYTH	long	62	1.60:1	—	trending	win	+2.48%
MASK	long	62	1.60:1	4.0×	trending	neutral	+6.75%
MASK	long	65	2.25:1	4.5×	trending	win	+2.29%
IMX	long	58	1.50:1	—	trending	win	+4.83%
MASK	long	65	1.75:1	3.5×	trending	win	+4.36%
TIA	short	55	1.67:1	2.5×	chop	win	+6.50%
GRASS	long	58	2.00:1	3.0×	trending	win	+8.54%
CELO	long	55	1.77:1	—	chop	neutral	-0.80%
MASK	long	62	1.40:1	3.5×	trending	win	+4.36%
MASK	long	68	1.40:1	3.5×	trending	win	+6.12%
MASK	long	58	1.75:1	3.5×	trending	win	+4.04%
BLUR	long	74	1.24:1	—	trending	neutral	-0.43%
MASK	long	58	1.80:1	4.5×	trending	win	+4.10%
MASK	long	62	1.67:1	5.0×	trending	win	+2.59%
AERO	long	55	1.40:1	3.5×	trending	neutral	-0.13%
MASK	long	62	2.33:1	3.5×	trending	neutral	+0.61%
SEI	long	55	1.55:1	—	trending	loss	-4.85%
GRASS	long	62	1.40:1	3.5×	trending	win	+14.40%
AERO	long	58	1.46:1	4.7×	trending	win	+2.03%
MASK	long	58	1.50:1	3.0×	trending	win	+5.10%

This replay applies only the gate logic retroactively. It does NOT re-run the LLM with new prompts. So if a thesis was generated with target too close (less than your ATR floor), it gets rejected here even though with a better prompt the desk would have aimed further. Use this tool to evaluate gate sensitivity, not to predict future LLM behavior. Static analysis · ~500 closed theses · ISR cache 30s.