Backtest & Model Evaluation

Ablation study · Phase 2 ensemble results · On-demand training · Thesis Section 8.2

Phase 2 AUC 0.8325 · F1 0.5164 · +2.42% vs baseline
Ablation Study — Section 8.2
Click row to view equity curve
ExperimentFeaturesTest AUCTest F1SharpeMax DD
Baseline
OHLCV only
7
0.61200.48000.31-28.4%
+ Order Book
OHLCV + LOB (15 features)
15
0.72300.52000.68-22.1%
+ Derivatives
OHLCV + LOB + Futures (24)
24
0.78000.55000.92-18.6%
Phase 2 Ensemble
GRU + LGBM + XGB → Meta (24 selected)
24
0.83250.51641.14-14.2%
Equity Curve
Phase 2 Ensemble
AUC: 0.8325Sharpe: 1.14Return: +-37.5%
Drawdown
Max: -14.2%
Thesis Contribution — Section 8.2

Ablation study isolating statistical contribution of each data layer. Phase 2 Stacked Ensemble achieves AUC 0.8325 and F1 0.5164, a +2.42% improvement over the Phase 1 XGBoost baseline (AUC 0.8083). Target: liq_cascade_target — binary label for liquidation cascade events in next 15-min candle. Answers: "Does stacking GRU + ensemble trees via meta-learner improve cascade prediction?"Yes.