Blog

Writing on systematic trading

Precise, opinionated writing on backtesting methodology, statistical validity, and the craft of building durable systematic strategies.

Methodology17 May 2026
Why most backtests overstate edge — and what to do about it
The predictable reasons that backtested performance doesn't translate to live trading, and the statistical tools that help separate real edge from noise.
Practice17 May 2026
A checklist for evaluating whether a backtest result is worth trusting
Six criteria — from result hash to parameter sensitivity — for distinguishing credible backtests from sophisticated noise.
Methodology17 May 2026
What Backtesting Actually Measures and What It Does Not
A backtest is a precise measurement of one historical path, not an estimate of future returns, and the distinction governs every downstream decision.
Practice17 May 2026
The Difference Between Gross and Net Returns
A precise breakdown of the cost stack separating paper backtest performance from realized account returns in systematic strategies.
Methodology17 May 2026
Why You Need More Data Than You Think
Backtest row counts mislead; the statistical sample size that governs strategy validation is far smaller than most researchers assume.
Practice17 May 2026
Reading an Equity Curve: What Smooth Actually Means
Equity-curve smoothness is at least five distinct properties; conflating them is how overfit systems pass review and fail in production.
Practice17 May 2026
How Transaction Costs Silently Destroy Strategy Edge
A precise look at how spreads, impact, and turnover compound against backtested alpha and why most cost models understate the damage.
Practice17 May 2026
Position Sizing Is Not Optional — It Changes Everything
Why position sizing determines whether a positive-expectancy system survives, and the three-layer sizing stack worth implementing.
Methodology17 May 2026
The Multiple Testing Problem, Explained Without Statistics
Why testing many strategies guarantees you will find a fake winner, and what the inflation of false discoveries actually looks like in practice.
Practice17 May 2026
Walk-Forward Analysis in Practice: A Worked Example
A concrete implementation of walk-forward analysis on a momentum strategy, with diagnostics that matter more than the aggregate Sharpe.
Methodology17 May 2026
Why Optimising for Sharpe Ratio Produces Fragile Strategies
Selecting strategies on Sharpe ratio systematically favours overfit, negatively-skewed configurations whose apparent quality is an artefact of the metric.
Practice17 May 2026
How to Build a Parameter Sensitivity Heatmap
A practical method for projecting strategy performance across two parameters to distinguish robust edges from overfit point estimates.
Methodology17 May 2026
In-Sample Performance Is Not Evidence of Edge
Backtests fit on the data used to design them measure optimization artifacts, not strategy edge, and require multiplicity-adjusted out-of-sample validation.
Methodology17 May 2026
DSR in Practice: How to Count Your Trials Honestly
A practical guide to applying the Deflated Sharpe Ratio by counting trials honestly, including correlated trials and pre-registered budgets.
Methodology17 May 2026
CPCV vs Walk-Forward: When to Use Each
A practical comparison of combinatorial purged cross-validation and walk-forward analysis, with guidance on sequencing them in a research workflow.
Statistics17 May 2026
The Mathematics of Overfitting: Degrees of Freedom Explained
A formal accounting of how parameters, trials, and implicit choices consume statistical power and inflate backtested performance metrics.
Methodology17 May 2026
Regime-Conditional Strategy Evaluation
Pooled performance metrics hide the conditional distributions that matter; regime-conditional evaluation separates strategy quality from historical regime-mix luck.
Statistics17 May 2026
Why Mean-Reversion Sharpe Ratios Are Almost Always Overstated
Negative autocorrelation and bid-ask bounce systematically inflate the naive Sharpe ratio of mean-reversion strategies, often by a factor of two.
Practice17 May 2026
Kelly Criterion in Practice: Why Everyone Uses Fractional Kelly
Full Kelly is mathematically optimal under perfect information; fractional Kelly is what survives the gap between backtested edge and forward reality.
Practice17 May 2026
Building a Strategy That Survives the Deflated Sharpe Test
A practical workflow for designing systematic strategies whose backtested Sharpe ratios survive correction for selection bias and higher moments.