Why Your Best Backtest Might Be the Most Dangerous One
Most investors think the job of a backtest is to find the strategy with the highest return. That is not the real job. The real job is to show you how fragile your conviction is before real money is involved.
A strategy that looks brilliant over one chosen period can fall apart when you move the start date by a year, a month, or even a few weeks. If that happens, you did not discover an edge. You discovered a lucky entry point.
The Mistake Almost Everyone Makes
Most people run a single backtest. They pick a period that feels relevant, recent, or memorable. Maybe it starts after a crash. Maybe it covers a strong bull market. Maybe it starts on the exact date they first became interested in leveraged ETFs.
Then they look at the final number and ask: Would I have beaten buy and hold?
That question feels sensible, but it is incomplete. A single backtest mostly tells you what would have happened if you had been lucky enough to start on that exact date. It does not tell you whether the strategy is durable.
Why A Great Chart Can Create False Confidence
Backtests are highly sensitive to market regime and sequence. Leveraged ETFs amplify both gains and pain, so small changes in the starting point can create very different outcomes.
A strategy that looks amazing from one entry date may simply be benefiting from:
- a recovery beginning shortly after you invest
- a long uninterrupted trend
- an unusually favorable valuation starting point
- a period with fewer whipsaws than normal
None of that means the strategy is bad. It means your evidence may be weak. The danger is psychological: one impressive chart can make a fragile strategy look inevitable.
The Better Question
Instead of asking How much did this strategy return?, ask questions that reveal robustness:
- How often did it beat the benchmark?
- What did a typical result look like, not just the best one?
- How bad were the worst starting dates?
- Did the strategy still work when I shifted the entry date forward or backward?
- Was the outcome stable across crashes, sideways markets, and strong bull runs?
Those questions create a need many users do not realize they have: the need to test a strategy across many possible starting dates, not just one.
What Robustness Looks Like In Practice
Imagine two strategies. Strategy A posts a spectacular result over one ten-year window. Strategy B looks slightly less exciting in its best run, but produces solid outcomes across many ten-year windows.
Most people choose Strategy A because the upside is easier to see. In practice, Strategy B is often the better strategy because it gives you a more realistic idea of what kind of experience an actual investor might have had.
This is especially important for leveraged ETFs. The biggest risk is not only drawdown. It is entering a strategy with the wrong expectations and abandoning it when the next period looks nothing like the chart you fell in love with.
What To Do Before You Trust Any Backtest
Use a single-period backtest to explore. Then use multi-period analysis to pressure-test your conclusions.
A simple workflow
- Run a normal backtest for the exact period you care about.
- Change the start date and end date slightly to see how fragile the result is.
- Run a broader analysis across many periods.
- Compare the median, the worst cases, and the win rate against the benchmark.
- Only then decide whether the strategy deserves your confidence.
On this site, that means pairing the Backtesting Tool with the Parallel Backtests page. The first helps you inspect a specific story. The second helps you see whether that story repeats often enough to matter.
The Real Use Of Backtesting
The best backtest is not the one that makes a strategy look unbeatable. It is the one that makes your assumptions harder to defend unless they are actually true.
If moving the start date slightly destroys the result, that is not a small technical detail. That is the result.
Once you start thinking this way, you stop searching for the prettiest chart and start searching for strategies that can survive contact with reality.
Try This Next
If you have a strategy that already looks good in a normal backtest, the next step is not to trust it more. The next step is to stress it.