Every back-test is overly optimistic.
"Every" is a strong word. -- All one has observed, and all one is likely to observe, however is probably accurate.Every back-test is overly optimistic.
I have seen and ran a fair number of strategies that have done better in real life than in the backtest (for various reasons, such as execution assumptions or market improvements).Every back-test is overly optimistic.
%%Every back-test is overly optimistic.







I've seen many new traders backtest the day trading with minute level data. It simply doesn't work. You need tick level data and a backend that fills orders at bid/ask prices.
UPDATE:
The unrealistic performance I saw was due to a form of look-ahead bias that arose from subtle differences between my backtesting and forward-testing procedures. After fixing the error, my signal dropped to near noise levels. Now I actually have to develop a working strategy.
Thank you so much to everyone who contributed to the thread.
Unless you wanna play order book games or hold positions for periods comparable to the order book refresh cycles, you don’t need tick level data. Minutely or even 5-min TaQ bars would be plenty for most strategies.

Define "scalp"? If you are trying to find a very short-term mean-reversion strategy where your expected alpha is comparable to the bid/ask, then yes, you have to do your research on tick data. However, unless you are colocated and have made serious technology investments, you are most probably not gonna make money anyway, even if you find a viable effect.There is no way you can backtest a Scalp-like strategy by using minutes. There are literally thousands price events in just a minute and, depending on the stock, a minute bar can be also >1% of delta, so deciding at what millisecond to enter and exit the trade is absolutely important.
Actually, I literally never backtest on tick level data despite running a fair number of latency sensitive strategies. It's very hard to simulate your queue priority, impact and realized latency - so either you are too conservative or too optimistic. For medium-frequency trading I use 5-min TaQ bars with 1 second delay between the observed prices and assumed execution. For intraday holding periods I use 1-second TaQ bars with a 100ms delay. I build both from the actual TaQ tick data, so I can vary stuff like delays or type of fields I look at. Finally, for execution-level alphas, I do research on the tick data but I test them in live trading on limited scale.But really, nobody should go live with his algorithm without have backtest at tick level!