Quote from heech:
I don't think the paper explained that very well at all, but I certainly agree with you on this point.
I disagree. I think the author explains that very well. You should read again the conclusion:
"Discussion of the results
...The backtesting flaws inherent in the specific versions of the two programs may have caused many technical traders to quickly discard certain systems that actually were promising and use others that appeared to satisfy certain performance characteristics but in reality that was not the case. This type of flaws can also have a severe impact on certain trend-following systems, as it was shown with an example. Traders should have been made aware of this sort of backtesting flaws resulting essentially in hidden conditions becoming part of the analysis. I cannot speculate about the reasons for these flaws since any speculation of this sort is beyond the nature and reach of the empirical study of this paper.
In the presence of backtesting flaws, the following are possible:
(1) Systems that show satisfactory performance during backtest may be adopted for actual trading where in reality their performance may be significantly degraded after corrections for skipped signals and wrong exit dates are made. In this case a trader may select a system believing it has satisfactory hypothetical historical performance when it actually does not.
(2) Systems that show an unsatisfactory performance during a backtest and are discarded may turn out to have acceptable actual performance if corrections to the backtest results are made to compensate for the impact of the flaws. In this case, a trader may discard a system that actually has some true potential.
Under actual trading conditions the situation may develop as follows. A trader, unaware of the flaws, backtests her idea and finds out that it produced acceptable performance according to her criteria. Then she programs the system in an API of a brokerage firm for sending the trades automatically. The API program will not share the same reservations about placing orders as the backtesting program, and it will generate orders at the close of a bar where an order in the same direction (long or short) was previously closed. The trader believes that the actual performance her system will statistically match the results of the backtest. However, one reason that may not happen and the system may turn out to lose money in actual trading is that the backtested system and the API system are two different systems, although the trader believes they are the same. This again may happen due to the hidden conditions that were introduced during the backtest, like the ones identified in this paper. Of course, it may also happen due to other factors that are not a subject of this study, such as slippage, partial order fills and other random effects.
I do not know whether any previous versions of the programs considered in this paper had the same backtesting flaws and when those were corrected."
ref:
http://www.tradingpatterns.com/About_Us/articles/backtesting/Backtesting.pdf