Quote from logic_man:
I was reading a review of Aronson's book on evidence-based TA and the reviewer stated that one sign that data mining bias was not present in an optimized backtest would be that there were few outliers in the backtest results...
I would understand this as follows:
For the good, ânon curve-fitâ system, good performance results from uncovering some underlying truth relating to market microstructure.
The performance of a curve-fit system on the other hand only appears good because of a serendipitous synchronization between entries/exits and price action.
The key point here about a curve-fit system is that the outcome is random, and just happens to look like good performance. Any such random outcome will be the result of multiple trades. What permutation of multiple random trades (that appear to reflect âgood performanceâ) is more likely? (a) A series of multiple random trades where all are good trades? Or (b) a series of multiple random trades where most are distributed equally about small wins and losses but a few âoutliersâ substantially lift the overall result?
IMO (b) is more likely than (a) to be the profile of a curve-fit system.