Quote from nephos:
Of course you can curve-fit and over-optimize that way like crazy. Chance the identifiers OHLC, change how many bars ago, change the combination of the (un)equations.
I am not saying that data mining like that inevitably leads to over-optimization, but it certainly is a possibility.
You confuse curve-fitting with selection bias. You can not curve fit C > O but you can select C > H instead.
I think he makes that clear in that link. Trading systems can be modeled as discrete sequences of entry and exit points generated by some process. Any curve fitting must eventually be reflected on how these sequences distribute over time. Things like C > O AND O > H, etc. cannot be redistributed over a given time series. There are no parameters available for doing that.
Selection bias can be dealt with up to a certain point but curve fitting cannot. I hope you understand the difference. They are both too bad. There is a fundamental difference though. With curve fitting you can essentially specify posteriori who won the lottery and it is absurd as it sounds. With selection bias your only claim is that because someone won you can also win.
I don't think many people understand the difference. Those who do not understand it have no chance to go ahead in this business. Take out you probability book and review the concepts. try to understand the difference between absurd claims (like curve fitting) from probabilistic claims like selection bias. They have no connection whatsoever and whoever told they do lied to you.
but does anyone know what are some of the key barometers one should use to measure the system. Is there a website or past discussion that list them out along with some explanation. I think most of it is just common sense, like % winner/loss vs positive expectancy etc.. but just want the complete list so i can measure my system.