Quote from TriPack:
Hello Eric,
Following your instructions I implemented this confidence testing in all my systems. Since doing so I have tested possibly hundreds of systems. Of those systems I have tested hundreds and hundreds of different permutations on several markets. I found out something interesting about this.
This is the basic assumption that this type of analysis makes: that a smoother P&L curve will lead to future profits with more certainty than a bumpy P&L curve. One thing I stumbled upon was that I took an always in system, and tested it. It got like 60-70% confidence depending on the particular market it was applied to. Then I took the same system and only added a very tight profit target and stop loss (turning it into a scalping system) and did the tests again. Overall profitability went down significantly but the confidence level shot way up above 90% on virtually every market tested. So the conclusion I came to is that the confidence interval really measures the smoothness of returns, rather than whether that particular system may or may not be profitable over the long haul. This is from forward testing and observing these and other systems side by side. And specifically the confidence interval measures the short term smoothness of returns, which can be affected by a string of wins or losses in the short term, which may or may not continue. There is no guarantee that the system will stay hot or will stay cold, and I haven't found a way to test whether these hot/cold streaks tend to persist or not so I'm not certain that filtering out systems based on a low % is necessarily the best course.
Don't get me wrong, I appreciate your work and continue to seek out systems that as a general rule have smoother equity curves. I think having a systematic measurable approach is far superior to what I was doing previously. It is a good tool and I use it regularly.
But the thing that gives me the most confidence in finding an edge is if I apply the same system to multiple markets without changing the system at all, and see if it achieves similar results in many different markets. If it does, my confidence goes up that there might be something there. If the system is a one market anomaly, then it is probably curve fit to the data.