Quote from logic_man:
But, to take a relatively simple example, if a strategy says to go long when the odds the Fed will cut rates are over 50%, stay flat if they are from 49% to 10% and go short if they are under 10% (let's say you've found that, historically, there is a 10% chance the bond market is reading the Fed wrong and that opens up a shorting opportunity). That strategy takes a range as its parameter values.
Is the risk of "bad" curve-fitting equally present in both approaches?
What you're saying there is the same thing in the earlier post, I see what you're asking. That's why I said - how you determine what is a fundamental market mechanism is an entirely different issue. It's an issue of statistics.
If you take one instrument, one bar size, one parameter of an indicator or whatever, and you test it, let's say that over all data you have you get 1000 sampling points upon which to base your conclusion. However, in the range of selections you had millions of values in-between. If you get a "perfect number" within that range, there's (to simplify things to the max) a 1000 in a few million chances that it's a truly meaningful value (even if it's a range of values). So if you wanted to really test your parameter you'd need a few more tonnes of data to make it valid. Then you'll realise that based on descriptive statistics of the instrument the value ranges seem to change. For example, the sustainability of positive trends is correlated to slightly higher values, etc. Then you'd have to create correlations within that huge pile of data and try to draw conclusions out of that - so that's basically combinations within data, increasing the amount of data by several factors. So for example you see a link between two descriptive statistics of the market and one parameter value, 3 values put on a 3d graph, you can see planes forming where the "meaningful" value ranges might statistically emerge. These planes in data signify a possible robustness, which can itself later be computed into single values and correlated to whatever else. You can also then move into n-dimensional search for planes, and if you're into math, that's a great hobby.
What i'm trying to say is that in order to find truly meaningful non-curve-fit parameters, it takes 1- about 100 GB of data, 2- a great statistical platform 3- understanding of statistics and quantitative analysis.
I saw a website recently that was kind of interesting (this discussion reminded me of it), though i didn't really read into the details, that might help you out:
http://meyersanalytics.com/
The "walk forward surface explorer" looks interesting to me, if it does what i think it does - read the "Data Mining and Curve Fitting. " on it's page.
Ultimately, it's nowhere near as easy as reading into a good range of parameter values for a single parameter on a single market. The probabilities are very large that with a 1000 samples you will get curve-fits all over the place. You need millions at least - in which case you'll probably get no meaningful data because of the differences in each market - then you're back to square one to link those markets somehow and find what could work with all of that in mind.
Further, statistics if applied incorrectly, will you give you garbage. It takes quite a bit more math and fundamental understanding of how everything links together to draw such conclusions. Even then, you have to take into consideration the framework (model) you are trying to put your output values into. Trying to predict price is often a futile exercise, but finding good parameters for one component of a model (ie detection of probability for mean reversion) can pay off.