>Katz/McCormick focused me again on the t-test.
>Scenario:
>The in-sample test of a strategy in this book resulted in
> - #trades:
> 118
The t-test is normally used for small sample although it's ok for big sample also of course since student law converges towards normal law. But each (individual) variable of the sample should follow normal law of same variance and must be independant. In some phenomenas the hypothesis are rather reasonable (if you mesure a table with the same instrument and the same protocol it is reasonable to assume that error is due to randomness) but not in unknown phenomenas. As I use to say checking the validity of premisces is much more important than making calculus of the significance of the test since this significance has no significance at all if the basic hypothesis is not checked.
>The statistical significance is adjusted: 1 - power(1-0.00184, 20) >= 0.3103 = 31.03%
>(Detail: Can anybody explain the concept between the last >step? It's somewhat unclear to me.)
Supposing that the premisces of the student test (more generally of a parametric test) are reasonable (which is not evident as said above but let's do as if they were) this says that if "the (true but unknown) mean was equal to 0" (called null hypothesis H0) the probability would be alpha = 0.00184. The contrary of H0 is "the (true but unknown) mean wasn't equal to 0". Then the basic axiom of probability says that Prob(H0) + Prob (non H0) = 1 (since 1 is the probability of certainty

) so that Prob (non H0) = 1 - Prob(H0) = 1-0.00184. This is called the significance of the test.
After that if an experience E is repeated with independancy, etc., the probability of P(E1*E2*E3*..*En)=P(E1)*P(E2)... this explains power(1-0.00184, 20). Substracted from 1 give the significance.
>Now my idea:
>Improving statistical significance could be target of an >optimization process. I could make sense to not only focus on
>good past performance but also (or only!!) look for a maximum >statistical significance. This might be a way to overcome
>(or make at least less hurting) the problem of over-optimization >and curve-fitting.
"your" idea is already the foundation of statistical decision theory : The theory of statistical estimator is based on the concept of efficiency like in stock market. Nevertheless efficiency here is not as fuzzy

. In statiscal theory optimality of estimation of a parameter means 3 things: consistancy (when the size of the sample grows, the estimator must converge towards the true parameter), no bias (for example the empirical standard deviation (of the sample) is a biased estimation of the true standard deviation (of the population) so one must correct by square-root of (n-1)), and efficiency which the smallest variance (if you vary the parameter this variance will also vary so that there is an optimum). The best accepted method to find this optimal is the maximum likelyhood which uses the formula above of multiplication of independant probability to find the roots of partial differential equations which will give the optimal value parameters. THE PROBLEM IS THE PROBABILITY LAW MUST BE KNOWN. This is common sense: you can't create knowledge from thin air
Quote from droth:
Hi,
I'm currently reading the book Katz/McCormick "The Encyclopedia of Trading Strategies" which brought me back
to one of my main concerns:
Trying to find a strongly systematic approach to trading I have to deal which adjusting strategies by
applying specific parameter sets. Selecting specific parameter sets may improve to strategy performance which
is called optimization. My main concern - as of many traders - is:
"How far do I have to optimize to make a strategy profitable, and when does over-optimization or
curve-fitting start?"
There are many approaches out there to solve this problem, but reading this book an idea came to my mind:
Katz/McCormick focused me again on the t-test.
Scenario:
The in-sample test of a strategy in this book resulted in
- #trades: 118
- Avg profit per trade: 740.9664
- StdDev of profit: 3811.355
(comment: I prefer using avgprofit as percent value, but that's personal style ...)
The StdError ("Expected SD of Mean") is 3811.355/sqrt(118) = 350.8637. Executing a t-test yields a probability of 1.84% that
the results are just by luck. This does sound really good, but has to be adjusted by the number of optimization steps
which have to be performed to achieve this result. The results in this examples have been achieved by executing 20
optimization steps.
The statistical significance is adjusted: 1 - power(1-0.00184, 20) = 0.3103 = 31.03%
(Detail: Can anybody explain the concept between the last step? It's somewhat unclear to me.)
This number is far worse than 1,84%. Anyhow: The book states, that due to serial dependencies, the real error may be less.
So far, all this stuff can be found in this book on page 63ff, unfortunately I didn't give birth to those ideas
.
Now my idea:
Improving statistical significance could be target of an optimization process. I could make sense to not only focus on
good past performance but also (or only!!) look for a maximum statistical significance. This might be a way to overcome
(or make at least less hurting) the problem of over-optimization and curve-fitting.
Is there any edge in it? Has anybody tried to evaluate past performance based on this kind of t-test and tried to execute the results on unseen data ?
Has anybody executed (and probably executed) optimizers based on this principle ?
Any comments are highly appreciated.
Dierk Droth