Quote from opmtrader:
I noticed that correlations are used in much of your work. I was wondering if you took sample size into account when performing these correlation calculations. I now use a "correlation rank" in my work after reading the following in Victor Niederhoffer's Practical Speculation:
"In practice, we have found any number above 0.10 or below -0.10 based on 100 or more observations to be useful. The general formula we follow for usefulness is that the correlation coefficient times the number should be greater than 10. Thus, for 50 observation, a correlation of 0.20 would be useful and for 20 observations, a correlation of 0.50 would be useful." (pg. 196 to 197)
I don't really think of sample size when viewing a correlation. All I'm trying to do is see if there is something that could require further followup.
I think Victor is estimating standard error and throwing out some ballpark numbers. For a better estimate:
Error estimate
E = z * std. dev. of sample
_________________________
sqrt of number of samples in test
z = number of std. dev. of normal distribution for the confidence level needed.
z = 3.08 = 99.8% confidence level
z= 2.58 = 99.0% confidence level
z=1.96 = 95.0% confidence level
z=1.645 = 90.0% confidence level
Ex. 20 trades in test
sample correlation = .50
sample std. dev. = .25
If we want to know the estimate of the mean to the 99% level then:
E = 2.58 * .25
_______
sqrt(20)
E = .1442
so with 99% certainty, we know the correlation is .50 +- .1442 (you can expect to see the correlation between .3558 and .6442 in the future)