Quote from acrary:
If you want to create a program here's how I calculated the correlation between two systems.
The correlation calculation here is straight forward. With correlation tho, I presume that the objective would be that as more systems were added, ideally the N systems would have ZERO pairwise correlation. Otherwise, for highly correlated systems, it would simply be better to opt for the highest performing system.
Given the sample size of returns (several months), is it introducing any sample errors? What I mean is that if I were to assess the correlation matrix of N systems, should I be taking a random sample from the population, as opposed to a biased sample of the most recent months, of the system's returns and then evaluating the correlation or does it all just collapse to this reduced form?... To elaborate a bit, what is being sampled is returns, across the board of all systems, at a specific point in time. Systems with high correlation are possibly variants of a fundamental form, I admit this is an iffy statement that I just made... A non-zero correlation value would indicate that a larger proportion of N systems have performance that are materially related to one of the other systems. Optimally, no one system should have performance correlated to another so as to limit the variance of the sum of the returns of all N systems. Actually, I have to think about this some more...
Very interesting stuff... For sure! I do have a moderate quant background but have kept much of this stuff separated from trading (well actually all aspects separate from trading). It is a knowledge and application weakness that I am attempting to overcome in order to integrate the best of both worlds...
Kindest Regards,
MAK!