Another set of test results which was interesting:
Drilling down from what I did with the previous test, I have 12 models running tests the same way, except... this time around, I set kept the test period static year-by-year confirming the validity by keeping track of the performance 6 months after that.
The average performance of each measure was around the range of the previous test but if you look at the performance year-to-year, you can realize that the efficiency of the measures decaying as passes. (I should have outputed the results from old to new but we all love to see an upward curve... I hope).
Anyways... as much as the markets change and models' performance decays with time. The tools we use have a cleaner curve of the decay.
Again, this is just a test of a single instance. But it's something to consider and a good chance for people to start thinking about the validity of "how" they develop models as much as the models themselves.
So... a question arises... if the significance of a validation tool decays with time, how would you adjust your tool?
I TEST EVERYTHING.
Drilling down from what I did with the previous test, I have 12 models running tests the same way, except... this time around, I set kept the test period static year-by-year confirming the validity by keeping track of the performance 6 months after that.
The average performance of each measure was around the range of the previous test but if you look at the performance year-to-year, you can realize that the efficiency of the measures decaying as passes. (I should have outputed the results from old to new but we all love to see an upward curve... I hope).
Anyways... as much as the markets change and models' performance decays with time. The tools we use have a cleaner curve of the decay.
Again, this is just a test of a single instance. But it's something to consider and a good chance for people to start thinking about the validity of "how" they develop models as much as the models themselves.
So... a question arises... if the significance of a validation tool decays with time, how would you adjust your tool?
I TEST EVERYTHING.