Hi guys, quite new here.
I am just curious to know; in the process of trading signal development, how do you ultimately decide which signals to include in your repertoire.
I am currently working on some models, where I ultimately decide to include the signal where I find that it replicates in 2 separate forward datasets. But the original process of developing the model included multi-fold cross validation on the training data, before the check on 2 data sets.
So I have to ask myself why the signals didn't work completely on the first dataset (in spite of multi-fold validation) and in some case why those that did went on to fail on the second dataset. That is; are the successes on test data set 1 and then test data set simply incidental?
I am just curious to know; in the process of trading signal development, how do you ultimately decide which signals to include in your repertoire.
I am currently working on some models, where I ultimately decide to include the signal where I find that it replicates in 2 separate forward datasets. But the original process of developing the model included multi-fold cross validation on the training data, before the check on 2 data sets.
So I have to ask myself why the signals didn't work completely on the first dataset (in spite of multi-fold validation) and in some case why those that did went on to fail on the second dataset. That is; are the successes on test data set 1 and then test data set simply incidental?