interesting post. I use Nvidia 980Ti cards as they get you the most cuda cores for the money. K40 and K80 cards are way overpriced. I installed cuda support for Theano and it does speed up computations by quite a bit.
I use the term data augmentation in a slightly different way but I get what you mean. I did test with inputting time series, for example, or other input data in a reverse order and the output and prediction rate turned out to be almost identical which confirmed that the learning algorithm is capable of building associations and assigning weights regardless of the order of the input data series. Obviously a complete re-ordering of input data is an entirely different story.
I use the term data augmentation in a slightly different way but I get what you mean. I did test with inputting time series, for example, or other input data in a reverse order and the output and prediction rate turned out to be almost identical which confirmed that the learning algorithm is capable of building associations and assigning weights regardless of the order of the input data series. Obviously a complete re-ordering of input data is an entirely different story.
I have recently started doing some experiments with convolution networks using theano for trading.
Do you do any data augmentation, like they do for images, where for each source one they also feed in rotated/translated/flipped versions? It's not very obvious to me how you could do this for financial data.
Can you tell what kind of hardware are you using for learning?
You can feed chart images, but then you will waste valuable filter space learning candle border pixels and other irrelevant data. Of course, if you have the time and resources...
The networks architecture is well covered for image classification. But even there, each year a new one appears and outperforms the older ones.
For financial applications, maybe you want to feed in multiple timeframes, or in general data sampled at different frequencies. In this case you need to think a bit, because if you combine in a single convolution filter data which changes at different rate you'll have problems, because the filter needs to be static.