I've been scraping the cme futures options settlements for a while now and finally have the free time to compile the data with the eventual goal of building some trading models. I was wondering if anyone here has any suggestions on best practices with regards to the type and structure of an options specific database, as my knowledge of DBMSs is pretty limited.
I work mostly in python and have been happy using SQLite for years (albeit only for linear instruments) and am hesitant to use SQL approach for options when so much of my data will be repeated (underlying, expiry, fwds, etc.).
The strategies I have in mind are pretty low frequency so I'm happy with daily data for now and, while I'll probably decide to buy intraday data in the future, I'll never need to store tick data. I care more about query speed because I plan on eventually trading many products on many strikes/expirations and also don't want the database size to explode.
I've heard good things about Arctic/MongoDB and also have received the suggestion of just saving to local hdf5 files rather than use a db.
All suggestions greatly appreciated.
I work mostly in python and have been happy using SQLite for years (albeit only for linear instruments) and am hesitant to use SQL approach for options when so much of my data will be repeated (underlying, expiry, fwds, etc.).
The strategies I have in mind are pretty low frequency so I'm happy with daily data for now and, while I'll probably decide to buy intraday data in the future, I'll never need to store tick data. I care more about query speed because I plan on eventually trading many products on many strikes/expirations and also don't want the database size to explode.
I've heard good things about Arctic/MongoDB and also have received the suggestion of just saving to local hdf5 files rather than use a db.
All suggestions greatly appreciated.