it would really help to quantify your data structures and base set.
what markets are you using and applying your data gathering and sampling to?
futures?
options?
options on futures?
equities?
bonds?
otc bonds? (derivatives, credit swaps, etc.)
emini futures?
pit traded contracts only?
equity - preferreds?
equity - (high) dividend yields
equity - (no dividend yields)
equity - etf's
equity - etf sub-category (underlying heavily basket traded)
if you knew enough about trading these markets, each being and having their own distinct volume signatures and market participants
what markets are you using and applying your data gathering and sampling to?
futures?
options?
options on futures?
equities?
bonds?
otc bonds? (derivatives, credit swaps, etc.)
emini futures?
pit traded contracts only?
equity - preferreds?
equity - (high) dividend yields
equity - (no dividend yields)
equity - etf's
equity - etf sub-category (underlying heavily basket traded)
if you knew enough about trading these markets, each being and having their own distinct volume signatures and market participants