Quote from intradaybill:
Get free daily data from yahoo! and then write an algo to generate random intraday data based on HLC constraints of daily data. Cost you nothing and if your system passes this test then you can spend money to test on a small subset of stocks that will cost you much less.
Bill, this is a complete waste of time. I don't advise anyone to pursue this. Data is made by the rational decisions of all market participants. There is no randomizer that will ever be able to produce the data that the market has. It is only the fact that there is data that you can say about doing so. This is utterly useless. You should not advise people to do anything so stupid. There's no value in acquiring actual data, and there's certainly no value in producing randomized data. Zero. Period.
Do not listen to this. I've had luck when it comes to my data, but my understanding is E-signal or Barchart is the cleanest, and the vendors who have such datasets only work in certain programs like Multicharts or Ninjatrader. What you (the op) really need is a tick file for your symbols. Your charting package will be able to make any interval bar with tick files. Don't waste your time on any time interval per se, but only the complete list of trades and from that you can do any interval you want, and I don't advise any timed interval.
As far as value, the best that it gets for me is all the daily data on any NDX derivative, and a futures library of over 80 symbols, with continuous and individual contracts. The value is more in the theory than in the data, but you have to have both, and there is some doubt whether there is actually an edge. Be sure when you do get your data that it has bid/ask series. This will make your backtesting a lot more accurate and just cements what I've said about getting tick files. Tick files will usually have the bid and ask in their datasets, at least in the ones I have.