I've stayed out of this debate, since I learnt to my cost that debating with VP is a painful and pointless exercise so I generally keep him on ignore. However I feel I should correct some misconceptions - since its my thread I actually know what its supposed to be about.
To quote from the original blog post:
I am trading futures, in a fully automated system, which is relatively slow and where latency is not an issue, using only price data."
"Hardly any strategy which targets long term holdings is fully systematized for good reason."
"No billion dollar fund trades fully systematized strategies. "
This clearly isn't the case. Systematic CTA's that I am aware of in the $1bn plus range include Winton, AHL, Bluetrend (now systematica), Cantab, Transtrend, Aspect. There are many more but you can google them yourself I'm sure.
Then outside of the CTA space you have the systematic global macro type guys; there is Bridgewater, nearly $100bn nowadays and FX concepts to name just a couple. Then you have the large systematic equity neutral funds. AHL ran an internal fund like this running at $2bn; and there are quite a few more.
I could go on, and on, and on, but I think I've made the point. And importantly all these places are trading relatively slowly.
Most high frequency shops tend to be small (virtu and a couple of others being some high profile exceptions). The reasons are obvious - you don't
need large capital to hold, and many High Frequency strategies don't scale well to much larger positions so
can't trade with larger capital. So there
probably aren't any multibillion dollar High Frequency funds (although you could argue that DE shaw and Rennisance are exceptions; although I don't really understand eithier business so I don't feel qualified to say); all the big systematic funds I've mentioned above trade slowly.
Its a common misconception amongst the general public that "high frequency trading" / "systematic strategies" / "algorithimic strategies" are all the same thing. High frequency is just one part of the larger systematic trading universe.
"Algorithmic" seems to be have been highjacked by the high frequency crowd, and I strongly disagree with that appropriation.
There is in fact a huge universe of difference between the guy scalping or spoofing with microsecond latency; and the stuff these large systematic funds and myself am doing; so for example on the fastest market I trade my
average holding period is 4 weeks. On the slowest its 2.5 months.
The techniques involved in high frequency trading are completely different from the longer latency stuff. I wouldn't presuppose to tell a High Frequency trader who to capture data, since I know nothing about that world. Equally I'm surprised when someone from that other world gives me advice that isn't relevant to me. For example these comments show a clear misunderstanding of the slow trading world:
"algorithmic futures trading is anything but slow"
I disagree, it can be very fast, or it can be incredibly slow.
"Tick based historical data are nowadays very easy to come by so I am not sure why you make the latency of your system a function of the availability of your captured data. You can cheaply purchase high precision futures contract exchange data."
I don't. I trade slowly because that's what I know how to do. Every time I do the maths on the costs of high frequency trading I can't work out how it can be profitable. Most retail punters who try to trade fast lose money. Why should I compete in a game I don't know how to play where the odds are stacked against me?
As a minor point the history of available tick data doesn't go back far enough for my kind of trading. Again the rules are different with slow trading. You need decades of low frequency data to test it, not a few years of tick data.
"Spikes and cleaning -> There should not be different options. When you receive a price of zero or one that lies x standard deviations away from the previously traded prices then that is an erroneous quote, period. You filter it out and are done. "
No, it could be a real price - depending on the value of x there is still a probability (unless you've set x too large so it never triggers). When you are trading relatively slowly you have the luxury of checking to see if its real or not.
Side issue:
Although I generally agree with the comments about fundamental data (I've used factset myself), the post wasn't about that, but about technical data (that would be another post; which I may well do actually). I'm fairly sure that the universe of people deploying relatively slow systematic trading on the back of earnings announcements (a large chunk of the equity neutral world) is much larger than those trying to react to them with low latency feed analysers.