The reason why the "best predictor" of the sign of x(i+1) - x(i) must be opposite to x(i) - 0.5 is due to anti-correlation and that x(i) is a UDRV in [0,1].Quote from phattails:
I took a look at that paper and I could be wrong in my analysis, but the results seem trivial. The E[X|x>.5]= .75 assuming a Continuous uniform dist. from 0 to 1. So if we are given a value of x, s.t. X>.5 or x<.5. Then our cond. expected values will be .75 and .25 respectively. So how often will any x be less
than .75? of course it's 75% of the time. Does this seem like prediction? If I referenced any value in the sequence that was < or > .5 I could say that 75% of the time I can predict the increment.
Regarding the synthetic data I say start with a simple random walk and start modelling and developing and then work up different stylized facts in order to isolate the relevant features.
Equation (3) is wrong, anti-correlation is not -1/4 in that example. Clearly the authors screwed up the calculation of E[x(i)^2] - 3rd term in expansion. I can assure you it is not 1/2 for a UDRV in [0,1].
But the derivation for general distributions looked correct last time I went through this.


