BlueTrebd has return about 12%annualized since inception. Interesting references to comments by famous Leda Braga who runs the fund about system design and data-mining. Anyone with knowledge of the details of bootstrap tests please comment on relevance of p-values calculated for the fund. Author makes some interesting comments about hypothesis testing and relation to data-mining.
Blog link
Bootstrapping provides a means of estimating the Sampling Distribution.
Statistical inference has three distributions:
- Data distribution in the Population (e.g. distribution of all conceivable daily returns that would be earned by the rule's signal “over the immediate practical future”)
- Data distribution in the Sample of size N (e.g. distribution of daily returns in the backtested sample)
- The Sampling Distribution, i.e. the Data distribution of the Sample Statistic, the attribute of the sample that is of interest (e.g. average daily return, or average monthly return, or Sharpe Ratio, or Profit Factor, etc.).
Bootstrapping is one method of estimating the Sampling Distribution (of say the average monthly return) by resampling with replacement from an original sample, and under the assumption that the rule being tested has no predictive value (e.g. so that the generated Sampling Distribution of average monthly returns will be centred on zero). http://www.amazon.com/Computer-Intensive-Methods-Testing-Hypotheses-Introduction/dp/0471611360
Once the Sampling Distribution has been generated, hypothesis testing can be done along the lines of:
H0: Rule generates an average monthly return <= 0
H1: Rule generates an average monthly return > 0
The observed average monthly return of X% per month corresponds to a p-value of Y if we use the Sampling Distribution generated by Bootstrapping.
So we accept/reject H0 at the Z% level, etc.