If you do many trials and tests you will possibly find a system with PF > 2 that is random. The question is not the metric value but how you test it for randomness. Some good reads:
http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2326253
http://www.priceactionlab.com/Blog/2012/06/fooled-by-randomness-through-selection-bias/
People need to understand that only metrics based on real performance count. One can easily cheat both IS and OOS performance with repeated backtests. Those links were pretty good info.