You will get far more dramatic speedup by rewriting your strategy in a compiled language like C++, than by trying to wring more performance out of your hardware.
It is a lot of work, but if you are serious about accelerating your calculations, it's the only way to go. By tweaking hardware...