You have an incomplete system.
Perhaps you got a bit over anxious to trade after the back test alluded to profits.
Incomplete is incomplete, regardless of PL behavior. The path that is happening is the same as a manual trader going off half baked because they see profit potential and just starts trading.
No ChatGPT, band-aid fixes or even surgical fixes will make the system complete. This is why you write your own code. So you know when the code is collapsing under its own weight and needs a new architecture. Problem is you cannot accurately make this assessment because you did not write the code or cannot code review it enough to add the incomplete parts
Ultimately, only highly experience software and or traders with algo experience should have others write code. You simply did not spec out the code sufficient enough.
Fun fact, 70% of the code in MS word is error handling. How much of this trading code is error handling?
Same mistake as manual trading. Chasing $$$ instead of fundamentals.
Sorry to be a downer, but just trying to keep it real.
BTW: since June, I have been running an ES algo. Trades from open +2.5 hours. Single contract, single position. Up 3.5K on 20K start. PL and risk aside, only one error in order handing, which was minor and fixed. Hit over a dozen "error" conditions and handled all but the one mentioned above. The code was to test the waters of live trading, and not lose money. Now I can start-continue work on the "Make money" - "risk adjusted" version.