FWIW here's the results of my experiments, take it with a HUGE grain of salt, because there could be bugs, missing\wrong data, anything.. Also, I'm running the whole backtest with the latest parameters that I'm using in real trading right now, as opposed to recalculating instrument and forecast weights as I go, optimizing them on the available data at the time(more correct). Also, as my weights are constant and many instruments only have recent data, a lot of risk-capital isn't allocated in the beginning..
Again, the general methodology here is to take the normal system with smaller number of instruments and a larger system with higher capital and larger number of instruments but apply the normal Markowitz-style risk overlay to the larger system such that it's realized risk (and therefore capital requirements) become the same as of the smaller system. This results in path-dependence, as when-ever the risk-capital is exhausted the new high-forecast instruments will be denied positions until some existing positions are exited.
I had to play with the "normal" risk overlay parameter quite a bit to make realized risk of the larger system to be as close as possible to the realized risk of the small\normal system to be able to compare apples to apples. The system's PnL takes trading costs into account, but I wouldn't vouch for it's accuracy

: the backtests assumes fills on the worst side of the spread (the bid-ask spread for each instrument is loosely calculated as the average daily spread for this contract, might be incorrect in many places in my DB

) + 5$ commission per trade.
View attachment 262393
View attachment 262394
So what I can see apart from the 1.2 increase in P&L is that the realized risk of the larger system is more even, less jumpy, that makes sense(more instruments, better diversification), also there are fewer extremes.
The number of trades increased mildly, only by 1.14, so that doesn't look too bad.
Margin usage improved, i.e. it's slightly higher on average and also seems less jumpy on the larger system and less often goes to almost zero.
What is tempting from these results, but is most-likely wrong, is to assume that I can slightly increase the target risk of the larger system simply because I'm adding more instruments. I.e. I can almost trade a little more instruments for free because the greater diversification reduces my risk and makes it more nicely-behaved. But of course this measure of risk is not perfect, and increasing system risk simply because I trade more instruments is probably dangerous because of idiosyncratic and unseen risk (that's essentially we're capping IDM to 2.5 as I understand, because we don't want to believe in the better diversification completely)..
So I dunno, as I mentioned, don't consider these results very accurate, as maybe I'm just seeing what I want to see and my system and data, especially far back might not be accurate..