Two ways you can do this:
1- Scrap 1 minute data and get tick data direct from the exchange. With good data you will see every quote (ticks are irrelevant). Then you can see the spread 100% and do your stats.
2- Do a statistical run and compare bar close with next bar open. If it's 50% of the time higher or lower by one tick, that means the spread is 1 tick 100% of the time. If the difference is 2 ticks 10% of the time, that means the spread is 2 ticks 20% of the time. This is stochastic sampling and you need a lot of 1-minute data. The more data you have obviously, the lower margin of error (confidence interval). This assumes you are talking about the usual 1-minute trade data (not midpoint or something).