Track blind-read accuracy, not just confidence
A good measurement loop starts before the outcome is visible. If the future is already leaking into the session, the score is not telling you much.
Blind accuracy is not the only metric, but it is still the cleanest way to see whether the read itself is improving.