Raw backtest_session_* point counts for this api key in the window, by
outcome — not session-deduplicated, so the three need not balance (a
pre-start failure has no started; a cross-manager handoff re-emits
started). Billing uses the compute totals, which derive only from the
terminal completed/failed points.