Live scoreboard · api.statefulai.tech/public/metrics

Real metrics for an operational cognition layer.

Two A/B demos run continuously against the alpha API. Same prompt, same agent, same model — one arm uses ContextOS as operational memory, the other doesn't. Tokens, time, tool calls, and completion rates are captured for every run. Numbers below come from — measured demo comparisons. The dogfood timeline below mirrors this repo's own evolution.

Open dashboard Raw JSON

last refresh · 5/28/2026, 9:27:19 AM