Live scoreboard · api.statefulai.tech/public/metrics
Real metrics for an operational cognition layer.
Two A/B demos run continuously against the alpha API. Same prompt, same agent, same model — one arm uses ContextOS as operational memory, the other doesn't. Tokens, time, tool calls, and completion rates are captured for every run. Numbers below come from — measured demo comparisons. The dogfood timeline below mirrors this repo's own evolution.