Commentary

Grok Code Fast dominates OpenRouter but the data flatters xAI — here's what it actually means

Sep 15, 2025

Key Points

  • Grok Code Fast generates 1.06 trillion tokens on OpenRouter, four times Claude Sonnet 4's volume, but OpenRouter captures only 5% of Anthropic's actual API revenue.
  • Grok's dominance reflects a specific win in price-sensitive, high-volume workloads where cheap-enough inference beats frontier reasoning models on total cost.
  • The metric is a marketing snapshot masking that most developers using Claude, GPT-4o, and Gemini route through direct APIs, not OpenRouter's commodity inference market.

Summary

Grok Code Fast dominates OpenRouter, but the ranking is misleading

Grok Code Fast is generating 1.06 trillion tokens on OpenRouter, compared to 343 billion for Claude Sonnet 4 and 72 billion for GPT-4o. That gap suggests xAI has built something four times larger than Anthropic and ten times larger than OpenAI. The numbers are deceptive.

OpenRouter is a price-sensitive market. Users route inference to whatever offers the best price-to-intelligence ratio. Most developers using Claude, Gemini, or OpenAI's flagship models hit those platforms' official APIs directly because those APIs expose parameters and features OpenRouter doesn't. The volume numbers skew toward cheap, commodity inference.

Anthropics's leaked revenue is roughly $4 billion annually, or about $250 million monthly in business. Claude Sonnet 4 on OpenRouter accounts for roughly $12 million of that, or about 5% of actual revenue. The other 95% flows through Anthropic's direct API. The same pattern holds for OpenAI.

Grok Code Fast's dominance reflects a specific market win: it delivers affordable, usable intelligence at scale. The model competes not against frontier reasoning systems like Claude and GPT-4o but against open-source alternatives like Llama. It's cheap enough that even requiring 10 inference runs to match a single frontier model run still costs less while delivering comparable results. For high-volume, price-sensitive workloads, that's a genuine advantage.

One factor: Grok is still free in Cursor and other routing layers, so some of those tokens may route through OpenRouter without users actively choosing it.

The story is lightly bullish for xAI. They have built something people want at a specific price point. But the metric is a marketing snapshot, not evidence that Grok has overtaken frontier models in overall demand.