Skip to content
AI Primer
breaking

Ramp reports business AI token spend at 13x January 2025 levels

Ramp data and operator reports said enterprise AI token spending is rising far faster than budget controls and procurement cycles. Teams should plan for routing, cheaper defaults, and spend caps to become core engineering infrastructure.

5 min read
Ramp reports business AI token spend at 13x January 2025 levels
Ramp reports business AI token spend at 13x January 2025 levels

TL;DR

You can read Ramp's AI Index, Simon Willison's product-market-fit post, and the sprawling Hacker News thread. The interesting split is that the top-line adoption story came with a cost-discipline warning inside Ramp's own framing, while operators on X and HN were already talking about routing layers, spend caps, and entire AI budgets getting torched early.

Ramp's index turned adoption into a cost story

Ramp's May 2026 index gave Anthropic 34.4% of business adoption versus OpenAI's 32.3%, but the more useful number was the spend curve: average business token spend was up 13x from January 2025, according to arakharazian's post linking back to Ramp's report.

That same summary said tokens are still less than 2% of total business spend even for the highest spenders. Ramp's framing, as relayed by arakharazian and kimmonismus, was that this share is small today but the growth rate is the part that looks hard to sustain.

kimmonismus also highlighted the contradiction inside the report: Anthropic was winning the business-adoption snapshot at the same moment inference platforms offering cheaper open models were among the fastest growers on Ramp's platform.

Budgeting broke before governance existed

Ethan Mollick's most concrete point was temporal. Tokens went from something companies did not budget for a year ago to something many teams now treat as required for coding, per his first post.

His follow-up claimed some large organizations had already blown through their entire token budget in the first couple months of the year, according to Mollick's follow-up. A third Mollick post pushed the next problem down to the manager level: who gets the expensive model, by skill level, by project, or by some other rule.

That same budget-cap logic showed up in TheEthanDing's CIO and CFO anecdote, which described enterprises waiting for someone to normalize explicit LLM spend ceilings in the $1,000 to $3,000 per user per month range. The point was not that one cap has won, but that token allocation is turning into a policy problem rather than a quiet API line item.

Routing and gateways are becoming the control plane

The operator playbook in the evidence pool was unusually consistent:

  1. Route simpler workloads to cheaper providers, as GergelyOrosz put it.
  2. Use cheaper open models through vendors like Fireworks and Baseten, again per that thread.
  3. Make the default model the cheap one, also from GergelyOrosz.
  4. Add a gateway layer so cost controls are enforced centrally, which Hacubu's retweet of Harrison Chase said is one of the main benefits customers get from LangSmith's LLM gateway.

Simon Willison's post explains why that infrastructure category is suddenly hot. He estimated that his own last 30 days of Claude Code and Codex use would have cost about $2,180 at API prices, while flat-rate plans cost him $200. In the accompanying Hacker News thread, commenters split between people saying they would gladly pay full price for the value and people arguing the economics invite open-weight substitution and smarter routing.

The HN discussion summary captured both sides cleanly: one branch focused on real value from heavy token use, while another focused on open-source competition and skepticism that premium model vendors keep those margins once cost discipline arrives.

AI deployment is creating a new implementation layer

The last useful reveal came from levie's thread, which shifted the story from token bills to org design. He argued that once enterprises move from chat-plus-search to agents wired into production systems, the work multiplies into data protection, access controls, observability, workflow redesign, human review points, and repeated upgrades whenever model capabilities change.

That thread also sketched who does the work. Some companies are repurposing internal IT talent, according to levie, but he also described demand for internal FDE-like roles, vendor-side applied AI architecture teams, and new services firms built around AI implementation. Rising token spend is only one line on the invoice, the bigger story is how much new operating machinery enterprise AI is dragging in behind it.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 3 threads
TL;DR1 post
Ramp's index turned adoption into a cost story1 post
Budgeting broke before governance existed2 posts
·
Other sources· 1 post

I think Anthropic and OpenAI have found product-market fit

Anthropic are strongly rumored to be about to have their first profitable quarter. Stories are circulating of companies surprised at how expensive their LLM bills are becoming from usage by their staff. I think this is because OpenAI and Anthropic have both found product-market fit. Enterprise customers are now paying API prices I think they've found product-market fit And they're ramping up The AI-failure stories around this are pretty thin We also know the labs are spending a lot API revenue is becoming less important April is a new inflection point Enterprise customers are now paying API prices I currently subscribe to the $100/month Max plan from Anthropic and the $100/month Pro plan from OpenAI. If you are a heavy user of coding agents these plans are a fantastic deal. I just ran the ccusage tool on my laptop to get an estimate of how much I would have spent if I were to pay for API tokens in the past 30 days and got: $1,199.79 for Anthropic Claude Code $980.37 for OpenAI Codex That's $2,180.16 worth of tokens for $200 - not bad at all! I'm a moderately heavy user of these tools, but I'm certainly not running agents every hour of the day and night. I had assumed that companies making extensive use of agents were getting similar discounts. It turns out I could not have been more wrong about that. I haven't been able to track down the exact date, but at some point in the last six months Anthropic switched their Enterprise plan (originally "Claude seats include enough usage

Share on X