Users report stricter Claude Code request caps, weeklong cooldowns, and desktop threads disappearing after restarts. Watch quotas closely and shift to lighter models or token-cutting workflows around /context and /clear if the limits hit your workflow.

You can trace the official cost story through Anthropic's docs on cost management, best practices, status lines, and experimental agent teams. The weirder part is how closely that advice sits next to same-day bug reports on GitHub, including aggressive limit complaints, a new-session usage-limit report, and a Windows desktop bug where user messages vanished from history after the assistant finished responding weeklong cooldown screenshot.
The March 29 complaints describe something harsher than the usual "you used a lot of tokens" story. One user posted a reset date on April 3, almost a week away, while another said the problem was requests per minute, not just aggregate usage, and that concurrent agents were what got punished first.
The GitHub issue queue showed the same pattern that day. In one report, a user said they hit a five-hour daily limit repeatedly despite light recent usage, with a quarter of their weekly limit gone after roughly four hours of work. Another said a new Claude Pro account hit a usage limit after seven minutes in a fresh session.
Anthropic's own docs make the concurrency angle hard to ignore. The agent teams page describes "multiple Claude Code instances working together as a team," with a lead session coordinating teammates across separate contexts.
That is almost the exact workflow users say got hit. One March 29 complaint described Claude Code as effectively unusable for a swarm setup that checked in on multiple agents every few minutes, and contrasted it with features the user missed elsewhere:
Those missing features matter here because they are the mechanics that make a multi-agent terminal workflow cheap to supervise.
The strongest user-side mitigation thread started with /context, which breaks down what is already occupying the prompt before any coding starts. One user said unused MCP servers and extra skills consumed 35 percent of context before work began, then fell to 10 percent after they trimmed them.
That lines up with Anthropic's own documentation. The best practices page warns that performance degrades as the context window fills, while the status line docs explicitly pitch a bottom-bar script for tracking context usage and session cost in real time. The cost guide says average usage stays below $12 per developer per day for 90 percent of users, but also notes wide variance depending on how many instances are running.
The thread's concrete token cuts were straightforward:
CLAUDE.md short, because it loads into every session context management tipsThe other same-day failure mode was the desktop client itself. One user said Code-mode threads on Windows kept disappearing after app restarts even though the threads still seemed to exist underneath.
A matching GitHub bug from March 29 reported that user messages in the Claude Code desktop app intermittently vanished from chat history after the assistant response finished rendering. During generation the message was visible, then it disappeared, leaving two assistant replies stacked together. That issue was closed as a duplicate, which suggests the report was not isolated.
Between the cooldown complaints and the history bugs, March 29 looked less like one quota edge case and more like a rough day for the people using Claude Code as a persistent, multi-session workstation.
Okay, these are getting more severe. I'll have to wait nearly a week. Also, my threads in the Code Windows desktop app keep disappearing; apparently, they still exist, but I cannot see them after I restart the app. This has happened over and over.
Since Claude Code is nearly useless to me until these new draconian rate limits go away (note: I’m not talking about usage limits; these are limits on the number of requests per minute, basically penalizing the use of concurrent agents), I thought I’d list the 3 biggest features Show more
Since Claude Code is nearly useless to me until these new draconian rate limits go away (note: I’m not talking about usage limits; these are limits on the number of requests per minute, basically penalizing the use of concurrent agents), I thought I’d list the 3 biggest features Show more
2/ First, run /context for a quick self-check. This command shows you exactly what's eating your tokens: System Prompt, MCP Tools, Memory Files. The first time I ran it, I found that 35% of my context was already gone before I had even started coding. The main reason was too Show more
5/ Don't send raw logs straight to Claude. Raw command output can easily run into thousands of lines. Send them as-is, and a single conversation can blow up your token usage. Use the open-source tool RTK (Rust Token Killer), which filters and compresses command output before Show more
Okay, these are getting more severe. I'll have to wait nearly a week. Also, my threads in the Code Windows desktop app keep disappearing; apparently, they still exist, but I cannot see them after I restart the app. This has happened over and over.