Gemini’s new usage limits are live, and users already seem frustrated about it
Source: 9to5Google
Google Announces Compute‑Based Usage Limits for Gemini
[Image: Gemini app redesign])
As part of a massive slate of new AI announcements, Google revealed new compute‑based usage limits for Gemini during I/O 2026. Existing users aren’t thrilled.
What’s changing?
- Limits now consider chat length, features used, and prompt complexity – the more compute a prompt consumes, the faster you burn through your quota.
- There is an overall weekly limit plus 5‑hour refresh windows.
- Google says the change “is a better way to allocate limits, because a simple text prompt uses far less compute than a complex video or coding prompt.”
Tier comparison (post‑change)
| Plan | Monthly price | Relative usage limit* |
|---|---|---|
| Free | $0 | 1× |
| AI Plus | $7.99 | 2× |
| AI Pro | $19.99 | 4× |
| AI Ultra (100) | $100 | 5 × Pro (≈ 20× Free) |
| AI Ultra (200) | $200 | 20 × Pro (≈ 80× Free) |
*Limits are expressed as multiples of the free‑tier quota.
Previously Google only described the plans with vague terms like “more” or “higher” without publishing exact figures.
Email excerpt (effective May 20)
Usage limits in the Gemini app:
For the Gemini app, we’re introducing compute‑based usage limits that factor in the complexity of your prompt, the features you use, and the length of your chat. Your limit refreshes every 5 hours until you reach your weekly limit. As an AI Pro subscriber, you’ll enjoy a 4× higher usage limit than non‑subscribers.
Community reaction
- Reddit – many users criticize the wording that compares Pro to the free tier rather than to the previous Pro limits.
- Some argue the value proposition is worse: AI Pro now offers less usage per dollar than AI Plus, even though Pro includes additional perks (e.g., YouTube Premium Lite).
- Others simply dislike the stricter caps, noting that the industry as a whole is moving toward tighter resource controls.
Relevant threads:
- “Fun new changes were heavily tanking your usage”
- “New usage limit just destroying Gemini’s biggest strength”
Google’s support page shows a dramatic drop in limits compared with the pre‑change numbers:
Why the shift?
AI services are scaling rapidly, and the underlying hardware is under supply strain—a shortage that was itself driven by the AI boom. Google’s move reflects a broader industry trend to manage compute resources more granularly.
Quick ways to check your quota
- Visit gemini.google.com/usage to see real‑time consumption.
- In the Gemini app, the usage bar updates after each 5‑hour refresh.
Additional notes
- Google has partially eased the tighter limits by raising Antigravity rate limits 3× permanently (Mohan Solo’s tweet).
- The new Gemini 3.5 Flash model is much more compute‑intensive. Early data shows a single prompt can consume several percentage points of a 5‑hour block, prompting complaints about rapid quota depletion.
[Image: Gemini usage screenshot])
What do you think?
Are the new compute‑based limits a reasonable way to keep the service sustainable, or do they make Gemini less attractive compared with competing AI platforms? Share your thoughts below!
More on Gemini
- Gemini Omni, the ‘create anything’ model, starts today with lifelike video
- Google is adding AI detection for photos, videos, and audio to Search and Chrome
- Gemini app rolling out ‘Neural Expressive’ redesign, 3.5 Flash, 24/7 Spark agent, & Daily Brief
Follow Ben: Twitter/X • Threads • Bluesky • Instagram
Updated to include a support page on limits, clarify how Google previously advertised AI plan upgrades.

FTC: We use income‑earning auto affiliate links. More…