Gemini’s new usage limits are live, and users already seem frustrated about it

Published: (May 21, 2026 at 10:30 AM EDT)
4 min read
Source: 9to5Google

Source: 9to5Google

Google Announces Compute‑Based Usage Limits for Gemini

[Image: Gemini app redesign])

As part of a massive slate of new AI announcements, Google revealed new compute‑based usage limits for Gemini during I/O 2026. Existing users aren’t thrilled.


What’s changing?

  • Limits now consider chat length, features used, and prompt complexity – the more compute a prompt consumes, the faster you burn through your quota.
  • There is an overall weekly limit plus 5‑hour refresh windows.
  • Google says the change “is a better way to allocate limits, because a simple text prompt uses far less compute than a complex video or coding prompt.”

Tier comparison (post‑change)

PlanMonthly priceRelative usage limit*
Free$0
AI Plus$7.99
AI Pro$19.99
AI Ultra (100)$1005 × Pro (≈ 20× Free)
AI Ultra (200)$20020 × Pro (≈ 80× Free)

*Limits are expressed as multiples of the free‑tier quota.
Previously Google only described the plans with vague terms like “more” or “higher” without publishing exact figures.


Email excerpt (effective May 20)

Usage limits in the Gemini app:
For the Gemini app, we’re introducing compute‑based usage limits that factor in the complexity of your prompt, the features you use, and the length of your chat. Your limit refreshes every 5 hours until you reach your weekly limit. As an AI Pro subscriber, you’ll enjoy a 4× higher usage limit than non‑subscribers.


Community reaction

  • Reddit – many users criticize the wording that compares Pro to the free tier rather than to the previous Pro limits.
  • Some argue the value proposition is worse: AI Pro now offers less usage per dollar than AI Plus, even though Pro includes additional perks (e.g., YouTube Premium Lite).
  • Others simply dislike the stricter caps, noting that the industry as a whole is moving toward tighter resource controls.

Relevant threads:

Google’s support page shows a dramatic drop in limits compared with the pre‑change numbers:


Why the shift?

AI services are scaling rapidly, and the underlying hardware is under supply strain—a shortage that was itself driven by the AI boom. Google’s move reflects a broader industry trend to manage compute resources more granularly.


Quick ways to check your quota

  • Visit gemini.google.com/usage to see real‑time consumption.
  • In the Gemini app, the usage bar updates after each 5‑hour refresh.

Additional notes

  • Google has partially eased the tighter limits by raising Antigravity rate limits 3× permanently (Mohan Solo’s tweet).
  • The new Gemini 3.5 Flash model is much more compute‑intensive. Early data shows a single prompt can consume several percentage points of a 5‑hour block, prompting complaints about rapid quota depletion.

[Image: Gemini usage screenshot])


What do you think?

Are the new compute‑based limits a reasonable way to keep the service sustainable, or do they make Gemini less attractive compared with competing AI platforms? Share your thoughts below!

More on Gemini

Follow Ben: Twitter/XThreadsBlueskyInstagram

Updated to include a support page on limits, clarify how Google previously advertised AI plan upgrades.

9to5Google preferred source badge (dark) 9to5Google preferred source badge (light)

FTC: We use income‑earning auto affiliate links. More…

0 views
Back to Blog

Related posts

Read more »