Enforcing new limits and retiring Opus 4.6 Fast from Copilot Pro+
Source: GitHub Changelog
GitHub Copilot’s rapid growth has led to increased patterns of high concurrency and intense usage. While many of these workloads are legitimate, they place significant strain on our shared infrastructure and operating resources. To ensure every user gets a fast, reliable Copilot experience, we’re updating limits to better balance capacity. These changes will roll out over the next few weeks and include:
- Limits for overall service reliability
- Limits for specific models or model‑family capacity
What this means for you
- When you hit a service reliability limit, you will need to wait until your current session resets. This will be visible in the error experience when you are rate‑limited.
- When you hit a usage limit for specific models or a model family, you can switch to an alternative model or use Auto mode.
We recommend distributing requests more evenly over time when possible, rather than sending them in large, concentrated waves. You can also upgrade your plan for higher limits.
We know limits can be frustrating and are actively exploring new ways to offer increased capacity for all users. We will share updates as we identify durable solutions. Learn more in our docs about rate limiting.
To further improve service reliability, we are streamlining our model offerings and focusing resources on the models our users use the most. As a first step, we’ll be retiring Opus 4.6 Fast for Copilot Pro+ users, beginning today. We recommend using Opus 4.6 as an alternative model with similar capabilities.