Google launches Gemini 3.1 Pro, retaking AI crown with 2X+ reasoning performance boost
Source: VentureBeat
Google’s Gemini 3.1 Pro Reclaims the AI Crown
Late last year, Google briefly took the crown for the most powerful AI model in the world with the launch of Gemini 3 Pro—only to be surpassed within weeks by OpenAI and Anthropic releasing newer models, a common occurrence in the fiercely competitive AI race.
Now Google is back to retake the throne with an updated version of that flagship model: Gemini 3.1 Pro, positioned as a smarter baseline for tasks where a simple response is insufficient. It targets science, research, and engineering workflows that demand deep planning and synthesis.
Early evaluations by the third‑party firm Artificial Analysis show that Gemini 3.1 Pro has leapt to the front of the pack and is once again the most powerful and performant AI model in the world.
A Big Leap in Core Reasoning
The most significant advancement in Gemini 3.1 Pro lies in its performance on rigorous logic benchmarks. Most notably, the model achieved a verified score of 77.1 % on ARC‑AGI‑2, a benchmark designed to evaluate a model’s ability to solve entirely new logic patterns it has not encountered during training.
This result represents more than double the reasoning performance of the previous Gemini 3 Pro model.

Beyond Abstract Logic
Internal benchmarks indicate that Gemini 3.1 Pro is highly competitive across specialized domains:
| Domain | Metric |
|---|---|
| Scientific Knowledge | 94.3 % on GPQA Diamond |
| Coding | Elo 2887 on LiveCodeBench Pro |
| 80.6 % on SWE‑Bench Verified | |
| Multimodal Understanding | 92.6 % on MMMLU |
These technical gains are not just incremental; they reflect a refinement in how the model handles “thinking” tokens and long‑horizon tasks, providing a more reliable foundation for developers building autonomous agents.
Improved Vibe Coding and 3D Synthesis
Google is demonstrating the model’s utility through “intelligence applied”—shifting the focus from chat interfaces to functional outputs.
One of the most prominent features is the model’s ability to generate vibe‑coded animated SVGs directly from text prompts. Because these are code‑based rather than pixel‑based, they remain scalable and retain tiny file sizes compared with traditional video, delivering far more detailed, presentable, and professional visuals for websites, presentations, and other enterprise applications.
Showcased Applications
-
Complex System Synthesis – The model configured a public telemetry stream to build a live aerospace dashboard visualizing the International Space Station’s orbit.
-
Interactive Design – In a demo, 3.1 Pro coded a complex 3‑D starling murmuration that users can manipulate via hand‑tracking, accompanied by a generative audio score.
-
Creative Coding – The model translated the atmospheric themes of Emily Brontë’s Wuthering Heights into a functional, modern web design, demonstrating an ability to reason through tone and style rather than just literal text.
Business Impact and Community Reactions
Enterprise partners have already begun integrating the preview version of 3.1 Pro, reporting noticeable improvements in reliability and efficiency.
Vladislav Tankov, Director of AI at JetBrains, noted a 15 % quality improvement over previous versions, stating the model is “stronger, faster… and more efficient, requiring fewer output tokens.”
Industry Reactions
-
Databricks – CTO Hanlin Tang reported that the model achieved “best‑in‑class results” on OfficeQA, a benchmark for grounded reasoning across tabular and unstructured data.
-
Cartwheel – Co‑founder Andrew Carr highlighted the model’s “substantially improved understanding of 3D transformations,” noting it resolved long‑standing rotation‑order bugs in 3D animation pipelines.
-
Hostinger Horizons – Head of Product Dainius Kavoliunas observed that the model understands the “vibe” behind a prompt, translating intent into style‑accurate code for non‑developers.
Pricing, Licensing, and Availability
For Developers
| Feature | Pricing |
|---|---|
| Input (prompt) tokens | • $2.00 / 1 M tokens for prompts ≤ 200 k |
| • $4.00 / 1 M tokens for prompts > 200 k | |
| Output (completion) tokens | • $12.00 / 1 M tokens for prompts ≤ 200 k |
| • $18.00 / 1 M tokens for prompts > 200 k | |
| Context caching | • $0.20 – $0.40 / 1 M tokens (depends on prompt size) |
| • Storage fee: $4.50 / 1 M tokens · hour | |
| Search grounding | • First 5,000 prompts/month – free |
| • Beyond that: $14 / 1 000 search queries |
Note: Gemini 3.1 Pro retains the same “reasoning‑to‑dollar” ratio as the original Gemini 3 Pro launch, delivering a substantial performance boost without increasing API costs.
For Consumers
- The model is being rolled out in the Gemini app and Notebook LM.
- Google AI Pro and Ultra subscribers receive higher usage limits.
All prices are listed in U.S. dollars and are subject to change. Refer to the official Google AI pricing page for the most up‑to‑date information.
Licensing Implications
3.1 Pro is a proprietary model delivered through:
- Vertex Studio – Google Cloud console
- Gemini API – API documentation
It follows a standard commercial SaaS (Software‑as‑a‑Service) licensing model rather than an open‑source license.
What This Means for Enterprise Users
- Grounded reasoning within the security perimeter of Vertex AI, allowing businesses to run the model on their own data with confidence.
- The “Preview” status lets Google refine safety and performance before a full general‑availability release—a common practice for high‑stakes AI deployments.
Strategic Outlook
By emphasizing core reasoning capabilities and specialized benchmarks such as ARC‑AGI‑2, Google signals that the next phase of the AI race will be won by models that can think through problems, not merely predict the next token.