NVIDIA Vera CPU Is ‘Packing a Heavy-Hitting Punch’ Against Competition

Published: (May 26, 2026 at 05:15 PM EDT)
4 min read

Source: NVIDIA AI Blog

The shift to agentic AI creates a new CPU requirement for the AI factory: fast cores, massive memory bandwidth, and the ability to sustain high performance when all cores are active.

Initial benchmark results published by Phoronix show that the NVIDIA Vera CPU meets this need. For this first public look, the benchmark scope was centered on the agentic workloads Vera was designed for in the modern data center.

The Vera CPU delivers the throughput AI factories need while optimizing platform power. Eighty‑eight NVIDIA custom Olympus cores, 1.2 TB/s of memory bandwidth, and a high‑speed, on‑chip fabric result in a CPU platform that combines core performance and memory bandwidth in an efficient power envelope.

NVIDIA Olympus Delivers Aggressive Performance

At the heart of Vera are custom NVIDIA Olympus CPU cores. Fully compatible with the Armv9.2 instruction set architecture, Olympus is designed for the sequential CPU work underpinning agentic AI: branch‑heavy runtimes, sandboxed code, data processing, and orchestration.

Vera’s monolithic die, wide cores, advanced branch prediction, and the second‑generation NVIDIA Scalable Coherency Fabric help keep data moving across all 88 cores.

Phoronix’s testing of a single‑socket Vera CPU — rated at 450 W TDP with less than 30 W of memory power — showed outstanding performance within that power profile, with generational gains across a broad array of workloads spanning code compilation, file compression, video transcoding, Python, Java, and database management. These are the same CPU‑heavy tasks that agents and AI factories run every day: compiling code, executing runtimes, compressing data, querying databases, and coordinating large software stacks.

“Going into this, I didn’t really know what to expect of NVIDIA’s Vera with the new Olympus cores,” wrote Michael Larabel, founder and principal author of Phoronix. “But in the end I was left realizing this is the most formidable competition to Intel and AMD x86_64 processors ever realized.”

‘Incredible Advantage’ in Memory Performance

Agentic workloads are not limited by core count alone. They need high core utilization and sustained memory bandwidth, making memory performance per watt a critical part of overall CPU efficiency.

Vera incorporates a second‑generation LPDDR5X memory subsystem, enabling dramatically lower energy per bit compared with DDR5. This allows Vera to offer up to 1.2 TB/s of bandwidth — up to 2× the peak memory bandwidth of traditional CPUs — while consuming less than 30 W of memory power (versus >100 W for DDR5‑based systems).

In Phoronix STREAM TRIAD testing, Vera sustained 90 % of its peak memory bandwidth, achieving the highest percentage of rated peak bandwidth of any CPU tested by Phoronix, and delivered over 4× the memory bandwidth per core compared with traditional x86 CPUs.

“NVIDIA Vera with its LPDDR5X memory was showing its incredible advantage in memory performance over current Intel Xeon and AMD EPYC processors,” Larabel wrote.

Separate testing by Prime Intellect found that Vera maintained high bandwidth and low, consistent memory latency as more workloads ran in parallel — the predictable performance needed for agentic AI.

A Large Generational Leap — and Leadership in Phoronix Testing

Compared with the prior‑generation NVIDIA Grace CPU, Vera delivered a 1.6× geometric‑mean increase in Phoronix’s testing — an impressive generation‑over‑generation gain.

“The difference from Grace to Vera was consistently exceeding my expectations for gen‑on‑gen performance we typically see for processors,” Larabel wrote. “NVIDIA’s Vera CPU with its in‑house‑designed Olympus cores ends up packing a heavy‑hitting punch with competitiveness to Intel/AMD x86_64 CPUs that I have never seen out of any other ARM or non‑x86_64 processors.”

Vera led the tested CPU field, delivering a 1.5× overall performance advantage compared with a latest‑generation 128‑core x86 processor. The gains showed up in practical developer workloads: single‑socket Vera compiled a default Linux kernel in just 20 seconds, the fastest result Phoronix measured in that test, and delivered 2× faster Linux kernel compilation on a per‑core basis versus the 128‑core processor.

“On a geometric‑mean basis, the NVIDIA Vera delivered 10 % better performance than the AMD EPYC 9575F 5.0 GHz high‑frequency processor,” Larabel noted.

Vera in Customer Testing, Coming Soon From Partners

At NVIDIA GTC, the company announced widespread ecosystem support for Vera, spanning AI natives, supercomputing centers, cloud service providers, and infrastructure providers.

NVIDIA has also delivered the first Vera CPUs to leading AI companies and cloud providers, marking an important milestone as Vera moves toward partner availability in the second half of the year.

Vera will be available from partners in dual‑ and single‑socket systems, with air‑cooled and liquid‑cooled options to support AI factory deployments, from standard enterprise data centers to high‑density agentic AI infrastructure.

Learn more about NVIDIA Vera.

0 views
Back to Blog

Related posts

Read more »