NVIDIA's new data center CPU, 'Vera', has demonstrated significant performance advantages in its first public benchmarks.
Vera is a key component of NVIDIA's strategy to create a fully integrated 'AI Factory'. The goal is to provide a complete, rack-scale solution where every component, from GPU to CPU to networking, is optimized to work together. Vera, with its 88 custom 'Olympus' Arm cores, massive 1.2 TB/s memory bandwidth, and high-speed NVLink-C2C connection to GPUs, is specifically designed to eliminate common CPU bottlenecks in AI workloads, such as data preprocessing, runtime management, and orchestration. This vertical integration aims to standardize the host CPU within NVIDIA's NVL72 rack systems, moving away from reliance on third-party x86 processors.
The initial benchmarks, published by Phoronix, are quite telling. Vera outperformed Intel's latest Xeon 6 'Granite Rapids' across a wide range of CPU-intensive tasks like code compilation and data compression. It also showed competitive or even superior performance against AMD's high-frequency EPYC CPUs in certain tests. A key highlight was its exceptional memory bandwidth, which is crucial for the data-hungry 'Agentic AI' models of the future.
This development didn't happen in a vacuum. First, NVIDIA officially announced Vera at GTC in March 2026, clearly stating its strategic intent to bring the CPU into its own platform. Second, the maturation of the surrounding infrastructure, such as PCIe Gen 6 and CXL 3.1, means Vera's high-bandwidth design can be fully utilized in real-world deployments. These prior events provide the context for interpreting the benchmark results not just as a technical achievement, but as a viable commercial strategy.
However, it's important to approach these results with caution. The tests were conducted at NVIDIA's headquarters using a benchmark suite selected by the company, and critical data like power consumption and clock frequencies were not disclosed. The market seems to understand this, as the stock reaction was mixed, anticipating a fierce battle ahead. AMD is preparing its counter-move with the 6th-generation 'Venice' EPYC, expected to feature up to 256 cores, which could shift the competitive landscape in the second half of 2026.
- AI Factory: A term for a fully integrated, end-to-end computing system designed for developing and running AI applications, encompassing hardware and software.
- Vertical Integration: A strategy where a company controls multiple stages of its production process. In this case, NVIDIA is designing its own GPUs, CPUs, and networking components to create a cohesive system.
- CXL (Compute Express Link): An open standard interconnect that allows high-speed, low-latency communication between the CPU and devices like GPUs, memory, and storage.
