On stage at Nvidia’s GTC 2025 conference in San Jose on Tuesday, CEO Jensen Huang announced a slew of new GPUs coming down the company’s product pipeline over the next few months.
Perhaps the most significant was Vera Rubin. Vera Rubin, which is set to be released in the second half of 2026, will feature tens of terabytes of memory and a custom Nvidia-designed CPU called Vera. Vera Rubin delivers substantial performance uplifts compared to its predecessor, Grace Blackwell, Nvidia claims, particularly on AI inferencing and training workloads.
When paired with Vera, Rubin can manage up to 50 petaflops while doing inference, more than double the 20 petaflops for the company’s current Blackwell chips, Nvidia says. And Vera is about twice as fast at the CPU used in Grace Blackwell.
Rubin will be followed by Rubin Ultra in the second half of 2027, a collection of 576 Vera Rubin GPUs, Huang said.
In the nearer term — the second half of 2025 — Nvidia will release Blackwell Ultra. A single Ultra chip will offer the same 20 petaflops of AI performance as Blackwell, but with 288GB of memory — up from 192GB in vanilla Blackwell.
On the far horizon is Feynman. Huang gave few details about Feynman’s architecture, named after American theoretical physicist Richard Feynman, save that it features a Vera CPU. It’s also unclear when the first Feynman GPUs will arrive beyond sometime in 2028.