Winbond Reportedly Secures NVIDIA Rubin NOR Flash Spot, Targeting 50% Share by 2027

Taiwan's Winbond is rumored to supply critical NOR Flash memory for NVIDIA's next-gen AI platform, Vera Rubin, starting in late 2026, strengthening NVIDIA's non-China supply chain.

The AI boom is causing a shortage of advanced memory like HBM, which in turn squeezes the production capacity for mature-node memory like NOR Flash, leading to scarcity and price hikes.

NVIDIA's new rack-scale architecture multiplies the number of components per system, and since each component needs its own NOR Flash to boot, overall demand for the memory has surged.

NVDA.KSMU.KSMCHP.KSAVGO.KSANET.KS

A recent report suggests that Taiwanese memory maker Winbond is set to become a key supplier for NVIDIA's next-generation AI platform, Vera Rubin, starting in late 2026.

This is significant for several interconnected reasons. First, it addresses a major geopolitical concern: supply chain resilience. Major tech companies and the U.S. government have been actively working to reduce their reliance on China for critical components. By bringing in Winbond, a non-Chinese supplier, NVIDIA can de-risk its supply chain for a mission-critical part, ensuring its new AI systems aren't vulnerable to trade tensions. This move aligns perfectly with the broader industry trend of diversifying away from China.

Second, the timing is driven by a massive squeeze in the global memory market. The AI boom has created unprecedented demand for high-performance memory like HBM and DDR5 DRAM. Chipmakers are dedicating most of their production capacity to these high-margin products. This creates a "crowding-out" effect, where the production of other essential, but less advanced, memory types like NOR Flash gets sidelined. The result is a shortage and soaring prices for NOR Flash, making it crucial for a giant like NVIDIA to secure a stable supply well in advance.

Finally, the demand is being multiplied by NVIDIA's own technology. The Vera Rubin platform uses a rack-scale architecture. Instead of a single server, an entire rack acts as one giant, interconnected computer. This rack is filled with hundreds of individual components—dozens of GPUs, CPUs, networking cards (SuperNICs), and management processors (DPUs). Each of these components needs its own small NOR Flash chip to store its basic startup software, or firmware. As a result, a single Rubin rack could contain hundreds of NOR Flash chips, dramatically increasing the total volume needed compared to previous generations.

In conclusion, the rumored partnership between Winbond and NVIDIA isn't just a simple supplier deal. It's a strategic move that sits at the intersection of geopolitics, intense market dynamics, and architectural innovation, highlighting how every single component, no matter how small, has become critical in the AI era.

NOR Flash: A type of non-volatile memory used to store firmware—the essential software that allows a device's hardware to boot up and function. It is known for its reliability and speed in reading code.
Hyperscaler: A large-scale cloud computing provider that operates massive data centers, such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud.
Rack-scale architecture: A data center design where an entire server rack—including its computing, networking, and storage components—is engineered to work as a single, integrated system.