FuriosaAI Unveils 'Renegade' Chip, Touting 7.4x User Capacity Over Nvidia GPU at Lower Power

FuriosaAI has launched its new 'Renegade' chip, designed for high efficiency in AI inference, which is becoming the dominant workload in data centers.

This chip directly addresses the growing challenges of massive electricity consumption and physical space constraints faced by modern data centers.

By using significantly less power than competitors, Renegade aims to lower the total cost of ownership and is positioned as a strategic choice for future AI infrastructure.

NVDA

Korean startup FuriosaAI has officially unveiled its second-generation AI chip, 'Renegade,' signaling a major push into the power-efficient inference market.

The timing of this launch is critical. The AI industry is at an inflection point where the primary workload is shifting from training massive models to inference—the process of using those trained models to generate answers, images, or predictions. McKinsey projects that by 2030, inference will consume more data center resources than training. This shift changes the definition of a 'good' chip from raw computing power (FLOPS) to sustained, real-world efficiency, often measured in 'tokens-per-watt'.

This brings us to the biggest challenge facing the AI boom: electricity. The International Energy Agency (IEA) forecasts that data center electricity usage could more than double by 2030. High-performance GPUs from market leader Nvidia can consume up to 700W each, creating immense strain on power grids and data center cooling systems. FuriosaAI's Renegade directly targets this problem with a Thermal Design Power (TDP) of just 180W. This isn't just about saving on electricity bills; it's about making large-scale AI deployments feasible within existing power constraints.

Furthermore, the economic benefits extend to the physical data center. A key metric is 'tokens-per-rack,' which measures performance within a fixed space and power budget. A standard server rack has a power limit, typically around 15 kW. While you might only fit one Nvidia DGX H100 server (which draws over 10 kW) in that rack, you could fit about five of FuriosaAI's NXT RNGD servers (at ~3 kW each). According to partner LG AI Research, this translates to a 3.75x improvement in performance per rack under a fixed power limit, leading to an estimated 40% reduction in Total Cost of Ownership (TCO).

This launch is the result of a long-term strategy. It began with an early design focus on low-power inference, followed by securing $125 million in funding to scale production, and building key partnerships with Korean giants like LG and Samsung SDS. By rejecting a reported acquisition offer from Meta, FuriosaAI also maintained its independence, making it an attractive partner for national and regulated AI projects that require 'sovereign AI' capabilities.

In essence, Renegade represents a targeted solution for a specific but rapidly growing problem. As the AI world grapples with unsustainable power demands, specialized, efficient hardware like Renegade offers a compelling path forward for building sustainable and cost-effective AI infrastructure.

Inference: The process of using a trained AI model to make predictions or generate outputs based on new data. This is what happens when you ask a chatbot a question.
Total Cost of Ownership (TCO): A financial estimate that includes not just the purchase price of an asset, but also all direct and indirect costs of operating it, such as electricity, cooling, and maintenance.
Neural Processing Unit (NPU): A specialized processor designed specifically to accelerate the machine learning algorithms used in AI applications, often with a focus on power efficiency.