China’s DeepSeek trains Nvidia-powered R1 AI for just $294K: here’s what it means
0
0

China’s DeepSeek just shook up the AI world by revealing it trained its new R1 model for a jaw-dropping $294,000, a fraction of the eye-watering costs seen in the West.
This move isn’t just about dollars; it puts DeepSeek at the heart of a generative AI revolution, where affordability meets cutting-edge performance and the global pecking order is suddenly in flux.
DeepSeek’s cost-crushing AI project: What it means?
The story behind DeepSeek’s latest leap is remarkable.
Released in a peer-reviewed Nature paper this week, DeepSeek disclosed that training its “reasoning-centric” R1 model cost just $294,000, a stunning contrast to OpenAI and Google, whose efforts can run into hundreds of millions.
The Hangzhou-based team relied on 512 of Nvidia’s H800 GPUs, a China-compliant chip designed for export during US tech restrictions, and used techniques like distillation, where the model learns from outputs of existing AIs to squeeze maximum value from every GPU hour.
Open-source at its core and designed for efficiency, R1’s training regime included elements gleaned from models like Meta’s Llama, amplifying both its performance and the controversy surrounding its methods.
The result? A model that rivals much pricier American LLMs, all produced under the cloud of strict chip restrictions and ever-watchful competition.
DeepSeek’s secret sauce seems to be a blend of technical agility and a readiness to push boundaries, even as critics and global rivals question the approach and debate the ethics of “distilling” knowledge from the AI giants.
With founder Liang Wenfeng still mostly behind the scenes, DeepSeek’s disclosure is as much a market signal as a press release: the value equation in AI is changing, fast.
China’s AI race in a changing world
The implications from DeepSeek’s feat run much deeper than tech circles.
It arrives just as China tightens the screws on Nvidia, issuing a ban on leading domestic tech firms using the American chipmaker’s advanced AI hardware and fast-tracking support for local champions like Huawei and Cambricon.
Losing access to Nvidia’s best chips, China’s AI sector is shifting to homegrown processors, an immense challenge, but one that is starting to deliver real results as Cambricon posts record profits and Huawei’s Ascend chips move into top-tier data centers.
For US and global investors, the shock is real: DeepSeek’s low-cost breakthrough wiped hundreds of billions from Nvidia’s value in a single day and reignited debates over intellectual property, national advantage, and the future of open-source innovation.
As Beijing’s ambitions collide with US trade walls, DeepSeek’s $294,000 model is a loud reminder, the next leap in AI may be as much about efficiency and ingenuity as it is about raw muscle or spending power.
The post China’s DeepSeek trains Nvidia-powered R1 AI for just $294K: here's what it means appeared first on Invezz
0
0
Securely connect the portfolio you’re using to start.