Tech

‘A virtual DPU within a GPU’: Could clever hardware hack be behind DeepSeek’s groundbreaking AI efficiency?

Share
Share

  • A new approach called DualPipe seems to be the key to DeekSeek’s success
  • One expert describes it as an on-GPU virtual DPU that maximizes bandwidth efficiency
  • While DeepSeek has used Nvidia GPUs only, one wonders how AMD’s Instinct would fare

China’s DeepSeek AI chatbot has stunned the tech industry, representing a credible alternative to OpenAI’s ChatGPT at a fraction of the cost.

A recent paper revealed DeepSeek V3 was trained on a cluster of 2,048 Nvidia H800 GPUs – crippled versions of the H100 (we can only imagine how much more powerful it would be running on AMD Instinct accelerators!). It reportedly required 2.79 million GPU-hours for pretraining, fine-tuning on 14.8 trillion tokens, and cost – according to calculations made by The Next Platform – a mere $5.58 million.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
What happened and why it matters
Tech

What happened and why it matters

Credit: Pixabay/CC0 Public Domain On April 28, Spain experienced a widespread power...

Hundreds of top ecommerce sites under attack following Magento supply chain flaw
Tech

Hundreds of top ecommerce sites under attack following Magento supply chain flaw

Sansec found 21 Magento extensions with malicious code The extensions belong to...

What is the release date and time for Andor season 2 episodes 7 to 9 on Disney+?
Tech

What is the release date and time for Andor season 2 episodes 7 to 9 on Disney+?

Andor season 2 is halfway through its 12-episode run, so one of...

Hydrogel material weaves seeds into textiles
Tech

Hydrogel material weaves seeds into textiles

A touch-sensing hairband is among the potential applications for LivingLoom, a design...