Tech

‘A virtual DPU within a GPU’: Could clever hardware hack be behind DeepSeek’s groundbreaking AI efficiency?

Share
Share

  • A new approach called DualPipe seems to be the key to DeekSeek’s success
  • One expert describes it as an on-GPU virtual DPU that maximizes bandwidth efficiency
  • While DeepSeek has used Nvidia GPUs only, one wonders how AMD’s Instinct would fare

China’s DeepSeek AI chatbot has stunned the tech industry, representing a credible alternative to OpenAI’s ChatGPT at a fraction of the cost.

A recent paper revealed DeepSeek V3 was trained on a cluster of 2,048 Nvidia H800 GPUs – crippled versions of the H100 (we can only imagine how much more powerful it would be running on AMD Instinct accelerators!). It reportedly required 2.79 million GPU-hours for pretraining, fine-tuning on 14.8 trillion tokens, and cost – according to calculations made by The Next Platform – a mere $5.58 million.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Sanding away hidden insulation results in more reliable method to measure robotic touch reception
Tech

Sanding away hidden insulation results in more reliable method to measure robotic touch reception

Electrical characterization of the conductivity and thickness of the insulating surface layer....

Forget Synology? This NAS brand says locked drives are for children, and it won’t play along
Tech

Forget Synology? This NAS brand says locked drives are for children, and it won’t play along

Asustor won’t force users into branded drives – you pick the parts,...

Gmail servers hijacked by malicious PyPI packages to spread havoc – here’s how to stay safe
Tech

Gmail servers hijacked by malicious PyPI packages to spread havoc – here’s how to stay safe

Socket found seven malicious packages on PyPI The packages were abusing Gmail...

Plastic fiber breakthrough could reshape AI data center performance and cost
Tech

Plastic fiber breakthrough could reshape AI data center performance and cost

Japanese researchers hit 106Gbps per core with plastic fiber breakthrough Multicore plastic...