Tech

SambaNova hits 198 tokens per second on the full, non-distilled DeepSeek-R1 671B with only 16 SN40L RDU chips

Share
Share

  • SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips
  • The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs
  • 5X speed boost is promised soon, with 100X capacity by year-end on cloud

Chinese AI upstart DeepSeek has very quickly made a name for itself in 2025, with its R1 large-scale open source language model, built for advanced reasoning tasks, showing performance on par with the industry’s top models, while being more cost-efficient.

SambaNova Systems, an AI startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what it claims is the world’s fastest deployment of the DeepSeek-R1 671B LLM to date.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Intel’s Core Ultra 9 and RTX 5060 Ti in one box? Lenovo’s wild mini PC pulls it off
Tech

Intel’s Core Ultra 9 and RTX 5060 Ti in one box? Lenovo’s wild mini PC pulls it off

Lenovo ThinkCentre neo Ultra 2025 squeezes high-end AI hardware into a tiny,...

10 Lego cars just raced the F1 Miami Grand Prix track – here’s how they were built
Tech

10 Lego cars just raced the F1 Miami Grand Prix track – here’s how they were built

10 Lego cars just drove around Miami’s F1 track They’re each built...

AI is booming, but most CFOs say they still can’t make money from it
Tech

AI is booming, but most CFOs say they still can’t make money from it

Most CFOs say they still can’t make money from AI yet Traditional...