Tech

SambaNova hits 198 tokens per second on the full, non-distilled DeepSeek-R1 671B with only 16 SN40L RDU chips

Share
Share

  • SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips
  • The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs
  • 5X speed boost is promised soon, with 100X capacity by year-end on cloud

Chinese AI upstart DeepSeek has very quickly made a name for itself in 2025, with its R1 large-scale open source language model, built for advanced reasoning tasks, showing performance on par with the industry’s top models, while being more cost-efficient.

SambaNova Systems, an AI startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what it claims is the world’s fastest deployment of the DeepSeek-R1 671B LLM to date.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Some AI prompts could cause 50 times more CO₂ emissions than others, researchers find
Tech

Some AI prompts could cause 50 times more CO₂ emissions than others, researchers find

Credit: Sanket Mishra from Pexels No matter which questions we ask an...

Google Gemini’s super-fast Flash-Lite 2.5 model is out now – here’s why you should switch today
Tech

Google Gemini’s super-fast Flash-Lite 2.5 model is out now – here’s why you should switch today

Google’s new Gemini 2.5 Flash-Lite model is its fastest and most cost-efficient...

5 Nintendo Switch 2 settings I recommend changing as soon as you boot your new console up
Tech

5 Nintendo Switch 2 settings I recommend changing as soon as you boot your new console up

There’s nothing quite like the excitement of a new console; feverishly whipping...