Tech

SambaNova hits 198 tokens per second on the full, non-distilled DeepSeek-R1 671B with only 16 SN40L RDU chips

Share
Share

  • SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips
  • The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs
  • 5X speed boost is promised soon, with 100X capacity by year-end on cloud

Chinese AI upstart DeepSeek has very quickly made a name for itself in 2025, with its R1 large-scale open source language model, built for advanced reasoning tasks, showing performance on par with the industry’s top models, while being more cost-efficient.

SambaNova Systems, an AI startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what it claims is the world’s fastest deployment of the DeepSeek-R1 671B LLM to date.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
10 Lego cars just raced the F1 Miami Grand Prix track – here’s how they were built
Tech

10 Lego cars just raced the F1 Miami Grand Prix track – here’s how they were built

10 Lego cars just drove around Miami’s F1 track They’re each built...

Lenovo#s ThinkPad P16s Gen 4 flexes AMD muscle but still trails HP’s AI workstation monster
Tech

Lenovo#s ThinkPad P16s Gen 4 flexes AMD muscle but still trails HP’s AI workstation monster

Lenovo ThinkPad P16s Gen 4 offers powerful AMD performance for professionals It...

Huge iPhone 17 Air news teased in new report – 3 things you need to know
Tech

Huge iPhone 17 Air news teased in new report – 3 things you need to know

Apple may launch a battery case with the iPhone 17 Air All-day...

SaaS is a ticking time bomb for global security, warns the world’s largest bank, JPMorganChase
Tech

SaaS is a ticking time bomb for global security, warns the world’s largest bank, JPMorganChase

JPMorganChase open letter calls for urgent industry-wide action on SaaS risks Third-party...