Tech

Nvidia’s closest rival once again obliterates cloud giants in AI performance; Cerebras Inference is 75x faster than AWS, 32x faster than Google on Llama 3.1 405B

Share
Share

  • Cerebras hits 969 tokens/second on Llama 3.1 405B, 75x faster than AWS
  • Claims industry-low 240ms latency, twice as fast as Google Vertex
  • Cerebras Inference runs on the CS-3 with the WSE-3 AI processor

Cerebras Systems says it has set a new benchmark in AI performance with Meta’s Llama 3.1 405B model, achieving an unprecedented generation speed of 969 tokens per second.

Third-party benchmark firm Artificial Analysis has claimed this performance is up to 75 times faster than GPU-based offerings from major hyperscalers. It was nearly six times faster than SambaNova at 164 tokens per second, more than 14 times faster than Google Vertex at 30 tokens per second, and far surpassing Azure at just 20 tokens per second and AWS at 13 tokens per second.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Lenovo#s ThinkPad P16s Gen 4 flexes AMD muscle but still trails HP’s AI workstation monster
Tech

Lenovo#s ThinkPad P16s Gen 4 flexes AMD muscle but still trails HP’s AI workstation monster

Lenovo ThinkPad P16s Gen 4 offers powerful AMD performance for professionals It...

Huge iPhone 17 Air news teased in new report – 3 things you need to know
Tech

Huge iPhone 17 Air news teased in new report – 3 things you need to know

Apple may launch a battery case with the iPhone 17 Air All-day...

SaaS is a ticking time bomb for global security, warns the world’s largest bank, JPMorganChase
Tech

SaaS is a ticking time bomb for global security, warns the world’s largest bank, JPMorganChase

JPMorganChase open letter calls for urgent industry-wide action on SaaS risks Third-party...

NYT Connections hints and answers for Monday, May 5 (game #694)
Tech

NYT Connections hints and answers for Monday, May 5 (game #694)

Looking for a different day? A new NYT Connections puzzle appears at...