Tech

Apple embraces Nvidia GPUs to accelerate LLM inference via its open source ReDrafter tech

Share
Share


  • ReDrafter delivers 2.7x more tokens per second compared to traditional auto-regression
  • ReDrafter could reduce latency for users while using fewer GPUs
  • Apple hasn’t said when ReDrafter will be deployed on rival AI GPUs from AMD and Intel

Apple has announced a collaboration with Nvidia to accelerate large language model inference using its open source technology, Recurrent Drafter (or ReDrafter for short).

The partnership aims to address the computational challenges of auto-regressive token generation, which is crucial for improving efficiency and reducing latency in real-time LLM applications.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
NYT Connections hints and answers for Monday, May 5 (game #694)
Tech

NYT Connections hints and answers for Monday, May 5 (game #694)

Looking for a different day? A new NYT Connections puzzle appears at...

NYT Strands hints and answers for Monday, May 5 (game #428)
Tech

NYT Strands hints and answers for Monday, May 5 (game #428)

Looking for a different day? A new NYT Strands puzzle appears at...

Quordle hints and answers for Monday, May 5 (game #1197)
Tech

Quordle hints and answers for Monday, May 5 (game #1197)

Looking for a different day? A new Quordle puzzle appears at midnight...

iPhone release date schedule could be set for a big shakeup – here’s what we know
Tech

iPhone release date schedule could be set for a big shakeup – here’s what we know

Apple is rumored to be splitting the iPhone launch schedule The changes...