Tech

Apple embraces Nvidia GPUs to accelerate LLM inference via its open source ReDrafter tech

Share
Share


  • ReDrafter delivers 2.7x more tokens per second compared to traditional auto-regression
  • ReDrafter could reduce latency for users while using fewer GPUs
  • Apple hasn’t said when ReDrafter will be deployed on rival AI GPUs from AMD and Intel

Apple has announced a collaboration with Nvidia to accelerate large language model inference using its open source technology, Recurrent Drafter (or ReDrafter for short).

The partnership aims to address the computational challenges of auto-regressive token generation, which is crucial for improving efficiency and reducing latency in real-time LLM applications.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Your Facebook account just got even more secure – and it could make phishing a thing of the past
Tech

Your Facebook account just got even more secure – and it could make phishing a thing of the past

Facebook will soon roll out passkeys for Facebook on mobile devices Passkeys...

Bringing energy and spatial planning together
Tech

Bringing energy and spatial planning together

Credit: CC0 Public Domain For an effective energy transition, we need to...

Minecraft players watch out – these fake mods are hiding password-stealing malware
Tech

Minecraft players watch out – these fake mods are hiding password-stealing malware

Check Point Research finds hundreds of malicious GitHub repositories These impersonate different...