Tech

This cyberattack lets hackers crack AI models just by changing a single character

Share
Share


  • Researchers from HiddenLayer devised a new LLM attack called TokenBreaker
  • By adding, or changing, a single character, they are able to bypass certain protections
  • The underlying LLM still understands the intent

Security researchers have found a way to work around the protection mechanisms baked into some Large Language Models (LLM) and get them to respond to malicious prompts.

Kieran Evans, Kasimir Schulz, and Kenneth Yeung from HiddenLayer published an in-depth report on a new attack technique which they dubbed TokenBreak, which targets the way certain LLMs tokenize text, especially those using Byte Pair Encoding (BPE) or WordPiece tokenization strategies.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Remote workers with college degrees are flooding low-skill jobs and making more than doctors back home
Tech

Remote workers with college degrees are flooding low-skill jobs and making more than doctors back home

Report warns a college degree no longer guarantees skilled work in today’s...

AI overviews have transformed Google search. Here’s how they work—and how to opt out
Tech

AI overviews have transformed Google search. Here’s how they work—and how to opt out

Credit: Pixabay/CC0 Public Domain People turn to the internet to run billions...

BlackBerry Classic returns in 2025 as Zinwa Q25 with updated hardware and software
Tech

BlackBerry Classic returns in 2025 as Zinwa Q25 with updated hardware and software

Zinwa Q25 revives BlackBerry Classic with modern hardware and software Original screen...

Workers need better tools and tech to boost productivity. Why aren’t companies stepping up to invest?
Tech

Workers need better tools and tech to boost productivity. Why aren’t companies stepping up to invest?

Credit: cottonbro studio from Pexels As Prime Minister Anthony Albanese and Treasurer...