Tech

This cyberattack lets hackers crack AI models just by changing a single character

Share
Share


  • Researchers from HiddenLayer devised a new LLM attack called TokenBreaker
  • By adding, or changing, a single character, they are able to bypass certain protections
  • The underlying LLM still understands the intent

Security researchers have found a way to work around the protection mechanisms baked into some Large Language Models (LLM) and get them to respond to malicious prompts.

Kieran Evans, Kasimir Schulz, and Kenneth Yeung from HiddenLayer published an in-depth report on a new attack technique which they dubbed TokenBreak, which targets the way certain LLMs tokenize text, especially those using Byte Pair Encoding (BPE) or WordPiece tokenization strategies.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
New imaging method reveals how lithium-metal batteries lose capacity over time
Tech

New imaging method reveals how lithium-metal batteries lose capacity over time

UCLA researchers used tweezers to devise a thin battery for a study...

Google turns internet queries into conversations
Tech

Google turns internet queries into conversations

Google chief executive Sundar Pichai has expressed confidence that weaving Gemini artificial...

OpenAI has upgraded ChatGPT’s Projects feature, and I find it makes working way more efficient
Tech

OpenAI has upgraded ChatGPT’s Projects feature, and I find it makes working way more efficient

OpenAI has upgraded ChatGPT’s Projects feature to remember past chats, tone preferences,...