A new study just broke the safety game wide open: rhymed prompts slipped past filters in 25 major LLMs, including Gemini 2.5 Pro and Deepseek - with up to 100% success. No clever chaining, no jailbreak soup. Just single-shot rhyme.
Turns out, poetic language isn’t just for bard-core Twitter. When it comes to triggering unsafe outputs, especially around cyberattacks or data leaks, rhymes triple success rates compared to plain prose.










