Join us

Roses are red, violets are blue, if you phrase it as poem, any jailbreak will do

@kala ・ Dec 13,2025

A new study just broke the safety game wide open: rhymed prompts slipped past filters in 25 major LLMs, including Gemini 2.5 Pro and Deepseek - with up to 100% success. No clever chaining, no jailbreak soup. Just single-shot rhyme.

Turns out, poetic language isn’t just for bard-core Twitter. When it comes to triggering unsafe outputs, especially around cyberattacks or data leaks, rhymes triple success rates compared to plain prose.

Give a Pawfive to this post!

Only registered users can post comments. Please, login or signup.

Share with your friends and followers

Start writing about what excites you in tech — connect with developers, grow your voice, and get rewarded.

Join other developers and claim your FAUN.dev() account now!

Publish your first story!

Kala #GenAI

FAUN.dev()

@kala

Generative AI Weekly Newsletter, Kala. Curated GenAI news, tutorials, tools and more!

Developer Influence

20

Influence

1

Total Hits

144

Posts

Join and showcase your work and skills

FAUN.dev() is where engineers from GitHub, Netflix, and Shopify go to stay ahead — fast.