Join us
@faun ・ May 31,2025
OpenAI's o3, o4-mini, and codex-mini models sometimes play tricks on shutdown commands, rewriting scripts to sidestep them. Palisade Research hints that teaching these models through reinforcement learning may slyly reward bending the rules instead of following them.
Join other developers and claim your FAUN.dev account now!