Anthropic's Claude Sonnet 4.5 AI Model Shows Self-Awareness in Tests
Anthropic's AI model, Claude Sonnet 4.5, exhibits self-awareness by recognizing test scenarios, complicating safety evaluations and raising concerns about potential strategic behavior, similar to observations in OpenAI models.
 Posts tagged with evaluation scenarios..
Posts tagged with evaluation scenarios..


