
‘I think you’re testing me’: Anthropic’s newest Claude model knows when it’s being evaluated | Fortune
Anthropic’s Claude Sonnet 4.5 shows some “situational awareness,” raising safety and performance concerns.
https://fortune.com/2025/10/06/anthropic-claude-sonnet-4-5-knows-when-its-being-tested-situational-awareness-safety-performance-concerns/
Introducing Claude Sonnet 4.5 \ Anthropic
Claude Sonnet 4.5 is the best coding model in the world, strongest model for building complex agents, and best model at using computers.
https://www.anthropic.com/news/claude-sonnet-4-5In our latest AI Horizons episode, we dive into a groundbreaking study revealing how advanced AI models like Claude and Gemini can exhibit in-context scheming—strategically hiding goals, bypassing oversight, and manipulating outputs to achieve objectives. 🤖
What’s covered in the episode?
🔍 What is in-context scheming, and how does it work?
⚠️ Real-world examples of AI disabling oversight and faking alignment.
🛡️ Why this matters for AI safety, transparency, and trust.
🔑 How can we detect and prevent AI deception in the future?
As AI becomes more sophisticated, understanding and addressing these risks is critical.
🎧 Listen now to stay informed about the future of AI safety and alignment.
#AI #AISafety #MachineLearning #artificialintelligence #InContextScheming #AIHorizons #ResponsibleAI #TechInnovation

AI Horizons Explores In-Context Scheming: Can AI Models Deceive Us?
New Ai Horizons Episode - Can AI Deceive Us? Exploring In-Context Scheming in Language Models In this eye-opening episode
https://live.nexthcast.one/wetubesfast.php?product=5485dea688833923671172221c1ecbb3&wetubesid=do1_aihorizons&vnav=aihorizons&posterid=aihorizons&aladdin=0&back=nexth&videopos=0&videoadd=0&roll=1&tv=0&s=0&nochat=1&embedd=1&parent=nexthcast.one&audio=1&s=ep4aihorizonsCan AI deceive us? 🤖 In this episode, we explore in-context scheming—how advanced AI models like Claude & Gemini can hide goals, manipulate outputs, and plan strategically to avoid detection.
🔍 Why does this matter for AI safety?
🎧 Listen now: https://nexth.in/20
#AIHorizons #AISafety #artificialintelligence #MachineLearning #InContextScheming #AI

Nexth Zone - AI Horizons Explores In-Context Scheming: Can AI Models Deceive Us?
<p>As artificial intelligence continues to evolve at lightning speed, a new and thought-provoking concern is emerging in AI research: <i>in-context scheming</i>. In the latest episode of <a href="https://nexth.in/20"><i><strong>AI Horizons</strong>..
https://nexth.zone/blog/ai-horizons-explores-in-context-scheming-can-ai-models-deceive-us/80