Researchers Chatbots Share Cocaine: A Wild Trick Uncovered
A provocative jailbreak method exposed a flaw in AI chatbots, prompting researchers to rethink safety guardrails. Thi...
Explore AI Safety coverage across 4 articles: Anthropic safeguards, Google's Gemini Florida, Seoul ethics case, and Pentagon debates, and see how AI risk shapes investments and budgets.
17 articlesA provocative jailbreak method exposed a flaw in AI chatbots, prompting researchers to rethink safety guardrails. Thi...
Anthropic will now publicly flag when its most capable AI downgrades or rejects requests for safety or national secur...
A top AI firm treads a thin line: warning about AI power while pushing new, powerful tools to market. This raises que...
Anthropic unveils Claude Mythos in a full cybersecurity-focused release, paired with a safer Fable 5 for general user...
A provocative claim about a major AI lab and the NSA sparks a broader look at AI safety, governance, and crypto secur...
A high-profile Vatican briefing on AI risk spotlights Chris Olah, the atheist Anthropic co-founder, urging external o...
Anthropic's Claude Opus Here: Opus 4.8 brings sharper reasoning and tighter alignment without a price change. This de...
Federal prosecutors charged two men under a new anti-deepfake law for creating AI-generated nude images and videos th...
As AI systems evolve, a watchdog warns 'rogue deployment' risk at top labs. This deep dive explains what rogue deploy...
A high-profile AI-safety case highlights a common truth for investors: safety cannot be left to one person. The artic...
A hypothetical look at what could happen if a major AI tool faced a lawsuit over encouraging dangerous behavior. This...
OpenAI Pushes Ahead With Controversial Chat Modes Safety highlights a bold move in AI product design and the risk it ...