SearchAbout
SecurityTopic

Model Security

Every story we’ve tagged Model Security.

Security vulnerability reports have exploded since AI models started hunting for bugs
Security

Security vulnerability reports have exploded since AI models started hunting for bugs

Anthropic's Claude Mythos Preview model has led to a surge in reported security vulnerabilities, with over 1,500 high-severity and critical vulnerabilities reported in June 2026. This follows Anthropic's announcement in April 2026 that its model can find software vulnerabilities on its own.

The Decoder3 min read12h agoSign in to upvoteSign in to save
Jul 2, 2026 Announcements More details on Fable 5’s cyber safeguards and our jailbreak framework
Safety & Policy

Jul 2, 2026 Announcements More details on Fable 5’s cyber safeguards and our jailbreak framework

Anthropic has provided more information on Fable 5's cybersecurity safeguards and proposed a framework for evaluating the severity of AI jailbreaks. The company aims to balance preventing misuse with allowing defensive uses of the technology. This move is part of a broader effort to establish industry standards for AI safety.

Anthropic News26 min read1d agoSign in to upvoteSign in to save
You’re all caught up.