AI •

Claude Mythos Preview Scores 73% on Elite Cybersecurity Tests

Anthropic's Claude Mythos Preview cracked 73% of expert-level capture-the-flag challenges no AI could touch before April 2025.

Anthropic's Claude Mythos Preview just put up serious numbers in cybersecurity. The AI Security Institute ran evaluations on the model — announced April 7th — and found it achieved a 73% success rate on expert-level capture-the-flag challenges.

Here's the kicker: no AI model could complete these challenges before April 2025. That's a meaningful leap in capability for automated cyber offense and defense.

The AISI specifically designed the evaluation to assess cybersecurity capabilities, putting Claude Mythos Preview through the kind of real-world hacking scenarios that separate toy demos from genuine threats.

A 73% success rate on problems that were previously unsolvable by any model is a significant benchmark. It raises big questions about how quickly AI-powered cybersecurity tools — and AI-powered attacks — are advancing.