Anthropic ships «more honest» Opus 4.8, teases Mythos release

Anthropic’s latest tops the benchmarks and makes fewer mistakes. (Picture: Anthropic)
The new model comes just over a month after the last one. It is four times less likely to allow flaws in code or make unsupported claims, and is «more likely» to say so when it is uncertain about a reply.

Releasing today at the same price as 4.7, it can tackle problems at a larger scale and has an upgrade to the «fast mode» — which is now three times cheaper.

There is also a new setting for Claude where users can set the «effort»-level on a given task. More effort costs more tokens, but gives a more precise answer, while low effort won’t bust rate limits.

The new Opus is of course on top of all the benchmarks so far in the cycle, and it is supposedly also sharper and more reliable in its judgement.

At the same time, Anthropic has some news about the code-busting Mythos model, previously deemed too dangerous to release.

The company says it is making headway on the safeguards needed to make it safe enough for public use, and plans to bring «Mythos-class» models to their customers «in the coming weeks.»

Read more: Anthropic’s announcement, TechCrunch, Reuters, The Verge.

Mythos finds 10K severe bugs in a month, as Anthropic widens release

Anthropic is now expanding availability for government and qualifying security teams. (Picture: Shutterstock)
Officially dubbed the Claude Mythos Preview, Anthropic’s code-busting agent has worked with about 50 partners to find ten thousand high-severity bugs in the month since release:

— Progress on software security used to be limited by how quickly we could find new vulnerabilities. Now it’s limited by how quickly we can verify, disclose, and patch the large numbers of vulnerabilities found by AI, Anthropic says.

Of those vulnerabilities, 6,202 were serious finds in open source software, where Anthropic has partnered with «more than 1,000» projects. Mythos actually found 23,019 bugs, but most were estimated at medium or low severity.

— Models with similar cybersecurity skills to Mythos Preview will soon be more broadly available, says Anthropic.

— There is a clear need for a larger effort across the software industry to manage the volume of findings that these models will generate.

Therefore, Anthropic is widening the release of Mythos, making it available to «qualifying» security teams «on request.» In the future, they say they hope to develop safeguards strong enough to make it generally available, but no such safety features exist as of today.

Read more: Anthropic’s findings and Engadget. Discussion on r/cybersecurity

GPT-5.5 -Cyber is out in «limited preview,» available to vetted defenders


Two models models are launching today; GPT-5.5 with «Trusted Access for Cyber» that requires some vetting to get into. It can handle defensive security, do code review, malware analysis and patch validation.

GPT-.5.5-Cyber requires stronger verification and does specialized workflows, red teaming, penetration testing and controlled validation.

GPT-5.5 Cyber was earlier found on par with Anthropic’s Mythos model, that has been spooking the establishment lately.

The vetting approach for the Cyber model has been «informed by conversations with cybersecurity and national security leaders across federal and state government and major commercial entities,» OpenAI says.

Read more: OpenAI’s blog, CNBC, Axios.

The White House reportedly discussing vetting AI models ahead of release

The White House says any Executive Order will come from the President himself. (Picture: Adobe)
The Trump administration has appartently been spooked by the cyber capabilities of Anthropic’s Mythos model and OpenAI’s GPT-5.5 — and is considering an Executive Order to vet new models ahead of release, Axios and the NYT reports.

These models have both been limited for their ability in cybersecurity, and point to a not-so-distant future where such capabilities might be widely available.

To that end, the White House’s Office of the National Cyber Director held all of two meetings last week, with tech and cyber companies on the one hand and with trade groups in tech on the other, according to Axios.

The ONCD has also been discussing safety testing for federal AI deployments, by assessing the security exposure of AI models before rolling out to the public sector.

The NYT reported on this first, and is saying that there might be a safety review for new models, while giving the Pentagon the first shot at eventual «useful» cyber capabilities, but would not block their release.

Any discussion on «potential executive orders is speculation,» a White House official told Axios.

Read more: Axios and the New York Times.

GPT-5.5 found on par with Mythos, with OpenAI to limit access to Cyber version

It took about three weeks for a competing model to hit parity with Mythos. (Picture: Adobe)
After a major research paper by the UK’s AI Security Institute found GPT-5.5 a little better than Mythos, Sam Altman moved to limit access to the Cyber version of the model.

The paper probes «vulnerability research and exploitation against realistic targets and modern mitigations» through rigorous tests, and found GPT-5.5 had a pass rate of 71.4%, compared to Mythos’ 68.6% on the most advanced evaluations.

According to the AISI, their test suite proves that Mythos is not a one-off act of brilliance, but part of a wider trend for frontier models. They say «we should expect further increases in cyber capability from models in the near future, potentially in quick succession.»

At the same time, Sam Altman posted on x.com that OpenAI will indeed follow Anthropic’s lead on limiting access to GPT-5.5-Cyber to «critical cyber defenders:»

— We will work with the entire ecosystem and the government to figure out trusted access for cyber; we want to rapidly help secure companies/infrastructure, Altman wrote.

Read more: The AI Security Institute, Altman’s x post, TechCrunch. Discussion on r/Singularity.

Private Discord group gains access to Mythos — to «play around» with it

The unnamed group has not run any cybersecurity prompts for fear of losing access. (Picture: adobe)
Bloomberg (paywalled) is reporting that a «private online forum» has managed to get access to Anthropic’s heralded Mythos model — said to be so advanced, it would be too dangerous to release.

— We’re investigating a report claiming unauthorized access to Claude Mythos Preview through one of our third-party vendor environments, Anthropic tells TechCrunch.

The group is part of a Discord channel focused on finding information on unreleased models, and made some educated guesses as to where the model would be located. They also had some help from a member whose job gave him access.

As for the warnings of dangerous fallout from public access to the model, the group says they are only interested in «playing around with new models,» not «wreaking havoc,» Gizmodo says, but the «hack» itself will raise concern in the security sector.

Read more: Bloomberg (paywalled), TechCrunch, and Gizmodo.

Mozilla uses Mythos to fix 271 bugs in latest Firefox, claims «vertigo»

The Mythos model is only available to select organizations for defensive cybersecurity. (Picture: generated)
The browser developer has been working with Anthropic since February, and got their hands on an early version of Claude Mythos Preview to scan for vulnerabilities.

— For a hardened target, just one such bug would have been red-alert in 2025, and so many at once makes you stop to wonder whether it’s even possible to keep up, Mozilla writes in their blog.

The upshot is that the 271 bugs mean that the company can approach security «much better than just keeping up», and that «defenders finally have a chance to win, decisively.»

— We have many years of experience picking apart the work of the world’s best security researchers, and Mythos Preview is every bit as capable, Mozilla continues.

They used Claude Opus 4.6 to find 22 bugs back in March, but this Mythos-powered bug hunt was so large it left them with a feeling akin to vertigo, they say.

Read more: Mozilla’s blog, interview on Wired, writeups on Ars Technica and Engadget.

In spite of government ban, the US NSA is actively using Anthropic’s Mythos

With compelling technology from Mythos, other agencies might not be far behind. (Picture: Shutterstock)
Sources in contact with Axios claim the National Security Agency, the premier digital spying agency, is widely using Anthropic’s Mythos

The model was deemed too dangerous to be released, but is available to about 40 select organizations through Project Glasswing, which uses its advanced cyber capabilities to scan for exploits and vulnerabilities — before the rest of the world catches up.

This is in spite of a Trump government ban on Anthropic and it being labeled a national security threat by the DoD this February after refusing to comply with Pentagon demands.

Anthropic held a meeting with the White House this Friday, said to be «productive and constructive,» Reuters reports.

Read more: Axios, Engadget, and Reuters

Anthropic launches Project Glasswing, greatly advancing cybersecurity

Project Glasswing pokes holes in almost any software, and if it isn’t used defensively now — attackers might soon. (Picture: Anthropic)
Anthropic has been cooking up the Mythos model lately, that internal documents had put as «a sea-change in capabilities,» and was too dangerous to release publicly.

Instead, they are releasing Project Glasswing, having found it to be especially suited for «an effort to secure the world’s most critical software.» It won’t take long for others to catch up, Anthropic says, and this is «an urgent attempt to put these capabilities to work» defensively.

Launch partners include a who’s who of Silicon Valley giants, and Anthropic claims it has already found thousands of vulnerabilities across every major operating system and browser.

The Mythos Preview model scores 93.9% on SWE-bench Verified compared to Opus 4.6 with 80.8%, and on SWE-bench Pro it is 77.8% vs 53.4%.

Smaller actors maintaining critical software will enjoy $100 million in usage credit donations from Anthropic — but for others it will cost a whopping $25/125 per million input/output tokens.

Read more: Anthropic’s announcement, Venturebeat, TechCrunch, CNBC