You just need a description of the task to get going with ChatGPT’s new agents. (Picture: OpenAI)Dubbing them «workspace agents,» OpenAI is rolling out Codex-powered agents to Business, Enterprise, Edu, and Teachers tiers.
In a bid to popularize agents beyond enthusiasts and IT departments, these agents are for everyone, and are sharebale between teams.
They are also very easy to create; simply add a description of the job you want done, and ChatGPT walks you through the process of turning it into an agent.
The unnamed group has not run any cybersecurity prompts for fear of losing access. (Picture: adobe)Bloomberg (paywalled) is reporting that a «private online forum» has managed to get access to Anthropic’s heralded Mythos model — said to be so advanced, it would be too dangerous to release.
— We’re investigating a report claiming unauthorized access to Claude Mythos Preview through one of our third-party vendor environments, Anthropic tells TechCrunch.
The group is part of a Discord channel focused on finding information on unreleased models, and made some educated guesses as to where the model would be located. They also had some help from a member whose job gave him access.
As for the warnings of dangerous fallout from public access to the model, the group says they are only interested in «playing around with new models,» not «wreaking havoc,» Gizmodo says, but the «hack» itself will raise concern in the security sector.
In the future, agents will do most work at Meta. Current employees will be training them. (Picture: Shutterstock)US workers for Meta will have to contend with most of their daily work being monitored for the training of future AI agents — in an initiative called Model Capability Initiative, according to internal memos seen by Reuters.
They will now get software installed on their computers to track basically everything they do, and take the occasional screenshot, to help train Meta’s agents, which are currently «struggling to replicate how humans interact with computers.»
This includes the use of keyboard shortcuts, clicking buttons, and using pull-down menus, Reuters says.
The idea is definitely to train their successors, according to Meta CTO Andrew Bosworth, who sent a memo to employees on Monday saying that «the vision we are building towards is one where our agents primarily do the work and our role is to direct, review and help them improve.»
Just yesterday, OpenAI debuted a tool to take constant screenshots of users’ computers in order to customize Codex to their usage.
The Mythos model is only available to select organizations for defensive cybersecurity. (Picture: generated)The browser developer has been working with Anthropic since February, and got their hands on an early version of Claude Mythos Preview to scan for vulnerabilities.
— For a hardened target, just one such bug would have been red-alert in 2025, and so many at once makes you stop to wonder whether it’s even possible to keep up, Mozilla writes in their blog.
The upshot is that the 271 bugs mean that the company can approach security «much better than just keeping up», and that «defenders finally have a chance to win, decisively.»
— We have many years of experience picking apart the work of the world’s best security researchers, and Mythos Preview is every bit as capable, Mozilla continues.
They used Claude Opus 4.6 to find 22 bugs back in March, but this Mythos-powered bug hunt was so large it left them with a feeling akin to vertigo, they say.
A stoat on a goat on a boat in a moat, approaching photorealism. (Picture: generated)The new image model for ChatGPT does not only offer higher fidelity, but more precision in how it renders pictures. According to OpenAI, it’s a sea change:
— If we think of Dall-e as cave drawings, and Images 1.0 as ancient art, then Images 2.0 is the Renaissance, OpenAI claims, according to Gizmodo.
It is a little bit faster to generate, and offers better instruction following, more accurate object placement — and reasoning with web search.
That means it can render multiple images from a single prompt, and «double-check» its outputs, as well as offering the latest information in the picture.
As for the benchmarks, GPT Image 2.0 has entered LMArena’s leaderboards at #1 for Text-to-Image with a whopping 242 point lead, and is 125 points ahead on Single-Image-Edit and up 90 on Multi-Image Edit.
The model is available today across all of ChatGPT’s tiers. For the thinking mode, you’ll need a paid Plus, Pro or Business subscription.
The new feature is tailored to high-output work environments, or it would be a privacy disaster. (Picture: Adobe)The new feature is an agent observing your screen all the time you work, storing screenshots as «memories» to better help with context for your Codex tasks.
— Over time, it helps Codex learn how you work: the tools you use, the projects you return to, and the workflows you rely on, OpenAI croons on x.com.
The point is to learn even more detail about you, from how you prefer your code to the tools and apps you use to perform. This can then later be recalled by Codex.
Notwithstanding the privacy concerns from Windows Recall, which also uses AI to take and store screenshots of your desktop, OpenAI is warning that the screenshots are even stored unencrypted on your computer.
They also warn that it eats up rate limits quickly, is very prone to prompt injection attacks and is only available on the $200 Pro subscription, as a research preview on macOS. Once enabled, it can be paused at any time in a menu item.
It’s a deal where it appears both sides win. (Picture: Amazon)Anthropic admits to taxing its servers lately, saying that their recent growth «places an inevitable strain on our infrastructure; our unprecedented consumer growth, in particular, has impacted reliability and performance […] especially during peak hours»
The AI lab says 1 gigawatt of the new capacity on Amazon’s custom silicon will come online in late 2026, and is committing to spending $100 billion on Amazon over the next decade to «train and run Claude.»
The deal also includes an initial investment of some $5 billion from Amazon which will scale to $25 billion «in the future,» tied to «commercial milestones.»
Andy Jassy, CEO of Amazon, has taken some flak for their gigantic $200 billion spend on AI infrastructure, but with this deal, they are recouping half of that:
—Anthropic’s commitment to run its large language models on AWS Trainium for the next decade reflects the progress we’ve made together on custom silicon, Jassy says.
With compelling technology from Mythos, other agencies might not be far behind. (Picture: Shutterstock)Sources in contact with Axios claim the National Security Agency, the premier digital spying agency, is widely using Anthropic’s Mythos
The model was deemed too dangerous to be released, but is available to about 40 select organizations through Project Glasswing, which uses its advanced cyber capabilities to scan for exploits and vulnerabilities — before the rest of the world catches up.
OpenAI wants to aid in research and discovery of new drugs, but hallucinations linger. (Picture: Adobe)Aiming to help discovery and create therapies from vast databases and cutting edge research, access to the new model will be tightly restricted.
For obvious reasons, biohacking can be a serious issue even for general AI implementations, but when it comes to building a model strictly for biology, only a select few researchers will get access.
The Rosalind model is based on the latest internal research from OpenAI, and outperforms GPT-5.4, sometimes massively, on chemistry, biochemistry, genetics and experimental design — the datasets it was trained on.
Codex is getting one step closer to a super app. (Picture: OpenAI)Getting one step closer to their super app, OpenAI’s latest Codex app can operate every app on your computer by «seeing, clicking and typing» with its own cursor — in the background (so you don’t have to wait for it to finish).
The app can also now generate images, remember preferences, and learn from previous workflows. It even comes with its own in-app browser — so you can check your web work instantly.
It can also open PDFs, spreadsheets, slides and docs natively, and gets a new summary pane to track agents, sources and «artifacts,» in addition to alpha support for SSH connections and multiple terminal tabs.
As a «preview,» it should be able to reuse older threads for context and instructions, and schedule its own work over days and weeks.
The app is available here. Computer use only works on the Mac version.
Anthropic’s latest model tops the benchmarks, but is not based on Mythos. (Picture: Anthropic)Keeping their focus on advanced software engineering, Anthropic says the new model especially shows gains on «the most difficult tasks.»
The new Opus should also be better at reading images for designs on interfaces, slides and documents.
Benchmarks posted by Anthropic tells a story of a significantly improved model over Opus 4.6, and jumping ahead of Gemini 3.1 and GPT-5.4 in most cases.
Opus 4.7 is not as powerful as the Mythos model used in «Project Glasswing», being much less capable at cyber skills, having been «differentially reduced» in training. It also automatically detects and blocks «prohibited or high-risk cybersecurity uses.»
Anthropic says they will use what they learn from the 4.7 release to inform a broader release of Mythos.
Only some rare instances will get verified, says Anthropic. (Picture: Adobe)There is a strict 18-year age limit for using Claude, and the company can ban accounts that are underage, as well as for usage or service violations.
Anthropic is therefore launching identity controls across Claude, in line with stricter industry standards elsewhere.
Their support page suggests age verification will only happen for «a few use cases» and for «certain capabilities» — but opens the door for simply checking your age.
In order to confirm your identity, Anthropic’s partner Persona Identities only accepts government-issued IDs or driver’s licenses — and none of the data is stored by Anthropic.
They also say they will not train their models on identity data, share them with anyone else, or collect more than they need.
ChatGPT has had age checks since January, but allows for teen use.
Fully featured Gemini, including nano banana and screen sharing — now for the Mac. (Picture: Google)The Gemini app for macOS took just a few days to prototype and was fully developed in less than a hundred days, Ars Technica writes.
Once installed, it can be launched from the menu bar or by pressing Option+Space on the keyboard.
The app goes a bit further than ChatGPT on the Mac, letting you share your entire screen with Gemini, or just select apps, and otherwise has everything the web interface offers, 9to5Google reports.
The app/window sharing lets Gemini answer questions about spreadsheets, reports, web pages or code bases, Google says.
Catching up to OpenAI? Sources claim Anthropic doubling in value in latest bids. (Picture: Shutterstock)With explosive growth and a claimed yearly run-rate of $30 billion, Anthropic is turning out to be a hot stock in Silicon Valley and beyond.
Business Insider is now reporting that the AI lab has received «multiple offers» at valuations of $800 billion, citing anonymous sources.
That would put it closer to OpenAI’s already stellar valuation at $852 billion — signifying peak interest in the AI sector at large and Anthropic specifically.
Anthropic just finished a $30 billion funding round in February at a valuation of $380 billion, meaning the latest offers would more than double the company’s value.
Anthropic then said that their revenue had grown 10x each year since inception, and said in April that customers spending $1 million or more had doubled since that.
It is common for «buzzy startups» to be on the receiving end of «preemptive offers,» Business Insider notes.
Only the most trusted cybersecurity pros will get access to the advanced model. (Picture: OpenAI)The new Cyber model has fewer restraints than other available bots to let cybersecurity professionals game out and test for vulnerabilities.
These kinds of tasks would normally get refusals for security reasons, but with Cyber access, developers can go as far as reverse engineering entire apps to poke for bugs.
The model is based on ChatGPT-5.4, but OpenAI says they are expanding the entire Cyber program now «in preparation for increasingly capable models over the next few months […] whose capabilities will rapidly exceed even the best purpose-built models of today.»
The release comes hot on the heels of Anthropic’s Mythos model and «Project Glasswing,» said to be so advanced they won’t release the full model.
To get access to GPT-5.4-Cyber, you have to first verify that you are a cybersecurity professional with OpenAI, and even then you might get «limited» access based on a tier system.