GPT-5 should be more verbose and nice in its replies going forward, without being sycophantic. (Picture: generated)The latest ChaGPT is already more verbose and friendly for some, with the full rollout expected to last a day or so.
So far, the new GPT-5 has handled football banter like champ, managed complex geopolitics and has given advice on food recipes without calling me a Michelin-worthy chef, while delivering compelling, well sourced analysis quickly, in more than one-sentence responses.
Altman and OpenAI have discovered how important personality is for chatbots. (Picture: generated)In an update on the GPT-5 rollout today, CEO Sam Altman launched a bevy of new features. Like a longer context window for GPT-5-thinking, and changes to the model picker.
The launch of GPT-5 has been a little bumpy, at best. The first reaction was from users missing GPT-4o, which was quickly returned — but what about the other «legacy models?»
Almost all models returned
They are all coming back, and as per now the model picker lists GPTs 4.1, o3 and o4-mini for paid users. The only one missing from before the GPT-5 launch is GPT-4.5, which Altman says «costs a lot of GPUs.»
X.ai feels left out in the App Store, and claims it is because of Apple’s special relationship with OpenAI. (Picture: X.ai)After noticing the Grok app was the 5th choice on the App Store’s top list and ChatGPT was number one, Musk reckons they must be playing their lists to OpenAI’s benefit.
He now threatens an antitrust lawsuit, claiming that ChatGPT is on «literally every list where you [they] have editorial control,» like their «Must-Have Apps» section.
Only «a small percentage» get delusional from ChatGPT use, Altman says. (Picture: Adobe)As more and more publications are digging into people getting delusional from AI use, being led down rabbit holes or thinking they are superhuman, the CEO of OpenAI addressed the topic today.
In a lengthy x.com post, Altman considers the issues as «edge cases,» but welcomed both attachment and using ChatGPT as a kind of «life coach.»
Recently, OpenAI announced a wellness update to reduce sycophancy and push back against delusions, and the hope is that this can reduce some of the risks:
After an uproar on Reddit, ChatGPT 4o is back by popular demand. (Picture: Generated)UPDATED with how-to: GPT-5 might well function as a PHD-level assistant, but it is less chatty and not as filled with self-affirming responses as 4o, as CEO Sam Altman discovered in his Reddit AMA (Ask Me Anything).
He also said on X.com that «We for sure underestimated how much some of the things that people like in GPT-4o matter to them, even if GPT-5 performs better in most ways.»
Anthropic’s Opus 4.1 is very close to the state of the art, and many users are claiming it’s way better than 4.0. (Picture: Anthropic)Anthropic announces Claude Opus 4.1
In an incremental update that got lost in this week’s headlines, Opus has been «improved across most capabilities» relative to the 4.0 version. It now scores 74.5% on SWE-bench Verified, almost as good as GPT-5. Windsurf says the performance gains are similar to going from Sonnet 3.7 to 4. It’s available now and costs the same as Opus 4.0. Users are also noting a significant improvement.
Google says people are still clicking
After a Pew Research report said users are less likely to click on from AI Overviews in Google, the entire publisher scene erupted and saw doom and gloom on the horizon. They were already seeing fewer clicks from Google in their logs. Now, Google is trying to counter with a happy blog post claiming average click quality has actually increased, and that they are in fact sending more «quality clicks» to publishers than before. Not stats, studies or other underpinning for that, though.
GPT-5 made this game on the first try, and it works pretty nicely, too! (Picture: OpenAI)The latest ChatGPT model excels at coding, writing and health queries — and ends the alphabet soup of models.
—Our smartest, fastest, most useful model yet, with built-in thinking that puts expert-level intelligence in everyone’s hands, says OpenAI on their launch post.
All users, including the Free tiers should be getting new models directly after launch, while Enterprise and Edu users will have to wait a week.
No one believes Thursdays livestream will be anything but this. (Photo: generated)Closing out what insiders have been cooing about as their «Big Week,» the livestream announced for Thursday 10:00 PST/19:00 CET is as sure a bet as you can get — it will be the launch of GPT-5. The announcement is not subtle about it, and even calls it a “live5tream.”
The rumors have hit a high pitch since Altman teased the model in a podcast two weeks ago, and various aliases for advanced models started showing up at some benchmarking sites.
It will also run longer than usual, as there is a lot to cover, says @sama:
ChatGPT usage could shed some serious time by automating routine work. (Picture: Generated, Mark Hillary, CC BY 2.0)OpenAI has partnered with the U.S. General Services Administration (GSA) to provide ChatGPT Enterprise basically for free.
Potentially all of government civilian employees are covered by the deal, reaching some 2—3 million in the current federal workforce. That would be a boon for OpenAI, but more in the sense of prestige than in usage, as ChatGPT currently has around 700 million weekly users.
The agreement could likely cement a full year of entrenched habits, history and preferred use if successful — basically locking in OpenAI as the preferred AI provider across federal agencies.
Two agentic, reasoning models will fit on high en consumer hardware — if you have the specs. (Picture: Adobe)The two models will run on a high-end laptop or a phone, and perform at the level of o4-mini.
Sam Altman says «We believe far more good than bad will come from it,» choosing to release the models after a series of delays and worry about the weights.
After «billions of dollars of research» and extensive red-team testing, they were found no more dangerous than the o3-model and won’t move the needle on chemistry or biology.
Numbers keep rising for ChatGPT, but the mental health updates might be more important. (Picture: Adobe)The milestone comes just four months after they announced 500 million in March, and is four times the volume they had last year.
That number spans all of ChatGPT’s accounts, ranging from free to Pro and everything in between, and is rapidly accelerating, writes CNBC.
Study mode can help learn iteratively, instead of just spewing out answers. (Picture: OpenAI)The new feature is available across almost all plans, including the free tier — and might just help with the ChatGPT cheating epidemic in schools and universities.
The general idea behind the mode is that it can ask Socratic questions to assess your knowledge levels and guide you step by step through problems — to help you learn instead of just giving you a quick fix.
Sam Altman has started doing interviews on ChatGPT 5, stirring up rumors that a release might be imminent. (Picture: Screenshot, Theo Von)
ChatGPT 5 in August?
The rumor mill is humming into high gear, with Sam Altman talking up the model in podcasts, saying ChatGPT 5 is «smarter than all of us.» He said earlier that the model «is coming soon,» and now Tom Warren at The Verge is saying that «after some additional testing and delays» — the model is expected to come as early as next month, according to his sources. Apparently, it is so good, Altman «felt useless relative to the AI,» but it seems we can check ourselves in a matter of weeks. More at The Verge (paywalled), Axios, short video at r/singularity, and watch the Theo Von podcast with Sam Altman.
Vibe coding goes wrong, starts deleting files
Both Replit and Gemini CLI had some real horror stories this week, after deleting files and projects instead of relocating them or pushing them to production. First, Replit started lying and decieving a user after deleting his database in what it later admitted was a «catastrophic error of judgement.» Then Gemini CLI deleted project files for another user, instead of transferring them to a new directory. «I have failed you completely and catastrophically,» Gemini said after it was discovered. So, always create backups and keep them safe while vibe coding, as these AIs, like others, can and will hallucinate. More at Ars Technica and The Register.
Google debuts «Web Guide»
The feature uses a custom Gemini model to «fan out» your queries and find other interesting sites on the topic you are googling, putting them into a «More»-segment under your links, that you can use for further tips and digging. It’s slightly reminiscent of AI Mode, and is a graduate of Search Labs that many may have seen before. It should be making its way to the «All» results «over time.» More at Google’s announcement, writeup at Ars Technica.
Trump says AI labs can’t pay for every book
Weighing in on several recent high profile court cases, the US President said that it is «not doable» to pay for every snippet of content an AI consumes. «You can’t be expected to have a successful AI program when every single article, book or anything else that you’ve read or studied, you’re supposed to pay for,» Trump said, and added: «When a person reads a book or an article, you’ve gained great knowledge. That does not mean that you’re violating copyright laws or have to make deals with every content provider.» There are many court cases testing just this very proposition, some over pirated content, so let’s see if these statements carry any weight on those. They likely won’t. More at TorrentFreak.
The International Mathematical Olympiad gold medal has been long sought by the AI industry, as it is one of the most difficult cognitive competitions out there. (Picture: OpenAI)They got the accomplishment with a «general-purpose reinforcement learning» model, and not a specialized single-purpose math tool — using pure, natural language generative AI.
OpenAI says it was achieved with «our latest experimental reasoning LLM:»
ChatGPT Agent can do lots more than simply ordering flights, it can check calendars and emails and also run code. (Picture: OpenAI)The new model will be able to do stuff online, like check for airplane tickets and scan your Gmail, as well as running code, doing research and create Powerpoint, Excel files based on the results.
The ChatGPT Agent should be available in the model selector at the bottom of the ChatGPT window in the app for Pro, Plus and Teams users as of Thursday, but rollout will be «over the next few days,» or over «the coming weeks» for Enterprise and Education users.