OpenAI brings web search, thinking to ChatGPT Images 2.0

A stoat on a goat on a boat in a moat, approaching photorealism. (Picture: generated)
The new image model for ChatGPT does not only offer higher fidelity, but more precision in how it renders pictures. According to OpenAI, it’s a sea change:

— If we think of Dall-e as cave drawings, and Images 1.0 as ancient art, then Images 2.0 is the Renaissance, OpenAI claims, according to Gizmodo.

It is a little bit faster to generate, and offers better instruction following, more accurate object placement — and reasoning with web search.

That means it can render multiple images from a single prompt, and «double-check» its outputs, as well as offering the latest information in the picture.

As for the benchmarks, GPT Image 2.0 has entered LMArena’s leaderboards at #1 for Text-to-Image with a whopping 242 point lead, and is 125 points ahead on Single-Image-Edit and up 90 on Multi-Image Edit.

The model is available today across all of ChatGPT’s tiers. For the thinking mode, you’ll need a paid Plus, Pro or Business subscription.

Read more: OpenAI’s announcement, Launch thread, Gizmodo and The Verge.