Google launches Gemini 3 Flash; faster, cheaper and the new default

Gemini 3 Flash is a whole lot faster than Pro, while retaining much of its prowess. (Picture: Google)
Offering performance just below Gemini 3 Pro and outperforming 2.5 Pro by wide margins, the new Flash model’s major selling point is price and speed.

The model is especially good at agentic workflows, and in fast reasoning, Google says.

On the benchmarks it performs stunningly well for the price, even beating Gemini 3 Pro at MMMU-Pro, which tests for multimodal understanding. It also ticks in at 33.7% in Humanity’s Last Exam, just a little below GPT-5.2.

Google has priced the model for efficiency, too, with a cost of $0.50 for 1 million tokens of input, and $3.00 for the same output. This is substantially lower than 3 Pro and is only beaten by Grok 4.1 Fast and Gemini 2.5 Flash.

The model is rolling out in all channels as of today, and will become the default in the Gemini app and on the web, as well as in AI Mode, where it will much improve reasoning.

Read more: Google: launch page, for developers, and in AI Mode. Writeups at The Verge, 9to5Google and Engadget.