Google debuts Gemini 3.1-Flash Lite, for developers needing speed and scale

Not for everyone; Flash Lite is built for high volume cost efficiency. (Picture: Google)

Positioning the model as a purely developer-focused one, Google is touting the price, latency and the sheer amount of work it can do.

Costing $0.25 for 1M input tokens and $1.50 for 1M output tokens, it is one of the cheapest models out there.

Compared to Gemini 2.5 Flash, it is 2.5x faster to the first answer, and 45% quicker in output speed, while maintaining quality.

This benefits high-frequency workloads, such as mass translations and content moderation where price is a priority, Google says.

Users of AI Studio and Vertex AI can also adjust its thinking levels, making it possible to balance speed and complexity.

Read more: Google’s announcement, Android Central, Tom’s Guide.