teknotum
Skip to content

Teknotum

xAI releases Grok 4 Fast, focusing on cost and efficiency

Grok 4 Fast is cheaper and uses fewer tokens, xAI says.
Almost as good as Grok 4, xAI claims — and plenty cheaper. Meet Grok 4 Fast. (Picture: xAI)
Touted as a state-of-the-art model for less, they claim to be «pushing the boundaries for smaller and faster AI.»

They say the model achieves performance comparable to Grok 4 proper with 40% less token use, and even pushes the price of those tokens down by 98%.

This makes it one of the cheapest and most efficient models out there, xAI claims, but many say they still prefer GPT-5 mini, which is also plenty cheap.

The model has a 2 million token window, and can switch between reasoning and non-reasoning on the fly.

Travels well in benchmarks
In benchmarks posted by xAI, it appears to score slightly below the grown up Grok 4 model and is a little behind GPT 5-high.

On LMarena it reaches joint eight place on text tasks, and is just ahead in first place on search tasks, but is nowhere to be seen on the other leaderboards.

Available for all
It’s the cost per token that is the showstopper here, for a top notch reasoning model — at $0.20 per input and $0.50 for output, its cost-efficiency is unmatched, xAI claims.

It is so cheap and competent, xAI say, that they are making it available for free in the app on the web, so you can check it out yourselves.

— For the first time, all users, including free users, will have access to our latest model without restrictions, marking a step toward democratizing advanced AI, xAI writes.

Read more: xAI’s announcement, writeup by Engadget, discussion on r/Singularity.

Author Tor FosheimPosted on 22. September 202522. September 2025Tags grok, xai

Post navigation

Previous Previous post: Zuckerberg would rather «misspend a couple of hundred billion» than lose out on AI race
Next Next post: Nvidia and OpenAI reach «strategic partnership» worth $100 billion

You might also like

AI use to become mandatory at Microsoft division

Google rolls out Veo 3 for Gemini Pro users globally

With help from top AI labs, American teachers to get better, free training

Grok’s new «companions:» sex crazed lovebot and a profane firestarter

OpenAI launches ChatGPT Agent mode, for tasks both easy and tough

In a first, judge rules training AI on copyrighted works is fair use

From the front page

Sundar Pichai: Gemini 3.0 is going to be released «this year»

08:49 18 Oct 2025

Weekend roundup: Copilot everywhere, Veo 3.1 and Altman on morality

05:59 17 Oct 2025

Anthropic launches Haiku 4.5; at twice the speed and a third of the cost

05:38 16 Oct 2025

Sam Altman says GPT-5 will be more friendly, allow age-verified erotica

04:30 15 Oct 2025

Broadcom to supply OpenAI with 10 GW’s worth of custom chip capacity

07:24 14 Oct 2025

AI airplanes anthropic apple bard cancer chatgpt climate coding copilot copyright defense drones education energy facebook film game gemini google grok hardware images instagram internet iphone law llama meta Microsoft military netflix nvidia openai research science search sosiale medier stargate streaming veo video work xai zuckerberg

  • About teknotum
  • Newsletter

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
Teknotum Proudly powered by WordPress