infrastructure Archives

Meta’s AI fallback plan: Forming a cloud computing company

Meta could soon be renting out excess compute, as it continues to build out capacity. (Picture: Adobe)

Already sitting on one of the world’s largest compute clusters with plans to add some $145 billion worth more just this year — Meta is making contingency plans.

Bloomberg (paywalled) is reporting that they are now planning to form a cloud computing company to compete with AWS, Azure and Google Cloud, aiming to sell «excess capacity» to paying customers.

The idea is a hedge against overcapacity, Axios reports, quoting Zuckerberg as saying in May: «that is an option that we have, and that is partially what gives us confidence in investing in building this out.»

The new unit would offer raw compute to companies looking to train models, or could offer access to models from other AI labs and rent out the infrastructure to power them. This would be similar to services from Amazon and Azure.

SpaceX, now incorporating X.ai, faced a similar problem when their compute power got bigger than their needs, and struck deals with Anthropic and Google to sell excess compute access, Reuters notes.

It is estimated that Meta currently sits on 20 gigawatts of capacity, and it plans to add another 14 GW over the next years, Axios says.

Read more: Bloomberg (paywalled), Axios, Reuters, CNBC, and Engadget.

OpenAI delivers Jalapeño, a state-of-the-art chip made in only nine months

The Jalapeño chip offers substantially better performance per watt than anything on the market. (Picture: OpenAI)

The new inference chip, developed with help from OpenAI’s models in collaboration with Broadcom, has compressed what is normally a glacial, years-long process into just a short sprint.

— We believe [this] to be the fastest ASIC development cycle ever achieved in high-performance advanced semiconductors, OpenAI says.

The chip is designed specifically for OpenAI’s needs on current and upcoming language models, and to combine the power of current AI accelerators with the shorter latency of «specialized systems,» like Cerebras’ hardware.

— The world is moving to a compute-powered economy, says Greg Brockman, President and Co-Founder of OpenAI. — Jalapeño is part of our long-term full-stack infrastructure strategy to make compute more abundant.

The new silicon will be deployed at gigawatt-scale data centers just as fast as it was made, starting with Microsoft «and other partners» already in 2026, Broadcom says. It is only the first design in what the companies expect to be generations of products.

Jalapeño is already running LLM workloads in the lab, delivering «substantially better» performance per watt than «the current state-of-the-art,» OpenAI says.

Read more: OpenAI’s presentation, CNBC, CNN, Reuters, and TechCrunch.

The EU plans for «technological sovereignty» with huge investments

The European Union has a long checklist of things to improve in the AI age, and stands ready to invest «at scale.» (Picture: Shutterstock)

The EU is increasingly concerned at their reliance on the USA for all things cloud, software and AI, and is taking urgent steps to counter it, or, as they put it, to «strengthen Europe’s digital resilience.»

— We cannot afford to depend on others for the technologies that keep our hospitals running, our energy grids stable and our services secure, Commission President, Ursula von der Leyen says in a statement.

Continue reading “The EU plans for «technological sovereignty» with huge investments”

SoftBank to spend €75 billion on data centers, manufacturing in France

The manufacturing and data centers will require next-gen worker skills. (Picture: Adobe)

The northern port of Dunkirk will become an advanced, automated AI manufacturing hub in the new plan — to support the buildout of 5 GW of compute power.

For the first phase, SoftBank is offering €45 billion to build 3.1 GW of capacity at various sites in the Hauts-de-France region by 2031, ramping up at a later date.

— SoftBank Group’s AI data centers will support growing demand for high-performance computing from AI companies, cloud providers, enterprises, public institutions and research organizations, they write in their release.

France was selected because of its wide manufacturing base, but another important factor is it advanced power grid, fueled by generous amounts of nuclear energy.

SoftBank will partner with Schneider Electric on the project, which will also create «thousands» of advanced, high-skilled jobs, requiring local universities, engineering schools, and training institutions to step up.

SoftBank is a major investor in OpenAI and is deeply involved in the Stargate program to build out compute capacity. This effort seems separate from both.

Read more: SoftBanks release, Reuters, and TechCrunch.

Anthropic grew 80-fold last quarter, secures compute deal with SpaceX

Power hungry: Anthropic’s growth has been off the charts last quarter. (Picture: generated)

The AI lab was preparing for its usual 10x growth rate this quarter, but the numbers made a huge, unexpected jump, CNBC reports

— That is the reason we have had difficulties with compute, says CEO Dario Amodei, revealing their strain on infrastructure since at least April.

The company has now made a deal with SpaceX for instant access to their entire Colossus 1 supercomputer, consisting of some 220,000 Nvidia GPUs, totaling more than 300 megawatts.

Using all of that capacity should immediately improve access for Anthropic’s users, and they are already doubling rate limits for the Pro and Max plans, and is «considerably» increasing API rates, while removing «peak hours» restrictions.

Elon Musk says he approved the deal after evaluating Anthropic’s altruism, that xAI will be known as SpaceXAI and that they have already moved Grok training to the Colossus 2 cluster.

Citing strain on servers, Anthropic secures 5 GW of compute from Amazon

It’s a deal where it appears both sides win. (Picture: Amazon)

Anthropic admits to taxing its servers lately, saying that their recent growth «places an inevitable strain on our infrastructure; our unprecedented consumer growth, in particular, has impacted reliability and performance […] especially during peak hours»

The AI lab says 1 gigawatt of the new capacity on Amazon’s custom silicon will come online in late 2026, and is committing to spending $100 billion on Amazon over the next decade to «train and run Claude.»

The deal also includes an initial investment of some $5 billion from Amazon which will scale to $25 billion «in the future,» tied to «commercial milestones.»

Andy Jassy, CEO of Amazon, has taken some flak for their gigantic $200 billion spend on AI infrastructure, but with this deal, they are recouping half of that:

—Anthropic’s commitment to run its large language models on AWS Trainium for the next decade reflects the progress we’ve made together on custom silicon, Jassy says.

Read more: Anthropic’s announcement, Amazon’s release. Writeups on CNBC, TechCrunch, and Engadget.

Amazon CEO explains $200 billion AI spend, has over $15 billion in AI revenue

AI bets are already starting to pay off for Amazon, Jassy says. (Picture: Shutterstock)

Andy Jassy’s annual shareholder letter this year was all about defending their massive, 60% increase in capital spending on AI infrastructure.

— We’re not investing approximately $200 billion in capex in 2026 on a hunch, he writes according to CNBC. — We’re not going to be conservative in how we play this — we’re investing to be the meaningful leader, and our future business, operating income, and [cash flow] will be much larger because of it, he continues.

There are already deals incoming to support this claim, he writes, noting a $100 billion commitment from OpenAI for AI compute on their server farms.

AI server use at Amazon’s cloud unit has reached more than $15 billion in annualized revenue, Jassy writes, and now represents about 10% of the total for AWS, Reuters reports.

Read more: The shareholder letter (long), writeups on CNBC and Reuters.

Anthropic reaches $30B revenue, gets compute from Google and Broadcom

Anthropic continues to diversify its compute needs. (Picutre: Anthropic)

Anthropic now says it has a run-rate revenue of $30 billion, up from $14 billion in February during their last fundraising.

They are also announcing that they are brining in new compute capacity, based on next generation Google TPUs that will start coming online in 2027.

The companies offer no detail the cost of the «partnership» or how much compute they are actually buying, but Broadcom Is hinting it’s around 3.5 GW, according to CNBC.

Anthropic also say they have doubled the rate of customers spending more than $1 million per year to 1,000, in just two months.

Claude now runs on Amazon’s Trainium chips, Google TPUs and Nvidia GPUs. The latter are more used, and Amazon remains their primary cloud provider, Anthropic says.

Read more: Anthropic’s announcement, CNBC adds numbers.

Mistral raises $830 million in debt to build data center just outside Paris

It’s a big investment for European AI, but significantly lower than what US AI labs are spending. (Picture: Mistral)

The new 44 MW data center will be powered by 13,800 Nvidia chips, and should be online by the second quarter of 2026.

The French AI lab is hoping to secure 200 megawatts of capacity by the end of 2027, Reuters reports.

— Scaling our infrastructure in Europe is ⁠critical to empower our customers and to ensure AI innovation and autonomy remain at the heart of Europe, says Mistral CEO Arthur Mensch.

The news comes hot on the heels of Mistral’s February startup of a €1.2 billion data center in Sweden, according to CNBC.

Mistral is the largest European AI lab, has contracts with the French armed forces, and has secured $3.1 billion in funding so far, TechCrunch writes.

Read more: Reuters, CNBC and TechCrunch.

Google’s next data center in Minnesota will have the world’s largest battery

Google’s energy in Minnesota won’t lead to higher electricity costs for consumers, they say. (Picture: Google)

The data center in Pine Island will have 1.9 gigawatts of capacity, sourced from new wind and solar power from Xcel Energy.

This will then be attached to a 300 megawatt iron-air battery from Form Energy, ensuring continuous service to operations.

That will be the world’s largest commercially deployed battery, and can provide power to the data center for a whopping 100 hours, TechCrunch notes.

As part of their buildout, Google is announcing that they will pay for their electricity in full, and will also invest $50 million in Xcel Energy’s green energy program to place batteries across their grid.

— Google’s partnership with Xcel Energy reimagines how data centers can be served, Google writes.

Read more: Google’s announcement, Xcel’s presser, CNBC and TechCrunch.

Meta to purchase 6GW of custom AMD AI chips, take up to 10% ownership

Joining forces with AMD, Meta is set to receive hundreds of thousands of custom inference chips. (Picture: Meta)

According to Reuters, the deal is worth $60 billion, runs over five years, and comes hot on the heels of another Meta deal with Nvidia earlier this month.

The agreement will see the first gigawatt of GPUs and CPUs delivered in the second half of 2026, and includes several as-yet-to-debut rackable chips.

AMD and Meta have long been partners in developing custom chips, but these are specifically built for inference, the process of creating answers for user queries.

— This is an important step for Meta as we diversify our compute, says Mark Zuckerberg on the deal.

AMD entered into a similar deal with OpenAI in October, 2025, meaning it might soon be 20% owned by AI labs.

Read more: AMD press release, Meta’s release, and Reuters report.

The hundreds of billions in AI investments flowing into India

The Indian AI Summit was a great success for the country, securing hundreds of billions in investments. (Picture: AI Impact Summit)

After the conclusion of the Indian AI Impact Summit, the chips are in on massive investments into data centers in the country.

It’s OpenAI’s second biggest market and looks set for a massive buildout of AI capacity in the years leading up to 2035.

Adani dishing out the dollars
First out is the Adani Group, pledging $100 billion to build data centers by 2035, which they expect to trigger «an additional $150 billion in secondary investments.» This should scale their data center portfolio from 2 gigawatts to 5 GW.

Continue reading “The hundreds of billions in AI investments flowing into India”

Nvidia strikes «multi-year strategic partnership» with Meta for AI chips

Likely costing a significant measure of Meta’s capital expenditures, the deal is expected to be in tens of billions dollars or more.

Both Meta and Nvidia are announcing a long-term, multi-generational strategic partnership today — without mentioning the price.

Meta, already a top customer for Nvidia, will use their chips in a «large-scale deployment» to build out data centers «optimized for AI training and inference,» they say.

The cost of the deal will likely run into the tens of billions of dollars or more, CNBC reckons, and includes access to future chips as well as the current Blackwell and Vera Rubin generations.

— We do expect a good portion of Meta’s capex to go toward this Nvidia build-out, chip analyst Ben Bajarin of Creative Strategies tells CNBC.

Reuters notes that Meta is likely one of the top three customers accounting for more than half of Nvidia’s sales.

Read more: Meta announcement, Nvidia announcement. Writeups on CNBC, Reuters and The Verge.

ByteDance is developing in-house AI chips, to be manufactured by Samsung

Nvidia chips are available in China, but users need permission to buy them. (Picture: Adobe)

Not much is known about the AI inference chips, or how they compare to Nvidia’s offerings, but ByteDance is going to be making about 100,000 of them «this year,» and then scale up to 350,000 units, according to Reuters.

ByteDance has been known to work with US chip producer Broadcom, and started seriously hiring chip specialists in 2022.

The new chips are set to be produced with Samsung in a deal that includes memory chips, which definitely sweetens the deal.

Production is advanced enough that Reuters’ sources say engineering samples are due by late March, which is the last stage before production.

A spokesperson for the company does not deny the report outright, but claims the information is «inaccurate,» Reuters writes.

Most US frontier labs are developing their own chips, as is Alibaba and Baidu.

Read the scoop at Reuters.

OpenAI commits to power buildouts for Stargate centers

OpenAI is now addressing local concerns for their massive data center buildouts.

The company shares how it plans to deal with «Stargate Communities» across the USA, especially on water and power consumption — and commit to being «good neighbors.»

This means for instance that they will cooperate with power utility companies at each site to ensure they are «paying our own way on energy.»

That could entail building out the infrastructure for each site where needed, or simply strengthening the grid.

They also address water usage, and note that Stargate Abilene will use as much water annually as the community uses in a day, thanks to «innovations in the cooling water systems design.»

OpenAI is even committing to slowing down workloads on days with adverse conditions, to lighten the load on the grid.

There have been at least 25 data center cancellations in the USA due to opposition from local communities, Gizmodo reports.

They are typically worried about rising electricity and water costs, which OpenAI is now directly addressing.

Read more: OpenAI’s blog, Reuters, Bloomberg.