Baseten is in talks to raise US$1 billion at an US$11 billion valuation, according to reports on 26 May. The round is not closed, so the figure is a target, not a done deal. If it lands, it more than doubles the US$5 billion valuation Baseten set in January.

What Baseten does

Inference is the part of AI that happens after training — the model running in production, answering a query, generating a response. Baseten rents Nvidia servers to application developers and helps them deploy, customise and run mostly open-source models on that hardware. It sells the unglamorous layer between a model and a working product, which is exactly the layer demand has rushed into this year.

Why investors are circling

The revenue is the draw. Baseten's annualised revenue reached roughly US$600 million by the end of the first quarter, up from about US$200 million at its start, per PYMNTS. A tripling in three months is the kind of curve that pulls a valuation up with it. Its January round was led by IVP and CapitalG, with Nvidia putting in US$150 million — a sign the chipmaker wants its hardware booked through fast-growing inference resellers.

US$1BRound being raised
US$11BTarget valuation
~US$600MAnnualised revenue, end Q1
US$5BValuation in January

The read

Two AI-infrastructure rounds in a week — Baseten here, the data layer repricing at Snowflake — point at the same thing. The market has decided that the money in this cycle is in the picks and shovels: the compute, the serving, the plumbing that every AI product needs. Whether US$11 billion is the right price depends on margins holding as Baseten rents hardware it does not own and resells at a markup the cloud giants would happily undercut. The revenue is real. The durability of the spread is the question.