Baseten is in talks to raise US$1 billion at an US$11 billion valuation, according to reports on 26 May. The round is not closed, so the figure is a target, not a done deal. If it lands, it more than doubles the US$5 billion valuation Baseten set in January.
What Baseten does
Inference is the part of AI that happens after training — the model running in production, answering a query, generating a response. Baseten rents Nvidia servers to application developers and helps them deploy, customise and run mostly open-source models on that hardware. It sells the unglamorous layer between a model and a working product, which is exactly the layer demand has rushed into this year.
Why investors are circling
The revenue is the draw. Baseten's annualised revenue reached roughly US$600 million by the end of the first quarter, up from about US$200 million at its start, per PYMNTS. A tripling in three months is the kind of curve that pulls a valuation up with it. Its January round was led by IVP and CapitalG, with Nvidia putting in US$150 million — a sign the chipmaker wants its hardware booked through fast-growing inference resellers.
The read
Two AI-infrastructure rounds in a week — Baseten here, the data layer repricing at Snowflake — point at the same thing. The market has decided that the money in this cycle is in the picks and shovels: the compute, the serving, the plumbing that every AI product needs. Whether US$11 billion is the right price depends on margins holding as Baseten rents hardware it does not own and resells at a markup the cloud giants would happily undercut. The revenue is real. The durability of the spread is the question.