Lepton AI

Acquired by NVIDIA in 2025, now DGX Cloud Lepton.

API Cloud Inference Mlops

LLMs & Chat Paid Has API

Researched 3 Jun 2026, 23:48 SGT · Published 4 Jun 2026, 08:27 SGT · Reviewed 13 Jul 2026

Visit Lepton AI Compare alternatives

RECATOOLS Score

7.5 / 10

Capability

Value for money

Ease of use

ASEAN readiness

API quality

Founded

—

Users

—

Launched

—

Developer

—

Overview

A high-performance model-serving startup founded by Caffe creator Yangqing Jia, acquired by NVIDIA in April 2025 and relaunched as DGX Cloud Lepton, a multi-cloud GPU marketplace.

What you can produce with Lepton AI

Relaunched as NVIDIA DGX Cloud Lepton (June 2025)
Multi-cloud GPU marketplace aggregating 20+ providers
Capacity from CoreWeave, Crusoe, Lambda, Nebius, SoftBank and others
High-performance LLM and multimodal inference serving
Developer API and SDK access
Founded by Caffe creator Yangqing Jia

ASEAN Perspective

Lepton AI in Southeast Asia

ASEAN-region availability and pricing notes coming soon. Drop the editorial team a note via /contact/ if you can supply local context (Singapore/Malaysia/Indonesia/Thailand/Vietnam).

RECATOOLS Verdict

The standalone product effectively no longer exists as an independent buy. Lepton AI built fast, cost-efficient LLM and multimodal serving to compete with Together and Fireworks, with a clean developer experience and the credibility of a founder — Yangqing Jia — who created Caffe and ran technology for Alibaba's cloud. NVIDIA closed its acquisition on 8 April 2025 (reported in the $300-500M range) and relaunched the platform that June as DGX Cloud Lepton, a marketplace that aggregates GPU capacity from providers like CoreWeave, Crusoe, Lambda, Nebius, SoftBank and others. So anyone evaluating "Lepton" today is really evaluating an NVIDIA product with NVIDIA branding, terms and pricing — the original independent serving business and its economics are gone. Worth understanding as context; for a live purchase decision, assess DGX Cloud Lepton on its current marketplace terms, not the old standalone offering.

Independent AI-assisted assessment by RECATOOLS.

What people say

The most important fact about Lepton AI is that it was acquired. NVIDIA closed the deal on 8 April 2025, reportedly worth several hundred million dollars (analyst estimates in the $300-500M range, exact figure undisclosed), and brought founders Yangqing Jia and Junjie Bai into its cloud org. Jia is well known in ML circles as the creator of the Caffe deep learning framework and a former VP of Technology at Alibaba.

Before the acquisition, Lepton was a Cupertino startup founded in 2023 offering GPU cloud services and developer tooling — including its FastGPU compute product and a Lepton Search conversational engine — and it competed on fast, cost-efficient inference against Together AI and Fireworks, with a developer experience reviewers described as clean.

NVIDIA rebranded the platform as DGX Cloud Lepton and relaunched it in June 2025. The repositioning is significant: rather than a single serving provider, DGX Cloud Lepton is now a compute marketplace that connects GPU capacity from a roster of providers including CoreWeave, Crusoe, Firmus, Foxconn, GMI Cloud, Lambda, Nebius, Nscale, SoftBank and Yotta. Forbes framed NVIDIA's ambition as a "planet-scale AI factory," and industry coverage read the move as NVIDIA vertically integrating — buying the startup that had been renting out NVIDIA's own chips.

What this means for a buyer: independent reviews of the old Lepton product (uptime, pricing, support quality as a standalone vendor) are now largely historical. Pricing under DGX Cloud Lepton varies by the underlying marketplace provider you select rather than a single published rate card, so there's no stable per-token or per-hour figure to quote. There isn't a substantial body of end-user reviews for the marketplace in its NVIDIA form yet. The practical takeaway is directional: the pedigree and performance focus are real, but this is now an NVIDIA offering, and evaluation should be against DGX Cloud Lepton's current terms — not the acquired startup's original standalone service.

Summary of public user & expert reviews, compiled by RECATOOLS.

About this listing

Researched on Wednesday, 3 June 2026 at 23:48 SGT (UTC+8)

Published on Thursday, 4 June 2026 at 08:27 SGT (UTC+8)

Last reviewed Monday, 13 July 2026 (1 week ago)

This entry was compiled from publicly available data including Lepton AI's official website, press releases, documentation, and reputable third-party publications. RECATOOLS is not affiliated with Lepton AI unless explicitly stated.

Data accuracy

Third-party AI tools update their pricing, features, availability, and policies frequently. Information here may be outdated by the time you read this — we make reasonable efforts to keep listings current, but cannot guarantee absolute accuracy.

For the latest details, please refer to Lepton AI directly →

Spotted something out of date? Suggest an update →