Chinese LLM comparison: DeepSeek, Qwen, Doubao, Kimi, ERNIE, GLM — price, context window, type. Estimates only. In your browser.
Chinese LLM Comparison
Compare the major Chinese chat LLMs side by side — DeepSeek, Qwen (通义千问), Doubao, Kimi, ERNIE (文心), Zhipu GLM and MiniMax — on input price, output price, context window and type (reasoning / general). Click any column header to sort. Everything renders in your browser; no model is ever called.
Heads-up: prices and context windows are provisional estimates — the official provider page prevails (价格与上下文为估算,以厂商官方为准). This tool only compares; it never calls a model.
| Model | Provider | Input $/1M | Output $/1M | Context | Type |
|---|
How the Chinese LLM comparison works
Scan the comparison table
The table lists the major Chinese chat models — DeepSeek, Qwen (通义千问), Doubao, Kimi, ERNIE, GLM and MiniMax — with each model's provider, input price, output price, context window and type (reasoning / general). The data comes from a shared data module loaded same-origin; everything renders locally in your browser.
Read the price columns
Input and output prices are both in USD per one million tokens. For almost every Chinese model the output price is higher than the input price, so the cost of a long answer is driven mainly by output tokens. Models whose price is not yet confirmed show "— / 待核实" until verified against the provider's own page.
Sort to suit your question
Click any column header to sort: sort input price low-to-high to find the cheapest model, click again to reverse it; or sort by context window to pick a model that fits a long document. Rows with a missing price always sink to the bottom, so they never get in the way.
Shortlist here, confirm on the official page
Use the table to shortlist two or three candidates, then check each provider's official pricing page for the current price and context limit — these figures are estimates for side-by-side comparison only; the provider's official numbers prevail. This tool only compares; it never calls a model.
How the Chinese LLM comparison works
One table for a fast-moving, fragmented market
China's large-language-model market moves fast and is unusually fragmented: DeepSeek, Qwen (通义千问) from Alibaba, Doubao from ByteDance, Kimi from Moonshot, ERNIE (文心) from Baidu, Zhipu's GLM line and MiniMax each ship frequent updates at different price points. Comparing them one provider page at a time is slow and error-prone, because every vendor formats its pricing differently and quotes tokens in its own way. This tool collapses all of that into a single, sortable table: each row shows the model, its provider, the input price, the output price, the context window, and whether it is a reasoning or a general model. The numbers come from a shared data module that the whole RECATOOLS AI-tools family reads, loaded same-origin into your browser, so the comparison renders instantly and stays consistent with our other cost tools.
The most useful first move is to sort. Click the input-price header to rank models cheapest-first and immediately see where DeepSeek and the lighter Qwen tiers sit against the rest; click again to reverse it. Sort by context window instead when your real constraint is fitting a long document or a long conversation into a single request. Models whose price we have not yet confirmed against the provider's own page show "— / 待核实" and always sort to the bottom, so a pending figure never distorts the ranking. Because the table is just a view over structured data, it is honest about what it does and does not know — and it never pretends a missing price is zero.
"Pick a Chinese model the way you'd pick anything else with a price tag: line them up, compare the numbers, then confirm on the official page before you commit."
Input price, output price and context — read all three
Reading the price columns correctly matters. Input price applies to the prompt tokens you send; output price applies to the answer tokens the model writes back. For almost every Chinese model the output price is the larger of the two — often by several times — so a long, generated answer costs far more than a long prompt. Reasoning models such as DeepSeek R1 amplify this: they emit a hidden "thinking" trace before the final answer, which means more output tokens and a higher effective bill, in exchange for stronger performance on hard reasoning, maths and code. For routine chat, summarisation and drafting, a cheaper general model is usually the better value. The context-window column is the third number to weigh: it sets how much text — input and output together — a single call can hold, and these models span roughly 130K to 260K tokens.
Treat every figure here as a provisional estimate, not a quote. The prices and context windows are gathered from public aggregators and are intended for fast side-by-side comparison; they can lag a provider's latest pricing, regional list price, or promotional tier. The right workflow is to use this table to shortlist two or three candidates on price, context and type, and then open each provider's official pricing page to confirm the current number before you build against it. The tool deliberately does nothing else: it reads our own same-origin data, renders the table locally, and never calls a model or uploads anything you do. That keeps it fast, private and dependable — a starting point for a decision, with the official page always the final word.
About Chinese LLM Pricing — 10 Key Points
The Chinese LLM ecosystem is highly diverse: DeepSeek, Qwen (Alibaba), Doubao (ByteDance), Kimi (Moonshot), ERNIE (Baidu), Zhipu GLM and MiniMax each have a distinct niche — this table lines them up for side-by-side comparison.
Input price and output price are two separate numbers: the input price applies to the prompt tokens you send, the output price to the answer tokens the model generates, each billed at its own rate.
For almost every Chinese model the output price is clearly higher than the input price, so the total cost of a long answer is usually driven mainly by output tokens.
DeepSeek is known for highly competitive pricing and is often among the lowest cost-per-million-tokens options in its capability tier.
Reasoning models (such as DeepSeek R1) generate a "thinking" trace before answering; they usually cost more per token and emit more output tokens, suiting tasks that need careful reasoning.
The context window sets how many tokens one request can hold (input and output share it): these models range from roughly 130K to 260K, and a larger window suits long documents and long conversations.
Token counting for Chinese models differs from English models — one Chinese character is not necessarily one token; the prices shown are per million tokens by each provider's own definition.
Some models (e.g. Doubao, ERNIE) still show "— / 待核实", meaning the price has not yet been confirmed against the provider's official page and will be filled in once verified.
Within one tier the price gap can be several-fold: a side-by-side comparison can cut your API cost markedly while keeping the capability you need.
This tool only reads our own same-origin shared data and renders the table locally in your browser — it never calls DeepSeek, Qwen or any model, and uploads none of your data.
Frequently Asked Questions
- No. It only reads our own same-origin shared model data and renders a comparison table locally in your browser. It does not call DeepSeek, Qwen or any model, and uploads none of your data to any server. It compares information only — no reasoning, no generation.
- It currently covers DeepSeek (including the reasoning model R1), Qwen (通义千问, Alibaba), Kimi (Moonshot), Zhipu GLM, MiniMax, plus Doubao (ByteDance) and ERNIE (Baidu). As the shared data is updated, added or changed models are reflected automatically.
- All prices are in USD per one million (1M) tokens, with input and output listed separately. For most models the output price is higher than the input price, so the cost of a long answer is driven mainly by output tokens.
- That means the price for that model (e.g. Doubao, ERNIE) has not yet been confirmed against the provider's official pricing page, so no figure is shown. Such rows sort to the bottom automatically and never interfere with comparing the rest.
- The prices and context windows are estimates (sourced from public aggregators) for side-by-side comparison, and may differ from each provider's latest pricing. Always treat the provider's official pricing page as authoritative — use this table to shortlist, then confirm on the official site.
- The context window is the maximum number of tokens a single request can hold, shared between input and output. A larger window fits longer documents, longer conversations or more reference material. Values are shown rounded (e.g. 131K) for easy comparison.
- A reasoning model (such as DeepSeek R1) generates a thinking trace before answering and is stronger at complex reasoning, maths and code, but costs more per token and emits longer output. A general model responds faster and cheaper, suiting most everyday chat and writing tasks.
- Yes. Click any column header to sort by that column, and click again to toggle ascending / descending. Common uses are sorting input price to find the cheapest model, or sorting context window to pick one that fits a long document; rows with a missing price always sink to the bottom.
- No. The tool needs no input from you, and the whole table renders locally in your browser. It collects, uploads and stores nothing, and calls no external endpoint (it only loads our own same-origin data file).
- Completely free, with no account or sign-up and no usage limit. It runs in your browser and collects no data.
Related News
You may be interested in these recent stories from our newsroom.
No related news yet for this tool. Our editorial team publishes new pieces every week.
Browse all news →75 more free tools
Calculators, converters, security tools — no signup.