Chinese AI pricing cheatsheet: chat token prices, image and video model access in compact tables. Estimates only. In your browser.
Chinese AI Pricing Cheatsheet
A quick-reference price cheatsheet across the major Chinese AI models — chat (input / output USD per 1M tokens, and context window), text-to-image and text-to-video. Compact tables for DeepSeek, Qwen (通义千问), Doubao, Kimi, GLM, Kling and more, read straight from our shared model data. Everything renders in your browser; no model is called.
Prices are provisional estimates — official pricing prevails (价格为估算,以各厂商官方定价为准).
Chat models · 对话模型
USD per 1M tokens| Model | Input / 1M | Output / 1M | Context |
|---|
Text-to-image · 文生图
capability & access| Model | Strength | Access |
|---|
Text-to-video · 文生视频
duration · resolution · access| Model | Max duration | Max res | Access |
|---|
Reference only — prices are aggregator estimates for a quick side-by-side. Confirm the exact rate on each provider's official pricing page before you budget or buy.
How the Chinese AI pricing cheatsheet works
Start with the chat-model table
The first table lists the major Chinese chat LLMs — DeepSeek, Qwen, Doubao, Kimi, Zhipu (GLM), MiniMax — with input price, output price (both USD per 1M tokens) and context window. Use it to see which price band your workload falls into.
Read input and output prices separately
A model usually charges different rates for input (your prompt) and output (its generation), and reasoning or long-output models tend to charge far more for output. To budget, multiply your expected input and output tokens by their own rates, then add — otherwise you will undercount the real cost.
Then check the image / video tables
Image and video models are billed per image, per credit or by subscription, and prices move fast — so these tables surface the provider, capability focus and access tier (free / paid / credits) to help you shortlist, then verify the current per-unit price on the provider page.
Treat the provider page as the source of truth
Every figure here is an aggregator estimate, stamped with an "as of" date. For any real selection, quote or budget, confirm against each provider's own pricing page — exchange rates, promotions, tiers and regional differences can all change the actual price.
How the Chinese AI pricing cheatsheet works
One table to compare Chinese-model prices at a glance
Pricing for Chinese large language models is scattered across provider consoles, English aggregators and forum threads, often in different units and currencies. This cheatsheet pulls the figures into one place and one format: chat models in USD per 1,000,000 tokens, split into input (your prompt) and output (the model's generation), alongside the context window each one offers. The chat table covers the families most teams actually shortlist — DeepSeek, Qwen (通义千问), Doubao (豆包), Kimi, GLM (智谱) and MiniMax — so you can see in seconds where a model sits relative to its peers, whether you are chasing the lowest per-token cost or weighing a larger context window against a higher rate. Because input and output are billed separately, the table keeps them side by side: a model that looks cheap on input can be expensive once a long, verbose answer is generated, and the two columns make that trade-off obvious before you commit.
Image and video generation work differently. Text-to-image services such as 即梦 Jimeng, 通义万相 Wan, 混元 Hunyuan, 文心一格 ERNIE-ViLG and 可图 Kolors, and text-to-video services such as 可灵 Kling, 即梦 Seedance, 海螺 Hailuo, Vidu and Wan, are billed per image, per credit or by subscription — rates that swing with resolution, duration, frame rate and speed. A single dollar figure would be misleading there, so those tables surface what actually helps you shortlist: the provider, the capability the model is known for, and the access tier (free, paid, or credits). You narrow the field here, then open the provider's page to confirm the current per-unit price for the exact spec you need.
"Treat this cheatsheet as a map, not a meter: it shows you where each model sits, but the provider's own page is the only price you should quote."
Estimates, not quotes — always confirm with the provider
Every number here is a provisional estimate, drawn from public aggregators and stamped with an "as of" date you can see above the tables. Provider prices move — with model launches, promotions, exchange rates and tiered or batch discounts — so the figure that was right last quarter may not be right today. That is why the cheatsheet leads with the date and labels unverified entries as "—/待核实" (pending) rather than guessing: a missing number is more honest than a wrong one. When a price matters to a decision, the workflow is simple — shortlist here, then open the provider's own pricing page and confirm the exact rate for your region, currency and usage tier.
Used that way, the cheatsheet is genuinely fast. Scan the chat table to find the price band that fits your volume, glance at the context column to rule out models too small for your documents, then jump to the image or video table to shortlist by capability and access tier. It runs entirely in your browser, reading from the same shared model-data file that powers our other AI cost tools, so the numbers stay consistent across the suite and update in one place. Nothing you do is uploaded, no model is called, and there is no account or limit — it is simply a clean, current snapshot to start a pricing decision from, not the final word on it. One more habit worth keeping: when two models land in the same price band, the cheaper sticker price is rarely the whole story. Check the context window against the length of the documents you actually send, then weigh the output rate against how verbose each model tends to be, because a chattier model at a lower output price can still cost more per finished task than a terse one that charges slightly more. The tables are arranged to make exactly those comparisons quick, so the few seconds you spend here can save a far larger surprise on the monthly invoice later.
About Chinese AI Pricing — 10 Key Points
Chinese LLM API prices are generally quoted per 1,000,000 tokens, with input (prompt) and output (generation) priced separately — and this cheatsheet lists them the same way.
The DeepSeek family is known for low prices: its base chat model often has one of the lowest per-1M-token rates in its tier, suiting high-volume, cost-sensitive work.
Reasoning models such as DeepSeek R1 usually charge much more for output than ordinary chat models, because they generate a longer "thinking" trace.
The context window (ctx) sets how many tokens fit in one call; most mainstream Chinese models now reach 128K, and some long-context variants stretch to 200K–260K.
Text-to-image models (Jimeng, Wan, Hunyuan, ERNIE-ViLG, Kolors) are mostly billed per image or per credit, with the rate floating by resolution and speed — hard to capture in one number.
Text-to-video models (Kling, Seedance, Hailuo, Vidu, Wan) generally use credits or subscriptions; longer duration, higher resolution and higher frame rate all burn more.
A single provider often offers both a free and a paid tier: free tiers usually carry watermarks, lower clarity or quota caps, while paid / credit tiers unlock higher specs.
USD-quoted Chinese-model prices drift with the renminbi exchange rate and provider promotions, so always check the "as of" date when comparing across time.
Price should not be the only metric: context length, Chinese-language understanding, reasoning, image / video quality and integration effort all shape real value for money.
The figures in this cheatsheet are aggregator estimates for a quick side-by-side; for any real budget or selection, defer to each provider's own pricing page.
Frequently Asked Questions
- They are estimates drawn from English aggregators (with an "as of" date) for a quick side-by-side, not official quotes. Providers can change prices at any time, and tiers, promotions and regional differences exist. For any real budget or selection, defer to each provider's own pricing page.
- Chat models are priced in USD per 1,000,000 (1M) tokens, with separate rates for input (prompt) and output (generation). To budget, multiply your expected input and output tokens by their own rates and add the two together.
- For a few models (such as the latest Doubao and ERNIE releases) the official per-token price is not yet confirmed in our data source, so we show a placeholder rather than a potentially misleading number until it is verified.
- No. It only renders built-in pricing data into tables, entirely in your browser. It never calls DeepSeek, Qwen or any model, and never uploads anything you do.
- Generating (output) is usually more compute-intensive than reading (input), so most providers charge more for output — especially reasoning and long-output models. Never budget on the input price alone.
- Image and video models are mostly billed per image, per credit or by subscription, and the price shifts the moment a spec changes (resolution, duration, frame rate, speed). So these tables surface the provider, capability focus and access tier; check the provider page for the exact rate.
- It is the maximum number of tokens the model can handle in one call (input and output share it). A larger window fits more material, but very long contexts can be slower and pricier. The table marks it in K (thousand) / M (million) tokens.
- They are generally list prices excluding tax, and usually in USD. Your actual bill can differ with local taxes, settlement currency and provider promotions — defer to the official invoice and pricing page.
- For a rough estimate, yes — multiply your input and output tokens by their rates and add them. For precise budgeting, combine your real call volume, caching, batch discounts and the provider's current price; this table is a quick reference only.
- Completely free, with no account or sign-up and no usage limit. It runs in your browser and collects no data.
Related News
You may be interested in these recent stories from our newsroom.
No related news yet for this tool. Our editorial team publishes new pieces every week.
Browse all news →75 more free tools
Calculators, converters, security tools — no signup.