Chinese LLM Token & Cost Calculator

AI CHINESE REFERENCE

简繁 EN

About this tool

Estimate tokens and per-model API cost for DeepSeek, Qwen, Kimi and more

Chinese LLM token & cost calculator for DeepSeek/Qwen/Kimi: estimate tokens and per-model API cost. Estimates only. In your browser.

RT-AI-066 · AI Tools

Estimate how many tokens your text uses, then compute the API cost across Chinese large language models — DeepSeek, Qwen (通义千问), Kimi, Zhipu GLM, MiniMax, Doubao and ERNIE. Paste text to estimate tokens, or type input/output token counts directly, and compare per-request and total cost side by side. Everything runs in your browser; nothing is sent to a server and no model is called.

Heads-up: token counts use the OpenAI tokenizer and are estimates (估算) for Chinese models; prices are provisional — official provider prices prevail (价格为估算，以各厂商官方为准). Models without a verified price show "—／待核实". Nothing is sent anywhere.

Paste text Enter token counts

Your text (input)

Estimated input tokens 0 估算 · OpenAI o200k

Output tokens

Requests

Model	In / 1M	Out / 1M	Per request	Total

How the Chinese LLM token & cost calculator works

Pick how you want to count

The tool takes input two ways: paste text and let it estimate the token count, or switch to "Enter token counts" mode and type the input / output token numbers you already know. Pasting text is ideal when you have not run the model yet and just want a rough figure first.

Paste text or enter token counts

In paste-text mode, drop in your prompt, document or conversation; the tool counts tokens with the browser-local OpenAI tokenizer (o200k_base) and treats that as the input tokens. Note: Chinese models do not use the OpenAI tokenizer, so the figure is an estimate (估算).

Set output tokens and request count

Enter the expected output tokens (the length of the model's reply) and how many requests you make per month or per batch. Cost = input tokens × input rate + output tokens × output rate, multiplied by requests — the table converts it live.

Compare cost across models

The table ranks DeepSeek, Qwen, Kimi, Zhipu, MiniMax and more from cheapest to priciest, showing per-request and total cost. Prices are estimates — official provider prices prevail; models with no verified price show "—／待核实".

How the Chinese LLM token & cost calculator works

Tokens, not words, are how LLMs bill — and Chinese counts differently

Every major large language model bills by the token, and a token is not the same as a word or a character. Roughly, a token is a short chunk of text — in English about four characters on average, but Chinese is far denser, where a single character can be close to one or two tokens on its own. That is why you cannot price an LLM call by counting words: the only honest unit is the token. This calculator gives you two ways to get there. Paste your prompt, document or conversation and it estimates the token count using a tokenizer that runs entirely in your browser; or, if you already have exact figures from your API usage logs, switch to manual mode and type the input and output token counts straight in. Either way, it then multiplies those tokens by each model's published input and output rates and ranks the results from cheapest to most expensive, so the trade-offs are visible at a glance.

The reason input and output are entered separately is that almost every provider prices them separately — and output is usually the more expensive of the two, often by several times. The text you paste represents only the input you send; the length of the model's reply, the output tokens, is something you have to estimate yourself before the request is made. Reasoning models complicate this further: a model like DeepSeek R1 generates a large volume of intermediate "thinking" tokens that are typically billed as output, so even a short final answer can carry a surprisingly high per-call cost. When you are budgeting for one of those, set the output token field higher than the visible answer length would suggest, or your estimate will run low.

"The cheapest and priciest Chinese model can differ by more than ten times on the same prompt — which is exactly why estimating before you commit pays for itself."

Estimates, not invoices: read the numbers as planning figures

It is important to be clear about what this tool can and cannot tell you. The token counts are estimates, not exact figures. They are produced with OpenAI's o200k_base tokenizer because it is the one that can run locally and openly in a browser — but DeepSeek, Qwen, Kimi, Zhipu and the rest each use their own tokenizer, and the same text will segment into a slightly different number of tokens on each. Treat the count as a sound planning figure, accurate enough to compare models and size a budget, but not as the exact number that will appear on your invoice. The same caution applies to the prices: the rates here are provisional, gathered from public aggregators, and the official price from each provider always prevails. Models whose rate has not yet been verified are listed by name with a "—／待核实" marker and left out of the arithmetic rather than guessed at.

Used with that understanding, the calculator is genuinely useful for the decisions that matter before you spend money. Run the same workload across the models and the cost gap is frequently several to more than ten times, which can change which model you choose, how long you let replies run, and whether a feature is viable at scale. Multiply a single request by your expected monthly volume and you have a defensible budget line before a single API key is issued. And because the whole thing runs locally — no upload, no model call, no stored input — you can iterate as much as you like: paste a real prompt, read the spread, tighten the output length, switch models, and re-check, all without anything you type ever leaving your device. When the decision is final, confirm the current rate on each provider's own pricing page and treat the figure here as the starting estimate it is meant to be.

About LLM Tokens & Pricing — 10 Key Points

LLM APIs bill by token, not by character: one token is roughly a short chunk of text, and input and output are usually priced separately.

In English about 4 characters ≈ 1 token; Chinese is denser, where a single character is often close to 1–2 tokens, so the same character count is not always cheaper.

This tool estimates tokens with OpenAI's o200k_base tokenizer, while DeepSeek, Qwen, Kimi and other Chinese models each have their own tokenizer — so the numbers here are estimates (估算), not an exact bill.

Output tokens usually cost far more than input tokens — many models charge several times more for output, so keeping replies short is often the biggest saving.

Reasoning models (e.g. DeepSeek R1) generate a large volume of "thinking" tokens, usually billed as output, which can sharply raise the per-call cost.

Run the same prompt across different models and the cost gap between the cheapest and priciest is frequently several to more than ten times.

The context window (ctx) sets how many tokens fit in one call; a big window is not cheap — long input is metered, and more tokens cost more.

Monthly cost for a batch job = per-request cost × number of requests; estimate one call, then multiply by scale to avoid a runaway bill at launch.

Prices in this calculator are estimates (from public aggregators); official provider prices prevail. Some models have no verified price and show "—／待核实".

The whole calculation runs locally in your browser — pasted text is never uploaded, never sent to a model, and never stored.

Frequently Asked Questions

No. This tool estimates tokens locally in your browser with OpenAI's o200k_base tokenizer, while DeepSeek, Qwen, Kimi, Zhipu and other Chinese models each use a different tokenizer, so the segmentation differs. The token counts here are estimates (估算), good for budgeting but not an exact bill.
Prices are estimates — official provider prices prevail. The rates here come from public aggregators and may differ from each provider's latest official, promotional or regional pricing. Check the provider's own page before any real purchase or launch. Some models have no verified price and show "—／待核实".
No. It does just two things in your browser: estimate tokens with a local tokenizer, and multiply numbers to compute cost in plain JavaScript. It never calls DeepSeek, Qwen or any model, and makes no network requests beyond loading our own same-origin tokenizer and pricing-data files.
If you have not run the model yet and only have some text, use paste-text mode and let the tool estimate the input tokens. If you already have exact input / output token counts from your API usage or logs, switch to "Enter token counts" and type them in for a result closer to the real bill.
Because input and output are usually priced separately, and output often costs much more than input. The length of the model's reply (output tokens) is something you cannot know before the request, so you estimate it; pasted text only represents the input you send.
Yes, noticeably. In English about 4 characters ≈ 1 token, while Chinese is denser, where a single character is often close to 1–2 tokens. That is why you cannot estimate cost by character count and should compute it per token.
It means we have not yet verified that model's official rate, so it is excluded from the cost calculation and only listed by name in the table. Once the price is verified, it will show a concrete cost like the other models.
Reasoning models generate a large volume of "thinking" tokens before the answer, usually billed as output tokens. Even if the final answer is short, the intermediate reasoning consumes tokens and raises the per-call cost — set output tokens higher when estimating these.
No. All calculation happens locally in your browser with plain JavaScript. Nothing you paste or type is sent to any model, server or third party, and nothing is stored.
Completely free, with no account or sign-up and no usage limit. It runs in your browser and collects no data.

Chinese LLM Token & Cost Calculator

How the Chinese LLM token & cost calculator works

Pick how you want to count

Paste text or enter token counts

Set output tokens and request count

Compare cost across models

How the Chinese LLM token & cost calculator works

Tokens, not words, are how LLMs bill — and Chinese counts differently

Estimates, not invoices: read the numbers as planning figures

About LLM Tokens & Pricing — 10 Key Points

Frequently Asked Questions

Related News

75 more free tools

Chinese LLM Token & Cost Calculator