Tokens ↔ Words ↔ Characters Converter
Convert between tokens, words and characters for LLM prompts. Type any one and get the other two instantly. Free, runs in your browser.
Tokens, Words and Characters Converter
Type any one value — the other two update instantly. Based on the English average of 1 token ≈ 0.75 words ≈ 4 characters. For an exact GPT count, use the Token Counter.
How to Use the Converter
Type into any box
Enter a number of tokens, words, or characters. You only ever fill one — the tool works out the other two for you.
Read the conversion
The other two boxes update instantly using the English average of about 0.75 words and 4 characters per token.
Use it to size a prompt
Heading into a model with a token budget or a context window? Convert your word count to tokens to see roughly how much room you'll use.
Switch to exact when it matters
These are averages. For billing or a tight context window, count the real text with the Token Counter, which runs the actual GPT tokenizer.
Tokens, Words and Characters: A Quick Mental Model
Why a rough conversion is so useful
You usually think in words — "a 500-word email", "a 2,000-word article" — but language models think in tokens, and that's the unit of both their context limit and their price. A fast conversion between the two lets you answer everyday questions without pasting anything into a tokenizer: will this article fit in the prompt? Roughly how much will this batch cost? Is my system prompt eating too much of the window? The widely-used rule of thumb is that English runs about 0.75 words per token, or equivalently about 4 characters per token — so 1,000 tokens is roughly 750 words.
That ratio is an average, not a law. Tokenizers split common words into single tokens but break rare or long words into pieces, and they count spaces and punctuation. Code, URLs, numbers and non-English text all tokenise less efficiently — often noticeably more tokens per word than prose. So treat this converter as a planning instrument: perfect for a quick sense of scale, but reach for an exact tokenizer when a few percent matters for your bill or your context budget.
"Think in words, budget in tokens. A good estimate bridges the two in your head — an exact count settles the invoice."
Where the estimate drifts
The 4-characters-per-token figure holds up well for ordinary English writing. It starts to drift in predictable directions: dense code and JSON push the token count up relative to characters because of symbols and indentation; long technical terms and identifiers split into multiple tokens; and languages with non-Latin scripts can use dramatically more tokens per character. If your text is mostly one of those, nudge your mental ratio accordingly — or just measure it. The point of this tool is to get you 90% of the way there in a second, so you only spend the effort of an exact count when the last few percent actually changes a decision.
10 Facts About Tokens, Words & Characters
English averages about 0.75 words per token — so 1,000 tokens ≈ 750 words.
Equivalently, English runs about 4 characters per token on average.
A token is often a whole common word, but rare or long words split into several tokens.
Spaces and punctuation are tokens too — and a leading space changes a word's token.
Code and JSON use more tokens per character than prose because of symbols and indentation.
Non-Latin scripts often use far more tokens per character than English.
Context windows and API prices are both measured in tokens, not words.
These conversions are estimates — the only exact count comes from running a tokenizer.
A typical page of text (~500 words) is roughly 650–700 tokens.
This tool runs entirely in your browser — nothing is sent anywhere.
Frequently Asked Questions
- About 750 words of English, since the average is roughly 0.75 words per token (or about 4 characters per token). It varies with punctuation, formatting and language, so treat it as a close estimate rather than an exact figure.
- Roughly 650–700 tokens for typical English prose (500 ÷ 0.75 ≈ 667). Dense code, long technical terms or non-English text will push that number higher.
- No — it's an average-based estimate. The only way to get an exact count is to run the model's tokenizer on the actual text. For OpenAI models you can do that with our Token Counter, which uses the real tiktoken encodings.
- Tokenizers are trained mostly on English text, so common English words map to single tokens. Symbols, indentation, long identifiers and non-Latin scripts don't fit those patterns and get split into more tokens, raising the token-per-character ratio.
- Slightly. Different model families use different tokenizers, so the exact token count for the same text varies a little. The ~4-characters-per-token rule is a good cross-model average, but for precise per-model figures you need that model's tokenizer.
- Because you often only know a word count — "a 2,000-word article" — and want a quick sense of whether it fits a context window or how much it might cost, without pasting the text anywhere. This converter gives that instant ballpark; the Token Counter gives the exact figure when you have the text.
- Yes. Type into tokens, words or characters and the other two update. It's fully bidirectional, so you can start from whichever number you already have.
- No. It's just arithmetic running in your browser. The numbers you type are never sent to any server or third party, and nothing is stored.
- This tool only converts counts. To turn token counts into a dollar figure, use our LLM Cost Calculator, which applies each model's input and output rates to your token numbers.
- Completely free, with no account or sign-up, and no limit on use. It runs in your browser and collects no data.
Related News
You may be interested in these recent stories from our newsroom.
No related news yet for this tool. Our editorial team publishes new pieces every week.
Browse all news →75 more free tools
Calculators, converters, security tools — no signup.