Rare Chinese character reader. Enter one or more Hanzi to instantly see pinyin (tone marks), Zhuyin, Cantonese jyutping, stroke count, and dictionary definitions (trad/simp forms + English glosses). Browser-only, zero tracking.
Rare Chinese Character Reader
How to use
Type or paste the rare character
Drop one or more Hanzi into the box — paste any character you cannot read. Rare characters from the CJK extension blocks are supported.
Click "Look Up"
The tool parses character by character and builds one result card each. You can also press Ctrl/⌘ + Enter as a shortcut.
Read four readings + stroke count
Each card shows pinyin (with tone marks), Zhuyin, Cantonese jyutping, and a stroke count — Mandarin, Taiwan bopomofo and Cantonese side by side.
Read the dictionary definition
If the character is in the built-in dictionary you get its traditional/simplified forms plus English glosses (every reading for polyphones). If not, the readings still show with a "no dictionary entry" note.
Rare Chinese Character Reader: Pinyin, Zhuyin and Jyutping in One Lookup
The Chinese writing system is vast. The Kangxi Dictionary catalogues around 47,000 characters, and the latest Unicode standard encodes well over 90,000 Han ideographs. Yet everyday modern reading relies on barely 3,000 high-frequency characters — which means the overwhelming majority of Hanzi are, for the average reader, rare characters (生僻字): dense in strokes, unfamiliar in structure, and impossible to guess the sound of. You hit one in a name, a place name, a classical text, a recipe, a herbal-medicine label, a surname, or an internet meme — say 燚 (yì), 龘 (dá), or 靐 (bìng) — and you simply do not know how to pronounce it. This tool exists for exactly that moment: paste the character, get the reading instantly.
Unlike a generic translator that returns a single romanisation, this reader outputs four readings at once, serving the different habits of mainland, Taiwan, Hong Kong and overseas Chinese readers. Pinyin with tone marks is the standard phonetic notation in mainland China and Chinese-medium schools across Singapore, Malaysia and Indonesia. Zhuyin (注音 ㄅㄆㄇㄈ) is the bopomofo system still used in Taiwan and parts of Hong Kong education. Cantonese jyutping gives Hong Kong and Guangdong readers the Cantonese pronunciation they actually care about. A stroke count rounds it out, handy when you are cross-checking a dictionary, an input method, or a form. If the character is in the built-in dictionary you also get its traditional and simplified forms plus English glosses — and for polyphones (one character with several readings) every reading and meaning is listed separately.
Where the readings come from
Nothing here is invented. Mandarin pinyin and Zhuyin are computed from the widely-used open-source pinyin-pro dictionary; Cantonese jyutping comes from the to-jyutping Cantonese lexicon; stroke counts come from hanzi-writer's stroke-order data; and the English glosses come from a CC-CEDICT-derived character dictionary bundled with the tool. The whole process is fully deterministic: the same character always returns the same result, with no random generation, no guessing, and no online model calls. Rare characters often carry variant readings (archaic, dialectal, or proper-noun pronunciations); this tool gives the most common standard reading, so for proper nouns (specific place names or surnames) treat an authoritative dictionary as the final arbiter.
"The character you can't read — paste it in and the answer's there." One puzzle, four readings.
Who uses it
The use cases are broader than you'd expect. Parents and teachers check tricky characters in textbooks and classical poems. News editors and subtitlers confirm the correct reading of personal names, place names and brand names. People filling in forms verify the romanisation of a rare surname on an ID card or passport. TCM practitioners, cooks and calligraphers look up herb names, archaic dish names and variant glyphs from stone rubbings. Chinese learners resolve an unknown character mid-read in a classical text or an internet meme. Everything runs locally in your browser — whatever you type is never uploaded, stored, or tracked, so even name lookups stay completely private. This is a pure utility, with no metaphysics or fortune-telling component — it exists only to help you pronounce the characters you can't.
10 Facts about Rare Chinese Characters
The latest Unicode standard encodes 90,000+ Han characters, yet everyday reading uses barely 3,000 — the rest are rare characters to most readers.
One famous high-stroke rare character is 龘 (dá) — three 龍 (dragon) stacked together, 48 strokes, meaning "dragons in flight".
Many rare characters are triplications: three identical components, e.g. 焱 (yàn, flames), 淼 (miǎo, vast water), 矗 (chù, towering) — their logic is guessable.
Rare characters most often appear in personal names. People in China have been unable to get ID cards or bank cards because system fonts lacked their name character — a real "font gap" problem.
The internet has revived rare characters: 囧 (jiǒng, originally "bright", reused as an awkward-face emoji), 巭 (gū), and 嫑 (biáo, a fusion of 不要) all went viral online.
A character's Mandarin and Cantonese readings can differ sharply: 我 is wǒ vs ngo5. Rare characters diverge even more across dialects, which is why this tool lists pinyin and jyutping together.
Rare characters are often variant glyphs — alternate written forms of the same character. 㑺 is an old form of 俊; a dictionary lookup flags the traditional/simplified and variant mappings.
Zhuyin (ㄅㄆㄇㄈ) and pinyin are parallel systems: Taiwan teaches Zhuyin, the mainland teaches pinyin. Students on each side see entirely different "phonetic spellings" for the same rare character.
Stroke count is the classic dictionary index. Rare characters in the Kangxi and Hanyu Da Zidian dictionaries are found by radical + residual strokes, so knowing the total stroke count is genuinely useful for cross-checking.
Rare doesn't mean useless: herb names (茈, 薤), old place names (邗, 郪) and surnames (逯, 仝) use uncommon characters heavily — they stay alive in professional and legal documents.
Frequently Asked Questions
-
Loosely, any Hanzi that rarely appears in everyday modern reading and that an average reader cannot recognise or pronounce. They may be high-stroke, variant or archaic forms, or used only in names, place names, or specialist fields (TCM, classical texts). This tool gives readings for any CJK character, including extension-block rarities.
-
Yes. Paste multiple characters into the box and the tool builds one card per character. Duplicate characters are automatically de-duplicated and shown once.
-
(1) Pinyin with tone marks (e.g. nǐ) — mainland + overseas-school standard; (2) Zhuyin (ㄋㄧˇ) — Taiwan standard; (3) Cantonese jyutping (e.g. ngo5) — HK/Guangdong reading; (4) stroke count — for dictionary cross-checking. The first three are pronunciations; stroke count is an auxiliary index.
-
Mandarin and Cantonese readings come from validated open-source dictionaries (pinyin-pro, to-jyutping) and are reliable for common and many rare characters. But rare characters often have variant readings (archaic, dialectal, proper-noun); the tool gives the most common standard one. For specific names or places, verify against an authoritative dictionary.
-
The built-in dictionary covers a great many — but not all — characters. Extremely rare or newly coined characters may lack a definition. In that case the readings (pinyin, Zhuyin, jyutping) and stroke count still display; only the English gloss is missing.
-
If the dictionary records multiple readings for a character, the definition area lists each reading (pinyin) with its meanings, so you can pick the right one by context. The pinyin/Zhuyin/jyutping at the top of the card show the most common primary reading.
-
Stroke counts come from hanzi-writer's stroke-order database (loaded locally). It covers most common and many rare characters; if an extremely rare character has no stroke data the count shows "—". Counts follow the modern standard glyph, so variant forms may differ slightly.
-
No. All reading, stroke and definition lookups run locally in your browser; the dictionary file is downloaded once and cached locally, with no server calls. Even name lookups stay completely private.
-
As long as you can copy the character — select and copy it from a web page, PDF or document, then paste it in. The tool never asks you to spell or type the character yourself, which is exactly what makes it useful for rare characters.
-
No. This is a pure linguistic utility — it looks up readings, stroke counts and meanings, with no metaphysics, character-divination or fortune component whatsoever. Its only goal is to help you pronounce a character you couldn't.
Related News
You may be interested in these recent stories from our newsroom.
-
A Bipartisan US Bill Wants One Federal Rulebook for Frontier AI — and the Fight Is Already About State Laws
Two House lawmakers have released a discussion draft of the Great American AI Act, a 269-page bipartisan attempt to set a single federal fra...
-
US Export-Control Directive Leads Anthropic to Pull Fable 5 and Mythos 5 Worldwide
Anthropic disabled Fable 5 and Mythos 5 worldwide after a US Commerce Department directive restricted foreign access. Axios and other outlet...
-
Brussels Disputes Apple's 'Due to DMA' Framing: EU Says the Siri AI Delay Is Apple's Choice, Not the Law's
A day after Apple said its new Siri AI would skip EU iPhones and iPads 'due to the DMA,' the European Commission pushed back: the delay is A...