Normalise email addresses to find duplicates. Handles Gmail dot variants, +tag aliases, capitalisation, and disposable-email detection.

RT-DEV-066 · Developer Tools

Email Normalizer Tool

0
Inputs
0
Unique
0
Duplicates
0
Invalid
0
Disposable
Unique normalised emails will appear here.
Duplicate entries (same normalised email) will appear here.
Inputs that don't look like valid emails will appear here.
Every input with its normalised form and any flags.
Advertisement
After results · AD-W1 Responsive · Post-tool — peak engagement

How to use the email normalizer

Paste your list

Drop your email addresses into the input box. One per line, comma-separated, or semicolon-separated all work. Mixed formatting is fine — the tool splits on any whitespace, comma, or semicolon.

Pick your normalisation rules

Defaults strip Gmail dots, remove +tag aliases, lowercase the domain, and treat googlemail.com as gmail.com. Toggle these off if you're cleaning lists for a context that needs the variants preserved.

Review the results

Four tabs: unique normalised list (the clean output), duplicates (what got merged), invalid (anything that doesn't look like an email), and annotated (every input with its normalised form and any flags).

Copy or download

Copy the current view to your clipboard or download the full annotated result as a CSV file ready for Excel, Google Sheets, or your CRM import.

Advertisement
After how-to · AD-W2 Responsive

Email normalisation — why "duplicates" hide in plain sight

If you've ever imported a customer list and watched it report "1,247 unique addresses" when the source had 1,300 entries, you've already met the email-normalisation problem. The same human is often represented by half a dozen syntactic variants of the same address. Lowercasing helps. Stripping plus-aliases helps more. Knowing the Gmail-specific dot rule — which is the single biggest source of duplicate counts in any consumer mailing list — closes most of the remaining gap.

The Gmail dot rule, explained

Gmail (and Google Workspace addresses on googlemail.com) treats dots in the local part as cosmetic. The addresses [email protected], [email protected], and [email protected] all deliver to the same inbox. This is documented Gmail behaviour, not a bug — it was originally designed so users could "format" their email for readability without losing reach. The side effect is that signup forms and CRMs see the same person as different people. Stripping all dots from the local part of a @gmail.com or @googlemail.com address is the single most-effective deduplication rule you can apply.

The +tag alias

Sub-addressing — the trick of writing [email protected] to get mail still delivered to [email protected] — was specified by RFC 5233 in 2008. Gmail, iCloud, Outlook.com, FastMail, Proton Mail, and most modern providers support it. Users do this deliberately, often to filter incoming mail ([email protected] for RECATOOLS signups, [email protected] for Amazon receipts). For deduplication, you almost always want to strip the +suffix and treat all variants as the same person. The exception is fraud detection — repeated +1, +2, +3 variants on a free-trial endpoint are usually one person abusing the limit, and the +tag is the giveaway.

Disposable email detection

Mailinator, 10minutemail, guerrillamail, tempmail, and their friends are throwaway-mail services — they hand out an email address that auto-expires in minutes. They're great for users who don't want to give their real address to a one-time signup; they're a problem for any business model where re-engaging a user matters. The tool here ships with a curated list of around 80 known disposable-email domains. The list is not exhaustive (new domains spin up weekly), but it catches the common offenders. For production systems with real fraud exposure, integrate a maintained API service (Mailcheck, Kickbox, NeverBounce) on top of this client-side check.

Role accounts and ASEAN gotchas

Role accounts (info@, admin@, support@, billing@, noreply@) are not individual humans — they're inboxes shared by a team or function. They are valid email addresses, but for marketing-list use they typically should be filtered out separately. The tool flags them so you can decide. Another regional gotcha: many Southeast Asian companies use addresses on country-specific subdomains ([email protected], [email protected]) — these are not "weird," they're standard, and the normaliser treats them identically to .com addresses. Don't strip the .sg or .my; it's part of the canonical address.

What this is not

Normalisation is not verification. The output is "this is the canonical spelling of an email that looks valid" — not "this email exists and accepts mail." Verifying that requires actually delivering to the address (or talking to the receiving SMTP server), which can't happen client-side and isn't this tool's job. Pair normalisation with a real-time verification API at the moment of signup if your business model depends on deliverability.

10 email-normalisation facts you should know

01

Gmail's "dots don't matter" rule has been confirmed by Google since 2008 and is documented in their help centre — it is intentional behaviour, not a bug, and applies to both gmail.com and googlemail.com addresses.

02

Sub-addressing with the + character was formally specified in RFC 5233 in 2008. Gmail, Outlook.com, iCloud, FastMail, Proton Mail and Microsoft 365 all support it; some legacy providers do not.

03

The local part of an email (everything before the @) is case-sensitive per RFC 5321, but in practice every major mail provider treats it as case-insensitive. Lowercasing for deduplication is safe.

04

The domain part of an email is case-insensitive per RFC and per DNS. Example.COM and example.com are the same domain, so always lowercase domains before comparing.

05

The longest valid email address per RFC 5321 is 320 characters: 64 for the local part + @ + 255 for the domain. Most form fields validate 254 max, which is the historically-recommended cap.

06

Disposable-email domains rotate constantly — by some industry estimates 20-50 new throwaway-mail domains spin up every week. Any blocklist needs ongoing maintenance to stay useful.

07

About 1.8 billion Gmail users exist worldwide as of 2025, making Gmail's dot rule the single highest-impact normalisation for any global consumer email list.

08

Microsoft 365 addresses on Exchange Online support +tag sub-addressing only if the tenant admin enables it explicitly — it's off by default, unlike Gmail.

09

Internationalised email addresses with non-ASCII characters (用户@例え.jp) are valid per RFC 6531 but mail-server support remains spotty. Many systems still reject them outright.

10

Apple's "Hide My Email" feature generates random aliases like [email protected] that forward to a user's real address — these are unique per signup and impossible to deduplicate without Apple's lookup.

Frequently asked questions

Gmail treats dots in the local part of @gmail.com and @googlemail.com addresses as cosmetic only — they don't affect delivery. All variants land in the same inbox. This is documented Gmail behaviour, not a bug.
No. [email protected] and [email protected] are different mailboxes. The dot-stripping rule is Gmail-specific. The tool only applies it to @gmail.com and @googlemail.com.
Sub-addressing (RFC 5233) lets users add +anything after their local part. [email protected] still delivers to [email protected]. Users do this to filter incoming mail; for deduplication, stripping the +suffix is usually correct.
Almost always for deduplication. The exception is fraud detection — repeated +1, +2 variants on free trials are usually one person abusing your limits. For that use-case, leave +tags intact and flag the pattern.
Mailinator, 10minutemail, guerrillamail, tempmail, throwawaymail and similar services that hand out temporary inboxes that auto-expire. The tool ships with ~80 known disposable domains; new ones appear constantly so the list is not exhaustive.
Addresses like info@, admin@, support@, noreply@ that go to a function or team rather than a person. Valid email addresses, but typically not what you want on a marketing list. The tool flags them.
Technically yes per RFC 5321, but every major provider treats it as case-insensitive. The tool lowercases the domain by default and offers a separate toggle for the local part — leave that off unless you know your data source is case-significant.
Domains like recasys.com.sg or example.com.my are standard and the tool treats them identically to .com addresses. Lowercasing applies; nothing is stripped from the domain.
No. All parsing, normalisation, and disposable-domain matching runs locally in your browser using vanilla JavaScript. The email list, the output, and the downloaded CSV never touch any server.
Three columns: original, normalised, and flags. The flags column is a semicolon-separated list of any of: invalid, duplicate, disposable, role. Direct import into Excel, Google Sheets, or any CRM.

Related News

You may be interested in these recent stories from our newsroom.

View all news →
Advertisement
Pre-footer · AD-W3 728 × 90

75 more free tools

Calculators, converters, security tools — no signup.