ELSA Speak
AI pronunciation coach that analyses your English at the phoneme level and corrects it in real time
Overview
ELSA Speak (English Language Speech Assistant) is an AI-powered mobile app that uses patented speech recognition to analyse English pronunciation at the phoneme level, giving learners instant, colour-coded feedback on individual sounds, word stress, intonation, and fluency. Founded in 2015 by Vietnamese-born CEO Vu Van and speech scientist Dr. Xavier Anguera, the platform has accumulated over 90 million downloads across 195 countries and delivers more than 25,000 exercises spanning pronunciation drills, AI role-play conversations, vocabulary, and exam preparation for IELTS, TOEFL, and TOEIC.
Beyond the consumer app, ELSA operates a B2B platform and a licensed Speech Recognition API used by schools and enterprises to embed pronunciation assessment into their own learning environments. The company has raised $60 million in total, most recently a $23 million Series C led by UOB Venture Management in September 2023, and was named a World Economic Forum Technology Pioneer in 2024. Vietnam and Indonesia rank among its largest user markets, and the app explicitly targets Asian accent patterns, making it particularly relevant for ASEAN learners navigating English in professional contexts.
Pricing
Pricing shown for reference only. These figures reflect RECATOOLS research as of 16 Jun 2026 and may be out of date or incomplete. This is not financial or purchasing advice — always confirm the current price on the provider’s official website before making any decision.
Use cases
What you can produce with ELSA Speak
- Phoneme-level pronunciation score with colour-coded feedback per sound after every spoken response
- Personalised daily lesson plan adapting to identified weak sounds and fluency gaps
- AI role-play conversations simulating workplace and IELTS/TOEFL speaking scenarios
- Native-like Score metric tracking measurable progress over time
- IPA phonetic transcription and audio comparison for every exercise
- IELTS/TOEFL speaking score prediction (plus ELSA proprietary proficiency score)
- B2B API delivering pronunciation and fluency assessment scores embeddable in third-party LMS or edtech platforms
ASEAN Perspective
ELSA Speak in Southeast Asia
ELSA Speak was co-built by a Vietnamese founder who experienced firsthand the accent barriers faced by ASEAN professionals in global workplaces, and Vietnam and Indonesia together account for roughly half of its documented customer geography. The app's AI is explicitly trained on Asian-accented English patterns, making its error detection far more relevant to SEA learners than tools built on predominantly native-speaker corpora. B2B partnerships with Vietnamese schools and corporates (including Intel and Kimberly-Clark Vietnam) and a 2026 Hong Kong market launch underline its sustained APAC expansion trajectory. For ASEAN professionals targeting IELTS, TOEFL, or multinational career advancement, ELSA Speak remains the most purpose-fit pronunciation tool available at this price.
ELSA Speak is the strongest dedicated pronunciation tool for ASEAN learners of English. Its phoneme-level AI — trained on millions of non-native speakers across 100+ accent profiles — surfaces the exact sounds that trip up Vietnamese, Indonesian, and Filipino learners in a way that generic chatbot tutors cannot match. The WEF Technology Pioneer recognition, $60 million in institutional backing, and a dual San Francisco/Ho Chi Minh City footprint signal a company with serious APAC commitment rather than an afterthought pivot. The B2B speech API also opens the door to institutional buyers like universities and corporate L&D teams across the region.
The caveats are real, though. The free tier is too thin to be useful for sustained practice, and the Trustpilot score (2.7/5) reflects persistent user complaints about aggressive upselling, unclear cancellation flows, and occasional voice-recognition misfires. The pronunciation engine is calibrated exclusively to American English, meaning British or Australian accent variants score artificially low. Competing apps such as BoldVoice add human coach video overlays that many learners find more motivating than algorithm scores alone. At its annual price point it delivers measurable improvement — multiple academic studies in Indonesian and Vietnamese school contexts confirm statistically significant pronunciation gains — but users must push past a cluttered onboarding experience to realise that value.
What people say
ELSA Speak earns strong marks for pronunciation granularity: its phoneme-level AI — trained on one of the world's largest non-native accented audio datasets (over 200 million hours of speech from learners in 195 countries) — genuinely detects accent-specific errors that generic AI tutors miss. Academic studies in Indonesia and Vietnam consistently show statistically significant pronunciation gains. ASEAN relevance is exceptional given its Vietnamese co-founding and explicit training on Asian accent patterns. The friction points are meaningful: the free tier is too thin to sustain learners, the American-English-only benchmark penalises other varieties, and a low Trustpilot score (~2.7/5 at review time) reflects real complaints about billing and inconsistent recognition. Best for disciplined ASEAN learners committing to the annual plan.
Summary of public user & expert reviews, compiled by RECATOOLS.
Notable facts
- U2 guitarist The Edge (David Evans) participated in ELSA's Series B funding round in 2021
- ELSA's AI is trained on accented English data from over 20 million non-native speakers — making it one of the largest non-native accent corpora in existence
- Early in ELSA's history, 80–90% of its users came from Vietnam; that share has since dropped to around 20% as the app went global
- ELSA CEO Vu Van attended the World Economic Forum in Davos in January 2025 as part of the 2024 Technology Pioneer cohort
Frequently asked questions
About this listing
This entry was compiled from publicly available data including ELSA Speak's official website, press releases, documentation, and reputable third-party publications. RECATOOLS is not affiliated with ELSA Speak unless explicitly stated.
Third-party AI tools update their pricing, features, availability, and policies frequently. Information here may be outdated by the time you read this — we make reasonable efforts to keep listings current, but cannot guarantee absolute accuracy.
For the latest details, please refer to ELSA Speak directly →
Spotted something out of date? Suggest an update →
More in Other