8000 GitHub - nuloperrito/PronunciationHelper: A fast, GUI-based speech assistant for language learners, offering real-time pronunciation help with IPA phoneme visualization and neural voice playback in dozens of languages.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

A fast, GUI-based speech assistant for language learners, offering real-time pronunciation help with IPA phoneme visualization and neural voice playback in dozens of languages.

License

Notifications You must be signed in to change notification settings

nuloperrito/PronunciationHelper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Word Pronunciation Assistant

A fast, local pronunciation and speech synthesis assistant powered by piper-phonemize, optimized for language learners and anyone looking to improve their spoken fluency across dozens of languages. This GUI application supports IPA phoneme visualization and real-time audio playback using neural voice models.

screenshot

Features

  • Text-to-IPA phonemization using eSpeak-ng
  • Real-time audio synthesis with neural voice models (ONNX format)
  • Adjustable length scale (speaking speed) and volume
  • Multi-language support with auto-discovered voices
  • Simple and responsive GUI built with wxWidgets
  • Runs fully offline on Windows and Linux

Quick Demo

  1. Download a voice model (see Voices below)
  2. Place both .onnx and .onnx.json files into the models/ directory
  3. Run the app, choose a model, type a phrase, and click Pronounce

Example

Welcome to the world of speech synthesis!
→ wˈɛlkʌm tə ðə wˈɜːld ʌv spˈiːtʃ sˈɪnθəsˌɪs

Then listen to the synthesized audio with realistic pronunciation.

screenshot


Voices

This application uses the same voices as the Piper TTS project, trained with VITS and exported for ONNX Runtime.

Supported Languages

  • Arabic (ar_JO)
  • Catalan (ca_ES)
  • Czech (cs_CZ)
  • Welsh (cy_GB)
  • Danish (da_DK)
  • German (de_DE)
  • Greek (el_GR)
  • English (en_GB, en_US)
  • Spanish (es_ES, es_MX)
  • Finnish (fi_FI)
  • French (fr_FR)
  • Hungarian (hu_HU)
  • Icelandic (is_IS)
  • Italian (it_IT)
  • Georgian (ka_GE)
  • Kazakh (kk_KZ)
  • Luxembourgish (lb_LU)
  • Nepali (ne_NP)
  • Dutch (nl_BE, nl_NL)
  • Norwegian (no_NO)
  • Polish (pl_PL)
  • Portuguese (pt_BR, pt_PT)
  • Romanian (ro_RO)
  • Russian (ru_RU)
  • Serbian (sr_RS)
  • Swedish (sv_SE)
  • Swahili (sw_CD)
  • Turkish (tr_TR)
  • Ukrainian (uk_UA)
  • Vietnamese (vi_VN)
  • Chinese (zh_CN)

📁 Each voice requires:

  • A .onnx file (neural model)
  • A corresponding .onnx.json configuration file

You can download voices from the Piper Voices repository or see the VOICES.md list.

⚠️ Note: Some voices may have restrictive licenses. Always check the MODEL_CARD file before use.


Installation

Binary Releases

Download prebuilt releases from the Releases page, or build from source:

Build From Source (Linux/Windows)

  1. Clone this repo
  2. Run cmake first, and then make or open in your IDE of choice

Your voice models should be placed in a models/ folder next to the binary.


Usage

After launching the app:

  1. Select a language model from the dropdown
  2. Enter any word or phrase in the text box
  3. Click Look up to see the IPA symbols
  4. Click Pronounce to hear the voice synthesis
  5. Adjust speed and volume using the sliders

All synthesis and audio playback happen locally and offline.


Technical Stack

  • GUI: wxWidgets
  • TTS Engine: Piper (ONNX Runtime, VITS)
  • Phonemizer: espeak-ng via piper-phonemize
  • Audio Output: PortAudio
  • Language Support: Multilingual, via ONNX+JSON model pairs

License

This app is open source and intended for educational, research, and personal language learning use. See LICENSE for more information.


Acknowledgements

About

A fast, GUI-based speech assistant for language learners, offering real-time pronunciation help with IPA phoneme visualization and neural voice playback in dozens of languages.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

0