NepaliTTS is a Text To Speech (TTS) + Optical Character Recognition (OCR) system that provides visually impaired people an easier way to access written information. It is done as part of the Final Year Major Project at IOE, Pulchowk Campus.
- React Native - Frontend
- TesseractOCR - OCR Engine
- Tacotron2 - TTS Model
- HifiGAN - Vocoder
- FastAPI - TTS API
- ExpressJS - OCR API
Additionally, Google Colab has been used for training the model.