8000 GitHub - Ujj1225/NepaliTTS
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Ujj1225/NepaliTTS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

NepaliTTS is a Text To Speech (TTS) + Optical Character Recognition (OCR) system that provides visually impaired people an easier way to access written information. It is done as part of the Final Year Major Project at IOE, Pulchowk Campus.

Technologies Used

  • React Native - Frontend
  • TesseractOCR - OCR Engine
  • Tacotron2 - TTS Model
  • HifiGAN - Vocoder
  • FastAPI - TTS API
  • ExpressJS - OCR API

Additionally, Google Colab has been used for training the model.

Demo

25-03-06-09-49-21.1.mp4

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  
0