8000 GitHub - gheyret/uyghur-asr-ctc: Speech Recognition for Uyghur using deep learning
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

gheyret/uyghur-asr-ctc

Repository files navigation

Speech Recognition for Uyghur using deep learning

Training:

this model using CTC loss for training.

Download pretrained model and dataset.

unzip results.7z and thuyg20_data.7z to the same folder where python source files located. then run:

python train.py

Recognition:

for recognition download only pretrained model(results.7z). then run:

python tonu.py test1.wav 

result will be:

        Model loaded: results/UModel_last.pth
            Best CER: 7.21%
             Trained: 473 epochs
The model has 26,389,282 trainable parameters

======================
Recognizing file .\test2.wav
test2.wav -> bu öy eslide xotunining xush tebessumi oghlining omaq külküsi bilen güzel idi

This project using

A free Uyghur speech database Released by CSLT@Tsinghua University & Xinjiang University

About

Speech Recognition for Uyghur using deep learning

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

0