Stars
Rethinking end-to-end evaluation for spoken language understanding
A multi-voice TTS system trained with an emphasis on quality
A Python module for controlling interactive programs in a pseudo-terminal
56 language, 1 model Multilingual ASR
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Command-line tools for speech and intent recognition on Linux
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
⚡ A Fast, Extensible Progress Bar for Python and CLI
A simple app for recording speech datasets.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Tools and Python libraries for manipulating Pico-8 game files. http://www.lexaloffle.com/pico-8.php
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Track emissions from Compute and recommend ways to reduce their impact on the environment.
VCTK multi-speaker tacotron for ICASSP 2020
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
⏰ AI conference deadline countdowns
CUDA kernels for generalized matrix-multiplication in PyTorch