-
IIIT Hyderabad
- Hyderabad
Highlights
- Pro
Stars
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
A repository of links with advice related to grad school applications, research, phd etc
Basically listing out how to get some basic logistics out of the way
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Keep track of opportunities and never miss a deadline again!
Reference implementation of real-time autoregressive wavenet inference
A Flow-based Generative Network for Speech Synthesis
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
This repository contains Accepted and Rejected proposals for various Google Summer of Code organizations.
Resources for "Natural Language Processing" Coursera course.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
my 9th grade project for NASA AMES SPACE SETTLEMENT CONTEST
The hands-on NLTK tutorial for NLP in Python
curated collection of papers for the nlp practitioner ππ©βπ¬
EPFL Course - Optimization for Machine Learning - CS-439