8000 pavs315 (Pavani Chowdary) / Starred Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View pavs315's full-sized avatar
  • IIIT Hyderabad
  • Hyderabad

Highlights

  • Pro

Organizations

@ERC-IIITH

Block or report pavs315

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Python 377 82 Updated Oct 23, 2023

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 866 351 Updated May 28, 2025

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Python 194 26 Updated Nov 9, 2022

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 301 47 Updated Aug 25, 2021

A repository of links with advice related to grad school applications, research, phd etc

2,212 215 Updated Nov 12, 2023

Basically listing out how to get some basic logistics out of the way

Shell 265 20 Updated Jan 26, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 14,627 2,899 Updated May 28, 2025

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 14,288 3,340 Updated Aug 12, 2024

g2p: English Grapheme To Phoneme Conversion

Python 852 129 Updated Jan 5, 2023

End-to-End Speech Processing Toolkit

Python 9,150 2,270 Updated May 22, 2025

Keep track of opportunities and never miss a deadline again!

TypeScript 432 76 Updated Jan 31, 2024

Reference implementation of real-time autoregressive wavenet inference

Cuda 737 126 Updated Jan 19, 2021

A Flow-based Generative Network for Speech Synthesis

Python 2,324 534 Updated Oct 19, 2023

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,227 1,418 Updated Jun 12, 2024

This repository contains Accepted and Rejected proposals for various Google Summer of Code organizations.

52 10 Updated Sep 5, 2023

Resources for "Natural Language Processing" Coursera course.

Jupyter Notebook 1,187 1,951 Updated Dec 21, 2022

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Python 2,973 955 Updated Jul 6, 2023

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

535 84 Updated Feb 9, 2022

my 9th grade project for NASA AMES SPACE SETTLEMENT CONTEST

2 Updated Sep 24, 2023

The hands-on NLTK tutorial for NLP in Python

Jupyter Notebook 554 240 Updated May 28, 2024

curated collection of papers for the nlp practitioner πŸ“–πŸ‘©β€πŸ”¬

1,072 89 Updated Aug 5, 2020

EPFL Course - Optimization for Machine Learning - CS-439

Jupyter Notebook 1,276 331 Updated May 23, 2025
0