kundan2510

Kundan Kumar kundan2510

Founder Lyrebird-AI (YC S17). Head of AI, Descript. Previously, phd-student at MILA, UdeM

166 followers · 23 following

Montreal
kundan2510.github.io

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

678 42 Updated Aug 3, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,405 709 Updated Jun 9, 2025

hubertsiuzdak / snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 609 32 Updated Nov 19, 2024

lyuchenyang / Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

Python 1,574 125 Updated Jan 1, 2025

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 6,397 929 Updated Jun 10, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 26,824 3,083 Updated May 10, 2025

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,554 276 Updated Jan 12, 2025

kuprel / min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,489 252 Updated Apr 28, 2025

tomwetjens / boardgamefiesta

Project to play board games like Great Western Trail and Dominant Species online. Backend code for Quarkus, AWS Lambda, DynamoDB. Front end code: https://github.com/tomwetjens/boardgamefiesta-app

Java 4 1 Updated Sep 22, 2022

nussl / nussl

A flexible source separation library in Python

Python 631 96 Updated Dec 9, 2024

infiloop2 / personal-stock-ticker

Scripts powering https://infiloop.io/personalstockticker

JavaScript 4 1 Updated Jan 23, 2021

pseeth / torch-stft

An STFT/iSTFT for PyTorch.

Python 359 52 Updated Oct 31, 2023

descriptinc / melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 1,012 218 Updated Aug 28, 2023

swechhasingh / Handwriting-synthesis

Implementation of "Generating Sequences With Recurrent Neural Networks" https://arxiv.org/abs/1308.0850

Jupyter Notebook 243 35 Updated May 1, 2023

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,445 8,995 Updated May 30, 2025

strob / gentle

gentle forced aligner

Python 1,586 302 Updated May 19, 2025

vickianand / kaggle_cats_vs_dogs

Using Convnet to classify images of cats from those of dogs. :)

Python 1 Updated Feb 17, 2019

mravanelli / pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,387 445 Updated Mar 14, 2022