-
National University of Singapore
- Singapore
Lists (3)
Sort Name ascending (A-Z)
Stars
BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Train transformer language models with reinforcement learning.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Residual Quantization with Implicit Neural Codebooks
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Implementation of SoundStorm built upon SpeechTokenizer.
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
An implement of STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency of Zhong-Qiu Wang et al.
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
Python implementation for audio time-frequency automatic gain control
Automatically control volume of songs in playlist to make a better experience.
digital signal processing library for software-defined radios
Official repository for the paper "Topological Neural Discrete Representation Learning à la Kohonen" (ICML 2023 Workshop on Sampling and Optimization in Discrete Space)
first base model for full-duplex conversational audio