dongwon00kim

dongwon00kim

1 follower · 2 following

Achievements

Stars

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 11,776 1,661 Updated May 5, 2025

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,127 91 Updated Mar 2, 2025

GrandaddyShmax / audiocraft_plus

Forked from facebookresearch/audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 603 67 Updated Aug 15, 2024

cpm0722 / transformer_pytorch

Transformer(Attention Is All You Need) Implementation in Pytorch

Python 71 16 Updated Dec 2, 2022

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 13,690 1,392 Updated May 6, 2025

SUNGBEOMCHOI / Korean-Streaming-ASR

Korean Streaming ASR(with Denoiser and Conformer CTC)

Python 26 6 Updated Apr 28, 2024

0417keito / VALL-E-X-Trainer-by-CustomData

Forked from Plachtaa/VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 67 6 Updated Sep 21, 2023

meta-llama / llama

Inference code for Llama models

Python 58,207 9,761 Updated Jan 26, 2025

HeliosVirtualCockpit / Helios

Forked from BlueFinBima/Helios14

Helios Distribution

C# 217 37 Updated Apr 21, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,865 787 Updated Feb 11, 2024

facebookresearch / audiocraft

Jupyter Notebook 21,955 2,323 Updated Mar 13, 2025

wiedehopf / tar1090

Provides an improved webinterface for use with ADS-B decoders readsb / dump1090-fa

JavaScript 1,421 260 Updated May 8, 2025

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,282 227 Updated May 21, 2023

SOL1archive / KoGrammar

Korean Grammar Correction Model based on LLM

Jupyter Notebook 4 3 Updated Jun 7, 2023

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 70,583 10,425 Updated Jun 18, 2024

jianfch / stable-ts

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 1,864 202 Updated Mar 26, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 15,621 1,675 Updated May 3, 2025

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,124 324 Updated Nov 14, 2023

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,905 1,184 Updated Apr 24, 2024

Edresson / YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 969 84 Updated Nov 4, 2024

SMART-TTS / SMART-G2P

Python 99 38 Updated Mar 24, 2023

scarletcho / KoLM

Korean text normalization and language preparation package for LM in Kaldi-based ASR system

Python 60 20 Updated Apr 23, 2020

Kyubyong / g2p

g2p: English Grapheme To Phoneme Conversion

Python 849 129 Updated Jan 5, 2023

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,990 416 Updated May 10, 2023

sevagh / audio-degradation-toolbox

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Python 49 10 Updated Nov 19, 2019

ruizhecao96 / CMGAN

Conformer-based Metric GAN for speech enhancement

Python 354 63 Updated May 3, 2024

brentspell / torch-yin

Yin pitch estimator in PyTorch

Python 114 7 Updated Nov 7, 2022

brentspell / hifi-gan-bwe

Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.

Python 214 26 Updated Oct 20, 2023

brandokoch / attention-is-all-you-need-paper

Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.

Jupyter Notebook 237 50 Updated Apr 29, 2024

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 5,746 549 Updated Mar 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dongwon00kim

Achievements

Achievements

Block or report dongwon00kim

Stars

SWivid / F5-TTS

jishengpeng / WavTokenizer

GrandaddyShmax / audiocraft_plus

cpm0722 / transformer_pytorch

FunAudioLLM / CosyVoice

SUNGBEOMCHOI / Korean-Streaming-ASR

0417keito / VALL-E-X-Trainer-by-CustomData

meta-llama / llama

HeliosVirtualCockpit / Helios

Plachtaa / VALL-E-X

facebookresearch / audiocraft

wiedehopf / tar1090

YuanGongND / ast

SOL1archive / KoGrammar

CompVis / stable-diffusion

jianfch / stable-ts

m-bain / whisperX

lifeiteng / vall-e

facebookresearch / demucs

Edresson / YourTTS

SMART-TTS / SMART-G2P

scarletcho / KoLM

Kyubyong / g2p

enhuiz / vall-e

sevagh / audio-degradation-toolbox

ruizhecao96 / CMGAN

brentspell / torch-yin

brentspell / hifi-gan-bwe

brandokoch / attention-is-all-you-need-paper

snakers4 / silero-vad