A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstra…

Jupyter Notebook 53 10 Updated May 15, 2025

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 942 110 Updated Aug 7, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 11,972 1,115 Updated Jun 10, 2025

paplhjak / Facial-Age-Estimation-Benchmark

Comparative Analysis of Deep Learning Approaches for Facial Age Estimation. Accepted to CVPR 2024

Python 58 3 Updated Oct 22, 2024

MAZiqing / FEDformer

Python 726 140 Updated Aug 16, 2023

theolepage / ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

Jupyter Notebook 19 2 Updated Jan 18, 2022

ShannonAI / ChineseBert

Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"

Python 558 93 Updated Jul 26, 2023

wangwang110 / CSC

ChineseBert用于中文拼写纠错

Python 41 2 Updated Mar 14, 2023

yochaiye / LipVoicer

Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"

Python 68 8 Updated Sep 19, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,517 9,007 Updated May 30, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 47,781 5,258 Updated Jun 18, 2025

HarlanCheung / MATH6005-2023

矩阵理论作业

TeX 3 Updated Dec 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Goodman PussyCat0700

Achievements

Achievements

Block or report PussyCat0700

Lists (1)

🚀 My stack

Stars

MoonshotAI / Kimi-Audio-Evalkit

PussyCat0700 / DiVISe

TsinghuaC3I / Awesome-RL-Reasoning-Recipes

anitaweng / SP-FaceVC

dhimasryan / STOI-Net

basiclab / FaceVC-Pytorch

Levent9 / Zero-shot-FaceVC

LqNoob / Neural-Codec-and-Speech-Language-Models

facebookresearch / large_concept_model

lucidrains / vector-quantize-pytorch

BigdogManLuo / HEFTcom24

SWivid / F5-TTS

ranchlai / awesome-speaker-embedding

leisongju / unidubbing

yangdongchao / AcademiCodec

jishengpeng / WavTokenizer

kale4eat / nisqalib

xixi219 / MOS

stefantaubert / mel-cepstral-distance