8000 Moore-Tian (Tian Muzhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Moore-Tian's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Moore-Tian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion

Python 65 8 Updated May 8, 2025

zero-shot voice conversion & singing voice conversion, with real-time support

Python 2,517 289 Updated Apr 20, 2025

LaTeX thesis template for Fudan University

TeX 923 220 Updated Dec 8, 2024
Python 300 29 Updated Apr 13, 2023

liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project

Python 499 85 Updated Dec 12, 2023

in preparation...

Python 393 70 Updated Oct 14, 2024

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 11 Updated Jul 15, 2023

[IPMI'23] Diffusion Model based Semi-supervised Learning on Brain Hemorrhage Images for Efficient Midline Shift Quantification

Python 14 1 Updated Apr 12, 2023

Full code for the paper "Incorporating Task-Specific Structural Knowledge into CNNs for Brain Midline Shift Detection"

Python 13 1 Updated Aug 19, 2019

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Python 367 65 Updated Jul 21, 2024

[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Python 227 20 Updated Mar 18, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,804 664 Updated May 24, 2025

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 728 102 Updated Feb 15, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 82,186 9,914 Updated May 13, 2025

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Python 316 64 Updated Nov 11, 2020

A PyTorch-based Speech Toolkit

Python 9,857 1,497 Updated May 23, 2025

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 968 165 Updated Jul 5, 2023

ASVtorch Toolkit: Speaker Verification with Deep-Neural Networks. To cite this software publication: https://www.sciencedirect.com/science/article/pii/S235271102100042X

Python 6 1 Updated Apr 27, 2021

So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook

Jupyter Notebook 714 107 Updated Mar 31, 2025

SoftVC VITS Singing Voice Conversion

Python 27,113 4,989 Updated Nov 11, 2023

[CSUR] A Survey on Video Diffusion Models

2,096 108 Updated Mar 31, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,442 261 Updated May 17, 2025

🐸 collection of TTS papers

689 72 Updated Jul 4, 2024

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,307 914 Updated Jul 6, 2023

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,448 139 Updated Jul 11, 2024

Foundational model for human-like, expressive TTS

Python 4,120 687 Updated Jul 30, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,263 789 Updated Mar 15, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,988 688 Updated Aug 13, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 46,728 5,142 Updated Apr 25, 2025

SOTA Open Source TTS

Python 21,195 1,696 Updated Apr 12, 2025
Next
0