8000 agangzz (melodyless) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View agangzz's full-sized avatar

Block or report agangzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A repo that builds text to music datasets from scratch

Python 10 Updated May 20, 2025

国家中小学智慧教育平台 电子课本下载工具,帮助您从智慧教育平台中获取电子课本的 PDF 文件网址并进行下载,让您更方便地获取课本内容。

Python 2,048 215 Updated May 18, 2025

所有小初高、大学PDF教材。

Roff 27,903 5,986 Updated May 18, 2025

Python implementation for audio time-frequency automatic gain control

Python 81 22 Updated Feb 24, 2013

Hybrid Demucs model for drum separation

Shell 109 8 Updated Oct 9, 2024

Toward Deep Drum Source Separation

Python 63 4 Updated Sep 10, 2024

Deep Learning for Person Re-identification: A Survey and Outlook

Python 702 94 Updated Feb 17, 2025

The official repository for ICLR2025 paper "HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts"

Python 12 1 Updated Apr 11, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,149 203 Updated May 20, 2025

Pytorch implementation of SoundCTM

Python 94 8 Updated Mar 31, 2025

Pytorch implementation of SoundCTM-DiT

Jupyter Notebook 3 1 Updated Mar 31, 2025

This repository aims to collect Transformer-based sound event detection (SED) algorithms.

Python 58 3 Updated Apr 15, 2025
Python 11 Updated Mar 19, 2025
Python 50 5 Updated Apr 1, 2025

Music Genre Transfer and Prediction

Python 8000 2 Updated Mar 27, 2025

Source code for Consistent ensemble distillation for audio tagging

Python 31 5 Updated Jul 16, 2024

Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.

Python 490 29 Updated May 19, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 487 31 Updated May 1, 2025

🎧 Hybrid music recommendation with graph neural networks.

Python 2 Updated Jul 24, 2023

(WWW'24 + LinkedIn) The first RS that tightly combines LLM with ID-based RS

Python 144 16 Updated Aug 7, 2024

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 708 35 Updated Dec 19, 2024

Utilizes ONNX Runtime for audio denoising.

Python 49 8 Updated May 9, 2025

NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms

Python 994 104 Updated Apr 21, 2025

automatic audio labelling with laion-clap

Python 18 1 Updated Jun 20, 2024

It includes papers on speech&audio field. Now update: ICLR2025-2023, ICML2025-2023, NeurIPS2024-2023, ACMMM2024, AAAI2025-2024, ACL2024, EMNLP2024, NAACL2025, IJCAI2024

59 1 Updated May 16, 2025

Sylber: Syllabic Embedding Representation of Speech from Raw Audio

Jupyter Notebook 55 2 Updated Mar 17, 2025

PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.

Python 484 27 Updated Apr 29, 2025

Code for the paper "Songs Across Borders: Singable and Controllable Neural Lyric Translation"

Python 18 4 Updated Jul 19, 2023

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Python 1,630 172 Updated May 10, 2025
Next
0