8000 tuteng0915 (Teng Tu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View tuteng0915's full-sized avatar

Block or report tuteng0915

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 1 Updated May 16, 2025

CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]

Python 156 7 Updated May 11, 2025

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 23,009 6,429 Updated May 26, 2025

MT3: Multi-Task Multitrack Music Transcription

Python 1,535 205 Updated Mar 14, 2025
Python 2 Updated Jan 19, 2025

Towards Modality Generalization: A Benchmark and Prospective Analysis

Python 24 1 Updated May 22, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 4,413 295 Updated May 31, 2025

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Python 326 38 Updated Apr 8, 2024

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

Python 158 11 Updated Apr 5, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,055 2,343 Updated Mar 13, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,663 812 Updated Jul 31, 2024

MU-LLaMA: Music Understanding Large Language Model

Python 275 22 Updated Mar 25, 2024

Evaluation functions for music/audio information retrieval/signal processing algorithms.

Python 646 117 Updated Feb 25, 2025

A curated list of Video to Audio Generation

43 2 Updated Apr 15, 2025

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Python 171 22 Updated Jul 30, 2024

Manually annotated chord data set of US pop songs and Popular Music Collection of RWC Music Database

Python 88 13 Updated Apr 9, 2013

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 5,357 639 Updated May 29, 2025

A large-scale dataset of caption-annotated MIDI files.

Python 65 3 Updated Jul 23, 2024
Jupyter Notebook 194 12 Updated Jul 5, 2024

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 231 12 Updated Jul 25, 2024

The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.

Jupyter Notebook 150 3 Updated Dec 22, 2023

Stable Diffusion web UI

Python 153,034 28,469 Updated May 3, 2025
Python 23 2 Updated Jan 16, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,253 6,194 Updated May 31, 2025

A curated list of awesome 3d generation papers

1,152 57 Updated Mar 9, 2023
Python 2 Updated Nov 24, 2023

Responsive Resume Cv Website Using HTML CSS And JavaScript

HTML 301 173 Updated Mar 31, 2024
Next
0