8000 ZeyueT / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ZeyueT's full-sized avatar

Highlights

  • Pro

Block or report ZeyueT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR2024] Make Your Dream A Vlog

Python 426 46 Updated May 19, 2025

Video-GPT via Next Clip Diffusion.

Python 36 1 Updated Jun 2, 2025

Repository of AudioX

Python 1,005 105 Updated Apr 30, 2025

Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention

Python 39 2 Updated Apr 19, 2025

Latest Advances on System-2 Reasoning

Python 1,122 55 Updated Jun 8, 2025

Audio-FLAN

157 4 Updated Mar 6, 2025

Let's finetune video generation models!

Python 478 25 Updated May 12, 2025

TRO 2022 - QPEP: A C++/MATLAB library for solving generalized quadratic pose estimation problems and related uncertainty description

MATLAB 179 18 Updated Mar 9, 2024
Python 77 2 Updated Jun 7, 2025

Official implementation of the Law of Vision Representation in MLLMs

Python 155 8 Updated Nov 17, 2024
Jupyter Notebook 1 Updated Jul 11, 2024

TorchCFM: a Conditional Flow Matching library

Python 1,778 143 Updated Mar 11, 2025

Generative models for conditional audio generation

Python 3,338 353 Updated Jun 2, 2025

[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Python 31 Updated Feb 6, 2025

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

TypeScript 2,131 528 Updated Mar 17, 2025

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Python 330 38 Updated Apr 8, 2024

[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

Python 160 11 Updated Apr 30, 2024

Community list of startups working with AI in audio and music technology

1,663 154 Updated Jan 22, 2025
Python 259 32 Updated Apr 24, 2024

[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

Python 148 7 Updated Jul 6, 2024

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (imag 76DA e, video, 3D and audio).

HTML 479 28 Updated Apr 4, 2025

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

Jupyter Notebook 2,457 340 Updated Jun 19, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,061 4,521 Updated Aug 19, 2024

Some Conferences' accepted paper lists (including AI, ML, Robotic)

Python 1,231 78 Updated Jan 23, 2025

ChatGPT, GenerativeAI and LLMs Timeline

950 57 Updated May 19, 2024

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Python 547 61 Updated Jun 3, 2023

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 9,516 1,145 Updated Oct 9, 2024

[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"

Python 1,324 93 Updated Mar 20, 2024
Next
0