Stars
Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Community list of startups working with AI in audio and music technology
Awesome speech/audio LLMs, representation learning, and codec models
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
[NAACL 2025 Findings] Code for "Perception Compressor: A Training-Free Prompt Compression Framework in Long Context Scenarios"
手写实现李航《统计学习方法》书中全部算法
Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
Code examples in pyTorch and Tensorflow for CS230
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
小火箭 shadowrocket 配置文件 模块 脚本 module sgmodule 图文教程 规则 分流 破解 解锁
提供多款 Shadowrocket 规则,带广告过滤功能。用于 iOS 未越狱设备选择性地自动翻墙。
清华大学计算机系考研攻略 Guidance for postgraduate entrance examination in Department of Computer Science and Technology, Tsinghua University