-
codec-bpe Public
Forked from AbrahamSanders/codec-bpeImplementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs
Python MIT License UpdatedSep 26, 2024 -
overseas-website-note Public
Forked from princehuang/overseas-website-note「海外工具网站」已经是我人生主要事业了,很庆幸还来得及,感谢这个伟大的 AI 时代。
UpdatedSep 5, 2024 -
llm-datasets Public
Forked from mlabonne/llm-datasetsHigh-quality datasets, tools, and concepts for LLM fine-tuning.
UpdatedAug 11, 2024 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Python Apache License 2.0 UpdatedJun 24, 2024 -
Awesome-LLMs-meet-Multimodal-Generation Public
Forked from YingqingHe/Awesome-LLMs-meet-Multimodal-Generation🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
HTML UpdatedJun 15, 2024 -
Mantis Public
Forked from TIGER-AI-Lab/MantisOfficial code for Paper "Mantis: Multi-Image Instruction Tuning"
Python Apache License 2.0 UpdatedJun 4, 2024 -
-
MoneyPrinterTurbo Public
Forked from harry0703/MoneyPrinterTurbo利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Python MIT License UpdatedMay 5, 2024 -
Bunny Public
Forked from BAAI-DCAI/BunnyA family of lightweight multimodal models.
Python Apache License 2.0 UpdatedApr 24, 2024 -
lina-speech Public
Forked from theodorblackbird/lina-speechlina-speech : linear attention based text-to-speech
Jupyter Notebook Other UpdatedApr 24, 2024 -
-
pyvideotrans Public
Forked from jianchang512/pyvideotransTranslate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
Python GNU General Public License v3.0 UpdatedApr 11, 2024 -
-
audio-pipeline Public
Forked from pengzhendong/audio-pipelinePython Apache License 2.0 UpdatedApr 6, 2024 -
Awesome-LLMs-Datasets Public
Forked from lmmlzn/Awesome-LLMs-DatasetsSummarize existing representative LLMs text datasets.
Apache License 2.0 UpdatedApr 6, 2024 -
FRESCO Public
Forked from williamyang1991/FRESCO[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Jupyter Notebook Other UpdatedApr 4, 2024 -
pytorch-speech-features Public
Forked from apple/pytorch-speech-featuresPython Other UpdatedApr 2, 2024 -
-
VoiceCraft Public
Forked from jasonppy/VoiceCraftZero-Shot Speech Editing and Text-to-Speech in the Wild
Python Other UpdatedMar 22, 2024 -
-
awesome-audio-plaza Public
Forked from metame-ai/awesome-audio-plazaDaily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
MIT License UpdatedMar 11, 2024 -
ConsistI2V Public
Forked from TIGER-AI-Lab/ConsistI2VConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Python MIT License UpdatedMar 9, 2024 -
SoraReview Public
Forked from lichao-sun/SoraReviewThe official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
UpdatedMar 8, 2024 -
Open-Sora Public
Forked from hpcaitech/Open-SoraBuilding your own video generation model like OpenAI's Sora
Python Apache License 2.0 UpdatedMar 8, 2024 -
EVA Public
Forked from baaivision/EVAEVA Series: Visual Representation Fantasies from BAAI
Python MIT License UpdatedMar 8, 2024 -
M2UGen Public
Forked from shansongliu/MuMu-LLaMAThis is the official repository for M2UGen
Jupyter Notebook MIT License UpdatedMar 7, 2024 -
ai-audio-startups Public
Forked from csteinmetz1/ai-audio-startupsCommunity list of startups working with AI in audio and music technology
Apache License 2.0 UpdatedMar 6, 2024 -
54B1 snac Public
Forked from hubertsiuzdak/snacMulti-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Python MIT License UpdatedMar 5, 2024 -
AudioEditingCode Public
Forked from HilaManor/AudioEditingCodePython Creative Commons Attribution Share Alike 4.0 International UpdatedMar 4, 2024 -
metavoice-src Public
Forked from metavoiceio/metavoice-srcFoundational model for human-like, expressive TTS
Python Apache License 2.0 UpdatedMar 1, 2024