-
Huazhong University of Science and Technology
Stars
MAGI-1: Autoregressive Video Generation at Scale
📚A curated list of Awesome LLM Inference Papers with Codes.
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Universal LLM Deployment Engine with ML Compilation
Awesome LLMs on Device: A Comprehensive Survey
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Mora: More like Sora for Generalist Video Generation
Open-Sora: Democratizing Efficient Video Production for All
SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
a state-of-the-art-level open visual language model | 多模态预训练模型
A simple and open-source analogue of the HeyGen system
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
✨✨Latest Advances on Multimodal Large Language Models
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.