Starred repositories
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
No fortress, purely open ground. OpenManus is Coming.
OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.
Official Pytorch implementations of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition(IJCV)
CMMLU: Measuring massive multitask language understanding in Chinese
A framework for few-shot evaluation of language models.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Toolkit for linearizing PDFs for LLM datasets/training
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Sky-T1: Train your own O1 preview model within $450
The off AA3F icial repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
DeepFaceLab is the leading software for creating deepfakes.
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
永久免费开源的 AIGC 课程, 目前已支持Prompt Engineering, ChatGPT, Midjourney, Runway, Stable Diffusion, AI数字人,AI声音&音乐,开源大模型
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
DeepSeek-VL: Towards Real-World Vision-Language Understanding
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)