8000 wen2cheng (cheng wen) / Starred · GitHub

More Web Proxy on the site http://driver.im/

wen2cheng

Follow

cheng wen wen2cheng

Follow

5 followers · 16 following

https://scholar.google.com/citations?user=9MLB3s8AAAAJ&hl=zh-CN

Achievements

Achievements

Stars

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,154 6,633 Updated Jul 11, 2025

HITsz-TMG / Awesome-Large-Multimodal-Reasoning-Models

The development and future prospects of multimodal reasoning models.

434 18 Updated Jul 6, 2025

LLMBook-zh / LLMBook-zh.github.io

《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣

Python 3,786 277 Updated Mar 31, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 3,964 266 Updated Jun 21, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 15,129 1,605 Updated Jul 7, 2025

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

305 16 Updated Nov 28, 2024

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,354 172 Updated Mar 28, 2025

GAIR-NLP / O1-Journey

O1 Replication Journey

1,992 66 Updated Jan 14, 2025

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 8,179 767 Updated Oct 16, 2024

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,520 847 Updated May 15, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,021 378 Updated Jun 13, 2025

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,762 245 Updated Jul 11, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 39,016 3,210 Updated Jul 11, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,923 130 Updated Oct 30, 2024

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 9,998 873 Updated Jun 18, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,529 1,517 Updated Jun 26, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,543 659 Updated May 29, 2025

X-PLUG / MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 4,447 455 Updated Jul 3, 2025

X2FD / LVIS-INSTRUCT4V

133 Updated Dec 22, 2023

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,611 430 Updated May 29, 2024

icoz69 / StableLLAVA

Official repo for StableLLAVA

Python 95 9 Updated Dec 22, 2023

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,798 1,028 Updated Jul 11, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 52,107 8,669 Updated Jul 13, 2025

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,914 280 Updated Jul 9, 2025

HillZhang1999 / llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

1,026 53 Updated Nov 21, 2024

thu-coai / Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

1,047 84 Updated Feb 27, 2024

X-PLUG / CValues

面向中文大模型价值观的评估与对齐研究

Python 525 20 Updated Jul 20, 2023

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,502 121 Updated Jun 13, 2024

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,155 327 Updated Jun 23, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,870 29,625 Updated Jul 12, 2025

0