Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,332 718 Updated Jun 27, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 622 13 Updated Jun 26, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,072 1,661 Updated Jun 28, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,806 289 Updated May 19, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,364 160 Updated Mar 20, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,757 380 Updated Jun 18, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,220 812 Updated May 15, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,410 2,237 Updated Feb 1, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,919 1,758 Updated Feb 26, 2025

deepseek-ai / DeepSeek-V3

Python 97,889 15,924 Updated Jun 27, 2025

deepseek-ai / DeepSeek-R1

90,291 11,652 Updated Jun 27, 2025

MoonshotAI / Kimi-k1.5

3,380 220 Updated Mar 7, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,659 1,016 Updated Jun 26, 2025

Mzying2001 / CefFlashBrowser

Flash浏览器 / Flash Browser

C# 3,736 195 Updated Jun 17, 2025

tencent-ailab / MuQ

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 214 9 Updated Jan 9, 2025

WarmCongee / SDUMC

[ICASSP 2025] "Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention"

Python 21 Updated Apr 27, 2025

MuyeHuang / VProChart

Python 8 Updated Jun 7, 2025

MuyeHuang / EvoChart

16 Updated Jan 1, 2025

lllllps / Deplot-LLM

This code if an example on how to use the deplot model provided by the authors together with LLM in your own python files.

1 Updated Sep 6, 2023

Pandaaaa906 / RFL-MSD

Forked from JingMog/RFL-MSD

Official Implementation of our paper "RFL: Simplifying Chemical Structure Recognition with Ring-Free Language", accepted by AAAI 2025.

Python 2 Updated Dec 11, 2024

chenchenzi718 / DL_GCN

手写了卷积神经网络内核，来处理图上的节点分类与链路预测任务，在三个数据集cora,citeseer,ppi上进行试验，并分析了自环、层数、DropEdge、PairNorm、激活函数等因素对模型的分类和预测性能的影响。

Python 15 3 Updated May 10, 2023

JingMog / RFL-MSD

[AAAI'25 Oral] "RFL: Simplifying Chemical Structure Recognition with Ring-Free Language".

Python 16 3 Updated Jun 14, 2025