This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 621 13 Updated Jun 26, 2025

RifleZhang / LLaVA-Reasoner-DPO

Python 80 4 Updated Jan 8, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,316 63 Updated Feb 8, 2025

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

668 20 Updated Jun 21, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 532 14 Updated Apr 13, 2025

bbruceyuan / LLMs-Zero-to-Hero

从无名小卒到大模型（LLM）大英雄~ 欢迎关注后续！！！

Jupyter Notebook 1,441 95 Updated Apr 13, 2025

liujunwen23 / MIRE

WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge

Python 120 15 Updated Nov 11, 2024

GobinFan / python-mcp-server-client

支持查询主流agent框架技术文档的MCP server（支持stdio和sse两种传输协议）, 支持 langchain、llama-index、autogen、agno、openai-agents-sdk、mcp-doc、camel-ai 和 crew-ai

Python 117 24 Updated May 5, 2025

openai / openai-agents-python

A lightweight, powerful framework for multi-agent workflows

Python 12,019 1,814 Updated Jun 27, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 11,251 1,142 Updated Jun 27, 2025

robodhruv / visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shenyang Wang wsyadc

Achievements

Achievements

Block or report wsyadc

Starred repositories

Jirl-upenn / VLMnav

B0B8K1ng / WMNavigation

pldlgb / nuggets

Junda24 / MonSter

AirVLN / AirVLN

expectorlin / NavCoT

Pendulumclock / FlightGPT

mgjinnn / translation_baseline

GengzeZhou / NavGPT-2

owenliang / rag-retrieval

datalab-to / marker

RoboVerseOrg / RoboVerse

ChanganVR / awesome-embodied-vision

YicongHong / Thinking-VLN

ModalMinds / MM-EUREKA

PKU-YuanGroup / LLaVA-CoT

TideDra / lmm-r1

hiyouga / EasyR1

Osilly / Vision-R1