8000 JingMog (JsingMog) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View JingMog's full-sized avatar
😃
😃

Block or report JingMog

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 430 37 Updated Jun 26, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,408 312 Updated May 13, 2025

[AAAI'25 Oral] "RFL: Simplifying Chemical Structure Recognition with Ring-Free Language".

Python 1 Updated Jan 20, 2025

Official PyTorch implementation of our paper "Multimodal Tree Decoder for Table of Contents Extraction in Document Images"

Python 8 2 Updated Apr 16, 2023

Code for paper: MATHS: Multimodal Transformer-based Human-readable Solver

Python 1 Updated Feb 26, 2024

A pipeline for the automatic construction of geometry problems along with step-by-step solutions.

Python 13 1 Updated Jun 26, 2025

the official implementation of our AAAI 2025 paper, DocMamba

Python 4 1 Updated Apr 29, 2025

Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation

Python 229 15 Updated Mar 30, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,332 718 Updated Jun 27, 2025

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 622 13 Updated Jun 26, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,072 1,661 Updated Jun 28, 2025

Witness the aha moment of VLM with less than $3.

Python 3,806 289 Updated May 19, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,364 160 Updated Mar 20, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,757 380 Updated Jun 18, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,220 812 Updated May 15, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,410 2,237 Updated Feb 1, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,919 1,758 Updated Feb 26, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,659 1,016 Updated Jun 26, 2025

Flash浏览器 / Flash Browser

C# 3,736 195 Updated Jun 17, 2025

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 214 9 Updated Jan 9, 2025

[ICASSP 2025] "Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention"

Python 21 Updated Apr 27, 2025
Python 8 Updated Jun 7, 2025
16 Updated Jan 1, 2025

This code if an example on how to use the deplot model provided by the authors together with LLM in your own python files.

1 Updated Sep 6, 2023

Official Implementation of our paper "RFL: Simplifying Chemical Structure Recognition with Ring-Free Language", accepted by AAAI 2025.

Python 2 Updated Dec 11, 2024

手写了卷积神经网络内核,来处理图上的节点分类与链路预测任务,在三个数据集cora,citeseer,ppi上进行试验,并分析了自环、层数、DropEdge、PairNorm、激活函数等因素对模型的分类和预测性能的影响。

Python 15 3 Updated May 10, 2023

[AAAI'25 Oral] "RFL: Simplifying Chemical Structure Recognition with Ring-Free Language".

Python 16 3 Updated Jun 14, 2025
Next
0