gyxxyg

🏠

Working from home

Yongxin Guo gyxxyg

🏠

Working from home

Ph.D. Student at CUHKSZ; Research Engineer at Alibaba

49 followers · 69 following

https://gyxxyg.github.io/yongxinguo/

Achievements

Stars

ruixin31 / Rethink_RLVR

Python 232 13 Updated May 27, 2025

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 1,681 71 Updated Jun 4, 2025

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 92 3 Updated May 30, 2025

hanyang1999 / discrete-diffusion-papers

A collection of papers on discrete diffusion models

124 2 Updated Jun 4, 2025

appletea233 / LLaVA-ST

[CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

43 1 Updated Feb 27, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 46,467 8,131 Updated Jun 3, 2025

LengSicong / MMR1

MMR1: Advancing the Frontiers of Multimodal Reasoning

159 5 Updated Mar 17, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 961 61 Updated May 28, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 16,808 1,980 Updated Jun 4, 2025

mbzuai-oryx / Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,709 123 Updated Jun 2, 2025

Wang-Xiaodong1899 / CVPR25-MLLM-Paper-List

🔥CVPR 2025 Multimodal Large Language Models Paper List

143 4 Updated Mar 12, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 591 20 Updated Mar 18, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 525 14 Updated Apr 13, 2025

KellerJordan / Muon

Muon: An optimizer for hidden layers in neural networks

Python 679 34 Updated May 27, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,958 1,382 Updated May 27, 2025

A980 maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,138 190 Updated Apr 9, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,408 610 Updated May 27, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,586 837 Updated Apr 29, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,540 182 Updated Jun 4, 2025

lucasjinreal / Namo-R1

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

Python 203 20 Updated Apr 22, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,065 309 Updated May 11, 2025

SkyworkAI / MoE-plus-plus

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 222 7 Updated Oct 16, 2024

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,800 278 Updated May 15, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,949 101 Updated Jun 2, 2025

GAIR-NLP / LIMR

Python 202 8 Updated Feb 20, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,039 1,936 Updated Jun 4, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 765 46 Updated May 14, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,983 1,132 Updated Jun 4, 2025

ZJU-LLMs / Foundations-of-LLMs

10,857 938 Updated Jan 14, 2025

modelscope / awesome-deep-reasoning

Collect every awesome work about r1!

Python 374 12 Updated May 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yongxin Guo gyxxyg

Achievements

Achievements

Block or report gyxxyg

Stars

ruixin31 / Rethink_RLVR

Paper2Poster / Paper2Poster

TsinghuaC3I / MARTI

hanyang1999 / discrete-diffusion-papers

appletea233 / LLaVA-ST

FoundationAgents / OpenManus

LengSicong / MMR1

bytedance / flux

camel-ai / owl

mbzuai-oryx / Awesome-LLM-Post-training

Wang-Xiaodong1899 / CVPR25-MLLM-Paper-List

turningpoint-ai / VisualThinker-R1-Zero

Fancy-MLLM / R1-Onevision

KellerJordan / Muon

Wan-Video / Wan2.1

A980 maitrix-org / llm-reasoners

deepseek-ai / DeepGEMM

deepseek-ai / FlashMLA

hiyouga / EasyR1

lucasjinreal / Namo-R1

om-ai-lab / VLM-R1

SkyworkAI / MoE-plus-plus

deepseek-ai / open-infra-index

Open-Reasoner-Zero / Open-Reasoner-Zero

GAIR-NLP / LIMR

huggingface / trl

TideDra / lmm-r1

volcengine / verl

ZJU-LLMs / Foundations-of-LLMs

modelscope / awesome-deep-reasoning