zchoi

🎯

Focusing

Haonan Zhang zchoi

🎯

Focusing

Ph.D. student. Research Interests: LLM-Agents, Vision-Language.

75 followers · 73 following

UESTC | TongYi Laboratory
Sichuan ⇌ Beijing
15:30 (UTC +08:00)
https://zchoi.github.io/

Achievements

Highlights

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

JimZAI / DETA-plus

Official implementation of the paper "Reliable Few-shot Learning under Dual Noises"

Python 3 Updated Apr 11, 2025

diankun-wu / Spatial-MLLM

Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Python 160 3 Updated May 30, 2025

haoni0812 / MDA

Python 80 2 Updated May 24, 2025

google-research / rlds

Jupyter Notebook 370 24 Updated Sep 26, 2024

TyroneLi / CUA_O3D

CVPR2025

Jupyter Notebook 9 Updated Apr 28, 2025

vaew / Awesome-spatial-visual-reasoning-MLLMs

Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)

Python 30 Updated May 31, 2025

moojink / openvla-oft

Forked from openvla/openvla

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 426 31 Updated Apr 28, 2025

kpertsch / rlds_dataset_builder

An example RLDS dataset builder for X-embodiment dataset conversion.

Python 169 190 Updated Jul 11, 2024

simpler-env / SimplerEnv

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 642 106 Updated Mar 28, 2025

aiming-lab / GRAPE

GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization

Python 129 6 Updated Apr 6, 2025

zchoi / OmniCharacter

[ACL25] Official codebase for "OmniCharacter:Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction" 🔥

4 Updated Feb 22, 2025

shiqichen17 / VLM_Merging

Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)

Python 51 2 Updated Jun 4, 2025

tengxiao1 / GSIL

How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective (EMNLP 2024)

Python 5 Updated Dec 7, 2024

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,452 240 Updated Jun 2, 2025

OSU-NLP-Group / LLM-Planner

[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

C 188 21 Updated Mar 26, 2025

bruno686 / Awesome-RL-based-LLM-Reasoning

Awesome RL-based LLM Reasoning

510 27 Updated May 4, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,129 255 Updated Jun 4, 2025

yfzhang114 / r1_reward

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 136 7 Updated May 9, 2025

MozerWang / AMPO

[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents

Python 30 3 Updated May 20, 2025

zhushiyun88 / teaching-boyfriend-llm

383 21 Updated May 8, 2025

deepseek-ai / DeepSeek-Prover-V2

1,133 79 Updated Apr 30, 2025

RainBowLuoCS / GUI-R1

Forked from ritzz-ai/GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Python 2 Updated May 710C 16, 2025