8000 zchoi (Haonan Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zchoi's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report zchoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of the paper "Reliable Few-shot Learning under Dual Noises"

Python 3 Updated Apr 11, 2025

Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Python 160 3 Updated May 30, 2025
Python 80 2 Updated May 24, 2025
Jupyter Notebook 370 24 Updated Sep 26, 2024

CVPR2025

Jupyter Notebook 9 Updated Apr 28, 2025

Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications)

Python 30 Updated May 31, 2025

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 426 31 Updated Apr 28, 2025

An example RLDS dataset builder for X-embodiment dataset conversion.

Python 169 190 Updated Jul 11, 2024

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 642 106 Updated Mar 28, 2025

GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization

Python 129 6 Updated Apr 6, 2025

[ACL25] Official codebase for "OmniCharacter:Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction" 🔥

4 Updated Feb 22, 2025

Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)

Python 51 2 Updated Jun 4, 2025

How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective (EMNLP 2024)

Python 5 Updated Dec 7, 2024

Official Repository of Absolute Zero Reasoner

Python 1,452 240 Updated Jun 2, 2025

[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

C 188 21 Updated Mar 26, 2025

Awesome RL-based LLM Reasoning

510 27 Updated May 4, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,129 255 Updated Jun 4, 2025

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 136 7 Updated May 9, 2025

[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents

Python 30 3 Updated May 20, 2025

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Python 2 Updated May 710C 16, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 14,037 1,728 Updated Jun 4, 2025

[NeurIPS 2024] Agent Planning with World Knowledge Model

Python 139 11 Updated Dec 17, 2024

TTRL: Test-Time Reinforcement Learning

Python 589 43 Updated May 23, 2025

Powerful menu bar manager for macOS

Swift 19,511 345 Updated Jan 26, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 3,721 372 Updated Apr 27, 2025

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Python 107 10 Updated May 5, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,549 184 Updated Jun 4, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 16,210 1,795 Updated May 29, 2025
Next
0