8000 zhengyuan-xie (Zheng-Yuan Xie) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zhengyuan-xie's full-sized avatar
😅
😅

Block or report zhengyuan-xie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

849 38 Updated Jun 3, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Python 230 8 Updated Apr 27, 2025

AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation

Python 75 2 Updated Apr 1, 2025

[CVPR2025] Rethinking Query-based Transformer for Continual Image Segmentation

15 Updated Mar 3, 2025

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,315 599 Updated May 15, 2024
Python 364 10 Updated Apr 15, 2025

Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, full-time, etc

732 14 Updated Jun 6, 2025

official repo of paper: Drone Referring Localization: An Efficient Heterogeneous Spatial Feature Interaction Method For UAV Self-Localization

Python 53 5 Updated Mar 10, 2025

「TCSVT2021」A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization

Python 95 20 Updated Mar 7, 2024

「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

Python 134 14 Updated Sep 25, 2024

[AAAI2025] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints

Python 22 Updated May 28, 2025

[CVPR 25 Highlight & ECCV Workshop 24 Best Paper] RoboTwin Dual-arm Robot Manipulation Simulation Platform

Python 926 106 Updated May 27, 2025

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,238 126 Updated Jun 5, 2025
Python 10 Updated Jan 24, 2025
Python 3 Updated Feb 26, 2024

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

Python 402 17 Updated May 29, 2025

Vision-and-Language Navigation in Continuous Environments using Habitat

Python 456 62 Updated Jan 7, 2025

Offical implementation of "Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 19 1 Updated Jun 5, 2025

NLTK Data

Python 1,677 1,088 Updated Mar 10, 2025

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 13,110 980 Updated Jun 4, 2025

Compose multimodal datasets 🎹

Python 394 17 Updated Jun 1, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,723 6,251 Updated Jun 6, 2025

Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard".

C++ 27 2 Updated May 27, 2025

Grounding Image Matching in 3D with MASt3R

Python 2,218 181 Updated May 26, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,713 123 Updated Jun 5, 2025

[AAAI 2025] The official repository of our paper "GCD: Advancing Vision-Language Models for Incremental Object Detection via Global Alignment and Correspondence Distillation"

Python 8 Updated May 21, 2025

Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)

Python 13 Updated May 2, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,875 1,454 Updated May 29, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,232 185 Updated Jun 4, 2025
Next
0