8000 Robertwyq (王宇琪) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Robertwyq's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Robertwyq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 32,956 2,435 Updated Jun 27, 2025

Unified Vision-Language-Action Model

Python 74 1 Updated Jun 27, 2025

We introduce CausalVQA, a benchmark dataset for video question answering (VQA) composed of question-answer pairs that probe models’ understanding of causality in the physical world.

Python 23 2 Updated Jun 10, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,570 126 Updated Jun 20, 2025

Drive-Pi0 and DriveMoE on End-to-end Autonomous Driving

Python 52 5 Updated Jun 3, 2025

Open-source unified multimodal model

Python 4,360 363 Updated Jun 17, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,229 1,498 Updated Jun 26, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,321 193 Updated Jun 17, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 9,131 881 Updated Jun 24, 2025

End-to-End Driving with Online Trajectory Evaluation via BEV World Model

Python 90 4 Updated Apr 15, 2025

Embodied Chain of Thought: A robotic policy that reason to solve the task.

Python 269 11 Updated Apr 5, 2025

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Python 1,275 82 Updated Jun 27, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,093 356 Updated Mar 24, 2025

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 537 14 Updated May 22, 2025

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 14,840 1,306 Updated Jun 27, 2025

NVIDIA Isaac GR00T N1.5 is the world's first open foundation model for generalized humanoid robot reasoning and skills.

Jupyter Notebook 4,266 585 Updated Jun 18, 2025

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 526 25 Updated Jun 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,054 1,655 Updated Jun 27, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 15,306 2,036 Updated Jun 27, 2025

Embodied Reasoning Question Answer (ERQA) Benchmark

Python 170 7 Updated Mar 12, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 16,826 1,742 Updated Jun 7, 2025

Fully open reproduction of DeepSeek-R1

Python 24,898 2,313 Updated Jun 26, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,364 160 Updated Mar 20, 2025

Official implementation of Diffusion Policy Policy Optimization, arxiv 2024

Python 520 49 Updated Feb 4, 2025

[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 798 56 Updated Jun 17, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,957 255 Updated Jun 16, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,027 516 Updated Jun 9, 2025
Next
0