-
westlake university
- Shilongshan Road No.18, Cloud Town, Xihu District, Hangzhou, Zhejiang, China.
Highlights
- Pro
Stars
Simulated experiments for "Real-Time Execution of Action Chunking Flow Policies".
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Interactive visualizations of the geometric intuition behind diffusion models.
🚀 A collection of utilities and tools for LeRobot.
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone", 线性代数的艺术中文版, 欢迎PR.
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
A comprehensive list of Implicit Representations, NeRF and 3D Gaussian Splatting papers relating to SLAM/Robotics domain, including papers, videos, codes, and related websites
3D Gaussian Splatting (3DGS) on fisheye cameras
A new markup-based typesetting system that is powerful and easy to learn.
A Python framework for accelerated simulation, data generation and spatial computing.
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
Embodied Reasoning Question Answer (ERQA) Benchmark
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"