Lists (2)
Sort Name ascending (A-Z)
Stars
An open-source AI agent that brings the power of Gemini directly into your terminal.
[ECCV 2024] 3D World Model for Autonomous Driving
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
A community-maintained Python framework for creating mathematical animations.
Official implementation of paper "AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning"
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Swift / Ultralytics…
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Latex code for making neural networks diagrams
❄️ Yun Portable Air Conditoner. 云空调,便携小空调,为你的夏日带去清凉!
😎 A curated list of awesome GitHub Profile which updates in real time
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🎓 Path to a free self-taught education in Computer Science!
Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory
The new spin-off of Visual Language Navigation.
A Python based lightweight robot simulator for the development of algorithms in robotics navigation, control, and learning.
Aligning Knowledge Graph with Visual Perception for Object-goal Navigation (ICRA 2024)
Falcon: A Remote Sensing Vision-Language Foundation Model
A Unreal Engine 5 (UE5) based plugin aiming to provide real-time visulization, management, editing, and scalable hybrid rendering of Guassian Splatting model.
A Unreal Engine 5 (UE5) based plugin aiming to provide real-time visulization, management, editing, and scalable hybrid rendering of Guassian Splatting model and Airsim Drones/Cars.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.