Lists (18)
Sort Name ascending (A-Z)
Stars
An open-source AI agent that brings the power of Gemini directly into your terminal.
ROS2-Gazebo simulation package leveraging Mid360 and FASTLIO for navigation.
Yolov5 real time smoke detection system
Base on YOLOv5 Head Person Helmet Detection on Construction Sites,基于目标检测工地安全帽和禁入危险区域识别系统,🚀😆附 YOLOv5 训练自己的数据集超详细教程🚀😆2021.3新增可视化界面❗❗
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Train embodied agents that can answer questions in environments
Hybrid A* Path Planner for the KTH Research Concept Vehicle
[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
A generative world for general-purpose robotics & embodied AI learning.
A simple demo of yolov5s running on rk3588/3588s using Python (about 72 frames). / 一个使用Python在rk3588/3588s上运行的yolov5s简单demo(大约72帧/s)。
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
ViPlanner: Visual Semantic Imperative Learning for Local Navigation
YOLOV5 semi-automatic annotation tool (Based on labelImg)
The project is a multi-threaded inference demo of Yolo running on the RK3588 platform, which has been adapted for reading video files and camera feeds. The demo uses the Yolov8n model for file infe…
YOLO ROS: Real-Time Object Detection for ROS
deep learning for image processing including classification and object-detection etc.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.