8000 WXONE / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View WXONE's full-sized avatar
🇦🇮
Out sick
🇦🇮
Out sick

Block or report WXONE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for PerAct², a language-conditioned imitation learning agent designed for bimanual robotic manipulation using the RLBench environment. It includes dataset generation, training scripts, and eva…

Python 81 7 Updated Feb 23, 2025

RoboBrain 2.0: Advanced version of RoboBrain. See Better. Think Harder. Do Smarter. 🎉🎉🎉

Python 417 33 Updated Jul 14, 2025

AudioLDM training, finetuning, evaluation and inference.

Python 262 52 Updated Dec 13, 2024

Text-to-Audio/Music Generation

Python 2,468 200 Updated Sep 29, 2024

Pybullet and libero data collection related code.

1 Updated Jun 27, 2025

Official Code for RVT-2 and RVT

Jupyter Notebook 359 53 Updated Feb 14, 2025

PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 758 42 Updated Jul 15, 2025

Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers

Python 103 7 Updated May 19, 2025

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,146 157 Updated Jul 1, 2025

RLBench_ACT: Running ALoha ACT and Diffusion Policy in the RLBench Framework

C 93 8 Updated Apr 18, 2025

GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data

140 2 Updated May 7, 2025

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Python 1,319 97 Updated Jul 16, 2025

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 650 38 Updated Oct 22, 2024

A gym environment for ALOHA

Python 135 40 Updated Apr 2, 2025
Python 53 1 Updated Jan 13, 2025

🔥🔥🔥 专注于YOLO11,YOLOv8、TYOLOv12、YOLOv10、RT-DETR、YOLOv7、YOLOv5改进模型,Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

Python 2,776 451 Updated Apr 7, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 21,482 2,182 Updated Jul 5, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,318 255 Updated Jun 12, 2025

Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

Python 280 11 Updated Jun 20, 2025

Interactive Post-Training for Vision-Language-Action Models

Python 92 5 Updated Jun 4, 2025

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 823 100 Updated Sep 30, 2021

The official code base of Accommodating Audio Modality in CLIP for Multimodal Processing

Python 6 1 Updated Jan 16, 2024

A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Python 303 26 Updated May 28, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,046 229 Updated Jul 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 11,029 1,827 Updated Jul 16, 2025

Deep Reinforcement Learning for mobile robot navigation in ROS2 Gazebo simulator. Using DRL (SAC, TD3) neural networks, a robot learns to navigate to a random goal point in a simulated environment …

Python 111 11 Updated Jan 30, 2025

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Python 169 8 Updated Jun 25, 2025

《Python编程:从入门到实践》- python3.5

Jupyter Notebook 17 5 Updated May 7, 2020

Python编程 从入门到实践

Python 99 49 Updated Sep 5, 2018

classic books of computer science!

1,762 794 Updated May 29, 2024
Next
0