-
-
-
legged_gym Public
Forked from leggedrobotics/legged_gymIsaac Gym Environments for Legged Robots
Python Other UpdatedJan 16, 2025 -
Information_Directed_Sampling Public
Forked from szrlee/Information_Directed_SamplingImplementation of Russo and Van Roy work on Information Directed Sampling (2017)
Python UpdatedJan 14, 2025 -
CUHKSZ-CSC4005 Public
Forked from tonyyxliu/CUHKSZ-CSC4005Project Materials for CUHK(SZ) Course CSC4005: Parallel Programming
C++ MIT License UpdatedDec 11, 2024 -
QT Public
Forked from charleshsc/QTICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
Python Apache License 2.0 UpdatedNov 10, 2024 -
diffusion_policy Public
Forked from real-stanford/diffusion_policy[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Python MIT License UpdatedJul 29, 2024 -
AgentGym Public
Forked from WooooDyy/AgentGymCode and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Python MIT License UpdatedJun 12, 2024 -
terrain-generator Public
Forked from leggedrobotics/terrain-generatorPython MIT License UpdatedMay 31, 2024 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedApr 9, 2024 -
LMFlow Public
Forked from OptimalScale/LMFlowAn Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Python Apache License 2.0 UpdatedApr 8, 2024 -
LaMo-2023 Public
Forked from srzer/LaMo-2023Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".
Python MIT License UpdatedMar 27, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedMar 13, 2024 -
-
AgentVerse Public
Forked from OpenBMB/AgentVerse🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
JavaScript Apache License 2.0 UpdatedJan 18, 2024 -
alfworld Public
Forked from alfworld/alfworldALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Python MIT License UpdatedJan 5, 2024 -
overcooked_ai Public
Forked from HumanCompatibleAI/overcooked_aiA benchmark environment for fully cooperative human-AI performance.
Jupyter Notebook MIT License UpdatedDec 12, 2023 -
mbpo_pytorch Public
Forked from Xingyu-Lin/mbpo_pytorchA pytorch reprelication of the model-based reinforcement learning algorithm MBPO
Python UpdatedNov 10, 2023 -
-
EfficientZero Public
Forked from YeWR/EfficientZeroOpen-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
Python GNU General Public License v3.0 UpdatedSep 12, 2023 -
tianshou Public
Forked from thu-ml/tianshouAn elegant PyTorch deep reinforcement learning library.
Python MIT License UpdatedAug 9, 2023 -
CORL Public
Forked from tinkoff-ai/CORLHigh-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Python Apache License 2.0 UpdatedAug 3, 2023 -
IVR Public
Forked from ryanxhr/IVRAuthor's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Python MIT License UpdatedJul 27, 2023 -
DeFog Public
Forked from hukz18/DeFogCode release for the ICLR 2023 conference paper "DeFog: Decision Transformer under Random Frame Dropping"
Python MIT License UpdatedJul 10, 2023 -
pan-motion-retargeting Public
Forked from hlcdyy/pan-motion-retargetingcodes for paper "Pose-aware Attention Network for Flexible Motion Retargeting by Body Part" (TVCG2023)
Python UpdatedJun 25, 2023 -
Metaworld Public
Forked from Farama-Foundation/MetaworldCollections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Python MIT License UpdatedJun 2, 2023 -
RORL Public
Forked from YangRui2015/RORLCode for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
Python MIT License UpdatedFeb 15, 2023 -
slbo Public
Forked from roosephu/slboAlgorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Python Other UpdatedJan 23, 2023 -
CHER Public
Forked from mengf1/CHERCurriculum-guided Hindsight Experience Replay (NeurIPS-2019)
Python UpdatedSep 23, 2022 -
div-hindsight Public
Forked from TianhongDai/div-hindsightThis is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" [PRICAI 2021].
Python MIT License UpdatedSep 19, 2022