jiawei415

jiawei415

3 followers · 1 following

Highlights

RobustDecisionTransformer Public

Python 2 1 Updated Mar 20, 2025
grpo_try Public

Shell Updated Feb 28, 2025
legged_gym Public
Forked from leggedrobotics/legged_gym

Isaac Gym Environments for Legged Robots

Python Other Updated Jan 16, 2025
Information_Directed_Sampling Public
Forked from szrlee/Information_Directed_Sampling

Implementation of Russo and Van Roy work on Information Directed Sampling (2017)

Python Updated Jan 14, 2025
CUHKSZ-CSC4005 Public
Forked from tonyyxliu/CUHKSZ-CSC4005

Project Materials for CUHK(SZ) Course CSC4005: Parallel Programming

C++ MIT License Updated Dec 11, 2024
QT Public
Forked from charleshsc/QT

ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning

Python Apache License 2.0 Updated Nov 10, 2024
diffusion_policy Public
Forked from real-stanford/diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python MIT License Updated Jul 29, 2024
AgentGym Public
Forked from WooooDyy/AgentGym

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python MIT License Updated Jun 12, 2024
terrain-generator Public
Forked from leggedrobotics/terrain-generator

Python MIT License Updated May 31, 2024
DeepSpeed Public
Forked from deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python Apache License 2.0 Updated Apr 9, 2024
LMFlow Public
Forked from OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python Apache License 2.0 Updated Apr 8, 2024
LaMo-2023 Public
Forked from srzer/LaMo-2023

Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".

Python MIT License Updated Mar 27, 2024
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Mar 13, 2024
VCP Public

Python 1 Updated Jan 24, 2024
AgentVerse Public
Forked from OpenBMB/AgentVerse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript Apache License 2.0 Updated Jan 18, 2024
alfworld Public
Forked from alfworld/alfworld

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python MIT License Updated Jan 5, 2024
overcooked_ai Public
Forked from HumanCompatibleAI/overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

Jupyter Notebook MIT License Updated Dec 12, 2023
mbpo_pytorch Public
Forked from Xingyu-Lin/mbpo_pytorch

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Python Updated Nov 10, 2023
UWMSG Public
Forked from YangRui2015/UWMSG

Python Updated Oct 18, 2023
EfficientZero Public
Forked from YeWR/EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python GNU General Public License v3.0 Updated Sep 12, 2023
tianshou Public
Forked from thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

Python MIT License Updated Aug 9, 2023
CORL Public
Forked from tinkoff-ai/CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python Apache License 2.0 Updated Aug 3, 2023
IVR Public
Forked from ryanxhr/IVR

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Python MIT License Updated Jul 27, 2023
DeFog Public
Forked from hukz18/DeFog

Code release for the ICLR 2023 conference paper "DeFog: Decision Transformer under Random Frame Dropping"

Python MIT License Updated Jul 10, 2023
pan-motion-retargeting Public
Forked from hlcdyy/pan-motion-retargeting

codes for paper "Pose-aware Attention Network for Flexible Motion Retargeting by Body Part" (TVCG2023)

Python Updated Jun 25, 2023
Metaworld Public
Forked from Farama-Foundation/Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python MIT License Updated Jun 2, 2023
RORL Public
Forked from YangRui2015/RORL

Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"

Python MIT License Updated Feb 15, 2023
slbo Public
Forked from roosephu/slbo

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees

Python Other Updated Jan 23, 2023
CHER Public
Forked from mengf1/CHER

Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)

Python Updated Sep 23, 2022
div-hindsight Public
Forked from TianhongDai/div-hindsight

This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" [PRICAI 2021].

Python MIT License Updated Sep 19, 2022

jiawei415

Highlights

RobustDecisionTransformer Public

Uh oh!

grpo_try Public

Uh oh!

legged_gym Public

Uh oh!

Information_Directed_Sampling Public

Uh oh!

CUHKSZ-CSC4005 Public

Uh oh!

QT Public

Uh oh!

diffusion_policy Public

Uh oh!

AgentGym Public

Uh oh!

terrain-generator Public

Uh oh!

DeepSpeed Public

Uh oh!

LMFlow Public

Uh oh!

LaMo-2023 Public

Uh oh!

trl Public

Uh oh!

VCP Public

Uh oh!

AgentVerse Public

Uh oh!

alfworld Public

Uh oh!

overcooked_ai Public

Uh oh!

mbpo_pytorch Public

Uh oh!

UWMSG Public

Uh oh!

EfficientZero Public

Uh oh!

tianshou Public

Uh oh!

CORL Public

Uh oh!

IVR Public

Uh oh!

DeFog Public

Uh oh!

pan-motion-retargeting Public

Uh oh!

Metaworld Public

Uh oh!

RORL Public

Uh oh!

slbo Public

Uh oh!

CHER Public

Uh oh!

div-hindsight Public

Uh oh!