Stars
code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"
[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regularization method, in Pytorch
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
[AAMAS 2024 oral] Original PyTorch implementation of experiments in paper 'Normalization Enhances Generalization in Visual Reinforcement Learning'.
This is the repo of "RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization"
Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)
Code for the ICML 2023 paper "What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?".
[IJCAI'24 - FlagVNE] Implementation of our paper "FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation", accepted by IJCAI 2024.
A benchmark for offline goal-conditioned RL and offline RL
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Graph Attention Networks (https://arxiv.org/abs/1710.10903)
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
This repo implements our paper, "Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem"
This is the code of paper: Learning Task-relevant Representations via Rewards and Real Actions for Reinforcement Learning.
Produce elegant shaded error bars in MATLAB
Code for https://jangirrishabh.github.io/lookcloser/
Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
Code for "Temporal Difference Learning for Model Predictive Control"
DMControl Generalization Benchmark
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Deep Reinforcement Learning for Multiobjective Optimization. Code for this paper
Implementation of: Nazari, Mohammadreza, et al. "Deep Reinforcement Learning for Solving the Vehicle Routing Problem." arXiv preprint arXiv:1802.04240 (2018).
Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]
Official implementation of paper "Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning"