-
Nanjing University of Posts and Telecommunications
- 江苏省南京市
- https://wild-firefox.github.io
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
This project implements a reinforcement learning (RL) framework for autonomous driving within the CARLA simulator.
Massively Parallel Deep Reinforcement Learning. 🔥
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Code for "Temporal Difference Learning for Model Predictive Control"
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈
Honor of Kings AI Open Environment of Tencent
FinRL®: Financial Reinforcement Learning. 🔥
FinRL®-Meta: Dynamic datasets and market environments for FinRL.
FinRL® Tutorials. Please star.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
Official implementation of HARL algorithms based on PyTorch.
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
Simple A3C implementation with pytorch + multiprocessing
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
A PyTorch Platform for Distributed RL
Multiagent Reinforcement Learning Research Project
Fully open reproduction of DeepSeek-R1
Generative Agents: Interactive Simulacra of Human Behavior