8000 wild-firefox (Gu Jiacheng) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wild-firefox's full-sized avatar
💭
e-mail:wild_firefox@outlook.com
💭
e-mail:wild_firefox@outlook.com

Highlights

  • Pro

Block or report wild-firefox

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This project implements a reinforcement learning (RL) framework for autonomous driving within the CARLA simulator.

Jupyter Notebook 1 Updated Jun 19, 2025

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,069 930 Updated May 8, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,964 1,865 Updated Jun 16, 2025

Distributed RL System for LLM Reasoning

Python 1,869 100 Updated Jun 26, 2025

Code for "Temporal Difference Learning for Model Predictive Control"

Python 442 67 Updated Nov 25, 2023

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Python 569 130 Updated May 21, 2025

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 1,955 395 Updated Jun 4, 2025

mcc_second_guandan

Python 86 21 Updated Nov 17, 2022
Python 7 3 Updated Mar 3, 2022

Honor of Kings AI Open Environment of Tencent

Python 743 86 Updated Jul 17, 2024

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 12,006 2,816 Updated Jun 24, 2025

FinRL­®-Meta: Dynamic datasets and market environments for FinRL.

Python 1,578 681 Updated Jun 2, 2025

FinRL® Tutorials. Please star.

Jupyter Notebook 1,034 392 Updated Mar 28, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37,704 6,525 Updated Jun 26, 2025

Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)

Python 27 3 Updated Jun 8, 2022

Official implementation of HARL algorithms based on PyTorch.

Python 731 96 Updated Apr 27, 2025

A collection of MCP servers.

57,752 4,431 Updated Jun 22, 2025

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

Python 632 91 Updated Jul 16, 2022

Simple A3C implementation with pytorch + multiprocessing

Python 643 145 Updated Mar 10, 2023

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 56,923 7,917 Updated Jun 25, 2025

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,302 191 Updated Mar 13, 2025

A PyTorch Platform for Distributed RL

Python 747 115 Updated Sep 15, 2021

RL implementations

Jupyter Notebook 1,135 179 Updated Jun 26, 2025

Multiagent Reinforcement Learning Research Project

Python 211 37 Updated Oct 17, 2024

Fully open reproduction of DeepSeek-R1

Python 24,884 2,310 Updated Jun 26, 2025

Generative Agents: Interactive Simulacra of Human Behavior

19,203 2,581 Updated Aug 5, 2024
Next
0