8000 wild-firefox (Gu Jiacheng) / Starred · GitHub

More Web Proxy on the site http://driver.im/

wild-firefox

Follow

💭

e-mail：wild_firefox@outlook.com

Gu Jiacheng wild-firefox

💭

e-mail：wild_firefox@outlook.com

Follow

9 followers · 4 following

Nanjing University of Posts and Telecommunications
江苏省南京市
https://wild-firefox.github.io

Achievements

Achievements

Highlights

Pro

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

htliang517 / CARLA_RL

This project implements a reinforcement learning (RL) framework for autonomous driving within the CARLA simulator.

Jupyter Notebook 1 Updated Jun 19, 2025

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,069 930 Updated May 8, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,964 1,865 Updated Jun 16, 2025

inclusionAI / AReaL

Distributed RL System for LLM Reasoning

Python 1,869 100 Updated Jun 26, 2025

nicklashansen / tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

Python 442 67 Updated Nov 25, 2023

nicklashansen / tdmpc2

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Python 569 130 Updated May 21, 2025

TradeMaster-NTU / TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 1,955 395 Updated Jun 4, 2025

AltmanD / guandan_mcc

mcc_second_guandan

Python 86 21 Updated Nov 17, 2022

submit-paper / 3v3Snakes

Python 7 3 Updated Mar 3, 2022

tencent-ailab / hok_env

Honor of Kings AI Open Environment of Tencent

Python 743 86 Updated Jul 17, 2024

AI4Finance-Foundation / FinRL

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 12,006 2,816 Updated Jun 24, 2025

AI4Finance-Foundation / FinRL-Meta

FinRL®-Meta: Dynamic datasets and market environments for FinRL.

Python 1,578 681 Updated Jun 2, 2025

AI4Finance-Foundation / FinRL-Tutorials

FinRL® Tutorials. Please star.

Jupyter Notebook 1,034 392 Updated Mar 28, 2025

PandaAI-Tech / panda_factor

Python 1,209 45 Updated Jun 19, 2025

24mlight / a-share-mcp-is-just-i-need

Python 279 44 Updated May 12, 2025

24mlight / A_Share_investment_Agent

Python 1,787 490 Updated Jun 22, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37,704 6,525 Updated Jun 26, 2025

seolhokim / DistributedRL-Pytorch-Ray

Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)

Python 27 3 Updated Jun 8, 2022

PKU-MARL / HARL

Official implementation of HARL algorithms based on PyTorch.

Python 731 96 Updated Apr 27, 2025

punkpeye / awesome-mcp-servers

A collection of MCP servers.

57,752 4,431 Updated Jun 22, 2025

starry-sky6688 / MADDPG

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

Python 632 91 Updated Jul 16, 2022

MorvanZhou / pytorch-A3C

Simple A3C implementation with pytorch + multiprocessing

Python 643 145 Updated Mar 10, 2023

Metro1998 / hppo-in-traffic-signal-control

Python 57 3 Updated May 9, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 56,923 7,917 Updated Jun 25, 2025

opendilab / PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Python 2,302 191 Updated Mar 13, 2025

facebookresearch / torchbeast

A PyTorch Platform for Distributed RL

Python 747 115 Updated Sep 15, 2021

Denys88 / rl_games

RL implementations

Jupyter Notebook 1,135 179 Updated Jun 26, 2025

binary-husky / hmp2g

Multiagent Reinforcement Learning Research Project

Python 211 37 Updated Oct 17, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,884 2,310 Updated Jun 26, 2025

joonspk-research / generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

19,203 2,581 Updated Aug 5, 2024

0