-
University of Cambridge
- Cambridge, UK
- enjeeneer.io
- @enjeeneer
Highlights
- Pro
-
-
-
TinyZero Public
Forked from JerryWu-code/TinyZeroDeepseek R1 zero tiny version own reproduce on two A100s.
Python Apache License 2.0 UpdatedFeb 6, 2025 -
zero-shot-rl Public
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
-
amago Public
Forked from UT-Austin-RPL/amagoa simple and scalable agent for training adaptive policies with sequence-based RL
Python MIT License UpdatedSep 17, 2024 -
popjaxrl Public
Forked from luchris429/popjaxrlBenchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
Python UpdatedDec 5, 2023 -
PEARL Public
Code for "Low Emissions Building Control with Zero-Shot Reinforcement Learning" (AAAI 2023)
-
sutton_and_barto Public
Notes and solutions to exercises in Sutton and Barto's Reinforcement Learning textbook
-
sac Public
Forked from denisyarats/pytorch_sacPyTorch implementation of Soft Actor-Critic (SAC)
Python MIT License UpdatedSep 29, 2022 -
-
beobench Public
Forked from rdnfn/beobenchA toolkit providing easy and unified access to building control environments for reinforcement learning (RL).
Python MIT License UpdatedMar 30, 2022 -
-
HowQuicklyCanWeGetBackToThePub Public
Forked from ashwinahuja/HowQuicklyCanWeGetBackToThePubAn investigation into the impacts of the N501Y variant, population immunity and vaccine distribution strategies on COVID-19 spread
Jupyter Notebook MIT License UpdatedJan 13, 2021