10000 arnupretorius (Arnu Pretorius) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View arnupretorius's full-sized avatar

Block or report arnupretorius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 6 Updated Feb 24, 2025

Simple single-file baselines for Q-Learning in pure-GPU setting

Python 162 9 Updated Mar 19, 2025

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Python 168 16 Updated Apr 17, 2025

A python package for end-to-end geospatial machine learning using multispectral earth observation data such as NASA HLS and ESA Sentinel-2.

Python 24 21 Updated Feb 21, 2025
Python 92 11 Updated Jul 2, 2024

RewardBench: the first evaluation tool for reward models.

Python 568 71 Updated May 14, 2025

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 323 34 Updated May 4, 2025

SKAI is a machine learning based tool for performing automatic building damage assessments on aerial imagery of disaster sites.

Python 126 18 Updated May 15, 2025

JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️

Python 284 19 Updated Nov 16, 2024

Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]

Python 100 10 Updated Dec 5, 2023

A collection of MARL benchmarks based on TorchRL

Python 393 79 Updated May 13, 2025

🧭 COMPASS: Combinatorial Optimization with Policy Adaptation using Latent Space Search

Python 38 4 Updated Jun 21, 2024

Efficient baselines for autocurricula in JAX.

Python 187 15 Updated Aug 24, 2024

Multi-Agent Reinforcement Learning with JAX

Python 576 112 Updated May 13, 2025

Datasets with baselines for offline multi-agent reinforcement learning.

Python 167 14 Updated May 10, 2025

⚡ Flashbax: Accelerated Replay Buffers in JAX

Python 236 17 Updated Mar 27, 2025

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Python 57 5 Updated Oct 23, 2023

🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

Python 607 70 Updated May 12, 2025

A tool for aggregating and plotting MARL experiment data.

Python 77 6 Updated Jan 20, 2025

ESM2 protein language models in JAX/Flax

Python 17 5 Updated Oct 10, 2022

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 7,038 752 Updated Apr 8, 2025

Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

Jupyter Notebook 13 2 Updated Nov 22, 2022

Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment

Python 367 81 Updated Sep 15, 2024

🧬 ManyFold: An efficient and flexible library for training and validating protein folding models

Python 80 9 Updated Dec 14, 2022

🌺 Population-Based Reinforcement Learning for Combinatorial Optimization

Python 73 15 Updated Feb 12, 2024

[NeurIPS 2022] Open source code for reusing prior computational work in RL.

Python 96 13 Updated Jul 5, 2023

A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers

51 8 Updated Jan 20, 2023

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 725 91 Updated May 7, 2025

Notebooks for the Practicals at the Deep Learning Indaba 2022.

Jupyter Notebook 175 44 Updated Apr 3, 2024
Python 22 18 Updated Nov 24, 2024
Next
0