8000 ignaciocases (Ignacio Cases) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ignaciocases's full-sized avatar

Block or report ignaciocases

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Core Flight System (cFS)

CMake 932 257 Updated May 29, 2025

Preempt-RT Kernel Build Guide for NVIDIA Development Board

Shell 22 6 Updated Jun 21, 2024

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 387 28 Updated Apr 30, 2025

NeXT hardware emulator for a NeXT Cube and NeXT Station. Mirrored from SourceForge

C 81 13 Updated Dec 12, 2017

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 334 62 Updated May 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,192 6,180 Updated May 30, 2025

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 3,817 475 Updated May 28, 2025

Train transformer language models with reinforcement learning.

Python 13,972 1,922 Updated May 30, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,895 672 Updated May 30, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 754 101 Updated May 20, 2025

A project that provides help for using DeepMind's mctx on gym-style environments.

Python 60 11 Updated Nov 14, 2024

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 351 73 Updated Sep 6, 2024
Python 11 1 Updated Jun 29, 2021

Recources to build the MFOS - Noise Toaster Synth by Ray Wilson

HTML 12 1 Updated Mar 25, 2024

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 272 26 Updated May 26, 2024

A library for advanced large language model reasoning

Python 2,135 190 Updated Apr 9, 2025

An extensible benchmark for evaluating large language models on planning

PDDL 374 39 Updated Apr 24, 2025

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,383 156 Updated May 30, 2025

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,737 1,391 Updated Nov 4, 2024

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

Python 41 4 Updated Sep 19, 2022

Monte Carlo tree search in JAX

Python 2,491 203 Updated Apr 10, 2025

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python 900 139 Updated Dec 20, 2023

An implementation of MuZero in JAX.

Python 56 8 Updated Nov 8, 2022

MuZero

Python 2,649 649 Updated Sep 3, 2024

Dream to Control: Learning Behaviors by Latent Imagination

Python 642 75 Updated Jul 14, 2020

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,573 481 Updated Apr 29, 2024

Really Fast End-to-End Jax RL Implementations

Python 877 69 Updated Sep 9, 2024

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Python 313 31 Updated May 26, 2025

RL Environments in JAX 🌍

Python 758 76 Updated May 30, 2025

Repository for the paper EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning (EACL'24)

5 Updated Mar 15, 2024
Next
0