ignaciocases

Ignacio Cases ignaciocases

21 followers · 9 following

http://web.stanford.edu/~cases/

Achievements

Stars

nasa / cFS

The Core Flight System (cFS)

CMake 932 257 Updated May 29, 2025

hmxf / RTJetson

Preempt-RT Kernel Build Guide for NVIDIA Development Board

Shell 22 6 Updated Jun 21, 2024

THUDM / WebRL

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 387 28 Updated Apr 30, 2025

probonopd / previous

NeXT hardware emulator for a NeXT Cube and NeXT Station. Mirrored from SourceForge

C 81 13 Updated Dec 12, 2017

ServiceNow / AgentLab

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 334 62 Updated May 28, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,192 6,180 Updated May 30, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 3,817 475 Updated May 28, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 13,972 1,922 Updated May 30, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,895 672 Updated May 30, 2025

ServiceNow / BrowserGym

🌎💪 BrowserGym, a Gym environment for web task automation

Python 754 101 Updated May 20, 2025

bwfbowen / muax

A project that provides help for using DeepMind's mctx on gym-style environments.

Python 60 11 Updated Nov 14, 2024

princeton-nlp / WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 351 73 Updated Sep 6, 2024

chrisgrimm / muzero

Python 11 1 Updated Jun 29, 2021

Tom-Obvious / MFOS-NoiseToaster

Forked from samzeter/noise-toaster

Recources to build the MFOS - Noise Toaster Synth by Ray Wilson

HTML 12 1 Updated Mar 25, 2024

waterhorse1 / LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 272 26 Updated May 26, 2024

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,135 190 Updated Apr 9, 2025

karthikv792 / LLMs-Planning

An extensible benchmark for evaluating large language models on planning

PDDL 374 39 Updated Apr 24, 2025

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,383 156 Updated May 30, 2025

google / dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,737 1,391 Updated Nov 4, 2024

hr0nix / omega

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

Python 41 4 Updated Sep 19, 2022

google-deepmind / mctx

Monte Carlo tree search in JAX

Python 2,491 203 Updated Apr 10, 2025

YeWR / EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python 900 139 Updated Dec 20, 2023

Hwhitetooth / jax_muzero

An implementation of MuZero in JAX.

Python 56 8 Updated Nov 8, 2022

werner-duvaud / muzero-general

MuZero

Python 2,649 649 Updated Sep 3, 2024

google-research / dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Python 642 75 Updated Jul 14, 2020

kzl / decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,573 481 Updated Apr 29, 2024

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 877 69 Updated Sep 9, 2024

MichaelTMatthews / Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Python 313 31 Updated May 26, 2025

RobertTLange / gymnax

RL Environments in JAX 🌍

Python 758 76 Updated May 30, 2025

kinjalbasu / explorer

Repository for the paper EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning (EACL'24)

5 Updated Mar 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ignacio Cases ignaciocases

Achievements

Achievements

Block or report ignaciocases

Stars

nasa / cFS

hmxf / RTJetson

THUDM / WebRL

probonopd / previous

ServiceNow / AgentLab

hiyouga / LLaMA-Factory

PKU-Alignment / align-anything

huggingface / trl

OpenRLHF / OpenRLHF

ServiceNow / BrowserGym

bwfbowen / muax

princeton-nlp / WebShop

chrisgrimm / muzero

Tom-Obvious / MFOS-NoiseToaster

waterhorse1 / LLM_Tree_Search

maitrix-org / llm-reasoners

karthikv792 / LLMs-Planning

opendilab / LightZero

google / dopamine

hr0nix / omega

google-deepmind / mctx

YeWR / EfficientZero

Hwhitetooth / jax_muzero

werner-duvaud / muzero-general

google-research / dreamer

kzl / decision-transformer

luchris429 / purejaxrl

MichaelTMatthews / Craftax

RobertTLange / gymnax

kinjalbasu / explorer