8000 yuanjiayiy (Jiayi "Carrie" Yuan) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yuanjiayiy's full-sized avatar

Block or report yuanjiayiy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"

Python 78 9 Updated May 22, 2025

Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"

Python 1 Updated Jun 12, 2025

Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"

Python 75 14 Updated Jan 23, 2025
JavaScript 3,292 1,317 Updated Jun 21, 2024

A reusable framework for successor features for transfer in deep reinforcement learning using keras.

Python 44 11 Updated May 11, 2021

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Python 1,094 173 Updated Jul 18, 2024

Awesome Open-ended AI

291 31 Updated Jun 10, 2025

A Survey Analyzing Generalization in Deep Reinforcement Learning

34 Updated Oct 31, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,187 89 Updated Feb 19, 2025

OMNI: Open-endedness via Models of human Notions of Interestingness

Python 50 11 Updated Jan 28, 2025

A curated list of awesome advice for computer science Ph.D. applicants.

295 15 Updated Sep 12, 2021

RL Environments in JAX 🌍

Python 764 80 Updated May 30, 2025

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 7,244 773 Updated Apr 8, 2025

Multi-Agent Reinforcement Learning with Stable-Baselines3

Python 20 3 Updated Dec 3, 2021

Initial InvestESG environment implementation forked from pettingzoo

Jupyter Notebook 1 1 Updated Mar 30, 2025

Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas

Python 49 11 Updated Dec 8, 2022

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,767 295 Updated May 27, 2025

Robotic AI bare code. This is designed as shared submodule of other projects. Try other repos that expose clearer interfaces (rai-python, robotics-course) first.

C++ 101 49 Updated Jun 11, 2025

Official codebase for Human Guided Exploration (HuGE)

Python 21 1 Updated Aug 16, 2023
0