Lists (1)
Sort Name ascending (A-Z)
Stars
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
Fancy Gym: Unifying interface for various RL benchmarks with support for Black Box approaches.
Examples of how to create colorful, annotated equations in Latex using Tikz.
We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.
MineWorld: A Real-time interactive world model on Minecraft
一款专注于Ai翻译的工具,一键自动翻译RPG SLG游戏,Epub TXT小说,Srt Vtt Lrc字幕,Word MD文档等等复杂长文本。
[ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.
DSGBench is a game benchmark designed to evaluate LLM agents across diverse, dynamic environments, including games like StarCraft II and Werewolf. It tests agents' abilities in decision-making, str…
An environment based on JSBSIM aimed at one-to-one close air combat.
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab
Benchmarking RL generalization in an interpretable way.
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Collect some World Models for Autonomous Driving (and Robotic) papers.
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
Natural Language Reinforcement Learning
Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe