10000 LightningLeader / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View LightningLeader's full-sized avatar

Highlights

  • Pro

Block or report LightningLeader

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
1,177 86 Updated 10000 May 27, 2024

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

391 20 Updated Sep 12, 2024

Fancy Gym: Unifying interface for various RL benchmarks with support for Black Box approaches.

Python 37 17 Updated Apr 17, 2024
Python 10 Updated May 29, 2024

Examples of how to create colorful, annotated equations in Latex using Tikz.

TeX 3,838 219 Updated Jul 12, 2022
Python 6 3 Updated Aug 10, 2023

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Python 23 3 Updated Feb 10, 2024

MineWorld: A Real-time interactive world model on Minecraft

Python 321 25 Updated May 8, 2025

一款专注于Ai翻译的工具,一键自动翻译RPG SLG游戏,Epub TXT小说,Srt Vtt Lrc字幕,Word MD文档等等复杂长文本。

Python 2,423 148 Updated May 14, 2025

[ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.

Python 46 2 Updated Apr 4, 2025

Robust Reinforcement Learning Suite

Python 29 Updated Dec 24, 2024

DSGBench is a game benchmark designed to evaluate LLM agents across diverse, dynamic environments, including games like StarCraft II and Werewolf. It tests agents' abilities in decision-making, str…

Python 9 Updated May 11, 2025

An environment based on JSBSIM aimed at one-to-one close air combat.

Python 354 115 Updated Apr 20, 2025

Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.

Python 7,430 1,336 Updated May 15, 2025

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 26,301 2,257 Updated May 15, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 20,824 2,433 Updated Apr 30, 2025

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

Python 762 85 Updated Apr 20, 2025
Jupyter Notebook 22 Updated Sep 23, 2024

frp跨平台桌面客户端,可视化配置,轻松实现内网穿透! 支持所有frp版本

Vue 5,540 385 Updated Apr 30, 2025

Benchmarking RL generalization in an interpretable way.

Python 156 13 Updated Mar 10, 2025

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 577 51 Updated Apr 20, 2025

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,203 147 Updated Aug 3, 2023

Collect some World Models for Autonomous Driving (and Robotic) papers.

970 34 Updated May 13, 2025
Python 11 Updated Dec 12, 2024

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

Jupyter Notebook 20,921 5,431 Updated Jul 25, 2024

Natural Language Reinforcement Learning

Python 87 9 Updated Dec 19, 2024

Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).

Python 153 17 Updated Aug 21, 2024

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 2,866 297 Updated Apr 30, 2025
Next
0