Stars
Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization
Paper list of multi-agent reinforcement learning (MARL)
Approaching (Almost) Any Machine Learning Problem
Mastering Diverse Domains through World Models
Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
A minimal implementation of Go-Explore without domain knowledge
Generative Planning Method (ICLR22)
Agent Learning Framework https://alf.readthedocs.io
Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
Everything you need about Active Learning (AL).
Code for Go-Explore: a New Approach for Hard-Exploration Problems
RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven explora…
Random Network Distillation pytorch
Simple Cartpole example writed with pytorch.
A curated list of awesome exploration RL resources (continually updated)
Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.