8000 lidpeng (Dapeng) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lidpeng's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report lidpeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 256 29 Updated Nov 27, 2023

An attempt at implementing DeepNash

Python 5 2 Updated Apr 7, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,831 1,892 Updated Apr 30, 2024

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,341 77 Updated May 13, 2025

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

Python 308 29 Updated Oct 22, 2024

Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization

Python 143 20 Updated May 16, 2024

Paper list of multi-agent reinforcement learning (MARL)

4,346 751 Updated Oct 17, 2024

Approaching (Almost) Any Machine Learning Problem

7,933 1,111 Updated Mar 25, 2023

Mastering Diverse Domains through World Models

Python 1,831 309 Updated Apr 11, 2025

Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021

Python 65 13 Updated May 22, 2021

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 68,477 8,344 Updated May 6, 2025

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Jupyter Notebook 68 9 Updated Jul 29, 2021

A minimal implementation of Go-Explore without domain knowledge

Python 15 5 Updated Apr 26, 2021

Generative Planning Method (ICLR22)

Python 8 1 Updated Apr 27, 2022

Agent Learning Framework https://alf.readthedocs.io

Python 322 54 Updated May 14, 2025

Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022

Jupyter Notebook 18 2 Updated Jul 10, 2023
Python 343 55 Updated Oct 12, 2022

Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.

Python 68 25 Updated Oct 21, 2023

Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization

Python 81 5 Updated Apr 13, 2023
Python 42 4 Updated Apr 13, 2023

Everything you need about Active Learning (AL).

884 80 Updated Jun 1, 2024

Code for Go-Explore: a New Approach for Hard-Exploration Problems

Python 566 101 Updated Dec 8, 2022

RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven explora…

Jupyter Notebook 394 19 Updated Apr 4, 2025

Random Network Distillation pytorch

Python 247 46 Updated Mar 4, 2019

Simple Cartpole example writed with pytorch.

Python 167 23 Updated Oct 29, 2019

A curated list of awesome exploration RL resources (continually updated)

474 14 Updated Feb 7, 2025

Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"

Python 10 1 Updated Jun 24, 2022

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 951 134 Updated Mar 17, 2025

Check out the new game server:

Python 3,438 1,317 Updated Sep 3, 2024
Next
0