-
Nanjing University
- NanJing, Jiangsu, China
-
04:17
(UTC +08:00) - https://www.lamda.nju.edu.cn/pangjc
Lists (8)
Sort Name ascending (A-Z)
🤖Autonomous Agent
Agent perceives its environment, takes actions autonomously to achieve goals, and may improve its performance with learning or acquiring knowledge.Benchmark
Benchmark for experimental environments, algorithms, etc.Efficiency
Implementing something with high efficiency.Interesting tools 🔨
Some interesting toolsModels🌲
Open-sourced foundation models, language models, multi-modal models.Paper collections📚
Paper Implementation 📄
Released algorithm implementation.Tutorial 📚
Tutorials or statistical list.Stars
LAMDA-RL / KALM
Forked from CharlieBrown-v1/KALM[NeurIPS‘24] KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts
A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.
Official Code Repository for 《InCLET: Large Language Model In-context Learning can Improve Embodied Instruction-following》
A live stream development of RL tunning for LLM agents
BabyAI platform. A testbed for training agents to understand and execute language commands.
A large-scale benchmark and learning environment.
Pre-trained Models of BWArea Model
AgentSociety: Large-scale Social Simulation to Understand Human Behaviors and Society through LLM-driven Agents
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Sky-T1: Train your own O1 preview model within $450
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
A lightweight framework for building LLM-based agents
SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?
Xiaomi Home Integration for Home Assistant
Train a 1B LLM with 1T tokens from scratch by personal
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.
Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
A collection of offline reinforcement learning algorithms.
[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
An elegant PyTorch offline reinforcement learning library for researchers.