8000 lafmdp (Jing-Cheng Pang) · GitHub

More Web Proxy on the site http://driver.im/

lafmdp

Follow

🎯

Focusing

Jing-Cheng Pang lafmdp

🎯

Focusing

Follow

Ph.D. student at Nanjing University. Interested in reinforcement learning.

41 followers · 15 following

Nanjing University
NanJing, Jiangsu, China
13:55 (UTC +08:00)
https://www.lamda.nju.edu.cn/pangjc

Achievements

Achievements

Pinned Loading

Awesome-Papers-Autonomous-Agent Awesome-Papers-Autonomous-Agent Public

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

696 59
LAMDA-RL/ImagineBench LAMDA-RL/ImagineBench Public

A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.

Python 7
HIDIL HIDIL Public

[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"

Python 12 1
RLC RLC Public

[ICLR'24] Official code for "Language Model Self-improvement by Reinforcement Learning Contemplation".

Jupyter Notebook 7
TALAR TALAR Public

Forked from ppq12138/TALAR

[NeurIPS'23] Official code for "Natural Language-conditioned Reinforcement Learning with Task-related Language Development and Translation", NeurIPS 2023.

Python 2
KALM KALM Public

Forked from CharlieBrown-v1/KALM

[NeurIPS‘24] KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

1

0