8000 lafmdp (Jing-Cheng Pang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lafmdp's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report lafmdp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS‘24] KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

1 Updated Jan 20, 2025

A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.

Python 7 Updated May 27, 2025
Python 4 Updated Mar 26, 2025
Python 12 2 Updated May 20, 2025

Official Code Repository for 《InCLET: Large Language Model In-context Learning can Improve Embodied Instruction-following》

Python 3 1 Updated Mar 17, 2025

A live stream development of RL tunning for LLM agents

Python 2,864 397 Updated May 23, 2025

BabyAI platform. A testbed for training agents to understand and execute language commands.

Python 728 151 Updated Oct 1, 2023

A large-scale benchmark and learning environment.

Python 1,392 270 Updated Jan 25, 2025
Python 6 1 Updated Mar 2, 2025

Pre-trained Models of BWArea Model

Python 9 Updated Sep 10, 2024

AgentSociety: Large-scale Social Simulation to Understand Human Behaviors and Society through LLM-driven Agents

Python 303 49 Updated May 20, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 16,391 1,683 Updated Apr 12, 2025

Supercharge yourself!

TypeScript 808 76 Updated May 27, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,253 324 Updated May 18, 2025
Python 7 3 Updated Apr 18, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,587 183 Updated Jan 30, 2025

A lightweight framework for building LLM-based agents

Python 2,134 218 Updated Mar 14, 2025

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Python 2,988 515 Updated May 22, 2025

Xiaomi Home Integration for Home Assistant

Python 19,861 1,015 Updated May 23, 2025

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 660 70 Updated Apr 27, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,842 669 Updated May 27, 2025

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

3,074 285 Updated Dec 17, 2024

Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.

Python 183 7 Updated Mar 22, 2025

Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"

Python 53 2 Updated Oct 10, 2024

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 3,383 262 Updated Jan 21, 2025

A collection of offline reinforcement learning algorithms.

Python 185 21 Updated Nov 26, 2024

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 318 15 Updated Dec 22, 2024

An elegant PyTorch offline reinforcement learning library for researchers.

Python 337 38 Updated Apr 17, 2024
Next
0