8000 fengpeng-yue / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View fengpeng-yue's full-sized avatar
  • Southern University of Science and Technology

Block or report fengpeng-yue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 1 1 Updated Jun 3, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,109 304 Updated Feb 27, 2025

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

578 26 Updated Nov 29, 2024

The official Python SDK for Model Context Protocol servers and clients

Python 15,362 1,916 Updated Jun 29, 2025
Python 6 3 Updated Dec 19, 2024
Python 35 3 Updated Feb 19, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 58,518 5,786 Updated Jun 30, 2025

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 363 74 Updated Sep 6, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,116 1,668 Updated Jun 30, 2025

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 479 63 Updated Jan 6, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,219 701 Updated Jun 19, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 459 32 Updated Jun 30, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 2,462 214 Updated Jun 23, 2025

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 318 33 Updated Aug 6, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 57,485 7,984 Updated Jun 28, 2025

AllenAI's post-training codebase

Python 3,031 409 Updated Jun 30, 2025

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser, Trae AI & Cluely (And other Open Sourced) System Prompts, Tools & AI Models.

63,090 18,538 Updated Jun 28, 2025

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,003 65 Updated May 16, 2025

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,130 445 Updated May 21, 2025

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

1,024 54 Updated Apr 24, 2024

The Open Source Memory Layer For Autonomous Agents

Jupyter Notebook 2,263 167 Updated Oct 22, 2024

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,709 206 Updated Jun 20, 2025

A-MEM: Agentic Memory for LLM Agents

Python 440 54 Updated Jun 27, 2025

Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python 35,674 3,637 Updated Jun 28, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 46,640 7,081 Updated Jun 30, 2025

Code implementation for paper "A-mem: Agentic Memory for LLM Agents"

Python 462 36 Updated May 25, 2025

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 19,633 1,916 Updated Jun 28, 2025

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

1,476 145 Updated Jun 8, 2025

We propose a pioneering benchmark to evaluate LLM agents' ability to improve over time in streaming scenarios

Python 47 7 Updated Oct 28, 2024

End-to-End Speech Processing Toolkit

Python 9,248 2,293 Updated Jun 20, 2025
Next
0