fengpeng-yue

fengpeng-yue

9 followers · 1 following

Southern University of Science and Technology

Achievements

Stars

DtYXs / verl

Forked from volcengine/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 1 1 Updated Jun 3, 2025

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,109 304 Updated Feb 27, 2025

apexrl / Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

578 26 Updated Nov 29, 2024

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 15,362 1,916 Updated Jun 29, 2025

sanjibanc / leap_llm

Python 6 3 Updated Dec 19, 2024

sanjibanc / agent_prm

Python 35 3 Updated Feb 19, 2025

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 58,518 5,786 Updated Jun 30, 2025

princeton-nlp / WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 363 74 Updated Sep 6, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,116 1,668 Updated Jun 30, 2025

alfworld / alfworld

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 479 63 Updated Jan 6, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,219 701 Updated Jun 19, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 459 32 Updated Jun 30, 2025

RUC-NLPIR / FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 2,462 214 Updated Jun 23, 2025

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 318 33 Updated Aug 6, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 57,485 7,984 Updated Jun 28, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,031 409 Updated Jun 30, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser, Trae AI & Cluely (And other Open Sourced) System Prompts, Tools & AI Models.

63,090 18,538 Updated Jun 28, 2025

Agent-RL / ReCall

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,003 65 Updated May 16, 2025

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,130 445 Updated May 21, 2025

taichengguo / LLM_MultiAgents_Survey_Papers

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

1,024 54 Updated Apr 24, 2024

kingjulio8238 / Memary

The Open Source Memory Layer For Autonomous Agents

Jupyter Notebook 2,263 167 Updated Oct 22, 2024

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,709 206 Updated Jun 20, 2025

agiresearch / A-mem

A-MEM: Agentic Memory for LLM Agents

Python 440 54 Updated Jun 27, 2025

mem0ai / mem0

Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python 35,674 3,637 Updated Jun 28, 2025

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 46,640 7,081 Updated Jun 30, 2025

WujiangXu / AgenticMemory

Code implementation for paper "A-mem: Agentic Memory for LLM Agents"

Python 462 36 Updated May 25, 2025

Fosowl / agenticSeek

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 19,633 1,916 Updated Jun 28, 2025

FoundationAgents / awesome-foundation-agents

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

1,476 145 Updated Jun 8, 2025

stream-bench / stream-bench

We propose a pioneering benchmark to evaluate LLM agents' ability to improve over time in streaming scenarios

Python 47 7 Updated Oct 28, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,248 2,293 Updated Jun 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fengpeng-yue

Achievements

Achievements

Block or report fengpeng-yue

Stars

DtYXs / verl

facebookresearch / jepa

apexrl / Diff4RLSurvey

modelcontextprotocol / python-sdk

sanjibanc / leap_llm

sanjibanc / agent_prm

infiniflow / ragflow

princeton-nlp / WebShop

volcengine / verl

alfworld / alfworld

OpenRLHF / OpenRLHF

langfengQ / verl-agent

RUC-NLPIR / FlashRAG

YuxiXie / MCTS-DPO

rasbt / LLMs-from-scratch

allenai / open-instruct

x1xhlol / system-prompts-and-models-of-ai-tools

Agent-RL / ReCall

OpenBMB / ToolBench

taichengguo / LLM_MultiAgents_Survey_Papers

kingjulio8238 / Memary

PeterGriffinJin / Search-R1

agiresearch / A-mem

mem0ai / mem0

microsoft / autogen

WujiangXu / AgenticMemory

Fosowl / agenticSeek

FoundationAgents / awesome-foundation-agents

stream-bench / stream-bench

espnet / espnet