-
Southern University of Science and Technology
Stars
DtYXs / verl
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
PyTorch code and models for V-JEPA self-supervised learning from video.
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"
The official Python SDK for Model Context Protocol servers and clients
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
verl: Volcano Engine Reinforcement Learning for LLMs
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser, Trae AI & Cluely (And other Open Sourced) System Prompts, Tools & AI Models.
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
The Open Source Memory Layer For Autonomous Agents
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Code implementation for paper "A-mem: Agentic Memory for LLM Agents"
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
About Awesome things towards foundation agents. Papers / Repos / Blogs / ...
We propose a pioneering benchmark to evaluate LLM agents' ability to improve over time in streaming scenarios