-
Southeast University
- Nanjing
Stars
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
🌐 WebAgent for Information Seeking bulit by Tongyi Lab: WebWalker & WebDancer & WebSailor https://arxiv.org/pdf/2507.02592
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
Train your Agent model via our easy and efficient framework
Official Repository of Absolute Zero Reasoner
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Official Repository of "Learning to Reason under Off-Policy Guidance"
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
An open protocol enabling communication and interoperability between opaque agentic applications.
Official Repo for Open-Reasoner-Zero
A curated list of 120+ LLM libraries category wise.
A live stream development of RL tunning for LLM agents
No fortress, purely open ground. OpenManus is Coming.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Making large AI models cheaper, faster and more accessible
Fully open reproduction of DeepSeek-R1
Witness the aha moment of VLM with less than $3.