AgentNetworkProtocol(ANP) is an open source protocol for agent communication. Our vision is to define how agents connect with each other, building an open, secure, and efficient collaboration netwo…

HTML 815 50 Updated Jun 2, 2025

Mondego / dafny-synthesis

[FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methods

Dafny 48 Updated Jun 9, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,393 1,230 Updated Jun 13, 2025

mbzuai-oryx / Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,736 123 Updated Jun 5, 2025

qhliu26 / awesome-time-series-analysis

📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)

36 4 Updated Mar 11, 2025

SkyRiver-2000 / RuleArena

Codes and data for our paper - RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Python 9 Updated Jun 11, 2025

RayeRen / acad-homepage.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,016 4,023 Updated Jun 10, 2025

SkyRiver-2000 / TRAD-Official

[SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

Python 19 1 Updated Mar 28, 2024

b-k / py1040

A U.S. personal income tax calculator

Python 336 25 Updated Apr 21, 2024

pliang279 / awesome-phd-advice

Collection of advice for prospective and current PhD students

1,786 134 Updated Jul 10, 2024

leobeeson / llm_benchmarks

A collection of benchmarks and datasets for evaluating LLM.

459 29 Updated Jul 13, 2024

normster / llm_rules

RuLES: a benchmark for evaluating rule-following in language models

Python 226 15 Updated Feb 24, 2025

eth-sri / language-model-arithmetic

Controlled Text Generation via Language Model Arithmetic

Python 221 14 Updated Sep 15, 2024

Wuziyi616 / Graduate_Application

Documents used for grad school application

303 21 Updated Jul 6, 2021

tianyi-lab / Cherry_LLM

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 372 24 Updated Sep 6, 2024

OpenMOSS / Say-I-Dont-Know

[ICML'2024] Can AI Assistants Know What They Don't Know?

Python 81 9 Updated Feb 5, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,775 3,399 Updated Jan 26, 2025

OpenBMB / Eurus

Python 317 14 Updated Sep 18, 2024

FoundationAgents / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 56,359 6,745 Updated May 16, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,485 1,515 Updated Apr 29, 2025

THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,609 184 Updated Jan 30, 2025

web-arena-x / webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,024 158 Updated Feb 7, 2025

LlamaFamily / Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

Python 14,605 1,305 Updated Apr 6, 2025

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,788 473 Updated Oct 12, 2024

Farama-Foundation / miniwob-plusplus

MiniWoB++: a web interaction benchmark for reinforcement learning

HTML 321 53 Updated May 5, 2025

princeton-nlp / WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 357 74 Updated Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ruiwen Zhou SkyRiver-2000

Achievements