8000 SkyRiver-2000 (Ruiwen Zhou) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View SkyRiver-2000's full-sized avatar

Block or report SkyRiver-2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,507 7,947 Updated Jun 13, 2025

Fully open reproduction of DeepSeek-R1

Python 24,771 2,292 Updated Jun 2, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,893 1,491 Updated Apr 24, 2025

Simple RL training for reasoning

Python 3,625 271 Updated Apr 10, 2025

AgentNetworkProtocol(ANP) is an open source protocol for agent communication. Our vision is to define how agents connect with each other, building an open, secure, and efficient collaboration netwo…

HTML 815 50 Updated Jun 2, 2025

[FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methods

Dafny 48 Updated Jun 9, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,393 1,230 Updated Jun 13, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,736 123 Updated Jun 5, 2025

📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)

36 4 Updated Mar 11, 2025

Codes and data for our paper - RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Python 9 Updated Jun 11, 2025

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,016 4,023 Updated Jun 10, 2025

[SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

Python 19 1 Updated Mar 28, 2024

A U.S. personal income tax calculator

Python 336 25 Updated Apr 21, 2024

Collection of advice for prospective and current PhD students

1,786 134 Updated Jul 10, 2024

A collection of benchmarks and datasets for evaluating LLM.

459 29 Updated Jul 13, 2024

RuLES: a benchmark for evaluating rule-following in language models

Python 226 15 Updated Feb 24, 2025

Controlled Text Generation via Language Model Arithmetic

Python 221 14 Updated Sep 15, 2024

Documents used for grad school application

303 21 Updated Jul 6, 2021

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 372 24 Updated Sep 6, 2024

[ICML'2024] Can AI Assistants Know What They Don't Know?

Python 81 9 Updated Feb 5, 2024

The official Meta Llama 3 GitHub site

Python 28,775 3,399 Updated Jan 26, 2025
Python 317 14 Updated Sep 18, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 56,359 6,745 Updated May 16, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,485 1,515 Updated Apr 29, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,609 184 Updated Jan 30, 2025

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,024 158 Updated Feb 7, 2025

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,605 1,305 Updated Apr 6, 2025

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,788 473 Updated Oct 12, 2024

MiniWoB++: a web interaction benchmark for reinforcement learning

HTML 321 53 Updated May 5, 2025

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 357 74 Updated Sep 6, 2024
Next
0