rookielyb

rookielyb

3 followers · 8 following

Achievements

Stars

multi-swe-bench / multi-swe-bench

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Python 206 21 Updated Jul 9, 2025

AIDotNet / OpenDeepWiki

OpenDeepWiki is the open-source version of the DeepWiki project, aiming to provide a powerful knowledge management and collaboration platform. The project is mainly developed using C# and TypeScrip…

C# 1,440 195 Updated Jul 11, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

10,782 1,329 Updated Jul 9, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,417 83 Updated Jul 11, 2025

microsoft / rStar

Python 585 50 Updated Apr 15, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 9,878 1,069 Updated Jul 12, 2025

NVIDIA / NeMo-Skills

A project to improve skills of large language models

Python 457 84 Updated Jul 12, 2025

zwhe99 / DeepMath

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 227 10 Updated Jun 4, 2025

ByteDance-Seed / Seed-Coder

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

527 37 Updated Jun 6, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,506 1,515 Updated Jun 26, 2025

JoeYing1019 / UltraTool

[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

Python 57 2 Updated Mar 27, 2024

ofirpress / self-ask

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Jupyter Notebook 314 35 Updated Dec 28, 2023

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,232 1,168 Updated Jul 9, 2025

ucb-bar / hammer

Hammer: Highly Agile Masks Made Effortlessly from RTL

Python 284 65 Updated May 16, 2025

qiancheng0 / ToolRL

Python 267 18 Updated Jun 10, 2025

modelscope / MCPBench

The evaluation benchmark on MCP servers

Python 149 7 Updated May 21, 2025

modelcontextprotocol / servers

Model Context Protocol Servers

TypeScript 59,051 6,829 Updated Jul 11, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 30,764 3,535 Updated Jul 12, 2025

ComposioHQ / Composio-Function-Calling-Benchmark

Function Calling Benchmark & Testing

Jupyter Notebook 87 5 Updated Jul 10, 2024

THUDM / ComplexFuncBench

Complex Function Calling Benchmark.

Python 117 13 Updated Jan 20, 2025

inclusionAI / AReaL

Distributed RL System for LLM Reasoning

Python 1,987 115 Updated Jul 10, 2025

aburkov / theLMbook

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,835 299 Updated May 21, 2025

0russwest0 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 641 38 Updated May 27, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,424 61 Updated May 11, 2025

rjust / defects4j

A Database of Real Faults and an Experimental Infrastructure to Enable Controlled Experiments in Software Engineering Research

Perl 853 338 Updated Jun 6, 2025

Qihoo360 / Light-R1

Python 729 47 Updated May 30, 2025

Qihoo360 / 360-LLaMA-Factory

Forked from hiyouga/LLaMA-Factory

adds Sequence Parallelism into LLaMA-Factory

Python 526 36 Updated Jul 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rookielyb

Achievements

Achievements

Block or report rookielyb

Stars

multi-swe-bench / multi-swe-bench

AIDotNet / OpenDeepWiki

HW-whistleblower / True-Story-of-Pangu

alibaba / ROLL

microsoft / rStar

axolotl-ai-cloud / axolotl

NVIDIA / NeMo-Skills

zwhe99 / DeepMath

ByteDance-Seed / Seed-Coder

QwenLM / Qwen3

JoeYing1019 / UltraTool

ofirpress / self-ask

ShishirPatil / gorilla

ucb-bar / hammer

qiancheng0 / ToolRL

modelscope / MCPBench

modelcontextprotocol / servers

openai / codex

ComposioHQ / Composio-Function-Calling-Benchmark

THUDM / ComplexFuncBench

inclusionAI / AReaL

aburkov / theLMbook

0russwest0 / Agent-R1

BytedTsinghua-SIA / DAPO

rjust / defects4j

Qihoo360 / Light-R1

Qihoo360 / 360-LLaMA-Factory

agentica-project / rllm

open-thoughts / open-thoughts

datawhalechina / unlock-deepseek