8000 rookielyb / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View rookielyb's full-sized avatar

Block or report rookielyb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Python 206 21 Updated Jul 9, 2025

OpenDeepWiki is the open-source version of the DeepWiki project, aiming to provide a powerful knowledge management and collaboration platform. The project is mainly developed using C# and TypeScrip…

C# 1,440 195 Updated Jul 11, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

10,782 1,329 Updated Jul 9, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,417 83 Updated Jul 11, 2025
Python 585 50 Updated Apr 15, 2025

Go ahead and axolotl questions

Python 9,878 1,069 Updated Jul 12, 2025

A project to improve skills of large language models

Python 457 84 Updated Jul 12, 2025

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 227 10 Updated Jun 4, 2025

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

527 37 Updated Jun 6, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,506 1,515 Updated Jun 26, 2025

[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

Python 57 2 Updated Mar 27, 2024

Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"

Jupyter Notebook 314 35 Updated Dec 28, 2023

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,232 1,168 Updated Jul 9, 2025

Hammer: Highly Agile Masks Made Effortlessly from RTL

Python 284 65 Updated May 16, 2025
Python 267 18 Updated Jun 10, 2025

The evaluation benchmark on MCP servers

Python 149 7 Updated May 21, 2025

Model Context Protocol Servers

TypeScript 59,051 6,829 Updated Jul 11, 2025

Lightweight coding agent that runs in your terminal

Rust 30,764 3,535 Updated Jul 12, 2025

Function Calling Benchmark & Testing

Jupyter Notebook 87 5 Updated Jul 10, 2024

Complex Function Calling Benchmark.

Python 117 13 Updated Jan 20, 2025

Distributed RL System for LLM Reasoning

Python 1,987 115 Updated Jul 10, 2025

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,835 299 Updated May 21, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 641 38 Updated May 27, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,424 61 Updated May 11, 2025

A Database of Real Faults and an Experimental Infrastructure to Enable Controlled Experiments in Software Engineering Research

Perl 853 338 Updated Jun 6, 2025
Python 729 47 Updated May 30, 2025

adds Sequence Parallelism into LLaMA-Factory

Python 526 36 Updated Jul 8, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,760 351 Updated Jul 8, 2025

Fully open data curation for reasoning models

Python 1,967 166 Updated Jun 5, 2025

DeepSeek 系列工作解读、扩展和复现。

Python 661 53 Updated Mar 29, 2025
Next
0