8000 wyxscir / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wyxscir's full-sized avatar
🍒
🍒
  • beijing

Block or report wyxscir

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

Python 180 4 Updated May 30, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 911 195 Updated May 30, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 596 36 Updated Jun 4, 2025

Open-source Multi-agent Poster Generation from Papers

Python 1,682 71 Updated Jun 4, 2025
Python 235 14 Updated May 27, 2025

Code for the paper: "Learning to Reason without External Rewards"

Python 243 21 Updated Jun 4, 2025

The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 102 4 Updated Jun 3, 2025
Python 8 Updated Jun 3, 2025

Code for paper "SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation"

5 Updated May 28, 2025

A benchmark for LLMs on complicated tasks in the terminal

Shell 142 26 Updated Jun 4, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 961 43 Updated Jun 4, 2025 BD5A

[ACL 2025 Findings] Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering

Python 2 Updated May 21, 2025

Obsidian Weread Plugin is a plugin to sync Weread(微信读书) hightlights and annotations into your Obsidian Vault.

TypeScript 1,444 85 Updated May 6, 2025
Python 196 9 Updated May 14, 2025

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 374 15 Updated May 17, 2025
Python 1 Updated May 19, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,405 58 Updated May 30, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 211 22 Updated Jun 3, 2025

Official Repository of Absolute Zero Reasoner

Python 1,450 240 Updated Jun 2, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 958 89 Updated Jun 1, 2025

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 136 7 Updated May 9, 2025
Python 10 Updated May 7, 2025

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

482 34 Updated May 15, 2025

official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”

Python 266 20 Updated Jun 2, 2025

YiZhao: A 2TB Open Financial Corpus. Data and tools for generating and inspecting YiZhao, a safe, high-quality, open-source bilingual financial corpus (Chinese and English).

Python 26 3 Updated Dec 12, 2024

GraphGen: A Scalable Approach to Domain-agnostic Labeled Graph Generation

C++ 58 16 Updated Jul 6, 2023

My learning notes/codes for ML SYS.

Python 2,389 149 Updated Jun 4, 2025

repo for paper https://arxiv.org/abs/2504.13837

Python 144 7 Updated May 24, 2025

TTRL: Test-Time Reinforcement Learning

Python 587 43 Updated May 23, 2025
Next
0