8000 yyht / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yyht's full-sized avatar

Block or report yyht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 48 3 Updated Jun 25, 2025

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents

Python 95 5 Updated Jun 25, 2025

slime is a LLM post-training framework aiming at scaling RL.

Python 445 19 Updated Jun 25, 2025

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 598 26 Updated Oct 26, 2024

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25

Python 32 2 Updated Jun 16, 2025

ICML 2025 Spotlight

Python 206 11 Updated Jun 21, 2025

The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 145 13 Updated Jun 3, 2025

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 278 12 Updated May 31, 2025
Python 156 18 Updated Jun 19, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,261 47 Updated Jun 14, 2025

A Python implementation of Hidden Topic Markov Model

Python 15 6 Updated May 6, 2018

c/ua is the Docker Container for Computer-Use AI Agents.

Python 8,779 392 Updated Jun 24, 2025
Python 55 2 Updated Jun 17, 2025

Official Repository of Absolute Zero Reasoner

Python 1,553 264 Updated Jun 2, 2025

Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users

Python 445 57 Updated Apr 15, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 670 23 Updated Jun 25, 2025

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 58 3 Updated Jun 2, 2025

Latest Advances on System-2 Reasoning

Python 1,137 57 Updated Jun 8, 2025

This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.

Python 87 21 Updated Jan 26, 2025

A package for sampling from Gibbs distributions during inference with LLMs.

Python 8 2 Updated Jun 12, 2025

A Chinese Open-Domain Dialogue System

Python 321 27 Updated Aug 16, 2023

Example models using DeepSpeed

Python 6,541 1,094 Updated Jun 21, 2025

Clustering for arbitrary data and dissimilarity function

Python 1 Updated Dec 21, 2021

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Python 1,317 119 Updated Dec 1, 2023
Python 57 6 Updated Dec 18, 2022
Python 63 8 Updated Jun 9, 2022

OPD: Chinese Open-Domain Pre-trained Dialogue Model

Python 75 1 Updated Jun 5, 2023

Accessible large language models via k-bit quantization for PyTorch.

Python 7,154 708 Updated Jun 24, 2025

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

Python 1,448 72 Updated Dec 9, 2024

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Python 1,485 197 Updated Jun 18, 2025
Next
0