8000 waterwaterrr / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View waterwaterrr's full-sized avatar

Block or report waterwaterrr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
JavaScript 1,172 97 Updated May 29, 2025

[ICML 2025] Official Implementation of GLIDER

Python 44 1 Updated May 27, 2025

Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Python 40 2 Updated May 22, 2025

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 17,220 1,861 Updated Apr 4, 2025

LLM Arena by KCORES team

HTML 826 38 Updated Apr 29, 2025

VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework

Python 339 15 Updated May 12, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 215 8 Updated Apr 2, 2025

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

414 11 Updated May 22, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 7,861 667 Updated May 31, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,280 52 Updated May 11, 2025

A series of technical report on Slow Thinking with LLM

Python 682 39 Updated May 27, 2025
Python 293 18 Updated May 31, 2025

Awesome RL-based LLM Reasoning

506 27 Updated May 4, 2025

🙌 OpenHands: Code Less, Make More

Python 56,973 6,415 Updated May 31, 2025
Python 731 32 Updated Apr 28, 2025

s1: Simple test-time scaling

Python 6,418 748 Updated May 19, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,306 305 Updated May 13, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,795 277 Updated May 15, 2025

Official Repo for Open-Reasoner-Zero

Python 1,938 101 Updated Apr 8, 2025

My learning notes/codes for ML SYS.

Python 2,325 144 Updated May 31, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】

Jupyter Notebook 10,486 1,274 Updated May 29, 2025

Deep Reinforcement Learning

3,916 627 Updated Dec 10, 2022

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,805 1,098 Updated May 31, 2025

LIMO: Less is More for Reasoning

Python 954 47 Updated Apr 6, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,347 155 Updated Mar 20, 2025

Simple RL training for reasoning

Python 3,601 267 Updated Apr 10, 2025

Fully open reproduction of DeepSeek-R1

Python 24,625 2,276 Updated May 28, 2025

ASCII generator (image to text, image to image, video to video)

Python 7,872 607 Updated Nov 22, 2024
Next
0