8000 cauyxy (Xinyu Yang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View cauyxy's full-sized avatar
🐵
Unacquainted with machine learning
🐵
Unacquainted with machine learning

Block or report cauyxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A python module to repair invalid JSON from LLMs

Python 2,418 105 Updated Jul 7, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,300 50 Updated Jun 14, 2025
Python 303 18 Updated May 31, 2025
Lean 59 27 Updated Jul 3, 2025
2 Updated Oct 9, 2023

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 351 1 Updated Mar 5, 2025

Block Puzzle is a classic, puzzle game, made in Unity, where you have to put a randomly spawned blocks in suitable places.

C# 52 19 Updated May 27, 2021

Ef E3EF ficient Triton Kernels for LLM Training

Python 5,325 367 Updated Jul 8, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,528 1,740 Updated Jul 8, 2025

A series of math-specific large language models of our Qwen2 series.

Python 960 136 Updated Jan 11, 2025

[AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning

Python 33 Updated Apr 14, 2025

A flexible and efficient training framework for large-scale alignment tasks

Python 385 32 Updated Jul 8, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,608 172 Updated Jul 8, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,671 567 Updated Jul 8, 2025

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 2,801 532 Updated Apr 15, 2024

KenLM: Faster and Smaller Language Model Queries

C++ 2,630 524 Updated Mar 30, 2025

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,260 146 Updated Jul 7, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,748 628 Updated Jul 7, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 37,883 6,570 Updated Jul 8, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,292 708 Updated Jul 8, 2025

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

263 7 Updated Aug 20, 2023

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

Python 38 4 Updated Jan 12, 2024

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,745 3,533 Updated Jul 7, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,627 546 Updated May 3, 2024

[NeurlPS D&B 2024] Generative AI for Math: MathPile

Python 414 21 Updated Apr 4, 2025

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,832 493 Updated Nov 27, 2024

A framework for few-shot evaluation of language models.

Python 9,476 2,520 Updated Jul 7, 2025
Next
0