8000 zui-jiang (zuijiang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zui-jiang's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zui-jiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 274 20 Updated May 30, 2025

A survey on harmful fine-tuning attack for large language model

178 6 Updated May 27, 2025

Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!

Jupyter Notebook 15 Updated Aug 24, 2024

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,388 58 Updated May 30, 2025

Official Repository of LatentSeek

Python 29 2 Updated May 25, 2025

DeepRAG: Thinking to Retrieve Step by Step for Large Language Models

Python 12 1 Updated May 17, 2025

Graphical Java application for managing BibTeX and BibLaTeX (.bib) databases

Java 3,905 2,815 Updated May 30, 2025

Cleaner and Formatter for BibTeX files

TeX 969 75 Updated May 16, 2025

Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang

Python 53 6 Updated Nov 8, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,719 202 Updated May 27, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 953 43 Updated May 24, 2025

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 606 40 Updated May 29, 2025

A light-weight tool for evaluating LLMs in rule-based ways.

Python 53 3 Updated May 26, 2025

An Azure Function solution to crawl through all of your image files in GitHub and losslessly compress them. This will make the file size go down, but leave the dimensions and quality untouched. Onc…

C# 1,292 279 Updated Jan 28, 2025

Async pipelined version of Verl

Python 91 10 Updated Apr 8, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. AntRay is forked from ray, offering incremental new features on top …

Python 126 19 Updated May 29, 2025

Simple, modern and fast file watching and code reload in Python.

Python 1,993 109 Updated Apr 10, 2025

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Python 106 7 Updated Apr 17, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,136 392 Updated May 30, 2025
Python 45 2 Updated Apr 9, 2025

Debugging torch distributed program

Python 7 Updated Aug 30, 2024

Synchronized viewing, theater, live streaming, video

Go 1,504 134 Updated May 8, 2025

跟你的好友一起实时在线听播客!

TypeScript 502 42 Updated Apr 9, 2024

veRL: Volcano Engine Reinforcement Learning for LLM

Python 4 2 Updated May 28, 2025

Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).

Python 136 14 Updated Oct 1, 2023

Efficient Triton Kernels for LLM Training

Python 5,119 338 Updated May 30, 2025

A visuailzation tool to make deep understaning and easier debugging for RLHF training.

Python 203 7 Updated Feb 20, 2025

Official Repo for Open-Reasoner-Zero

Python 1,936 100 Updated Apr 8, 2025
Next
0