8000 haozheji (Haozhe Ji) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View haozheji's full-sized avatar
:shipit:
:shipit:

Highlights

  • Pro

Block or report haozheji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 206 12 Updated Jun 8, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,990 806 Updated Jun 18, 2025

Minimal RLHF implementation built on top of minGPT.

Python 29 3 Updated Jul 4, 2024

Create Epic Math and Physics Animations From Text.

Python 986 110 Updated May 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,644 1,513 Updated Jun 18, 2025

This repository includes some detailed proofs of "Bias Variance Decomposition for KL Divergence".

4 Updated Sep 25, 2021

Scalable toolkit for efficient model alignment

Python 814 99 Updated May 31, 2025

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,772 278 Updated Dec 27, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,118 692 Updated Jun 17, 2025

Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)

Python 34 1 Updated Apr 21, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 897 64 Updated Feb 16, 2025

An in-browser, local-first Markdown resume builder.

TypeScript 617 115 Updated Jul 11, 2024
Python 23 4 Updated Sep 24, 2024

A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac

VBA 1,077 70 Updated Jan 30, 2025

A python Linear Programming API

Python 2,268 411 Updated May 30, 2025

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,107 3,636 Updated Apr 2, 2025

Puzzles for learning Triton

Jupyter Notebook 1,700 136 Updated Nov 18, 2024

Grok open release

Python 50,291 8,356 Updated Aug 30, 2024

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

706 43 Updated Jun 13, 2025

ICLR2023 - Tailoring Language Generation Models under Total Variation Distance

Python 21 1 Updated Feb 8, 2023

Some preliminary explorations of Mamba's context scaling.

Python 214 10 Updated Feb 8, 2024

Easy TOC creation for GitHub README.md

Shell 3,270 2,738 Updated Oct 12, 2024

Example models using DeepSpeed

Python 6,539 1,094 Updated Jun 18, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,006 8,141 Updated Jun 18, 2025

Robust recipes to align language models with human and AI preferences

Python 5,229 448 Updated Apr 30, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,669 481 Updated Jan 8, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,608 217 Updated Aug 11, 2024

DSPy: The framework for programming—not prompting—language models

Python 25,606 1,968 Updated Jun 18, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

3,990 243 Updated Apr 30, 2025

Inference code for CodeLlama models

Python 16,331 1,918 Updated Aug 12, 2024
Next
0