haozheji

Haozhe Ji haozheji

PhD student @ Tsinghua University HFer | THU EE | THU CoAI

98 followers · 53 following

Beijing, China
06:18 (UTC +08:00)
haozheji.github.io
@HaozJi

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

sam-paech / slop-forensics

Jupyter Notebook 206 12 Updated Jun 8, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,990 806 Updated Jun 18, 2025

ttumiel / minRLHF

Minimal RLHF implementation built on top of minGPT.

Python 29 3 Updated Jul 4, 2024

HarleyCoops / Math-To-Manim

Create Epic Math and Physics Animations From Text.

Python 986 110 Updated May 30, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,644 1,513 Updated Jun 18, 2025

HolmesShuan / Bias-Variance-Decomposition-for-KL-Divergence

This repository includes some detailed proofs of "Bias Variance Decomposition for KL Divergence".

4 Updated Sep 25, 2021

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 814 99 Updated May 31, 2025

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,772 278 Updated Dec 27, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,118 692 Updated Jun 17, 2025

d-tiapkin / gflownet-rl

Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)

Python 34 1 Updated Apr 21, 2024

princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 897 64 Updated Feb 16, 2025

Renovamen / oh-my-cv

An in-browser, local-first Markdown resume builder.

TypeScript 617 115 Updated Jul 11, 2024

EleutherAI / w2s

Python 23 4 Updated Sep 24, 2024

Jonathan-LeRoux / IguanaTex

A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac

VBA 1,077 70 Updated Jan 30, 2025

coin-or / pulp

A python Linear Programming API

Python 2,268 411 Updated May 30, 2025

gpakosz / .tmux

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,107 3,636 Updated Apr 2, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,700 136 Updated Nov 18, 2024

xai-org / grok-1

Grok open release

Python 50,291 8,356 Updated Aug 30, 2024

Yangyi-Chen / Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

706 43 Updated Jun 13, 2025

thu-coai / TaiLr

ICLR2023 - Tailoring Language Generation Models under Total Variation Distance

Python 21 1 Updated Feb 8, 2023

jzhang38 / LongMamba

Some preliminary explorations of Mamba's context scaling.

Python 214 10 Updated Feb 8, 2024

ekalinin / github-markdown-toc

Easy TOC creation for GitHub README.md

Shell 3,270 2,738 Updated Oct 12, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,539 1,094 Updated Jun 18, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,006 8,141 Updated Jun 18, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,229 448 Updated Apr 30, 2025

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,669 481 Updated Jan 8, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,608 217 Updated Aug 11, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 25,606 1,968 Updated Jun 18, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,990 243 Updated Apr 30, 2025

meta-llama / codellama

Inference code for CodeLlama models

Python 16,331 1,918 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haozhe Ji haozheji

Achievements

Achievements

Highlights

Block or report haozheji

Stars

sam-paech / slop-forensics

deepseek-ai / DeepEP

ttumiel / minRLHF

HarleyCoops / Math-To-Manim

volcengine / verl

HolmesShuan / Bias-Variance-Decomposition-for-KL-Divergence

NVIDIA / NeMo-Aligner

tatsu-lab / alpaca_eval

OpenRLHF / OpenRLHF

d-tiapkin / gflownet-rl

princeton-nlp / SimPO

Renovamen / oh-my-cv

EleutherAI / w2s

Jonathan-LeRoux / IguanaTex

coin-or / pulp

gpakosz / .tmux

srush / Triton-Puzzles

xai-org / grok-1

Yangyi-Chen / Multimodal-AND-Large-Language-Models

thu-coai / TaiLr

jzhang38 / LongMamba

ekalinin / github-markdown-toc

deepspeedai / DeepSpeedExamples

vllm-project / vllm

huggingface / alignment-handbook

CarperAI / trlx

eric-mitchell / direct-preference-optimization

stanfordnlp / dspy

opendilab / awesome-RLHF

meta-llama / codellama