8000 hannlp (Yuchen Han) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hannlp's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report hannlp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,615 1,324 Updated May 17, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,567 835 Updated Apr 29, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,775 106 Updated Apr 3, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,131 1,002 Updated May 23, 2025

Curated list of datasets and tools for post-training.

3,072 265 Updated Jan 29, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,286 305 Updated May 13, 2025

Simple RL training for reasoning

Python 3,583 266 Updated Apr 10, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,381 1,031 Updated May 23, 2025

🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts

Python 38 Updated Sep 29, 2024

Code for BLT research paper

Python 1,661 137 Updated May 22, 2025

Machine Learning Engineering Open Book

Python 13,776 829 Updated May 8, 2025

Cool Papers - Immersive Paper Discovery

JavaScript 539 11 Updated May 12, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,448 240 Updated May 23, 2025

🧑‍🚀 全世界最好的LLM资料总结(视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

5,284 523 Updated May 23, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,799 663 Updated May 23, 2025

Next-Token Prediction is All You Need

Python 2,127 80 Updated Mar 17, 2025

O1 Replication Journey

1,992 65 Updated Jan 14, 2025

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python 106 8 Updated Oct 16, 2024

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Jupyter Notebook 263 13 Updated Jul 30, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,579 551 Updated Apr 19, 2025

A generative speech model for daily dialogue.

Python 36,319 3,926 Updated May 23, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,160 621 Updated Apr 27, 2025

LLM training in simple, raw C/CUDA

Cuda 26,661 3,064 Updated May 10, 2025

🚀 Power Your World with AI - Explore, Extend, Empower.

JavaScript 7,567 576 Updated May 1, 2025

A collection of 150+ surveys on LLMs

296 18 Updated Feb 19, 2025

A latent text-to-image diffusion model

Jupyter Notebook 70,718 10,444 Updated Jun 18, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,538 1,424 Updated May 22, 2025
Next
0