8000 lidh15 (Denghao Li) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lidh15's full-sized avatar

Block or report lidh15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pytorch implementation of BRECQ, ICLR 2021

Python 275 57 Updated Aug 1, 2021

Ongoing research training transformer models at scale

Python 12,575 2,834 Updated Jun 14, 2025

Distributed RL System for LLM Reasoning

Python 1,730 85 Updated Jun 13, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,646 348 Updated Mar 12, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,439 1,242 Updated Jun 14, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,328 55 Updated May 11, 2025

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 360 34 Updated Feb 22, 2025

EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection

Python 8 Updated Mar 6, 2025

Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)

Python 24 Updated Apr 2, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,364 308 Updated May 13, 2025

Train transformer language models with reinforcement learning.

Python 14,170 1,961 Updated Jun 13, 2025

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,834 163 Updated Apr 4, 2025

Code for paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning

Python 6 1 Updated May 21, 2025
Python 21 Updated Jun 6, 2025

Fast cosine similarity for Python

Python 8 1 Updated Jan 29, 2022

EvaByte: Efficient Byte-level Language Models at Scale

Python 101 6 Updated Apr 22, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,683 193 Updated Jun 14, 2025

The official GitHub page for the survey paper "A Survey of RWKV".

27 1 Updated Jan 7, 2025

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python 1,113 81 Updated Jan 11, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 3,172 327 Updated Jun 13, 2025

Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning

Python 47 2 Updated Jun 11, 2025

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 197 19 Updated Feb 23, 2025

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 642 44 Updated Apr 25, 2025

Agent S: an open agentic framework that uses computers like a human

Python 5,454 556 Updated Jun 10, 2025

PyTorch Implementation for Hyperbolic Fine-tuning for LLMs

Python 16 Updated Oct 24, 2024

[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

Python 39 3 Updated Apr 18, 2025

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 797 57 Updated Oct 1, 2024

Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"

Python 20 2 Updated May 20, 2025
Python 94 13 Updated Dec 6, 2024
Next
0