8000 AugF (AugF) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View AugF's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report AugF

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome LLMs on Device: A Comprehensive Survey

1,116 104 Updated Jan 12, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Python 3,019 395 Updated Jun 12, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,358 306 Updated May 13, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,891 1,491 Updated Apr 24, 2025

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 640 38 Updated Jul 22, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 15,087 1,984 Updated Jun 12, 2025

Python library implementing a trie data structure.

Python 40 8 Updated Mar 26, 2024

Python library implementing a trie data structure.

Python 822 130 Updated Apr 10, 2021

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 812 102 Updated Jun 9, 2025

Summarize existing representative LLMs text datasets.

1,284 130 Updated Mar 25, 2025

Curated list of datasets and tools for post-training.

3,146 268 Updated Jan 29, 2025

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).

Python 330 32 Updated Jun 1, 2023

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

559 129 Updated Oct 16, 2023

A quick guide (especially) for trending instruction finetuning datasets

3,106 205 Updated Nov 28, 2023

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,600 338 Updated May 30, 2023
Jupyter Notebook 201 10 Updated Oct 21, 2024

CA-LoRA: Adapting Existing LoRA for Compressed LLMs to Enable Efficient Multi-Tasking on Personal Devices (COLM 2024)

Python 8 1 Updated Oct 30, 2024

Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

Python 62 3 Updated Nov 1, 2024

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,015 543 Updated May 9, 2025

📰 Must-read papers and blogs on Speculative Decoding ⚡️

788 46 Updated Jun 12, 2025

[TMLR 2024] Efficient Large Language Models: A Survey

1 Updated Jun 14, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 30,442 2,919 Updated Jun 12, 2025

Reaching LLaMA2 Performance with 0.1M Dollars

Python 983 80 Updated Jul 23, 2024

LLM training in simple, raw C/CUDA

Cuda 26,847 3,083 Updated May 10, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

55,271 5,898 Updated Jun 4, 2025

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 843 68 Updated Aug 27, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,124 174 Updated Mar 27, 2024
Next
0