AndSonder

🎯

Focusing

Chang Lu AndSonder

🎯

Focusing

Life is a vast wilderness, not a predetermined path.

74 followers · 54 following

University of Electronic Science and Technology of China
Cheng Du
space.keter.host

Achievements

x2 x3

Achievements

x2 x3

Highlights

Organizations

Lists (9)

Sort

Starred repositories

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,281 647 Updated May 31, 2024

thunlp / Seq1F1B

Forked from NVIDIA/Megatron-LM

Sequence-level 1F1B schedule for LLMs.

Python 23 2 Updated Dec 24, 2024

ayangweb / BongoCat

跨平台桌宠 BongoCat，为桌面增添乐趣！

TypeScript 4,548 218 Updated May 16, 2025

openai / codex

Lightweight coding agent that runs in your terminal

TypeScript 23,299 2,341 Updated May 17, 2025

IST-DASLab / sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 792 103 Updated Aug 20, 2024

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,523 414 Updated Apr 22, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 17,465 2,043 Updated May 1, 2025

Vaibhavs10 / insanely-fast-whisper

Jupyter Notebook 8,366 599 Updated Jun 16, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 81,816 9,833 Updated May 13, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,662 768 Updated May 12, 2025

lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…

TypeScript 60,847 12,779 Updated May 17, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,444 2,250 Updated May 17, 2025

datawhalechina / so-large-lm

大模型基础: 一文了解大模型基础知识

4,836 414 Updated Feb 24, 2025

SiriusNEO / Triton-Puzzles-Lite

Puzzles for learning Triton, play it with minimal environment configuration!

Python 313 36 Updated Dec 3, 2024

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 20,885 2,444 Updated Apr 30, 2025

jackaduma / awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

1,240 310 Updated Dec 14, 2023

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

1,990 143 Updated Dec 26, 2024

ajeetdsouza / zoxide

A smarter cd command. Supports all major shells.

Rust 26,457 631 Updated May 16, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,066 138 Updated Jan 4, 2025

deepseek-ai / DeepSeek-V3

Python 96,857 15,742 Updated Apr 9, 2025

SUSTechBruce / LOOK-M

[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"

Python 93 6 Updated Nov 9, 2024

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

Python 4,000 277 Updated May 15, 2025

lyogavin / airllm

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,771 457 Updated May 6, 2025

liguodongiot / llm-resource

LLM全栈优质资源汇总

Shell 549 63 Updated Nov 25, 2024

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 7,487 824 Updated Apr 30, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,408 1,771 Updated May 17, 2025

yutto-dev / yutto

🧊 一个可爱且任性的 B 站视频下载器

Python 1,376 116 Updated May 15, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,068 5,970 Updated May 16, 2025

AmadeusChan / Awesome-LLM-System-Papers

582 27 Updated May 10, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,500 7,449 Updated May 17, 2025

Chang Lu AndSonder

Highlights

Organizations

Lists (9)

Anomaly detection

aw-GPT-Tools

C++ Programming

CUDA Programming

Distributed System

DL Inference Framework

Federated learning

LLM

SD-Models

Starred repositories

Python