8000 AndSonder (Chang Lu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View AndSonder's full-sized avatar
🎯
Focusing
🎯
Focusing
  • University of Electronic Science and Technology of China
  • Cheng Du

Highlights

  • Pro

Organizations

@sanyuankexie @neet-cv

Block or report AndSonder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,281 647 Updated May 31, 2024

Sequence-level 1F1B schedule for LLMs.

Python 23 2 Updated Dec 24, 2024

跨平台桌宠 BongoCat,为桌面增添乐趣!

TypeScript 4,548 218 Updated May 16, 2025

Lightweight coding agent that runs in your terminal

TypeScript 23,299 2,341 Updated May 17, 2025

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 792 103 Updated Aug 20, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,523 414 Updated Apr 22, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 17,465 2,043 Updated May 1, 2025
Jupyter Notebook 8,366 599 Updated Jun 16, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 81,816 9,833 Updated May 13, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,662 768 Updated May 12, 2025

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…

TypeScript 60,847 12,779 Updated May 17, 2025

Fully open reproduction of DeepSeek-R1

Python 24,444 2,250 Updated May 17, 2025

大模型基础: 一文了解大模型基础知识

4,836 414 Updated Feb 24, 2025

Puzzles for learning Triton, play it with minimal environment configuration!

Python 313 36 Updated Dec 3, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 20,885 2,444 Updated Apr 30, 2025

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

1,240 310 Updated Dec 14, 2023

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,990 143 Updated Dec 26, 2024

A smarter cd command. Supports all major shells.

Rust 26,457 631 Updated May 16, 2025

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,066 138 Updated Jan 4, 2025

[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"

Python 93 6 Updated Nov 9, 2024

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

Python 4,000 277 Updated May 15, 2025

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,771 457 Updated May 6, 2025

LLM全栈优质资源汇总

Shell 549 63 Updated Nov 25, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 7,487 824 Updated Apr 30, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,408 1,771 Updated May 17, 2025

🧊 一个可爱且任性的 B 站视频下载器

Python 1,376 116 Updated May 15, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,068 5,970 Updated May 16, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,500 7,449 Updated May 17, 2025
Next
0