Lists (2)
Sort Name ascending (A-Z)
Starred repositories
A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
Efficient implementation of DeepSeek Ops (Blockwise FP8 GEMM, MoE, and MLA) for AMD Instinct MI300X
A community-supported supercharged document management system: scan, index and archive all your documents
手写实现李航《统计学习方法》书中全部算法
FlashInfer: Kernel Library for LLM Serving
This is a repo with links to everything you'd ever want to learn about data engineering
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
Open source process design kit for usage with SkyWater Technology Foundry's 130nm node.
Tracking RISC-V Actions on Education, Training, Courses, Monitorships, etc.
A framework for few-shot evaluation of language models.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
awesome AI models with NCNN, and how they were converted ✨✨✨
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
Run evaluation on LLMs using human-eval benchmark
Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)
Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.
Notebooks for Hardware-Aware Training of Spiking Neural Networks. Open-Source Neuromorphic Circuit Design Tutorial at ESSCIRC 2023.
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
Recent Advances on Efficient Vision Transformers
An EDA toolchain for integrated core-memory interval thermal simulations of 2D, 2.5, and 3D multi-/many-core processors
Offers a toolset for comprehensive, multi-faceted large-scale data analysis and optimizations
Standalone Flash Attention v2 kernel without libtorch dependency