Stars
使用 cutlass 实现 flash-attention 精简版,具有教学意义
📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.
使用 Typst 编写的中文简历, 语法简洁, 样式美观, 开箱即用, 可选是否显示照片
A minimalist and extensible PyTorch extension for implementing custom backend operators in PyTorch.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
how to learn PyTorch and OneFlow
how to optimize some algorithm in cuda.
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
Learning material for CMU10-714: Deep Learning System
A CUDA tutorial to make people learn CUDA program from 0
This repository give a guidline to learn CUDA and TensorRT from the beginning.
Simple application that loads an OBJ file and an environment map and calculates & displays the corresponding 3rd order spherical harmonics coefficients
The IndoorHarmony-Dataset for Spatially-Varying Illumination-Aware Indoor Harmonization.
Python implementation of convertion between equirectangular, cubemap and perspective. (equirect2cube, cube2equirect, equirect2perspec)
Qt日常开发的一些功能模块或者Qt库、三方库的使用Demo,例如FFmpeg、OpenCV、Web、QGraphicsView、OpenGL、网络编程、数据库、QSS样式表、QtChart图表绘制等。
Interactive HTML BOM generation plugin for KiCad, EasyEDA, Eagle, Fusion360 and Allegro PCB designer
You can get time\weather\temperature-inside\humility-inside\pressure-inside\IAQ-inside\QQ-note\computer-info on this device.
Official electron build of draw.io
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。