-
ByteDance Inc.
- Beijing,China
-
04:46
(UTC +08:00) - qingyunqu.github.io
Stars
Distributed Compiler Based on Triton for Parallel Systems
Efficient Triton Kernels for LLM Training
Ongoing research training transformer models at scale
A code-searching tool similar to ack, but faster.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
A model compilation solution for various hardware
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Development repository for the Triton language and compiler
A Easy-to-understand TensorOp Matmul Tutorial
本书为《C++17 the complete guide》的个人中文翻译,仅供学习和交流使用,侵删
A technical report on convolution arithmetic in the context of deep learning
workspace是基于C++11的轻量级异步执行框架,支持:通用任务异步并发执行、优先级任务调度、自适应动态线程池、高效静态线程池、异常处理机制等。
A high-performance, extensible Python AOT compiler.
Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.
《Effective Modern C++》- 完成翻译
Backward compatible ML compute opset inspired by HLO/MHLO
润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。
2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
A cheatsheet of modern C++ language and library features.