Stars
An open-source Python framework for creating, editing, and invoking Noisy Intermediate-Scale Quantum (NISQ) circuits.
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Fast and accurate object detection with end-to-end GPU optimization
Fast and accurate object detection with end-to-end GPU optimization