-
-
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedNov 26, 2024 -
mirage Public
Forked from mirage-project/mirageA multi-level tensor algebra superoptimizer
C++ Apache License 2.0 UpdatedSep 9, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedJul 16, 2024 -
FlexFlow-private Public
Forked from flexflow/flexflow-trainA distributed deep learning framework that supports flexible parallelization strategies.
-
TASO Public
The Tensor Algebra SuperOptimizer for Deep Learning
-
jax Public
Forked from jax-ml/jaxComposable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Python Apache License 2.0 UpdatedDec 18, 2022 -
sosp19ae Public
Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
-
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Python Apache License 2.0 UpdatedNov 19, 2020 -
website Public
Forked from flexflow/websitewebsite for flexflow.ai
SCSS MIT License UpdatedNov 7, 2020 -
-
minimal-mistakes Public
Forked from mmistakes/minimal-mistakes📐 Jekyll theme for building a personal site, blog, project documentation, or portfolio.
CSS MIT License UpdatedJun 22, 2020 -
dlrm Public
Forked from facebookresearch/dlrmAn implementation of a deep learning recommendation model (DLRM)
Python MIT License UpdatedSep 14, 2019 -
-
-
cudnn-training Public
Forked from tbennun/cudnn-trainingA CUDNN minimal deep learning training code sample using LeNet.
-
-
-
-
-
-
-
clang Public
We implement our nan integer compiler based on clang
-
-