8000 GitHub - guacamoleo/Tensile: Stretching GPU performance for GEMMs and tensor contractions.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

guacamoleo/Tensile

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems (such as batched GEMM), and general N-dimensional tensor contractions on a GPU. The Tensile library is mainly used as a backend library for rocBLAS. Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.

Note

The published documentation is available at Tensile in an organized, easy-to-read format, with search and a table of contents. The documentation source files reside in the Tensile/docs/src folder of this repository. As with all ROCm projects, the documentation is open source. For more information on contributing to the documentation, see Contribute to ROCm documentation 5F27 .

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 51.6%
  • C++ 29.2%
  • Assembly 15.1%
  • TeX 1.4%
  • Shell 1.2%
  • CMake 0.9%
  • Other 0.6%
0