8000 Releases · eth-cscs/spla · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Releases: eth-cscs/spla

SPLA 1.2.1

21 Oct 12:23
1366d55
Compare
Choose a tag to compare

SPLA 1.2.1 Release Notes

Features

  • Added CMake option to specify the BLAS library

Bug fixes

  • Fixed an internal error being thrown for certain large sizes with pgemm_ssb

SPLA 1.2.0

04 Oct 17:30
891bbc2
Compare
Choose a tag to compare

SPLA 1.2.0 Release Notes

Features

  • Improved performance with multiple threads and small matrix sizes.
  • Added threshold parameter, below which gemm computations may be done on Host, even if GPU is set a processing unit.

Bug fixes

  • If OpenBLAS is compiled without thread-safety, multi-threading is disabled for gemm computation.

SPLA 1.1.1

16 Sep 09:29
446d5b8
Compare
Choose a tag to compare

SPLA 1.1.1 Release Notes

Bug fixes

  • Fixed CMake find module for AMD HIP libraries

SPLA 1.1.0

07 Sep 17:13
e2cf5c7
Compare
Choose a tag to compare

SPLA 1.1.0 Release Notes

Changes

  • Changed tiling strategy for host and GPU computations for improved performance
  • Rewrote and extended tests

Bug fixes

  • Fixed issue with pgemm_ssb if local k = 0
  • Added workaround for bug in ROCm 3.5 related to device pointer detection
  • Fixed bug with Intel MKL reporting invalid arguments with certain edge cases
  • Fixed parameter check, such that null pointers are valid if local slice size is 0

SPLA 1.0.0

04 Aug 07:23
eb9dc14
Compare
Choose a tag to compare
v1.0.0

change version to 1.0.0
0