-
SSpMV Public
Sparsity-aware SpMV that can adaptive use the optimal format and algorithm for specific input sparse matrix.
C++ UpdatedJan 23, 2025 -
CL-DB-GEMM Public
OpenCL based Double Buffer General Matrix-Matrix Multiplication Library
-
Paper: "STM-multifrontal QR: streaming task mapping multifrontal QR factorization empowered by GCN"
-
ALSparse Public
Forked from AlphaSparse/LibraryA sparse BLAS lib supporting multiple backends
C MIT License UpdatedJun 19, 2023 -
HPS-Cholesky Public
The implementation of paper: "HPS Cholesky: Hierarchical Parallelized Supernodal Cholesky with Adaptive Parameters"
C UpdatedMay 16, 2023 -
Cpp_Primer_Practice Public
Forked from applenob/Cpp_Primer_Practice搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
C++ UpdatedDec 28, 2022 -
CLTune Public
Forked from CNugteren/CLTuneCLTune: An automatic OpenCL & CUDA kernel tuner
C++ Other UpdatedDec 12, 2022 -
clBLAS_Modified Public
It's a modified version of clBLAS2.12.0 that fixed some error and BUGs for the release version.
C Apache License 2.0 UpdatedSep 28, 2022 -
clMagma_Modified Public
Single precision implementation of clMagma library based on clBLASt
Fortran Other UpdatedSep 28, 2022 -
Benchmark_SpMV_using_CSR5 Public
Forked from weifengliu-ssslab/Benchmark_SpMV_using_CSR5CSR5-based SpMV on CPUs, GPUs and Xeon Phi
C++ MIT License UpdatedAug 16, 2022 -
OpenCL-Guide Public
Forked from KhronosGroup/OpenCL-GuideA guide to help developers get up and running quickly with the OpenCL programming framework
CMake Creative Commons Attribution 4.0 International UpdatedMay 12, 2022 -
spECK Public
Forked from GPUPeople/spECKEfficient SpGEMM on GPU using CUDA and CSR
Cuda MIT License UpdatedJan 15, 2022 -
LAFF-On-PfHP Public
Forked from ULAFF/LAFF-On-PfHPRepository for "LAFF-On Programming for High Performance"
C BSD 3-Clause "New" or "Revised" License UpdatedNov 21, 2021 -
-
-
perf-tools Public
Forked from brendangregg/perf-toolsPerformance analysis tools based on Linux perf_events (aka perf) and ftrace
Shell GNU General Public License v2.0 UpdatedAug 3, 2021 -
clBLAS Public
Forked from clMathLibraries/clBLASa software library containing BLAS functions written in OpenCL
C++ Apache License 2.0 UpdatedApr 28, 2021 -
-
opentuner Public
Forked from jansel/opentunerAn extensible framework for program autotuning
Python MIT License UpdatedMar 31, 2021 -
-
how-to-optimize-gemm Public
Forked from tpoisonooo/how-to-optimize-gemmARM RowMajor sgemm optimization
Fortran MIT License UpdatedDec 24, 2020 -
clFFT_Modified Public
Forked from clMathLibraries/clFFTa software library containing FFT functions written in OpenCL
C++ Apache License 2.0 UpdatedAug 23, 2020 -
SuiteSparse Public
Forked from DrTimothyAldenDavis/SuiteSparseThe official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University
C Other UpdatedApr 8, 2020 -
ai-edu Public
Forked from microsoft/ai-eduAI education materials for Chinese students, teachers and IT professionals.
Jupyter Notebook Other UpdatedMar 6, 2020 -
suitesparse-metis-for-windows Public
Forked from jlblancoc/suitesparse-metis-for-windowsCMake scripts for painless usage of SuiteSparse+METIS from Visual Studio and the rest of Windows/Linux/OSX IDEs supported by CMake
C BSD 3-Clause "New" or "Revised" License UpdatedJan 21, 2020 -
-
BLASX Public
Forked from linnanwang/BLASXa heterogeneous multiGPU level-3 BLAS library
C UpdatedDec 9, 2019 -
nn-from-scratch Public
Forked from dennybritz/nn-from-scratchImplementing a Neural Network from Scratch
Jupyter Notebook UpdatedNov 8, 2019 -
UGATIT Public
Forked from taki0112/UGATITOfficial Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation
Python MIT License UpdatedOct 16, 2019 -