8000 ozturkosu (Muhammed Emin Ozturk) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ozturkosu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • University of Utah
  • Salt Lake City

Organizations

@AARInternal

Block or report ozturkosu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Python 205 15 Updated May 1, 2025

QUDA is a library for performing calculations in lattice QCD on GPUs.

C++ 326 110 Updated Jul 12, 2025

CUDA Kernel Benchmarking Library

Cuda 680 80 Updated Jul 11, 2025

AMD's graph optimization engine.

C++ 229 103 Updated Jul 13, 2025

AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

Fortran 225 52 Updated Jul 10, 2025

LLVM/MLIR based compiler instrumentation of AMD GPU kernels

C++ 6 Updated Jun 5, 2025

Advanced Profiling and Analytics for AMD Hardware

Python 159 64 Updated Jul 12, 2025

AI Tensor Engine for ROCm

Python 230 66 Updated Jul 13, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,647 875 Updated Apr 29, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 382 186 Updated Jul 11, 2025

Next generation FFT implementation for ROCm

C++ 195 93 Updated Jul 11, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,218 725 Updated Jul 13, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,913 1,316 Updated Jul 6, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

Python 246 166 Updated Jul 13, 2025

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 4,117 566 Updated Jul 11, 2025

GPU-accelerated compiler

Futhark 347 10 Updated Mar 20, 2024

AMD SMI

C++ 78 43 Updated Jul 12, 2025

ROCm Communication Collectives Library (RCCL)

C++ 347 159 Updated Jul 13, 2025

C++ Insights - See your source code with the eyes of a compiler

C++ 4,326 254 Updated Jun 26, 2025

Trio – a friendly Python library for async concurrency and I/O

Python 6,605 358 Updated Jul 9, 2025
MLIR 148 42 Updated Jul 11, 2025

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,877 191 Updated Mar 11, 2025

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

C++ 2,249 459 Updated Jul 12, 2025

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

C++ 1,311 195 Updated Apr 14, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 121 75 Updated Jul 11, 2025

ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime

C++ 260 121 Updated Jul 12, 2025

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,903 286 Updated Jul 7, 2025
Next
0