8000 atomic-operation (hyv5) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View atomic-operation's full-sized avatar

Organizations

@IBM

Block or report atomic-operation

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A curated list of awesome Qt and QML libraries, resources, projects, and shiny things.

2,377 365 Updated Jun 18, 2025

LLVM tutorial in Rust language

Rust 1,208 87 Updated Apr 3, 2024

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,203 721 Updated Jul 4, 2025

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,282 1,004 Updated Jul 2, 2025

PyTorch Neural Network eXchange

Python 595 36 Updated Jun 12, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,784 1,295 Updated Jul 3, 2025

Zero Bubble Pipeline Parallelism

Python 401 26 Updated May 7, 2025

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,280 332 Updated May 16, 2023

Transformer related optimization, including BERT, GPT

C++ 6,226 909 Updated Mar 27, 2024

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,526 202 Updated Apr 7, 2025

Serverless LLM Serving for Everyone.

Python 498 49 Updated Jul 3, 2025

✨✨✨ Geeker Admin,基于 Vue3.4、TypeScript、Vite5、Pinia、Element-Plus 开源的一套后台管理框架。

Vue 7,751 1,605 Updated Sep 14, 2024

Tensor library for machine learning

C++ 12,773 1,271 Updated Jul 3, 2025

LLM inference in C/C++

C++ B434 82,543 12,252 Updated Jul 4, 2025

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation

Python 848 97 Updated Dec 27, 2023

⚡ Based on yolo's ultra-lightweight universal target detection algorithm, the calculation amount is only 250mflops, the ncnn model size is only 666kb, the Raspberry Pi 3b can run up to 15fps+, and …

C 2,055 431 Updated Aug 11, 2021

www.giantpandacv.com

Python 151 31 Updated Jun 20, 2024

Serving multiple LoRA finetuned LLM as one

Python 1,067 52 Updated May 8, 2024

A code generator for array-based code on CPUs and GPUs

Python 608 76 Updated Jul 4, 2025

how to learn PyTorch and OneFlow

440 27 Updated Mar 22, 2024

how to optimize some algorithm in cuda.

Cuda 2,300 209 Updated Jun 27, 2025

compiler learning resources collect.

Python 2,433 351 Updated Mar 19, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,804 2,214 Updated Jul 2, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,911 1,760 Updated Jul 1, 2025

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 875 165 Updated Dec 30, 2024

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,417 3,612 Updated Jul 4, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,726 4,276 Updated Jun 27, 2025

Official inference library for Mistral models

Jupyter Notebook 10,334 930 Updated Mar 20, 2025

NumPy & SciPy for GPU

Python 10,307 923 Updated Jul 3, 2025
Next
0