8000 atomic-operation (hyv5) / Starred · GitHub

More Web Proxy on the site http://driver.im/

atomic-operation

Follow

hyv5 atomic-operation

Follow

IBMer

9 followers · 16 following

@IBM
Dalian
www.hyv5.cn

Achievements

Achievements

Organizations

Starred repositories

mikalv / awesome-qt-qml

A curated list of awesome Qt and QML libraries, resources, projects, and shiny things.

2,377 365 Updated Jun 18, 2025

jauhien / iron-kaleidoscope

LLVM tutorial in Rust language

Rust 1,208 87 Updated Apr 3, 2024

iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,203 721 Updated Jul 4, 2025

Oneflow-Inc / oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,282 1,004 Updated Jul 2, 2025

pnnx / pnnx

PyTorch Neural Network eXchange

Python 595 36 Updated Jun 12, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,784 1,295 Updated Jul 3, 2025

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 401 26 Updated May 7, 2025

Tony-Tan / CUDA_Freshman

Cuda 2,479 473 Updated Jan 16, 2024

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,280 332 Updated May 16, 2023

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,226 909 Updated Mar 27, 2024

Tencent / TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,526 202 Updated Apr 7, 2025

ServerlessLLM / ServerlessLLM

Serverless LLM Serving for Everyone.

Python 498 49 Updated Jul 3, 2025

HalseySpicy / Geeker-Admin

✨✨✨ Geeker Admin，基于 Vue3.4、TypeScript、Vite5、Pinia、Element-Plus 开源的一套后台管理框架。

Vue 7,751 1,605 Updated Sep 14, 2024

ggml-org / ggml

Tensor library for machine learning

C++ 12,773 1,271 Updated Jul 3, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ B434 82,543 12,252 Updated Jul 4, 2025

OpenBMB / ProAgent

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation

Python 848 97 Updated Dec 27, 2023

dog-qiuqiu / Yolo-Fastest

⚡ Based on yolo's ultra-lightweight universal target detection algorithm, the calculation amount is only 250mflops, the ncnn model size is only 666kb, the Raspberry Pi 3b can run up to 15fps+, and …

C 2,055 431 Updated Aug 11, 2021

BBuf / giantpandacv.com

www.giantpandacv.com

Python 151 31 Updated Jun 20, 2024

punica-ai / punica

Serving multiple LoRA finetuned LLM as one

Python 1,067 52 Updated May 8, 2024

inducer / loopy

A code generator for array-based code on CPUs and GPUs

Python 608 76 Updated Jul 4, 2025

BBuf / how-to-learn-deep-learning-framework

how to learn PyTorch and OneFlow

440 27 Updated Mar 22, 2024

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,300 209 Updated Jun 27, 2025

BBuf / tvm_mlir_learn

compiler learning resources collect.

Python 2,433 351 Updated Mar 19, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,804 2,214 Updated Jul 2, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 20,911 1,760 Updated Jul 1, 2025

alibaba / BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 875 165 Updated Dec 30, 2024

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,417 3,612 Updated Jul 4, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,726 4,276 Updated Jun 27, 2025

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 10,334 930 Updated Mar 20, 2025

cupy / cupy

NumPy & SciPy for GPU

Python 10,307 923 Updated Jul 3, 2025

Starred topics

Machine learning

Awesome Lists

0