8000 haohui (Haohui Mai) / Starred · GitHub

More Web Proxy on the site http://driver.im/

haohui

Follow

Haohui Mai haohui

Follow

82 followers · 12 following

San Franscisco
http://haohui.me

Achievements

Achievements

Highlights

Pro

Stars

causalflow-ai / petit-kernel

Optimized FP16/BF16 x FP4 GPU kernels for AMD GPUs

C++ 9 1 Updated Jul 3, 2025

Repeerc / flash-attention-v2-RDNA3-minimal

a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA environments.

Python 43 6 Updated Aug 25, 2024

RRZE-HPC / gpu-benches

collection of benchmarks to measure basic GPU capabilities

C++ 387 56 Updated Feb 11, 2025

XiongjieDai / GPU-Benchmarks-on-LLM-Inference

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 1,681 66 Updated May 13, 2024

ztxz16 / fastllm

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型，任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型，单并发20tps；INT4量化模型单并发30tps，多并发可达60+。

C++ 3,731 379 Updated Jul 2, 2025

zylon-ai / private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 56,187 7,537 Updated Nov 13, 2024

v2ray / v2ray-core

A platform for building proxies to bypass network restrictions.

Go 46,156 8,915 Updated May 28, 2025

openhardwaremonitor / openhardwaremonitor

Open Hardware Monitor

C# 6,192 1,291 Updated Jul 13, 2024

CalcProgrammer1 / NVFC

Forked from graphitemaster/NVFC

OpenSource tool for monitoring, configuring and overclocking NVIDIA GPUs

C 2 Updated Feb 21, 2020

mc-imperial / gpuverify

GPUVerify: a Verifier for GPU Kernels

C# 62 16 Updated Jul 28, 2022

cloudcores / CuAssembler

An unofficial cuda assembler, for all generations of SASS, hopefully ：）

Python 509 86 Updated Apr 20, 2023

scarsty / kys-cpp

《金庸群侠传》c++复刻版，已完工

C++ 2,748 388 Updated Jul 4, 2025

fail0verflow / radeon-tools

Radeon reverse engineering tools

Python 150 17 Updated Mar 29, 2020

decodecudabinary / Decoding-CUDA-Binary

C++ 52 13 Updated Nov 21, 2019

envytools / envytools

Tools for people envious of nvidia's blob driver.

C 477 96 Updated Oct 26, 2023

pdziepak / sopt

C++ 9 Updated Aug 23, 2019

daadaada / turingas

Assembler for NVIDIA Volta and Turing GPUs

Python 223 40 Updated Jan 13, 2022

ConsenSysDiligence / mythril

Mythril is a symbolic-execution-based securty analysis tool for EVM bytecode. It detects security vulnerabilities in smart contracts built for Ethereum and other EVM-compatible blockchains.

Python 4,058 776 Updated Jun 9, 2025

uber-archive / AthenaX

SQL-based streaming analytics platform at scale

Java 1,225 286 Updated Jun 21, 2020

scipr-lab / libsnark

C++ library for zkSNARKs

C++ 1,880 591 Updated Jun 12, 2025

aws / aws-fpga

Official repository of the AWS EC2 FPGA Hardware and Software Development Kit

SystemVerilog 1,575 525 Updated Jul 1, 2025

facebookarchive / beringei

Beringei is a high performance, in-memory storage engine for time series data.

C++ 3,167 293 Updated Jul 11, 2018

nicehash / nheqminer

Equihash miner for NiceHash

C++ 769 579 Updated Dec 27, 2018

Piyush3dB / awesome-deep-computation

A curated list of Deep Learning hardware, cycle/memory optimisation techniques

41 14 Updated Aug 9, 2016

ReFirmLabs / binwalk

Firmware Analysis Tool

Rust 12,661 1,659 Updated Apr 14, 2025

ExpressOS / expressos

The ExpressOS kernel

C# 16 5 Updated Jun 7, 2013

mrroach9 / Rowell

A pure front-end web UI for you-know-which bbs.

JavaScript 26 10 Updated Mar 7, 2016

0