8000 YiminGao0113 (Yimin Gao) / Starred · GitHub

More Web Proxy on the site http://driver.im/

YiminGao0113

Follow

Yimin Gao YiminGao0113

Follow

9 followers · 8 following

University of Virginia
yg9bq@virginia.edu
@YiminGao

Achievements

Achievements

Stars

Richielee630 / TMMA

TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models, optimized for edge deployment on Xilinx KV260.

Jupyter Notebook 8 1 Updated Mar 24, 2025

rohan-paul / Quantize-GPT-LLM-models-with-HuggingFace-transformers

Quantize GPT LLM models with HuggingFace transformers

Python 5 1 Updated Jan 5, 2024

YiminGao0113 / SIMD_Multipliers

Verilog 3 Updated Apr 3, 2024

wtfpad / wtfpad

Source code to simulate WTF-PAD on a set of web traffic traces.

Python 23 10 Updated Jul 14, 2020

pylls / padding-machines-for-tor

Project repository for creating padding machines for Tor to defend against website fingerprinting

Python 22 1 Updated Nov 26, 2020

spcl / gemm_hls

Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.

C++ 347 58 Updated Jan 20, 2025

diwu1990 / uSystolic-Sim

A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.

C++ 79 15 Updated Nov 7, 2021

abdelazeem201 / Systolic-array-implementation-in-RTL-for-TPU

IC implementation of Systolic Array for TPU

Verilog 247 30 Updated Oct 21, 2024

Xilinx / Vitis_Libraries

Vitis Libraries

C++ 978 380 Updated Jun 5, 2025

VeriGOOD-ML / public

Verilog 59 11 Updated Apr 30, 2025

casys-kaist / LLMServingSim

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Python 118 20 Updated Jun 6, 2025

scalesim-project / scale-sim-v2

Repository to host and maintain scale-sim-v2 code

Python 301 115 Updated Apr 23, 2025

Xilinx / Vitis_Accel_Examples

Vitis_Accel_Examples

Makefile 545 217 Updated Jun 12, 2025

skudlur / pes_sysarray

Systolic Array implementation for ASIC Course

Verilog 7 2 Update AEC7 d Nov 26, 2023

abdelfattah-lab / BitMoD-HPCA-25

Python 41 7 Updated Dec 16, 2024

arc-research-lab / Aries

ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)

C++ 31 8 Updated Jun 8, 2025

HLSTransform / submission

C 93 17 Updated Jan 11, 2024

cornell-zhang / allo

Allo: A Programming Model for Composable Accelerator Design

Python 239 44 Updated Jun 12, 2025

SuperLiaoXH / SystolicArray-2D-FP16

基于FP16的二维脉动阵列电路设计

SystemVerilog 11 Updated Feb 23, 2023

cathalmccabe / PYNQ_tutorials

Jupyter Notebook 92 16 Updated Nov 30, 2023

DeepWok / mase

Machine-Learning Accelerator System Exploration Tools

Python 166 83 Updated Jun 5, 2025

sun-jay / FPGA-Hardware-NN-Accelerator

Verilog 2 3 Updated May 1, 2025

neasotho / SystolicArray_FPGA

Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board

Tcl 14 4 Updated Jun 23, 2020

bsc-loca / sauria

SAURIA (Systolic-Array tensor Unit for aRtificial Intelligence Acceleration) is an open-source Convolutional Neural Network accelerator based on a GeMM systolic array engine.

SystemVerilog 44 12 Updated Oct 1, 2024

wavedrom / wavedrom.github.io

Digital timing diagram editor

JavaScript 1,008 166 Updated Jan 29, 2025

erihsu / INT_FP_MAC

INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.

Verilog 103 14 Updated Sep 27, 2020

hankshyu / RISC-V_MAC

MAC system with IEEE754 compatibility

Verilog 13 Updated Nov 22, 2023

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 838 67 Updated Sep 4, 2024

tensil-ai / tensil

Open source machine learning accelerators

Scala 380 30 Updated Mar 24, 2024

ChrisTitusTech / winutil

Chris Titus Tech's Windows Utility - Install Programs, Tweaks, Fixes, and Updates

PowerShell 35,775 1,981 Updated Jun 10, 2025

0