8000 YiminGao0113 (Yimin Gao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View YiminGao0113's full-sized avatar

Block or report YiminGao0113

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models, optimized for edge deployment on Xilinx KV260.

Jupyter Notebook 8 1 Updated Mar 24, 2025

Quantize GPT LLM models with HuggingFace transformers

Python 5 1 Updated Jan 5, 2024
Verilog 3 Updated Apr 3, 2024

Source code to simulate WTF-PAD on a set of web traffic traces.

Python 23 10 Updated Jul 14, 2020

Project repository for creating padding machines for Tor to defend against website fingerprinting

Python 22 1 Updated Nov 26, 2020

Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.

C++ 347 58 Updated Jan 20, 2025

A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.

C++ 79 15 Updated Nov 7, 2021

IC implementation of Systolic Array for TPU

Verilog 247 30 Updated Oct 21, 2024

Vitis Libraries

C++ 978 380 Updated Jun 5, 2025
Verilog 59 11 Updated Apr 30, 2025

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Python 118 20 Updated Jun 6, 2025

Repository to host and maintain scale-sim-v2 code

Python 301 115 Updated Apr 23, 2025

Vitis_Accel_Examples

Makefile 545 217 Updated Jun 12, 2025

Systolic Array implementation for ASIC Course

Verilog 7 2 Update AEC7 d Nov 26, 2023

ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)

C++ 31 8 Updated Jun 8, 2025

Allo: A Programming Model for Composable Accelerator Design

Python 239 44 Updated Jun 12, 2025

基于FP16的二维脉动阵列电路设计

SystemVerilog 11 Updated Feb 23, 2023
Jupyter Notebook 92 16 Updated Nov 30, 2023

Machine-Learning Accelerator System Exploration Tools

Python 166 83 Updated Jun 5, 2025

Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board

Tcl 14 4 Updated Jun 23, 2020

SAURIA (Systolic-Array tensor Unit for aRtificial Intelligence Acceleration) is an open-source Convolutional Neural Network accelerator based on a GeMM systolic array engine.

SystemVerilog 44 12 Updated Oct 1, 2024

Digital timing diagram editor

JavaScript 1,008 166 Updated Jan 29, 2025

INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.

Verilog 103 14 Updated Sep 27, 2020

MAC system with IEEE754 compatibility

Verilog 13 Updated Nov 22, 2023

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 838 67 Updated Sep 4, 2024

Open source machine learning accelerators

Scala 380 30 Updated Mar 24, 2024

Chris Titus Tech's Windows Utility - Install Programs, Tweaks, Fixes, and Updates

PowerShell 35,775 1,981 Updated Jun 10, 2025
Next
0