chae-sy

chae-sy

0 followers · 2 following

Achievements

Starred repositories

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,017 249 Updated May 9, 2025

efficient-ai-study / efficient-ai-study

87 11 Updated Mar 28, 2024

gnodipac886 / ViT-FPGA-TPU

FPGA based Vision Transformer accelerator (Harvard CS205)

SystemVerilog 119 14 Updated Feb 11, 2025

Buck008 / Transformer-Accelerator-Based-on-FPGA

You can run it on pynq z1. The repository contains the relevant Verilog code, Vivado configuration and C code for sdk testing. The size of the systolic array can be changed, now it is 16X16.

Verilog 173 17 Updated Mar 24, 2024

mflowgen / mflowgen

mflowgen -- A Modular ASIC/FPGA Flow Generator

Python 254 60 Updated Feb 24, 2025

mflowgen / skywater-130nm

ASIC Design kit for Skywater 130 for use with mflowgen

Verilog 12 5 Updated Mar 12, 2023

google / skywater-pdk

Open source process design kit for usage with SkyWater Technology Foundry's 130nm node.

Python 3,156 413 Updated Oct 28, 2024

jha-lab / acceltran

[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers

Python 43 10 Updated Nov 22, 2023

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,410 174 Updated Jul 12, 2024

Xilinx / brevitas

Brevitas: neural network quantization in PyTorch

Python 1,319 215 Updated May 21, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,672 134 Updated Apr 23, 2025

jaywonchung / Verilog-Harvard-CPU

Verilog implementation of various types of CPUs

Verilog 51 6 Updated Sep 27, 2019

VLSIDA / OpenRAM

An open-source static random access memory (SRAM) compiler.

Python 904 222 Updated Apr 1, 2025

mflowgen / freepdk-45nm

ASIC Design Kit for FreePDK45 + Nangate for use with mflowgen

Verilog 175 37 Updated Mar 8, 2020

rsnikhil / Learn_Bluespec_and_RISCV_Design

Textbook and full source codes to learn basics of RISC-V pipelined CPU design using the Bluespec Hardware Design Language(s)

Verilog 74 12 Updated Sep 13, 2024

chenxm1986 / PIM-Toolchain

EDA toolchain for processing-in-memory architectures, including an architecture synthesizer, a compiler, and a simulator

12 2 Updated Nov 20, 2024

lixixi-jook / PIMSYN-NN

An Automatic Synthesis Tool for PIM-based CNN Accelerators.

Python 12 2 Updated Feb 29, 2024

debtanu09 / fmultiplier

This is the verilog implementation of IEEE 754 32 bit floating point multiplier

Verilog 8 1 Updated Jul 12, 2020

olofk / serv

SERV - The SErial RISC-V CPU

Verilog 1,587 220 Updated May 16, 2025

openhwgroup / cva6

The CORE-V CVA6 is an Application class 6-stage RISC-V CPU capable of booting Linux

Assembly 2,480 771 Updated May 22, 2025

stnolting / neoTRNG

🎲 A Tiny and Platform-Independent True Random Number Generator for any FPGA (and ASIC).

VHDL 183 23 Updated Jan 19, 2025

hughperkins / VeriGPU

OpenSource GPU, in Verilog, loosely based on RISC-V ISA

SystemVerilog 997 114 Updated Nov 22, 2024

abdelazeem201 / Systolic-array-implementation-in-RTL-for-TPU

IC implementation of Systolic Array for TPU

Verilog 240 30 Updated Oct 21, 2024

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 22,899 3,288 Updated Mar 5, 2025

chipsalliance / chisel

Chisel: A Modern Hardware Design Language

Scala 4,273 628 Updated May 22, 2025

sfu-arch / SPAGHETTI

RTL generator for SpGEMM

Verilog 12 1 Updated Feb 2, 2021

fengbintu / Neural-Networks-on-Silicon

This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.

1,941 386 Updated Jan 6, 2025

WheatBeer / play_with_dramsim3

MEMORY CENTRIC SYSTEMS FOR AI(CSI6207-01) Lecture at Yonsei(20-1)

TeX 9 2 Updated Apr 6, 2022

umd-memsys / DRAMsim3

DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator

C++ 370 155 Updated Aug 3, 2024

PSAL-POSTECH / ONNXim

ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference

C++ 115 25 Updated Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chae-sy

Achievements

Achievements

Block or report chae-sy

Starred repositories

mit-han-lab / llm-awq

efficient-ai-study / efficient-ai-study

gnodipac886 / ViT-FPGA-TPU

Buck008 / Transformer-Accelerator-Based-on-FPGA

mflowgen / mflowgen

mflowgen / skywater-130nm

google / skywater-pdk

jha-lab / acceltran

mit-han-lab / smoothquant

Xilinx / brevitas

horseee / Awesome-Efficient-LLM

jaywonchung / Verilog-Harvard-CPU

VLSIDA / OpenRAM

mflowgen / freepdk-45nm

rsnikhil / Learn_Bluespec_and_RISCV_Design

chenxm1986 / PIM-Toolchain

lixixi-jook / PIMSYN-NN

debtanu09 / fmultiplier

olofk / serv

openhwgroup / cva6

stnolting / neoTRNG

hughperkins / VeriGPU

abdelazeem201 / Systolic-array-implementation-in-RTL-for-TPU

lucidrains / vit-pytorch

chipsalliance / chisel

sfu-arch / SPAGHETTI

fengbintu / Neural-Networks-on-Silicon

WheatBeer / play_with_dramsim3

umd-memsys / DRAMsim3

PSAL-POSTECH / ONNXim

Starred topics

blog-theme