FUE5 is a fan-made project with the goal to see what would Factorio look like and behave in 3D. This project has no affiliation with the official Factorio game.

1,851 64 Updated Jun 16, 2023

opencomputeproject / FP8

10 4 Updated Jun 23, 2023

ehw-fit / evoapproxlib

Library of approximate arithmetic circuits

Verilog 55 19 Updated Sep 8, 2022

ehw-fit / tf-approximate

Approximate layers - TensorFlow extension

C 27 12 Updated Apr 14, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,073 6,499 Updated Jun 27, 2025

jmluu / Awesome-Efficient-Training

A collection of research papers on efficient training of DNNs

70 8 Updated Jul 6, 2022

microsoft / microxcaling

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 251 34 Updated Jun 18, 2025

AaronJing / Gemmini-SEA

Scala 4 Updated May 11, 2024

xai-org / grok-1

Grok open release

Python 50,288 8,354 Updated Aug 30, 2024

dawsonjon / fpu

synthesiseable ieee 754 floating point library in verilog

Verilog 647 158 Updated Mar 13, 2023

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,969 2,071 Updated Jun 27, 2025

lilianweng / transformer-tensorflow

Implementation of Transformer Model in Tensorflow

Python 470 90 Updated Mar 25, 2023

Cjkkkk / CUDA_gemm

A simple high performance CUDA GEMM implementation.

Cuda 382 42 Updated Jan 4, 2024

zzh8829 / yolov3-tf2

YoloV3 Implemented in Tensorflow 2.0

Jupyter Notebook 2,512 898 Updated Aug 30, 2024

miemie2013 / Keras-YOLOv4

yolov4 42.0% mAP.ppyolo 45.1% mAP.

Python 445 127 Updated Dec 17, 2020

LongxingTan / tfyolo

tfyolo: Efficient Implementation of Yolov5 in TensorFlow

Python 233 72 Updated Apr 3, 2024

strutive07 / transformer-tensorflow2.0

transformer in tensorflow 2.0

Jupyter Notebook 64 21 Updated Apr 30, 2021

hxuaj / tf2-faster-rcnn

This is a fast and concise implementation of Faster R-CNN with TensorFlow2.

Python 26 10 Updated Mar 21, 2023

borumyk / Perceptron-Heros

Comp9444 - cv project

Jupyter Notebook 2 1 Updated Aug 2, 2022

Huanghongru / SGEMM-Implementation-and-Optimization

📝 Some source code about matrix multiplication implementation on CUDA

Cuda 34 9 Updated Sep 12, 2018

e-dupuis / awesome-approximate-dnn

Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment

26 6 Updated May 15, 2024

ucb-bar / gemmini

Berkeley's Spatial Array Generator

Scala 978 201 Updated Apr 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iceberg AaronJing

Achievements

Achievements

Highlights

Block or report AaronJing

Stars

upstash / context7

AaronJing / ApproxTrain

leimao / CUDA-GEMM-Optimization

usyd-fsalab / fp6_llm

jax-ml / ml_dtypes

akamaster / pytorch_resnet_cifar10

hngenc / stellar

jonatasgrosman / findpapers

FUE5BASE / FUE5