(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)

Python 165 9 Updated May 9, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 17,471 1,693 Updated May 22, 2025

zeux / calm

CUDA/Metal accelerated language model inference

C 578 26 Updated Apr 10, 2025

warda-rahim / articles

B3C9

Jupyter Notebook 7 4 Updated Mar 5, 2023

pyamsoft / pstate-frequency

Easily control Intel p-state driver on Linux

Shell 180 22 Updated Feb 25, 2025

qianyuzqy / IADG

(CVPR 2023) Instance-Aware Domain Generalization for Face Anti-Spoofing

Python 83 4 Updated Oct 19, 2023

bchiang7 / v4

Fourth iteration of my personal website built with Gatsby

JavaScript 7,891 4,006 Updated Jul 28, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,383 6,868 Updated Dec 9, 2024

jagregory / abrash-black-book

Markdown source for Michael Abrash's Graphics Programming Black Book

CSS 4,591 336 Updated Jun 20, 2023

ronny-rentner / UltraDict

Sychronized, streaming Python dictionary that uses shared memory as a backend

Python 284 24 Updated Feb 28, 2025

gfickel / multiprocessing_lib

Python 7 1 Updated Nov 29, 2023

gustavofuhr / miyagi_pytorch_trainer

A pytorch trainer with a range of choice for backbones, losses, metrics and wandb sweeps.

Python 10 1 Updated Aug 21, 2023

satellite-image-deep-learning / techniques

Techniques for deep learning with satellite & aerial imagery

9,363 1,570 Updated May 20, 2025

facebookresearch / omnivore

Omnivore: A Single Model for Many Visual Modalities

Python 564 39 Updated Nov 12, 2022

SergeyMakeev / slot_map

A slot map is a high-performance associative container with persistent unique 32/64 bit keys to access stored values.

C++ 291 13 Updated Jan 10, 2024

amb5l / 6502_65C02_functional_tests

Forked from Klaus2m5/6502_65C02_functional_tests

Tests for all valid opcodes of the 6502 and 65C02 processor

Makefile 34 10 Updated Jun 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guilherme Fickel gfickel

Achievements

Achievements

Block or report gfickel

Stars

jart / json.cpp

lucidrains / x-transformers

lucidrains / vit-pytorch

opooladz / Momentum-SAM-ScheduleFree

open-telemetry / opentelemetry-ebpf-profiler

haanjack / mnist-cudnn

johnma2006 / mamba-minimal

facebookresearch / schedule_free

mackron / miniaudio

Yuliang-Liu / Monkey

amazon-science / chronos-forecasting

fkodom / grouped-query-attention-pytorch