- France
- armandpl.com
Stars
Source code for "Improving the Diffusability of Autoencoders" [ICML 2025]
A digital logic designer and circuit simulator.
Efficient triton implementation of Native Sparse Attention.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
A bunch of kernels that might make stuff slower ๐
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
adafruit / uf2-samdx1
Forked from microsoft/uf2-samdx1MSC bootloader (based on UF2) for SAMD21
nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)
Muon is an optimizer for hidden layers in neural networks
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!
A library to analyze PyTorch traces.
AI Robotics tutorials for hobbyists
Solve puzzles. Improve your pytorch.
KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: ๐บ๐ธ ๐จ๐ณ ๐ฏ๐ต ๐ฎ๐น ๐ฐ๐ท ๐ท๐บ ๐ง๐ท ๐ช๐ธ
A Rust Embedded-HAL for the rp series microcontrollers
PCB files for Adafruit QT Py SAMD21 microcontroller.
This repo aims to help Logseq users to sync their data with Git and GitHub.
A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap
Your ultimate guide to setting up Zed with Vim mode, tailored settings, and key bindings for a seamless coding experience
Stop messing around with finicky sampling parameters and just use DRยตGS!