Stars
A Chrome extension that enables virtual fashion try-on using FASHN AI technology. Simply hover over clothing images on any website to try them on virtually with your uploaded model image.
AirLLM 70B inference with single 4GB GPU
Unofficial precompiled binaries of the Apple MLX library.
Control the platform power state of your Apple Silicon Mac.
riffpga -- write FPGA bitstreams through a USB drive, get USB serial and dynamic clocking in a platform independent way
The highest performace Cray-like RISC-V Vector in the world.
Retro emulation for the ODROID-GO and other ESP32 devices
[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
VPTQ, A Flexible and Extreme low-bit quantization algorithm
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
UE5's Nanite implementation using WebGPU. Includes the meshlet LOD hierarchy, software rasterizer and billboard impostors. Culling on both per-instance and per-meshlet basis.
Text-to-Music Generation with Rectified Flow Transformers
Skip enables the creation of native SwiftUI apps for iOS and Android
Virtual whiteboard for sketching hand-drawn like diagrams
Most modern mobile touch slider with hardware accelerated transitions
PlayStation 4 emulator for Windows, Linux and macOS written in C++
Fast parallel LLM inference for MLX
Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.