More
Lists (1)
Sort Name ascending (A-Z)
Stars
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
Gateway API Inference Extension
From the Transistor to the Web Browser, a rough outline for a 12 week course
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Containerization is a Swift package for running Linux containers on macOS.
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
Everything we actually know about the Apple Neural Engine (ANE)
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
llm-d is a Kubernetes-native high-performance distributed LLM inference framework
A CNI meta-plugin for multi-homed pods in Kubernetes
mutatio.dev is an open source platform to systematically test, measure, and optimize LLM prompts.
rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
A lightweight design for computation-communication overlap.
real time face swap and one-click video deepfake with only a single image
Supercharge Your LLM with the Fastest KV Cache Layer
A course of learning LLM inference serving on Apple Silicon for systems engineers.
A dynamic library providing Virtualization-based process isolation capabilities
Accelerate LLM preference tuning via prefix sharing with a single line of code
macOS: mount any linux-supported filesystem read/write using NFS and a microVM
FlatBuffers: Memory Efficient Serialization Library