Stars
Bootable Llamafile inference server with model weights built-in !!!! Exparimental Not BOOTABLE YET !!!
Model swapping for llama.cpp (or any local OpenAPI compatible server)
LostRuins / koboldcpp
Forked from ggml-org/llama.cppRun GGUF models easily with a KoboldAI UI. One File. Zero Install.