TheBloke's Docker templates

Updated to latest ExLlama code, fixing issue with SuperHOT GPTQs
ExLlama now automaticaly updates on boot, like text-generation-webui already did
- This should result in the template automatically supporting new ExLlama features in future

Major update to the template
text-generation-webui is now integrated with:
- AutoGPTQ with support for all Runpod GPU types
- ExLlama, turbo-charged Llama GPTQ engine - performs 2x faster than AutoGPTQ (Llama 4bit GPTQs only)
- CUDA-accelerated GGML support, with support for all Runpod systems and GPUs.
All text-generation-webui extensions are included and supported (Chat, SuperBooga, Whisper, etc).
text-generation-webui is always up-to-date with the latest code and features.
Automatic model download and loading via environment variable MODEL.
Pass text-generation-webui parameters via environment variable UI_ARGS.

Runpod: TheBloke's Local LLMs UI

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github		.github
conf-files		conf-files
cuda11.8.0-ubuntu22.04-oneclick-rp		cuda11.8.0-ubuntu22.04-oneclick-rp
cuda11.8.0-ubuntu22.04-oneclick		cuda11.8.0-ubuntu22.04-oneclick
cuda11.8.0-ubuntu22.04-pytorch-conda		cuda11.8.0-ubuntu22.04-pytorch-conda
cuda11.8.0-ubuntu22.04-pytorch		cuda11.8.0-ubuntu22.04-pytorch
cuda11.8.0-ubuntu22.04-textgen		cuda11.8.0-ubuntu22.04-textgen
cuda11.8.0-ubuntu22.04-train		cuda11.8.0-ubuntu22.04-train
imgs		imgs
scripts		scripts
LICENSE		LICENSE
README.md		README.md
README_Runpod_LocalLLMsUI.md		README_Runpod_LocalLLMsUI.md
README_Runpod_LocalLLMsUIandAPI.md		README_Runpod_LocalLLMsUIandAPI.md