Stars
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo
Fast and lightweight DOM diffing/patching (no virtual DOM needed)
Writing Extension for Text Generation WebUI
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
llama3 implementation one matrix multiplication at a time
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors, now in oobabooga text generation webui!
Memoir+ a persona memory extension for Text Gen Web UI.
Web page with political compass quiz results for open LLMs
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…
Tools for merging pretrained large language models.
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
Official implementation of Half-Quadratic Quantization (HQQ)
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
Distribute and run LLMs with a single file.
A web search extension for Oobabooga's text-generation-webui (now with nougat)
jllllll / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention - Windows wheels
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing the LLM's tendency to fixate on a single word, phrase, or se…
Integrate image generation capabilities to text-generation-webui using Stable Diffusion.
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
A natural language interface for computers