Stars
TAG-Bench: A benchmark for table-augmented generation (TAG)
Efficient and Scalable Estimation of Tool Representations in Vector Space
[EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
The official repo for "LLoCo: Learning Long Contexts Offline"
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
An extremely fast Python linter and code formatter, written in Rust.
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
[EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!
FacTool: Factuality Detection in Generative AI
Port of OpenAI's Whisper model in C/C++
CoreNet: A library for training deep neural networks
SoTA LLM for converting natural language questions to SQL queries
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Code examples and resources for DBRX, a large language model developed by Databricks
A natural language interface for computers
Robust recipes to align language models with human and AI preferences
⚡ A Fast, Extensible Progress Bar for Python and CLI
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
🦜🔗 Build context-aware reasoning applications
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.