Highlights
- Pro
Starred repositories
aider is AI pair programming in your terminal
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
DSPy: The framework for programming—not prompting—language models
[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
A Bulletproof Way to Generate Structured JSON from Language Models
StableLM: Stability AI Language Models
LAVIS - A One-stop Library for Language-Vision Intelligence
LLaMA: Open and Efficient Foundation Language Models
Code and documentation to train Stanford's Alpaca models, and generate the data.
Instruct-tune LLaMA on consumer hardware
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to large…
tloen / llama-int8
Forked from meta-llama/llamaQuantized inference code for LLaMA models
UnifiedQA: Crossing Format Boundaries With a Single QA System
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
Large Language Model Text Generation Inference
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Port of OpenAI's Whisper model in C/C++
Multilingual/multidomain question generation datasets, models, and python library for question generation.
🥤🧑🏻🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"
Cramming the training of a (BERT-type) language model into limited compute.
The official Notion API client library, but rewritten in Python! (sync + async)