Lists (1)
Sort Name ascending (A-Z)
Stars
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Training PyTorch models with differential privacy
Robust recipes to align language models with human and AI preferences
A framework for prompt tuning using Intent-based Prompt Calibration
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vector…
An extremely fast Python package and project manager, written in Rust.
The Security Toolkit for LLM Interactions
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Sparsity-aware deep learning inference runtime for CPUs
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Examples of programs built using Modal
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
A high-throughput and memory-efficient inference and serving engine for LLMs
TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a simple linear probe-based method and a more sophisticated m…
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
TensorFlow code and pre-trained models for BERT
An agent benchmark with tasks in a simulated software company.
DSPy: The framework for programming—not prompting—language models
Code for "Learning to Generate Reviews and Discovering Sentiment"
Python and TypeScript library for integrating the Stripe API into agentic workflows