Stars
SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open sour…
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
A binary encoder / decoder implementation in Rust.
Pytorch implementation of the paper "Circle Loss: A Unified Perspective of Pair Similarity Optimization"
Retrieval and Retrieval-augmented LLMs
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
State-of-the-Art Text Embeddings
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Benchmarks of approximate nearest neighbor libraries in Python
The Rust OpenTelemetry implementation
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
Lock-free SPSC FIFO ring buffer with direct access to inner data
Custom hash algorithm used by rustc (plus hashmap/set aliases): fast, deterministic, not secure
A lightweight data processing framework built on DuckDB and 3FS.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
FlashMLA: Efficient MLA decoding kernels
Prometheus instrumentation library for Rust applications
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
A library for efficient similarity search and clustering of dense vectors.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production