Stars
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
Metering and Billing for AI, API and DevOps. Collect and aggregate millions of usage events in real-time and enable usage-based billing.
Prompts for our Grok chat assistant and the `@grok` bot on X.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
High accuracy RAG for answering questions from scientific documents with citations
JUCE is an open-source cross-platform C++ application framework for desktop and mobile applications, including VST, VST3, AU, AUv3, LV2 and AAX audio plug-ins.
The AI Browser Automation Framework
AI computer use powered by open source LLMs and E2B Desktop Sandbox
open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for designing complex, interactive environments where agents can act,…
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Python tool for converting fi 8000 les and office documents to Markdown.
A benchmark to evaluate language models on questions I've previously asked them to solve.
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and mo…
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse
A high-throughput and memory-efficient inference and serving engine for LLMs
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api
AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording
A guide for technical professionals looking to start consulting
Questions that I ask myself at the end of each year and each decade.