Stars
A curated list of awesome DuckLake tools and resources
We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理
A curated list of awesome academic researches and industrial materials about Artificial Intelligence for IT Operations (AIOps).
ToolHive makes deploying MCP servers easy, secure and fun
An Open-source RL System from ByteDance Seed and Tsinghua AIR
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
🥥 Coco AI App - Search, Connect, Collaborate, Your Personal AI Search and Assistant, all in one space.
TransMLA: Multi-Head Latent Attention Is All You Need
Fast inference engine for Transformer models
A blazing fast inference solution for text embeddings models
High-performance retrieval engine for unstructured data
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
glen-amd / flashinfer
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Supercharge Your LLM with the Fastest KV Cache Layer
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
A set of scripts to grab public datasets from resources related to arXiv
The Granite Guardian models are designed to detect risks in prompts and responses.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
16-fold memory access reduction with nearly no loss
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
The Security Toolkit for LLM Interactions
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Get your documents ready for gen AI
A machine learning software for extracting information from scholarly documents
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
OpenResearcher, an advanced Scientific Research Assistant