8000 huangqingxin / Starred · GitHub

More Web Proxy on the site http://driver.im/

huangqingxin

Follow

huangqingxin

Follow

3 followers · 18 following

Stars

esadek / awesome-ducklake

A curated list of awesome DuckLake tools and resources

32 2 Updated Jun 30, 2025

SpursGoZmy / Awesome-Tabular-LLMs

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

506 36 Updated Jun 19, 2025

OpsPAI / awesome-AIOps

A curated list of awesome academic researches and industrial materials about Artificial Intelligence for IT Operations (AIOps).

266 36 Updated Feb 12, 2025

stacklok / toolhive

ToolHive makes deploying MCP servers easy, secure and fun

Go 706 63 Updated Jul 5, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,406 59 Updated May 11, 2025

thu-pacman / chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,154 77 Updated Jul 4, 2025

infinilabs / coco-app

🥥 Coco AI App - Search, Connect, Collaborate, Your Personal AI Search and Assistant, all in one space.

TypeScript 483 50 Updated Jul 4, 2025

SearchSavior / OpenArc

Lightweight Inference server for OpenVINO

Python 187 6 Updated Jul 4, 2025

fxmeng / TransMLA

TransMLA: Multi-Head Latent Attention Is All You Need

Python 323 22 Updated Jul 4, 2025

OpenNMT / CTranslate2

Fast inference engine for Transformer models

C++ 3,893 368 Updated Apr 8, 2025

huggingface / text-embeddings-inference

A blazing fast inference solution for text embeddings models

Rust 3,763 284 Updated Jul 3, 2025

D-Star-AI / dsRAG

High-performance retrieval engine for unstructured data

Python 1,431 110 Updated Jun 18, 2025

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 851 70 Updated Sep 4, 2024

glen-amd / flashinfer

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1 1 Updated Feb 25, 2025

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 2,469 291 Updated Jul 4, 2025

michaelfeil / infinity

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,290 154 Updated Jul 3, 2025

mattbierbaum / arxiv-public-datasets

A set of scripts to grab public datasets from resources related to arXiv

Python 451 73 Updated May 20, 2024

ibm-granite / granite-guardian

The Granite Guardian models are designed to detect risks in prompts and responses.

Jupyter Notebook 973E 88 10 Updated Jun 25, 2025

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 46,059 4,609 Updated Jul 3, 2025

pavel-denisov-fraunhofer / docling-ibm-models

Forked from docling-project/docling-ibm-models

Python 2 Updated Apr 14, 2025

andy-yang-1 / DoubleSparse

16-fold memory access reduction with nearly no loss

Python 100 8 Updated Mar 26, 2025

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,161 704 Updated Jul 4, 2025

protectai / llm-guard

The Security Toolkit for LLM Interactions

Python 1,798 236 Updated Jun 30, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,502 295 Updated Jul 4, 2025

dataelement / bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 9,021 1,476 Updated Jul 4, 2025

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 58,983 5,857 Updated Jul 4, 2025

docling-project / docling

Get your documents ready for gen AI

Python 33,680 2,238 Updated Jul 4, 2025

kermitt2 / grobid

A machine learning software for extracting information from scholarly documents

Java 4,165 493 Updated Jul 4, 2025

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,745 197 Updated Apr 9, 2025

GAIR-NLP / OpenResearcher

OpenResearcher, an advanced Scientific Research Assistant

HTML 450 39 Updated Oct 10, 2024

0