8000 huangqingxin / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View huangqingxin's full-sized avatar

Block or report huangqingxin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of awesome DuckLake tools and resources

32 2 Updated Jun 30, 2025

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

506 36 Updated Jun 19, 2025

A curated list of awesome academic researches and industrial materials about Artificial Intelligence for IT Operations (AIOps).

266 36 Updated Feb 12, 2025

ToolHive makes deploying MCP servers easy, secure and fun

Go 706 63 Updated Jul 5, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,406 59 Updated May 11, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,154 77 Updated Jul 4, 2025

🥥 Coco AI App - Search, Connect, Collaborate, Your Personal AI Search and Assistant, all in one space.

TypeScript 483 50 Updated Jul 4, 2025

Lightweight Inference server for OpenVINO

Python 187 6 Updated Jul 4, 2025

TransMLA: Multi-Head Latent Attention Is All You Need

Python 323 22 Updated Jul 4, 2025

Fast inference engine for Transformer models

C++ 3,893 368 Updated Apr 8, 2025

A blazing fast inference solution for text embeddings models

Rust 3,763 284 Updated Jul 3, 2025

High-performance retrieval engine for unstructured data

Python 1,431 110 Updated Jun 18, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 851 70 Updated Sep 4, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1 1 Updated Feb 25, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 2,469 291 Updated Jul 4, 2025

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,290 154 Updated Jul 3, 2025

A set of scripts to grab public datasets from resources related to arXiv

Python 451 73 Updated May 20, 2024

The Granite Guardian models are designed to detect risks in prompts and responses.

Jupyter Notebook 973E 88 10 Updated Jun 25, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 46,059 4,609 Updated Jul 3, 2025

16-fold memory access reduction with nearly no loss

Python 100 8 Updated Mar 26, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,161 704 Updated Jul 4, 2025

The Security Toolkit for LLM Interactions

Python 1,798 236 Updated Jun 30, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,502 295 Updated Jul 4, 2025

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 9,021 1,476 Updated Jul 4, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 58,983 5,857 Updated Jul 4, 2025

Get your documents ready for gen AI

Python 33,680 2,238 Updated Jul 4, 2025

A machine learning software for extracting information from scholarly documents

Java 4,165 493 Updated Jul 4, 2025

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,745 197 Updated Apr 9, 2025

OpenResearcher, an advanced Scientific Research Assistant

HTML 450 39 Updated Oct 10, 2024
Next
0