Stars
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Toolkit for linearizing PDFs for LLM datasets/training
A simple, easy-to-hack GraphRAG implementation
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的mcp框架。
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
PostgreSQL PL/PGSQL function that generates table DDL for the given schema/table.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Fast, light, simple Docker containers & Linux machines
Azure AD Client Credentials with Certificate code examples
Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
A modular graph-based Retrieval-Augmented Generation (RAG) system
史上最大规模1.4亿知识图谱数据免费下载,知识图谱,通用知识图谱,融合了两千五百多万的实体,拥有亿级别的实体属性关系。
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
The Security Toolkit for LLM Interactions
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
DSPy: The framework for programming—not prompting—language models
A Django content management system focused on flexibility and user experience
使用深度学习方法解析问题 知识图谱存储 查询知识点 基于医疗垂直领域的对话系统
World's most advanced database DevSecOps solution for Developer, Security, DBA and Platform Engineering teams. The GitHub/GitLab for database DevSecOps.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Precision Medicine Knowledge Graph (PrimeKG)
Retrieval and Retrieval-augmented LLMs
PULSE: Pretrained and Unified Language Service Engine
High-speed Large Language Model Serving for Local Deployment