prep
OCR & Document Extraction using vision models
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…
This is a repo with links to everything you'd ever want to learn about data engineering
RAG that intelligently adapts to your use case, data, and queries
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
Neo4j graph construction from unstructured data using LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Guide for fine-tuning Llama/Mistral/CodeLlama models and more
Label, clean and enrich text datasets with LLMs.
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.
A curated list of awesome open-source libraries for production LLM
Natural Language Processing with Large Language Models