Stars
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.
Build Real-Time Knowledge Graphs for AI Agents
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
LLMs-from-scratch项目中文翻译
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
💫 Industrial-strength Natural Language Processing (NLP) in Python
Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.
Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented generation.
Medical Graph RAG: Graph RAG for the Medical Data
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Neo4j graph construction from unstructured data using LLMs
A powerful tool for creating fine-tuning datasets for LLM
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
rockset / rocksdb-cloud
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage optimized for AWS
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
Pholcus幽灵蛛是一款Go语言编写的爬虫软件框架(含GUI界面),优雅的爬虫规则、可控的高并发、任意的批量任务、多种输出方式、大量Demo,并且考虑了支持分布式布局。
Automated testing to find logic and performance bugs in database systems
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...