Stars
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)
Learn Rust dark magics by implementing an expression framework in database systems
Best practice and tips & tricks to write scientific papers in LaTeX, with figures generated in Python or Matlab.
A course to build distributed key-value service based on TiKV model
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
Source code for Neural Information Processing Systems (NeurIPS) 2018 paper "Multi-Task Learning as Multi-Objective Optimization"
Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engine.
🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes
Expand your Training Limits! Generating Training Data for ML-based Data Management
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.
The official home of the Presto distributed SQL query engine for big data