Stars
Go implementation of the Ethereum protocol
Manuscript is a revolutionary blockchain data streaming framework. With Manuscript, you can seamlessly integrate on-chain and off-chain data into target data storage for unrestricted querying and a…
Bigtop Manager is a modern, AI-driven web application designed to simplify the complexity of bigdata cluster management.
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.
Apache Doris is an easy-to-use, high performance and unified analytics database.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
The Metadata Platform for your Data and AI Stack
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Apache Spark - A unified analytics engine for large-scale data processing
an easy-to-use dynamic service discovery, configuration and service management platform for building AI cloud native applications.
Python - 100天从新手到大师