- Beijing, China
Highlights
- Pro
Stars
Fluss is a streaming storage built for real-time analytics.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).
A native Rust library for Delta Lake, with bindings into Python
InfluxData's core functionality for InfluxDB Edge and IOx
Apache DataFusion Comet Spark Accelerator
Database connectivity API standard and libraries for Apache Arrow
✅ The programmer-friendly testing framework for Java and the JVM
Transmute-free Rust library to work with the Arrow format
Notes talking about the design and implementation of Apache Spark
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
Speech recognition module for Python, supporting several engines and APIs, online and offline.
DuckDB is an analytical in-process SQL database management system
A lightweight framework for golang object (struct) serialization (mapping). Inspired heavily by marshmallow (a Python library).
Scalable datastore for metrics, events, and real-time analytics
An opinionated list of awesome Python frameworks, libraries, software and resources.
What-happens-when 的中文翻译,原仓库 https://github.com/alex/what-happens-when
A simple wrapper for go-chi, aims at writing RESTful API in an elegant way~
Generate test data from SQL files before testing and clear it after finished.
Papers from the computer science community to read and discuss.
All Algorithms implemented in Python