Stars
OpenAPI Generator allows generation of API client libraries (SDK generation), server stubs, documentation and configuration automatically given an OpenAPI Spec (v2, v3)
Yosegi is a Schema-less columnar storage format. Provide flexible representation like JSON and efficient reading similar to other columnar storage formats.
Data governance through AWS LakeFormation credentials vending API
A set of Docker images that include popular frameworks for machine learning, data science and visualization.
The fastest knowledge base for growing teams. Beautiful, realtime collaborative, feature packed, and markdown compatible.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
A collection of awesome web crawler,spider in different languages
Scrapy, a fast high-level web crawling & scraping framework for Python.
No fortress, purely open ground. OpenManus is Coming.
An Apache Spark Structured Streaming S3 connector for reading S3 files using Amazon S3 event notifications to AWS SQS
magic-trace collects and displays high-resolution traces of what a process is doing
The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark, Flink and others, when used with the Iceberg Table format
A book about data analysis and trading strategies for EVE Online in-game markets. Online version: https://orbitalenterprises.github.io/eve-market-strategies/index.html