Stars
The Metadata Platform for your Data and AI Stack
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
This is a repo with links to everything you'd ever want to learn about data engineering
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Open, Multi-modal Catalog for Data & AI
ApiCurio Schema Registry implementation for DataHub Kafka ingestion source
A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes
A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes
Low-code tool for automating actions on real time data | Stream processing for the users.
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
Caffe2 is a lightweight, modular, and scalable deep learning framework.