-
deltacat Public
Forked from ray-project/deltacatA Pythonic Data Catalog powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
Python Apache License 2.0 UpdatedMay 10, 2025 -
ray Public
Forked from ray-project/rayA fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning libr…
-
iceberg Public
Forked from apache/icebergApache Iceberg
Java Apache License 2.0 UpdatedApr 30, 2024 -
iceberg-python Public
Forked from apache/iceberg-pythonApache PyIceberg
Python Apache License 2.0 UpdatedJan 24, 2024 -
delta Public
Forked from delta-io/deltaAn open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
HTML Apache License 2.0 UpdatedNov 22, 2023 -
hudi Public
Forked from apache/hudiUpserts, Deletes And Incremental Processing on Big Data.
Java Apache License 2.0 UpdatedNov 22, 2023 -
delta-rs Public
Forked from delta-io/delta-rsA native Rust library for Delta Lake, with bindings into Python
Rust Apache License 2.0 UpdatedNov 22, 2023 -
amazon-ray Public
Forked from amzn/amazon-rayStaging area for ongoing enhancements to Ray focused on improving integration with AWS and other Amazon technologies.
Python Apache License 2.0 UpdatedMar 18, 2023 -
arrow Public
Forked from apache/arrowApache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
C++ Apache License 2.0 UpdatedOct 11, 2022 -
beam Public
Forked from apache/beamApache Beam is a unified programming model for Batch and Streaming
Java Apache License 2.0 UpdatedFeb 23, 2022 -
ray_beam_runner Public
Forked from ray-project/ray_beam_runnerRay-based Apache Beam runner
Python Apache License 2.0 UpdatedDec 3, 2021