Stars
A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your ML and analytics workloads.
An extensible, state of the art columnar file format. Formerly at @spiraldb, now a Linux Foundation project.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
A vendor neutral GPU multiplexing tool driven by VFIO & YAML.
Distributed query engine providing simple and reliable data processing for any modality and scale
Staging area for ongoing enhancements to Ray focused on improving integration with AWS and other Amazon technologies.
Modin: Scale your Pandas workflows by changing a single line of code
A toolkit to run Ray applications on Kubernetes
Convenient pyarrow operations following the Pandas API
Secure and fast microVMs for serverless computing.
a fast, scalable, multi-language and extensible build system
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.