-
Snowflake
Highlights
- Pro
-
polaris Public
Forked from apache/polarisThe interoperable, open source catalog for Apache Iceberg
Java Apache License 2.0 UpdatedJun 10, 2025 -
spark Public
Forked from apache/sparkMirror of Apache Spark
Scala Apache License 2.0 UpdatedOct 14, 2022 -
arrow Public
Forked from apache/arrowApache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
C++ Apache License 2.0 UpdatedSep 21, 2022 -
parquet-format Public
Forked from apache/parquet-formatApache Parquet
Java Apache License 2.0 UpdatedMar 24, 2022 -
dataproc-initialization-actions Public
Forked from GoogleCloudDataproc/initialization-actionsRun in all nodes of your cluster before the cluster starts - let's you customize your cluster
-
hadoop Public
Forked from apache/hadoopMirror of Apache Hadoop
Java Apache License 2.0 UpdatedApr 4, 2018 -
zlib Public
Forked from madler/zlibA massively spiffy yet delicately unobtrusive compression library.
C UpdatedFeb 15, 2018 -
appengine-flask-skeleton Public
Forked from googlearchive/appengine-flask-skeletonA skeleton for creating Python applications using the Flask framework on App Engine
Python Apache License 2.0 UpdatedDec 6, 2017 -
cloud-bigtable-examples Public
Forked from GoogleCloudPlatform/cloud-bigtable-examplesExamples of how to use Cloud Bigtable both with GCE map/reduce as well as stand alone applications.
Java Apache License 2.0 UpdatedDec 6, 2017 -
hive Public
Forked from apache/hiveMirror of Apache Hive
Java Apache License 2.0 UpdatedApr 27, 2017 -
airflow-gcp-examples Public
Forked from alexvanboxel/airflow-gcp-examplesRepository with examples and smoke tests for the GCP Airflow operators and hooks
Python Apache License 2.0 UpdatedJan 15, 2017 -
bigtop Public
Forked from apache/bigtopMirror of Apache Bigtop
Java Apache License 2.0 UpdatedNov 29, 2016 -
zeppelin Public
Forked from apache/zeppelinMirror of Apache Zeppelin
Java Apache License 2.0 UpdatedNov 22, 2016 -
bigdata-interop Public
Forked from GoogleCloudDataproc/hadoop-connectorsLibraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
-
hbase Public
Forked from apache/hbaseMirror of Apache HBase
Java Apache License 2.0 UpdatedSep 11, 2016 -
-
spark-csv Public
Forked from databricks/spark-csvCSV data source for Spark SQL and DataFrames
-
bdutil Public
Forked from GoogleCloudDataproc/bdutil -
spark-dataflow Public
Forked from hougs/spark-dataflowProvides a Spark backend for executing Dataflow pipelines.
-
codelabs Public
Forked from reprogrammer/codelabsCodelabs in various languages demonstrating usage of several tools & systems upon genomics data.