8000 jx2lee's list / data · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jx2lee's full-sized avatar
🤭
daeater
🤭
daeater

Block or report jx2lee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

data

72 repositories
C++ 51 24 Updated Jun 20, 2025
C++ 289 62 Updated Jun 25, 2025
Python 141 44 Updated Jun 3, 2025

A playground for running duckdb as a stateless query engine over a data lake

Python 206 8 Updated Jan 10, 2024

Dremio - the missing link in modern data

Java 1,435 451 Updated May 1, 2025

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 1,247 153 Updated Jun 28, 2025

The platform that powers Airbyte. Please file issues in https://github.com/airbytehq/airbyte

Kotlin 256 326 Updated Jun 27, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 18,542 4,614 Updated Jun 28, 2025

DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS

HCL 762 273 Updated Jun 27, 2025

An Open Standard for lineage metadata collection

Java 1,992 348 Updated Jun 27, 2025

do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

Python 853 76 Updated Apr 5, 2024

📊 Cube’s universal semantic layer platform is the next evolution of OLAP technology for AI, BI, spreadsheets, and embedded analytics

Rust 18,648 1,849 Updated Jun 27, 2025

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 2,445 227 Updated Jun 27, 2025

A modular SQL linter and auto-formatter with support for multiple dialects and templated code.

Python 8,962 840 Updated Jun 28, 2025

Compare tables within or across databases

Python 2,974 289 Updated May 17, 2024

DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.

47 49 Updated May 2, 2025

Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.

Jupyter Notebook 31,547 6,704 Updated Jun 27, 2025

A Singer (https://singer.io) target that writes data to Google BigQuery.

Python 39 121 Updated Mar 8, 2021

A provider package for kafka

Python 37 15 Updated Jul 31, 2023

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially…

Python 2,689 312 Updated Nov 28, 2024

DuckDB is an analytical in-process SQL database management system

C++ 30,583 2,404 Updated Jun 27, 2025

The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, it has since helped thousands of companies create production-…

Shell 693 484 Updated Oct 15, 2024

Repository of helm charts for deploying DataHub on a Kubernetes cluster

Mustache 190 258 Updated Jun 24, 2025

A web UI for Debezium; Please log issues at https://issues.redhat.com/browse/DBZ.

TypeScript 347 103 Updated Apr 29, 2025

Singer.io Tap for MySQL

Python 54 50 Updated Oct 30, 2024

A series of DAGs/Workflows to help maintain the operation of Airflow

Python 1,725 398 Updated Jun 18, 2024

Web tool for operating kafka connect https://hub.docker.com/r/officialkakao/kafka-connect-web

Vue 114 13 Updated Oct 17, 2023

This repository is a getting started guide to Singer.

Makefile 1,308 144 Updated Sep 3, 2024

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Java 11,548 2,686 Updated Jun 26, 2025

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python 2,119 177 Updated Jun 28, 2025
0