8000 gbaf's list / etl · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View gbaf's full-sized avatar

Block or report gbaf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

etl

20 repositories

⚡ Fetching and realtime data exchange framework.

TypeScript 1,077 27 Updated Jul 1, 2025

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

Python 14,990 800 Updated Jun 11, 2025

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 600+ plugins. Alternative to Airflow, n8n, Rundeck, VMware vRA, Zapier ...

Java 19,492 1,609 Updated Jul 1, 2025

Easily setup logical replication and switchover to new database with minimal downtime

Ruby 978 21 Updated Jun 23, 2025

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Python 28,557 628 Updated Jul 1, 2025

🦎 A multi-protocol edge & service proxy. Seamlessly interface web apps, IoT clients, & microservices to Apache Kafka® via declaratively defined, stateless APIs.

Java 595 62 Updated Jun 28, 2025

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,855 2,422 Updated Jul 1, 2025

Blazing-fast Data-Wrangling toolkit

Rust 2,915 82 Updated Jul 1, 2025

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 19,670 1,858 Updated Jul 1, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 40,800 15,257 Updated Jul 1, 2025

High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka/Pulsar, Iceberg, or ClickHouse, and seamlessly write results back. Supports powerful feat…

C++ 1,837 85 Updated Jun 6, 2025

An open-source, low-code machine learning library in Python

Jupyter Notebook 9,393 1,813 Updated Apr 21, 2025

chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse

C++ 2,406 84 Updated Jun 30, 2025

A portable accelerated data query and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

Rust 2,462 134 Updated Jul 1, 2025

A flexible distributed key-value database that is optimized for caching and other realtime workloads.

C 22,120 931 Updated Jul 1, 2025

ParadeDB is a modern Elasticsearch alternative built on Postgres. Built for real-time, update-heavy workloads.

Rust 7,231 248 Updated Jul 1, 2025

A Python framework for defining and querying BI models in your data warehouse

Python 166 7 Updated Jan 15, 2025

Open-source BI for engineers

Rust 2,315 58 Updated Feb 17, 2025

Maestro: Netflix’s Workflow Orchestrator

Java 3,489 220 Updated Jul 1, 2025

Trench — Open-Source Analytics Infrastructure. A single production-ready Docker image built on ClickHouse, Kafka, and Node.js for tracking events, page views. Easily build product analytics dashboa…

TypeScript 1,572 54 Updated May 1, 2025
0