Stars
基于当前互联网最热的springboot微服务架构,采用丰富的vue、iview等前端组件打造的kettle调度监控服务平台,解决了企业实际数据抽取业务场景中,无法实现kettle web端配置、调用、监控的痛点
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
The Metadata Platform for your Data and AI Stack
基于spark-ml,spark-mllib,spark-streaming的推荐算法实现
A DBAPI and SQLAlchemy dialect for Elasticsearch
Convert your (Beamer) PDF slides to (Powerpoint) PPTX
📊 Cube’s universal semantic layer platform is the next evolution of OLAP technology for AI, BI, spreadsheets, and embedded analytics
🚀 ice.js: The Progressive App Framework Based On React(基于 React 的渐进式应用框架)
Vue数据可视化组件库(类似阿里DataV,大屏数据展示),提供SVG的边框及装饰、图表、水位图、飞线图等组件,简单易用,长期更新(React版已发布)
🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo
懒松鼠Flink-Boot 脚手架让Flink全面拥抱Spring生态体系,使得开发者可以以Java WEB开发模式开发出分布式运行的流处理程序,懒松鼠让跨界变得更加简单。懒松鼠旨在让开发者以更底上手成本(不需要理解分布式计算的理论知识和Flink框架的细节)便可以快速编写业务代码实现。为了进一步提升开发者使用懒松鼠脚手架开发大型项目的敏捷的度,该脚手架默认集成Spring框架进行Bean管…
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Davinci is a DVsaaS (Data Visualization as a Service) Platform
Moonbox is a DVtaaS (Data Virtualization as a Service) Platform
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
Java应用性能监控系统,使用JMX实现,实现了类加载监控、内存监控、线程监控、GC监控