Stars
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Apache Spark - A unified analytics engine for large-scale data processing
Multi-container environment with Hadoop, Spark and Hive
The official home of the Presto distributed SQL query engine for big data
A high performance caching library for Java
A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others
Java binary serialization and cloning: fast, efficient, automatic
Free and Open Source, Distributed, RESTful Search Engine
📚Java核心知识点整理(包括Java基础、JVM、数据库、计算机网络、算法、操作系统、设计模式、系统设计、框架原理)
😮 Core Interview Questions & Answers For Experienced Java(Backend) Developers | 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
The java implementation of Apache Dubbo. An RPC and microservice framework.
Spring Boot Project for Apache Dubbo