Stars
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platforโฆ
FastAPI framework, high performance, easy to learn, fast to code, ready for production
A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.
A data generator source connector for Flink SQL based on data-faker.
An evolving description of general best practices for backend development.
A simplified, lightweight ETL Framework based on Apache Spark
What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolation levels.
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
Data ingestion library for Amundsen to build graph and search index
Library for managing service-level fault isolation using Amazon Route 53.
Pre-processing and training scripts for the Tarteel Dataset
[NOT MAINTAINED] This script creates a NATed or Bridged WiFi Access Point.
System design interview for IT companies
Python library that makes it easy for data scientists to create charts.
Build and run Docker containers leveraging NVIDIA GPUs
An open source library for face detection in images. The face detection speed can reach 1000FPS.
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
Bootstrap yourself to write an OS from scratch. A book for self-learner.
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Uses a Convolutional Neural Network to detect duplicate questions in the public Quora dataset.
๐ An opinionated intermediate/advanced Git book
Tutorials and programming exercises for learning Q# and quantum computing
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Over 400 software engineering companies that are easy to apply to
Curated list of project-based tutorials