Starred repositories
Python library for portfolio optimization built on top of scikit-learn
A curated list of open source tools used in analytics platforms and data engineering ecosystem
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
The official home of the Presto distributed SQL query engine for big data
DiceDB is an open-source, fast, reactive, in-memory database optimized for modern hardware.
A toolkit for machine learning from time series
Download market data from Yahoo! Finance's API
Python client library for FaaSKeeper, the serverless ZooKeeeper.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Databricks SQL Connector for Python
CLOMonitor is a tool that periodically checks open source projects repositories to verify they meet certain project health best practices
A curated list of free courses with certifications. Also available at https://free-certifications.com/
Hackathon starter project for Flask applications
What happens behind the scenes when we type www.google.com in a browser?
Discover great opportunities to become a Cloud Native contributor
My small cheatsheets for data science, ML, computer science and more.
Systems design is the process of defining the architecture, modules, interfaces, and data for a system to satisfy specified requirements. Systems design could be seen as the application of systems …
Kroxylicious, the snappy open source proxy for Apache Kafka®
📕machine learning tech collections at Microsoft and subsidiaries.
Image colorization using neural networks.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Projects and e-book for our course, REST APIs with Flask and Python
Only valid pull requests will be allowed. Use python only and readme changes will not be accepted.