Stars
Linux OS for Azure 1P services and edge appliances
Repository for Demonstration/Tutorial needs of CBL-Mariner
Enabling the Windows Subsystem for Linux to include support for Wayland and X server related scenarios
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Codes for data visualizations posted on reddit and twitter
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
A Clojure dataframe library that runs on Spark
R package to create internally consistent, mini version of CRAN
Forecasting Functions for Time Series and Linear Models
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
Spark: The Definitive Guide's Code Repository
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Using R's tidytext package to inspect sentiment of Reddit comments for Smithsonianmag.com.
Inspired by ProPublica and Google's Election DataBot, an interactive exploration of the 2016 midterms, this Shiny app acts as a dynamic interface to explore 2020 Democratic candidate tweets, Google…
📊 Path to a free self-taught education in Data Science!
Data that I have acquired (created, gathered, scraped, etc.) and want to share in a more organized or clean format.
Core repo for election results data acquisition, transformation and output.
The Washington Post collected data on more than 52,000 criminal homicides over the past decade in 50 of the largest American cities.
A topic-centric list of HQ open datasets.