Stars
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Machine Learning Engineering Open Book
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Machine Learning Journal for Intermediate to Advanced Topics.
Fully managed Apache Parquet implementation
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A reference .NET application implementing an eCommerce site
Notes talking about the design and implementation of Apache Spark
This repo is used for servicing PR's for .NET Core 2.1 and 3.1. Please visit us at https://github.com/dotnet/runtime
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
A Java parser for Tom's Obvious, Minimal Language (TOML).
Documentation for extending Visual Studio with new types of projects.
A library that provides an embeddable, persistent key-value store for fast storage.
Trill is a single-node query processor for temporal or streaming data.
HyperLogLog-based set cardinality estimation library
.NET API reference documentation (.NET 5+, .NET Core, .NET Framework)
A simple hotel reservation system demonstrating WPF MVVM fundamentals.
A repo for upcoming changes to extensibility in Visual Studio, the new extensibility model, and language server protocol.
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
ShellProgressBar - display progress in your console application
Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretation based) engines and compiling engines.
Generate diagrams from textual description