Starred repositories
Repository for the Lux AI Challenge, season 3 @NeurIPS 24. Hosted on @kaggle
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
Solution of Kaggle competition: LMSYS - Chatbot Arena Human Preference Predictions
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Notebooks that support blog posts and tech talks on Dask / Coiled.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Open source platform for the machine learning lifecycle
The Python micro framework for building web applications.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
DSPy: The framework for programming—not prompting—language models
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Statsmodels: statistical modeling and econometrics in Python
Catalog, compose, and ship ML—Python simplicity, SQL scale.
Production-Grade Container Scheduling and Management
Create an open source toy dataset for finetuning LLMs with reasoning abilities
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
AI's query engine - Platform for building AI that can answer questions over large scale federated data. - The only MCP Server you'll ever need
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Codecademy Docs is a collection of information for all things code. 📕
🚀✨ Help beginners to contribute to open source projects
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
A code-first agent framework for seamlessly planning and executing data analytics tasks.