- Philadelphia, PA
- in/luke-chesley2070
Stars
Muon optimizer: +>30% sample efficiency with <3% wallclock overhead
Cross-platform, fast, feature-rich, GPU based terminal
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Official repository for our work on micro-budget training of large-scale diffusion models.
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
Efficient Triton Kernels for LLM Training
A playbook for systematically maximizing the performance of deep learning models.
llama3 implementation one matrix multiplication at a time
A PyTorch native library for large-scale model training
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
thepowerfuldeez / OLMo
Forked from allenai/OLMoMy fork os allen AI's OLMo for educational purposes.
Build a RAG (Retrieval Augmented Generation) pipeline from scratch and have it all run locally.
A collection of learning resources for curious software engineers
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Time series forecasting with PyTorch
You like pytorch? You like micrograd? You love tinygrad! ❤️
Google Research
🌙 LunarVim is an IDE layer for Neovim. Completely free and community driven.
Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023