Stars
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser, Trae AI & Cluely (And other Open Sourced) System Prompts, Tools & AI Models.
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
Official implementation of Inductive Moment Matching
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
A Conversational Speech Generation Model
A Python package that makes it easy for developers to create AI apps powered by various AI providers.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Font files available from Google Fonts, and a public issue tracker for all things Google Fonts
Simple, unified interface to multiple Generative AI providers
A simple screen parsing tool towards pure vision based GUI agent
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Entropy Based Sampling and Parallel CoT Decoding
notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification dataset and the transformers library
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Morphological processing for languages of the Horn of Africa
Refine high-quality datasets and visual AI models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
this repository accompanies the book "Grokking Deep Learning"
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
Master programming by recreating your favorite technologies from scratch.
Stable Diffusion web UI
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
We write your reusable computer vision tools. 💜