Stars
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
All languages stopwords collection
Decoder Only Transformer Policy for Behavioral Cloning
Ongoing research training transformer models at scale
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
A feature-rich command-line audio/video downloader
Large World Model -- Modeling Text and Video with Millions Context
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
Implementation of MagViT2 Tokenizer in Pytorch
A framework for few-shot evaluation of language models.
Example models using DeepSpeed
Fast and memory-efficient exact attention
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Making large AI models cheaper, faster and more accessible
Transformer related optimization, including BERT, GPT
Streamlit — A faster way to build and share data apps.
Streamlit tool to explore coco datasets
Elyra extends JupyterLab with an AI centric approach.
Kubebuilder - SDK for building Kubernetes APIs using CRDs
A dependency injection based application framework for Go.
RocksDB/LevelDB inspired key-value database in Go