-
HuggingFace
- France
- 3outeille.github.io
- @FerdinandMom
- @FerdinandMom
-
quack Public
Forked from Dao-AILab/quackA Quirky Assortment of CuTe Kernels
Python Apache License 2.0 UpdatedJul 10, 2025 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJul 4, 2025 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedJun 30, 2025 -
-
prime Public
Forked from PrimeIntellect-ai/primeprime is a framework for efficient, globally distributed training of AI models over the internet.
-
DualPipe Public
Forked from deepseek-ai/DualPipeA bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
Python MIT License UpdatedFeb 27, 2025 -
nccl Public
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
C++ Other UpdatedFeb 17, 2025 -
picotron-deepseek Public
Forked from huggingface/picotronMinimalistic 4D-parallelism distributed training framework for education purpose
Python Apache License 2.0 UpdatedJan 24, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedJan 17, 2025 -
nccl-tests Public
Forked from NVIDIA/nccl-testsNCCL Tests
Cuda BSD 3-Clause "New" or "Revised" License UpdatedDec 12, 2024 -
EasyContext Public
Forked from jzhang38/EasyContextMemory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Python Apache License 2.0 UpdatedSep 27, 2024 -
-
ring-attention-pytorch Public
Forked from lucidrains/ring-attention-pytorchImplementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Python MIT License UpdatedSep 25, 2024 -
-
litgpt Public
Forked from Lightning-AI/litgpt20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Python Apache License 2.0 UpdatedAug 12, 2024 -
dust Public
Forked from kelpsyberry/dustA Nintendo DS emulator written in Rust for desktop devices and the web, with debugging features and a focus on accuracy
Rust GNU General Public License v3.0 UpdatedAug 6, 2024 -
fms-fsdp Public
Forked from foundation-model-stack/fms-fsdpDemonstrate throughput of PyTorch FSDP
Python Apache License 2.0 UpdatedJul 5, 2024 -
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedJun 14, 2024 -
diloco_simple Public
Forked from PrimeIntellect-ai/diloco_simpletorch implementation of diloco
Python UpdatedMay 31, 2024 -
lighteval Public
Forked from huggingface/lightevalLightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Python MIT License UpdatedMay 23, 2024 -
fsdp_tp_example Public
minimum working example of fsdp and tp (using dmesh and process groups)
-
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 2, 2024 -
veScale Public
Forked from volcengine/veScaleA PyTorch Native LLM Training Framework
Python Apache License 2.0 UpdatedApr 26, 2024 -
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedApr 16, 2024 -
llm-viz Public
Forked from bbycroft/llm-viz3D Visualization of an GPT-style LLM
TypeScript UpdatedApr 11, 2024 -
llamafile Public
Forked from Mozilla-Ocho/llamafileDistribute and run LLMs with a single file.
C++ Other UpdatedApr 6, 2024 -
mad-lab Public
Forked from athms/mad-labA MAD laboratory to improve AI architecture designs 🧪
Python MIT License UpdatedMar 28, 2024 -
-
Tune-A-Video Public
Forked from showlab/Tune-A-Video[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jupyter Notebook Apache License 2.0 UpdatedMar 17, 2024 -
Open-Sora Public
Forked from hpcaitech/Open-SoraOpen-Sora, an open-source initiative dedicated to efficiently reproducing OpenAI's Sora
Python Apache License 2.0 UpdatedMar 17, 2024