- Guangzhou, China
-
01:32
(UTC +08:00) - http://gameofdimension.com/
-
cache-dit Public
Forked from vipshop/cache-dit🤗A Training-free and Easy-to-use Cache Acceleration Toolbox for DiTs: DBCache, DBPrune, TaylorSeer, FBCache, etc🔥
Python Other UpdatedJul 4, 2025 -
-
flux Public
Forked from bytedance/fluxA fast communication-overlapping library for tensor/expert parallelism on GPUs.
C++ Apache License 2.0 UpdatedJun 12, 2025 -
ParaAttention Public
Forked from chengzeyi/ParaAttentionhttps://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
Python Other UpdatedJun 5, 2025 -
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMar 25, 2025 -
tiny-grpo Public
A torch native grpo training example
-
-
ascend-picotron Public
Forked from huggingface/picotronMinimalistic 4D-parallelism distributed training framework for education purpose. Adapting to Ascend NPU.
Python Apache License 2.0 UpdatedFeb 21, 2025 -
limulidae Public
benchmark gpu/npu flops and bandwidth
-
prodigy Public
Forked from konstmish/prodigyThe Prodigy optimizer and its variants for training neural networks.
Python MIT License UpdatedJan 14, 2025 -
kapok Public
distributed inference for DiTs, in plain pytorch
Python Apache License 2.0 UpdatedJan 11, 2025 -
tutorials Public
Forked from pytorch/tutorialsPyTorch tutorials.
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedJan 8, 2025 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Python Apache License 2.0 UpdatedJan 2, 2025 -
-
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 9, 2024 -
gpu_benchmark Public
Forked from mag-/gpu_benchmarkNPU/GPU peak FLOPs benchmark
Python UpdatedNov 1, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedOct 15, 2024 -
lectures Public
Forked from gpu-mode/lecturesMaterial for gpu-mode lectures
Jupyter Notebook Apache License 2.0 UpdatedOct 1, 2024 -
-
-
-
-
PIDM Public
Forked from ankanbhunia/PIDMPerson Image Synthesis via Denoising Diffusion Model (CVPR 2023)
Jupyter Notebook MIT License UpdatedJun 11, 2024 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedMay 31, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
-
-
stable-diffusion-webui-colab Public
Forked from camenduru/stable-diffusion-webui-colabstable diffusion webui colab
Jupyter Notebook The Unlicense UpdatedAug 31, 2023