-
mixtral-offloading Public
Forked from dvmazur/mixtral-offloadingRun Mixtral-8x7B models in Colab or consumer desktops
Jupyter Notebook MIT License UpdatedApr 24, 2024 -
Opensubtitles_dataset Public
downloads and parses subtitle dataset from opensubtitles.org
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python Apache License 2.0 UpdatedNov 14, 2023 -
-
Megatron-LM Public
Forked from huggingface/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedSep 5, 2023 -
guesslang Public
Forked from yoeo/guesslangDetect the programming language of a source code
Python MIT License UpdatedAug 8, 2023 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 21, 2023 -
transformers-bloom-inference Public
Forked from RezaYazdaniAminabadi/transformers-bloom-inferenceFast Inference Solutions for BLOOM
Python Apache License 2.0 UpdatedNov 21, 2022 -
example-sphinx-basic Public
Forked from readthedocs-examples/example-sphinx-basicA basic Sphinx project for Read the Docs
Python UpdatedJul 21, 2022 -
example-mkdocs-basic Public
Forked from readthedocs-examples/example-mkdocs-basicA basic MkDocs project for Read the Docs
Python UpdatedJul 8, 2022 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMay 14, 2022 -
mesh-transformer-jax Public
Forked from kingoflolz/mesh-transformer-jaxModel parallel transformers in JAX and Haiku
Python Apache License 2.0 UpdatedApr 27, 2022 -
-
mup Public
Forked from microsoft/mupmaximal update parametrization (µP)
Jupyter Notebook MIT License UpdatedMar 8, 2022 -
fish Public
An independent replication of `Training Neural Networks with Fixed Sparse Masks` by Sung et al.
Python UpdatedDec 14, 2021 -
lm_dataloader Public
Dataloader tools for language modelling
-
-
image-dl Public
A fast and simple image downloader in python
-
pbar-pool Public
A straightforward, dependency free way to update multiple progress bars with python's multiprocessing library.
-
-
-
mesh Public
Forked from tensorflow/meshMesh TensorFlow: Model Parallelism Made Easier
Python Apache License 2.0 UpdatedApr 1, 2021 -
-
-
youtube_subtitle_dataset Public
YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training
-
stylegan2 Public
Forked from shawwn/stylegan2StyleGAN2 - Official TensorFlow Implementation
-
Yandex-Image-Scraper Public
some tools for scraping images from yandex image search
Python UpdatedJul 4, 2020 -