-
Megatron-DeepSpeed Public
Forked from deepspeedai/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedJan 30, 2023 -
-
OFA Public
Forked from OFA-Sys/OFAOfficial repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Python Apache License 2.0 UpdatedJul 25, 2022 -
libai Public
Forked from Oneflow-Inc/libaiLiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Python Apache License 2.0 UpdatedJul 6, 2022 -
fashion-mnist Public
Forked from zalandoresearch/fashion-mnistA MNIST-like fashion product database. Benchmark 👉
Python MIT License UpdatedMay 9, 2019 -
tf-nlp-blocks Public
Forked from hanxiao/tf-nlp-blocksSome frequently used NLP blocks I implemented
Python MIT License UpdatedMar 20, 2019 -
bert_chinese_pytorch Public
Forked from duanzhihua/bert_chinese_pytorchbert for chinese text classification
Python UpdatedDec 11, 2018 -
encoding-blocks Public
Forked from hanxiao/encoding-blocksCode for my blog post
Python MIT License UpdatedAug 14, 2018