-
nvidia-resiliency-ext Public
Forked from NVIDIA/nvidia-resiliency-extNVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…
Python Other UpdatedFeb 7, 2025 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedDec 9, 2024 -
NeMo-Aligner Public
Forked from NVIDIA/NeMo-AlignerScalable toolkit for efficient model alignment
Python Apache License 2.0 UpdatedNov 22, 2024 -
NeMo Public
Forked from NVIDIA/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedSep 6, 2024 -
logging Public
Forked from mlcommons/loggingMLPerf™ logging library
Python Other UpdatedOct 2, 2023 -
training Public
Forked from mlcommons/trainingReference implementations of MLPerf™ training benchmarks
Python Other UpdatedNov 29, 2022