LLMs Research/Engineering
Pinned Loading
-
rm-for-rank-torchtune
rm-for-rank-torchtune PublicTorchTune recipes for ranking using RM: ORPO recipe (single GPU + DDP) + DDP for DPO (to avoid existing bug in FSDP) + ranking evals
Python 3
-
distributed-llm-code-samples
distributed-llm-code-samples PublicCode samples on how to distribute the LLM training between GPUs/nodes
Python
-
torchtune
torchtune PublicForked from pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.