8000 martin-kukla (Martin Kukla) · GitHub

More Web Proxy on the site http://driver.im/

martin-kukla

Follow

Martin Kukla martin-kukla

Follow

LLMs Research/Engineering

1 follower · 0 following

Achievements

Achievements

Pinned Loading

pre-tjax pre-tjax Public

Transformers written from first principle in JAX/Torch.Func/Triton; Comparison of their training efficiency on 1GPU

Python 1
rm-for-rank-torchtune rm-for-rank-torchtune Public

TorchTune recipes for ranking using RM: ORPO recipe (single GPU + DDP) + DDP for DPO (to avoid existing bug in FSDP) + ranking evals

Python 3
distributed-llm-code-samples distributed-llm-code-samples Public

Code samples on how to distribute the LLM training between GPUs/nodes

Python
torchtune torchtune Public

Forked from pytorch/torchtune

A Native-PyTorch Library for LLM Fine-tuning

Python

0