Stars
Large Language Model (LLM) Systems Paper List
Introduction to Machine Learning Systems
Latency and Memory Analysis of Transformer Models for Training and Inference
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.