Tags: sail-sg/oat
Tags
Upgrade vllm for more efficient collocation (#34) * upgrade vllm & adopt collective_rpc * use .float() for kl & increase timeout to 60m * speed up minibatch training * add constant lr scheduler * update * updates * fix non_eos detection * changes * minor * update * ratio * updates