Stars
Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluations.
NVIDIA GPU monitor with GDDR6 memory temperature support.
This is our own implementation of 'Layer Selective Rank Reduction'
In-depth tutorials for implementing deep learning models on your own with PyTorch.