Popular repositories Loading
-
-
-
re-grpo
re-grpo PublicForked from open-thought/tiny-grpo
Minimal hackable GRPO implementation
Python 3
-
-
simpleRL-reason
simpleRL-reason PublicForked from hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Python 1
Repositories
Showing 10 of 46 repositories
- gpu-school Public
researchim-ai/gpu-school’s past year of commit activity - building-sim-py Public
researchim-ai/building-sim-py’s past year of commit activity - financial-agent Public
researchim-ai/financial-agent’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…