-
Notifications
You must be signed in to change notification settings - Fork 12
Issues: NVIDIA/NeMo-RL
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Qwen3 Moe with Megatron backend
help wanted
Extra attention is needed
#424
opened May 20, 2025 by
terrykong
gemma-3-4b-it got nan probs_ratio in both FSDP1/FSDP2
bug
Something isn't working
#419
opened May 20, 2025 by
yuki-666
[Feature] Explicit failure for unmatched model and checkpoints
bug
Something isn't working
#415
opened May 19, 2025 by
KiddoZhu
Support MoE models in FSDP2
enhancement
New feature or request
new model
#413
opened May 19, 2025 by
yuki-666
Isolated ray worker creation doesn't allow us to patch functions easily anymore.
bug
Something isn't working
#408
opened May 16, 2025 by
SahilJain314
Create RL playbook as a follow up to DAPT
enhancement
New feature or request
good first issue
Good for newcomers
#404
opened May 16, 2025 by
snowmanwwg
Allow proper vocab padding to permit training on 32+ nodes
bug
Something isn't working
#403
opened May 15, 2025 by
alexandery-nvidia
DCP to HF script should also propagate the tokenizer
bug
Something isn't working
#395
opened May 15, 2025 by
terrykong
[Feature] Add LLM as Judge Environment
enhancement
New feature or request
#392
opened May 15, 2025 by
yashaswikarnati
Gemma 27B OOM with dynamic batching (in get_logprobs)
bug
Something isn't working
#383
opened May 14, 2025 by
terrykong
6684
[Feature Request] Add configurable New feature or request
resume_if_exists
flag to control automatic checkpoint resumption
enhancement
#345
opened May 9, 2025 by
go5paopao
Add train and val conversations (ie completions) to a wandb table
#339
opened May 8, 2025 by
alexandery-nvidia
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-04-21.