verl v0.2.1 & v0.3 release checklist #354

eric-haibin-lin · 2025-02-23T23:43:27Z

BearBiscuit05 · 2025-02-24T04:45:58Z

What can I help about the 'tool calling examples' part?

eric-haibin-lin · 2025-02-24T05:09:32Z

What can I help about the 'tool calling examples' part?

related to:
#344
#340

under the hood chat calls generate, so the design is supposed to work. just need to provide a working/stable example

liyu199809 · 2025-02-24T09:22:18Z

Will megatron context parallelism be supported in the future?

vermouth1992 · 2025-02-24T10:57:07Z

Will megatron context parallelism be supported in the future?

Yes. We will use mcore that supports cp by default.

casper-hansen · 2025-02-24T12:25:40Z

@BearBiscuit05 See #344, I outlined the main challenge. I think it should be relatively straightforward if veRL can start using chat or vLLM directly adds support for tool calling in generate.

I imagine we can have GRPO-trained reasoners in the future that learns when to use tools as part of their <think> tags, e.g. to execute code for a feedback loop or retrieve additional information.

vermouth1992 · 2025-02-24T12:28:27Z

@BearBiscuit05 See #344, I outlined the main challenge. I think it should be relatively straightforward if veRL can start using chat or vLLM directly adds support for tool calling in generate.

I imagine we can have GRPO-trained reasoners in the future that learns when to use tools as part of their <think> tags, e.g. to execute code for a feedback loop or retrieve additional information.

I talked to vllm maintainer yesterday. It seems that there should be no blocking if we switch from generate to chat. Do you mind give it a try to call chat using SPMD style offline inference?

BearBiscuit05 · 2025-02-24T13:25:56Z

Not very familiar with inference, but I think I’m starting to get the hang of it. Does this mean I need to build a new chat function and add extra params that include tool calls to invoke generate? Or should I just replace generate directly with the chat function from vllm?

casper-hansen · 2025-02-24T14:34:02Z

You should be able to replace generate directly with chat. The only problem is that we currently pass tokenized inputs into generate where as chat expects List[ChatCompletionContentPartTextParam] or List[List[ChatCompletionContentPartTextParam]]. Not sure what the best design would be in this case.

Case 1: Detokenize the tokenized 8000 inputs we use for generate.
Case 2: Change veRL to not tokenize datasets before-hand (relatively big change)

class ChatCompletionContentPartTextParam(TypedDict, total=False):
    text: Required[str]
    """The text content."""

    type: Required[Literal["text"]]
    """The type of the content part."""

vermouth1992 · 2025-02-24T15:04:17Z

The second choice would incur significant overhead when tokenizing on-the-fly (typically 2x slowdown in generation, which is basically unacceptable). I guess we will need to seek solution for case 1

BearBiscuit05 · 2025-02-25T01:24:02Z

Got it. I'll give it a try.

liyu199809 · 2025-02-25T05:12:31Z

未来会支持 megatron 上下文并行吗？

是的。我们将默认使用支持cp的mcore。

It seems that the context parallelism in the model part has not been implemented yet. Is this function currently available?

BearBiscuit05 · 2025-02-25T05:19:35Z

未来会支持 megatron 上下文并行吗？

是的。我们将默认使用支持cp的mcore。

It seems that the context parallelism in the model part has not been implemented yet. Is this function currently available?

Not right now, but if you check this roadmap, once verl upgrades MCore, cp will be support.

casper-hansen · 2025-02-25T14:28:24Z

Is it possible to optimize startup time? I noticed when using veRL, it is significantly slower to launch a job than when using Huggingface TRL
#384

maksimstw · 2025-03-02T01:17:51Z

Disabling torch.compile is useful, as it can also hang PPO training when enabling use_remove_padding. #387

eric-haibin-lin · 2025-03-03T04:34:20Z

Disabling torch.compile is useful, as it can also hang PPO training when enabling use_remove_padding. #387

@maksimstw thanks for the feedback! Would you like to provide a PR with this option?

Llipengll · 2025-03-05T08:58:44Z

when will you release the "sglang integration" part?

JarvisFei · 2025-03-14T02:38:48Z

v0.2.1

add assertion when log_prob_micro_batch_size is smaller than world_size, and fix the case when "the evaluation dataset size is not divisible by the world_size" Hangs during vllm rollout, no error message #12 (comment)[ ] add an option to remove the call of torch.compile in https://github.com/volcengine/verl/blob/main/verl/workers/actor/dp_actor.py#L56 in case of gcc/nvcc issues Having issues with vLLM for GRPO #245 (comment)[ ] include the fix for checkpoint fixes in Load checkpoint from default_local_dir & Save hdfs checkpoints #250[ ] check if Quickstart PPO training error #283 persists (and if so fix it)[ ] multi-node training tutorial with ray start Add instructions on how to run verl on multi-node #278
doc: add multinode training and dbeug tutorial #585[x] fix the main_generation example main_generation seems broken #349 [fix] Improve the params template for generation #351 Tried to run main_generation.py, but it raised KeyError: ConfigAttributeError: Key 'actor' is not in struct. #331

v0.3

feel free to propose features (contributions are welcome!)

upgrade mcore to v0.6 or v0.11[ ] deepseek v3 examples[ ] megatron checkpoint supports[x] megatron qwen2 support [megatron] feat: support qwen2 megatron backend #261[x] sequence parallel optimization for latest transformer #312[x] multimodal (qwen vl) support[ ] sglang integration[ ] tool calling examples[ ] non nvidia gpu support https://github.com/volcengine/verl/pull/360/files[ ] start time optimization[x] prime recipe

how to install v0.3

#354

## Summary Providing an option in the config to turn off the `torch.compile` used in `dp_actor.py` ## Usage Adding the following line to the driver or cli scripts to turn off `torch.compile`. ```python +actor_rollout_ref.actor.use_torch_compile=False ``` Otherwise, `torch.compile` will be used by default ## Related Issue #354 #245 --------- Signed-off-by: Hongpeng Guo <hpguo@anyscale.com>

hongpeng-guo · 2025-03-14T07:45:56Z

add an option to remove the call of torch.compile

Item solved in #554

eric-haibin-lin · 2025-03-20T20:52:46Z

hi @JarvisFei , v0.3 is not fully released but you are welcome to try verl main branch with pip install -e with the source code

eric-haibin-lin · 2025-03-22T01:43:59Z

As we are already making quite some progress in the main branch, I suggest we freeze code week for v0.3 and push the rest of the features to v0.4

eric-haibin-lin · 2025-03-30T22:04:34Z

Moving discussions to #710

volcengine#354

…lcengine#554) ## Summary Providing an option in the config to turn off the `torch.compile` used in `dp_actor.py` ## Usage Adding the following line to the driver or cli scripts to turn off `torch.compile`. ```python +actor_rollout_ref.actor.use_torch_compile=False ``` Otherwise, `torch.compile` will be used by default ## Related Issue volcengine#354 volcengine#245 --------- Signed-off-by: Hongpeng Guo <hpguo@anyscale.com>

volcengine#354

…lcengine#554) ## Summary Providing an option in the config to turn off the `torch.compile` used in `dp_actor.py` ## Usage Adding the following line to the driver or cli scripts to turn off `torch.compile`. ```python +actor_rollout_ref.actor.use_torch_compile=False ``` Otherwise, `torch.compile` will be used by default ## Related Issue volcengine#354 volcengine#245 --------- Signed-off-by: Hongpeng Guo <hpguo@anyscale.com>

volcengine#354

…lcengine#554) ## Summary Providing an option in the config to turn off the `torch.compile` used in `dp_actor.py` ## Usage Adding the following line to the driver or cli scripts to turn off `torch.compile`. ```python +actor_rollout_ref.actor.use_torch_compile=False ``` Otherwise, `torch.compile` will be used by default ## Related Issue volcengine#354 volcengine#245 --------- Signed-off-by: Hongpeng Guo <hpguo@anyscale.com>

eric-haibin-lin added the roadmap label Feb 23, 2025

eric-haibin-lin pinned this issue Feb 23, 2025

AIBionics mentioned this issue Feb 25, 2025

Support MoE model Training? #378

Open

casper-hansen mentioned this issue Mar 6, 2025

[RFC] ActorEnvironment implementation to support tool calling #501

Closed

hongpeng-guo mentioned this issue Mar 12, 2025

[Config] Providing an option to turn off torch.compile in actor #554

Merged

wuxibin89 mentioned this issue Mar 13, 2025

doc: add multinode training and debug tutorial #585

Merged

vermouth1992 pushed a commit that referenced this issue Mar 14, 2025

doc: add multinode training and debug tutorial (#585)

9ae01af

#354

eric-haibin-lin closed this as completed Mar 30, 2025

vermouth1992 unpinned this issue Apr 2, 2025

wangyuchen333 pushed a commit to wangyuchen333/verl that referenced this issue Apr 25, 2025

doc: add multinode training and debug tutorial (volcengine#585)

32513d2

volcengine#354

histmeisah pushed a commit to SJTU-IAAR/verl that referenced this issue Apr 27, 2025

doc: add multinode training and debug tutorial (volcengine#585)

ef9b885

volcengine#354

yumc-afk pushed a commit to yumc-afk/verl that referenced this issue May 18, 2025

doc: add multinode training and debug tutorial (volcengine#585)

2d5b2cb

volcengine#354

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

verl v0.2.1 & v0.3 release checklist #354

verl v0.2.1 & v0.3 release checklist #354

v0.2.1

v0.3

verl v0.2.1 & v0.3 release checklist #354

verl v0.2.1 & v0.3 release checklist #354

Comments

v0.2.1

v0.3

v0.2.1

v0.3