[RLlib] Upgrade RLlink protocol for external env/simulator training. #53550

sven1977 · 2025-06-04T12:12:25Z

Upgrade RLlink protocol for external env/simulator training.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Copilot

Pull Request Overview

This PR upgrades the RLlink protocol for external environment/simulator training by deprecating legacy example scripts and refactoring internal APIs to use the new rllink protocol and associated message‐packing via msgpack.

Removal of outdated Unity3D and CartPole example scripts.
Refactoring of annotations (e.g., replacing @publicapi with @OldAPIStack) and protocol messaging (using new RLlink API in rllink.py).
Introduction of new modules (rllink.py and rllib_gateway.py) and updates in algorithm_config.py.

Reviewed Changes

Copilot reviewed 16 out of 16 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
rllib/examples/envs/external_envs/*.py	Removed outdated example scripts for external env training.
rllib/env/wrappers/unity3d_env.py	Updated API annotation and trimmed outdated docstrings.
rllib/env/utils/external_env_protocol.py	Redirected RLlink import to new rllink implementation.
rllib/env/tcp_client_inference_env_runner.py	Updated message functions to use new RLlink API.
rllib/env/policy_server_input.py, rllib/env/policy_client.py	Removed legacy docstrings and updated annotations.
rllib/env/external/rllink.py, rllib/env/external/rllib_gateway.py	New modules for upgraded protocol and external gateway.
rllib/algorithms/algorithm_config.py	Minor comment fix and additional handling for spaces.

Comments suppressed due to low confidence (1)

rllib/algorithms/algorithm_config.py:327

Typo in comment: 'reaplce' should be corrected to 'replace'.

# TODO (sven): Once new ormsgpack system in place, reaplce the string

Signed-off-by: sven1977 <svenmika1977@gmail.com>

simonsays1980

LGTM. Some small nits. Thanks for the work @sven1977 !

simonsays1980 · 2025-06-05T11:06:17Z

rllib/algorithms/algorithm_config.py

-            if isinstance(env, gym.vector.VectorEnv):
-                rl_module_spec.observation_space = env.single_observation_space
-            rl_module_spec.action_space = env.single_action_space
+        if rl_module_spec.observation_space is None:


Dumb question: Can't we reuse the self.obs_space, self.action_space here?

simonsays1980 · 2025-06-05T11:07:51Z

rllib/env/external/rllib_gateway.py

+        self._prev_action = None
+        self._prev_extra_model_outputs = None
+
+        def _connecto_to_server_thread_func():


_connecto_... -> _connect_to_...? :)

great catch. Fixed

simonsays1980 · 2025-06-05T11:11:02Z

rllib/env/external/rllib_gateway.py

+            next_observation: The current observation, from which the action should be
+                computed. Note that first, `observation`, the previously returned
+                action, `prev_reward`, and `terminated/truncated` are logged with the running
+                episdode through `Episode.add_env_step()`, then the env-to-module


"epsidode" -> "episode"

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…ade_rllink_protocol

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…ade_rllink_protocol

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

8e7f895

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Copilot AI review requested due to automatic review settings June 4, 2025 12:12

sven1977 requested a review from a team as a code owner June 4, 2025 12:12

sven1977 assigned simonsays1980 Jun 4, 2025

Copilot AI reviewed Jun 4, 2025

View reviewed changes

wip

22a8d8c

Signed-off-by: sven1977 <svenmika1977@gmail.com>

simonsays1980 approved these changes Jun 5, 2025

View reviewed changes

sven1977 enabled auto-merge (squash) June 5, 2025 13:30

github-actions bot added the go add ONLY when ready to merge, run all tests label Jun 5, 2025

wip

bdb55b8

Signed-off-by: sven1977 <svenmika1977@gmail.com>

github-actions bot disabled auto-merge June 5, 2025 13:37

sven1977 added 7 commits June 5, 2025 16:24

LINT

87d4527

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

f1d7f07

Signed-off-by: sven1977 <svenmika1977@gmail.com>

stabilize CI

689e311

Signed-off-by: sven1977 <svenmika1977@gmail.com>

LINT

18f5eda

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into upgr…

61c2fa2

…ade_rllink_protocol

wip

51b6c03

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

e9fc111

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested a review from a team as a code owner June 13, 2025 09:04

51

39d640e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 enabled auto-merge (squash) June 13, 2025 11:16

sven1977 added 2 commits June 17, 2025 14:49

Merge branch 'master' of https://github.com/ray-project/ray into upgr…

0e7f570

…ade_rllink_protocol

wip

5175046

Signed-off-by: sven1977 <svenmika1977@gmail.com>

github-actions bot disabled auto-merge June 17, 2025 13:03

Merge branch 'master' into upgrade_rllink_protocol

15b7e82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RLlib] Upgrade RLlink protocol for external env/simulator training. #53550

[RLlib] Upgrade RLlink protocol for external env/simulator training. #53550

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[RLlib] Upgrade RLlink protocol for external env/simulator training. #53550

Are you sure you want to change the base?

[RLlib] Upgrade RLlink protocol for external env/simulator training. #53550

Uh oh!

Conversation

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!