[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests #53974

MortalHappiness · 2025-06-20T16:01:11Z

Why are these changes needed?

Part of KubeRay release process.

KubeRay version updated to v1.4.0.
Ray version updated to 2.46.0.
Update autoscaler v2 yaml template to use new autoscalerOptions.version = v2 instead of setting the RAY_enable_autoscaler_v2 environment variable.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Chi-Sheng Liu <chishengliu@chishengliu.com>

Signed-off-by: Rueian <rueiancsie@gmail.com>

Copilot

Pull Request Overview

This PR updates the Kubernetes-based Ray testing infrastructure to use KubeRay v1.4.0 and Ray v2.46.0, and switches the autoscaler v2 template to use the new autoscalerOptions.version field instead of an environment variable.

Bump KubeRay and Ray versions in test scripts and YAML templates
Remove RAY_enable_autoscaler_v2 env var and enable autoscaler v2 via version: v2
Ensure the kuberay-system namespace is set and created before running operator tests

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
release/k8s_tests/run_gcs_ft_on_k8s.py	Updated `KUBERAY_VERSION` from v1.2.2 to v1.4.0
python/ray/tests/kuberay/test_files/ray-cluster.autoscaler-v2-template.yaml	Updated Ray image/tag to 2.46.0, enabled `version: v2`, removed old env var
python/ray/tests/kuberay/test_files/ray-cluster.autoscaler-template.yaml	Updated Ray image/tag to 2.46.0
python/ray/autoscaler/kuberay/init-config.sh	Bumped `KUBERAY_BRANCH` and `OPERATOR_TAG` to v1.4.0; added namespace override
ci/k8s/run-operator-tests.sh	Added `kubectl create namespace kuberay-system`

Copilot · 2025-06-21T10:51:59Z

ci/k8s/run-operator-tests.sh

@@ -17,6 +17,7 @@ kind load docker-image ray-ci:kuberay-test
 # python python/ray/tests/kuberay/setup/setup_kuberay.py

 bash python/ray/autoscaler/kuberay/init-config.sh
+kubectl create namespace kuberay-system


Make namespace creation idempotent to avoid failures when rerunning tests. For example, you could use kubectl apply -f - with a namespace manifest or append || true to ignore 'already exists' errors.

Suggested change

kubectl create namespace kuberay-system

cat <<EOF | kubectl apply -f -

apiVersion: v1

kind: Namespace

metadata:

name: kuberay-system

EOF

Copilot · 2025-06-21T10:52:00Z

release/k8s_tests/run_gcs_ft_on_k8s.py

@@ -51,7 +51,7 @@ def generate_cluster_variable():

 def check_kuberay_installed():
    # Make sure the ray namespace exists
-    KUBERAY_VERSION = "v1.2.2"
+    KUBERAY_VERSION = "v1.4.0"


[nitpick] Consider centralizing version strings (e.g., KubeRay and Ray versions) into a shared configuration or environment variable to reduce duplication and simplify future bumps across scripts and templates.

Copilot · 2025-06-21T10:52:00Z

python/ray/autoscaler/kuberay/init-config.sh

@@ -24,6 +24,7 @@ DIR=$(mktemp -d -t "kuberay-XXXXXX")
    (
        cd kuberay/ray-operator/config/default
        kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"


[nitpick] It may be clearer to handle namespace overrides via a separate overlay or document the reason for this change in a comment, preventing unexpected modifications to base manifests.

Suggested change

kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"

kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"

# Set the namespace to 'kuberay-system' for the operator. This is required for the current deployment setup.

kevin85421

I chatted with @rueian offline. The issue is because:

The KubeRay installs in default namespace instead of ray-system namespace since v1.3.0.
We forget to update the Ray repository to v1.3.0.

kevin85421 · 2025-06-22T05:29:34Z

cc @jjyao @edoakes would you mind merging this PR? Thanks!

…ray-project#53974) Signed-off-by: Chi-Sheng Liu <chishengliu@chishengliu.com> Signed-off-by: Rueian <rueiancsie@gmail.com> Co-authored-by: Rueian <rueiancsie@gmail.com>

[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests

9f100c9

Signed-off-by: Chi-Sheng Liu <chishengliu@chishengliu.com>

MortalHappiness added the go add ONLY when ready to merge, run all tests label Jun 20, 2025

MortalHappiness and others added 2 commits June 21, 2025 09:31

Exclude kuberay-operator pod from get all pods

3d5c842

Signed-off-by: Chi-Sheng Liu <chishengliu@chishengliu.com>

Install kuberay-operator in a separate namespace for tests

a3027c8

Signed-off-by: Rueian <rueiancsie@gmail.com>

rueian force-pushed the feature/kuberay-1.4.0-autoscaler-tests branch from f30d792 to a3027c8 Compare June 21, 2025 03:58

MortalHappiness marked this pull request as ready for review June 21, 2025 10:51

Copilot AI review requested due to automatic review settings June 21, 2025 10:51

MortalHappiness requested a review from a team as a code owner June 21, 2025 10:51

MortalHappiness assigned kevin85421 Jun 21, 2025

Copilot AI reviewed Jun 21, 2025

View reviewed changes

kevin85421 approved these changes Jun 22, 2025

View reviewed changes

jjyao merged commit cfa531f into ray-project:master Jun 22, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests #53974

[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests #53974

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

-kubectl create namespace kuberay-system
+cat <<EOF | kubectl apply -f -
+apiVersion: v1
+kind: Namespace
+metadata:
+  name: kuberay-system
+EOF

	kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"
	kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"
	# Set the namespace to 'kuberay-system' for the operator. This is required for the current deployment setup.

[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests #53974

[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests #53974

Uh oh!

Conversation

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!