8000 [Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests by MortalHappiness · Pull Request #53974 · ray-project/ray · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[Test][KubeRay] Update KubeRay version to v1.4.0 for autoscaler tests #53974

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

MortalHappiness
Copy link
Member
@MortalHappiness MortalHappiness commented Jun 20, 2025

Why are these changes needed?

Part of KubeRay release process.

  • KubeRay version updated to v1.4.0.
  • Ray version updated to 2.46.0.
  • Update autoscaler v2 yaml template to use new autoscalerOptions.version = v2 instead of setting the RAY_enable_autoscaler_v2 environment variable.

Related issue number

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: Chi-Sheng Liu <chishengliu@chishengliu.com>
@MortalHappiness MortalHappiness added the go add ONLY when ready to merge, run all tests label Jun 20, 2025
MortalHappiness and others added 2 commits June 21, 2025 09:31
Signed-off-by: Chi-Sheng Liu <chishengliu@chishengliu.com>
Signed-off-by: Rueian <rueiancsie@gmail.com>
@rueian rueian force-pushed the feature/kuberay-1.4.0-autoscaler-tests branch from f30d792 to a3027c8 Compare June 21, 2025 03:58
@MortalHappiness MortalHappiness marked this pull request as ready for review June 21, 2025 10:51
@Copilot Copilot AI review requested due to automatic review settings June 21, 2025 10:51
@MortalHappiness MortalHappiness requested a review from a team as a code owner June 21, 2025 10:51
Copy link
Contributor
@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR updates the Kubernetes-based Ray testing infrastructure to use KubeRay v1.4.0 and Ray v2.46.0, and switches the autoscaler v2 template to use the new autoscalerOptions.version field instead of an environment variable.

  • Bump KubeRay and Ray versions in test scripts and YAML templates
  • Remove RAY_enable_autoscaler_v2 env var and enable autoscaler v2 via version: v2
  • Ensure the kuberay-system namespace is set and created before running operator tests

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file
File Description
release/k8s_tests/run_gcs_ft_on_k8s.py Updated KUBERAY_VERSION from v1.2.2 to v1.4.0
python/ray/tests/kuberay/test_files/ray-cluster.autoscaler-v2-template.yaml Updated Ray image/tag to 2.46.0, enabled version: v2, removed old env var
python/ray/tests/kuberay/test_files/ray-cluster.autoscaler-template.yaml Updated Ray image/tag to 2.46.0
python/ray/autoscaler/kuberay/init-config.sh Bumped KUBERAY_BRANCH and OPERATOR_TAG to v1.4.0; added namespace override
ci/k8s/run-operator-tests.sh Added kubectl create namespace kuberay-system

@@ -17,6 +17,7 @@ kind load docker-image ray-ci:kuberay-test
# python python/ray/tests/kuberay/setup/setup_kuberay.py

bash python/ray/autoscaler/kuberay/init-config.sh
kubectl create namespace kuberay-system
Copy link
Preview
Copilot AI Jun 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Make namespace creation idempotent to avoid failures when rerunning tests. For example, you could use kubectl apply -f - with a namespace manifest or append || true to ignore 'already exists' errors.

Suggested change
kubectl create namespace kuberay-system
cat <<EOF | kubectl apply -f -
apiVersion: v1
kind: Namespace
metadata:
name: kuberay-system
EOF

Copilot uses AI. Check for mistakes.

@@ -51,7 +51,7 @@ def generate_cluster_variable():

def check_kuberay_installed():
# Make sure the ray namespace exists
KUBERAY_VERSION = "v1.2.2"
KUBERAY_VERSION = "v1.4.0"
Copy link
Preview
Copilot AI Jun 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] Consider centralizing version strings (e.g., KubeRay and Ray versions) into a shared configuration or environment variable to reduce duplication and simplify future bumps across scripts and templates.

Copilot uses AI. Check for mistakes.

@@ -24,6 +24,7 @@ DIR=$(mktemp -d -t "kuberay-XXXXXX")
(
cd kuberay/ray-operator/config/default
kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"
Copy link
Preview
Copilot AI Jun 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[nitpick] It may be clearer to handle namespace overrides via a separate overlay or document the reason for this change in a comment, preventing unexpected modifications to base manifests.

Suggested change
kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"
kustomize edit set image kuberay/operator=quay.io/kuberay/operator:"$OPERATOR_TAG"
# Set the namespace to 'kuberay-system' for the operator. This is required for the current deployment setup.

Copilot uses AI. Check for mistakes.

Copy link
Member
@kevin85421 kevin85421 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I chatted with @rueian offline. The issue is because:

  1. The KubeRay installs in default namespace instead of ray-system namespace since v1.3.0.
  2. We forget to update the Ray repository to v1.3.0.

@kevin85421
Copy link
Member

cc @jjyao @edoakes would you mind merging this PR? Thanks!

@jjyao jjyao merged commit cfa531f into ray-project:master Jun 22, 2025
5 checks passed
minerharry pushed a commit to minerharry/ray that referenced this pull request Jun 27, 2025
…ray-project#53974)

Signed-off-by: Chi-Sheng Liu <chishengliu@chishengliu.com>
Signed-off-by: Rueian <rueiancsie@gmail.com>
Co-authored-by: Rueian <rueiancsie@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
go add ONLY when ready to merge, run all tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants
0