-
Notifications
You must be signed in to change notification settings - Fork 6.5k
Enable setting OS disk size in Azure #45867
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
Signed-off-by: Vik Paruchuri <github@vikas.sh>
can we get an approval or your team's perspective on this?
|
this sounds like a good idea to me! |
@ericl @architkulkarni @hongchaodeng 12/4/2024 - I confirmed this PR worked with ray 2.39.0 for azure both head and workers nodes. however, @VikParuchuri modify your PR so the the json passes a null if the user does not set the disk size since the default sizes seem to creep up over time -- current defaults are at 150GB. |
@@ -44,6 +44,13 @@ | |||
"description": "The version of the VM image" | |||
} | |||
}, | |||
"diskSizeGB": { | |||
"type": "int", | |||
"defaultValue": 64, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
however, @VikParuchuri modify your PR so the the json passes a null if the user does not set the disk size since the default sizes seem to creep up over time -- current defaults are at 150GB.
We should address this comment?
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
@VikParuchuri do you have time to address the review comments? Thanks! |
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
stale pr, let me know once you update it |
This pull request has been automatically marked as stale because it has not had You can always ask for help on our discussion forum or Ray's public slack channel. If you'd like to keep this open, just leave any comment, and the stale label will be removed. |
Why are these changes needed?
Currently, the default OS disks on Azure have a capacity of around 30GB. This causes cluster creation and tasks to often fail because many ray docker images and support files are around this size. When it does succeed, it also prevents object spilling.
This enables specifying the OS disk size to get around these limitations. It also ups the default OS disk size to 64GB.
Related issue number
This is a related discussion.
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.