[Data] Fix parallelism deriving heuristic to ensure parallelism stays w/in min/max bounds #47695

alexeykudinkin · 2024-09-17T00:24:39Z

8000

Why are these changes needed?

Currently, min/max parallelism isn't actually being enforced correctly -- for large enough clusters we will be scaling out too aggressively purely based on the # of available CPUs disregarding the target block-sizes.

This change

Fixes parallelism detection heuristic to appropriately respect min/target block-sizes
Makes block-sizes configs' defaults env-var-configurable
Adjusts default min-block-size from 1Mb to 16Mb

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

raulchen · 2024-10-18T01:24:57Z

python/ray/data/_internal/util.py

+
+        assert (
+            min_safe_parallelism <= max_reasonable_parallelism
+        ), f"Parallelism boundaries have to overlap: {estimation_context}"


nit, use an if statement to avoid generating the error message string when not needed.

stale · 2025-02-01T01:11:44Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

If you'd like to keep this open, just leave any comment, and the stale label will be removed.

stale · 2025-04-26T01:25:05Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

If you'd like to keep this open, just leave any comment, and the stale label will be removed.

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

github-actions · 2025-06-02T00:34:40Z

This pull request has been automatically marked as stale because it has not had
any activity for 14 days. It will be closed in another 14 days if no further activity occurs.
Thank you for your contributions.

You can always ask for help on our discussion forum or Ray's public slack channel.

If you'd like to keep this open, just leave any comment, and the stale label will be removed.

github-actions · 2025-06-17T00:31:59Z

This pull request has been automatically closed because there has been no more activity in the 14 days
since being marked stale.

Please feel free to reopen or open a new pull request if you'd still like this to be addressed.

Again, you can always ask for help on our discussion forum or Ray's public slack channel.

Thanks again for your contribution!

alexeykudinkin requested review from ericl, scv119, c21, amogkam, scottjlee, bveeramani, raulchen, stephanie-wang and omatthew98 as code owners September 17, 2024 00:24

alexeykudinkin added the go add ONLY when ready to merge, run all tests label Sep 17, 2024

alexeykudinkin assigned raulchen Sep 17, 2024

alexeykudinkin force-pushed the ak/prlsm-drv-fix branch 2 times, most recently from 206c0a6 to b99aea7 Compare October 1, 2024 07:26

alexeykudinkin force-pushed the ak/prlsm-drv-fix branch from e240215 to 8f30820 Compare October 16, 2024 21:23

raulchen approved these changes Oct 18, 2024

View reviewed changes

alexeykudinkin linked an issue Nov 12, 2024 that may be closed by this pull request

[data] ray.data.read_parquet not reading in one file per block #47437

Open

alexeykudinkin mentioned this pull request Nov 12, 2024

[data] ray.data.read_parquet not reading in one file per block #47437

Open

stale bot added the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Feb 1, 2025

alexeykudinkin removed the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Feb 7, 2025

pcmoritz requested a review from a team as a code owner March 26, 2025 22:34

stale bot added the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Apr 26, 2025

alexeykudinkin added 2 commits May 7, 2025 15:34

Making block-size configs' defaults env-var configurable

2eedffc

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

lint

51ddca6

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

alexeykudinkin force-pushed the ak/prlsm-drv-fix branch from eaab6c6 to 51ddca6 Compare May 7, 2025 22:35

alexeykudinkin removed the stale The issue is stale. It will be closed within 7 days unless there are further conversation label May 7, 2025

alexeykudinkin removed request for ericl, scv119 and stephanie-wang May 7, 2025 22:35

alexeykudinkin removed request for c21, scottjlee, amogkam, omatthew98 and bveeramani May 7, 2025 22:35

github-actions bot added the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Jun 2, 2025

github-actions bot closed this Jun 17, 2025

alexeykudinkin reopened this Jun 19, 2025

github-actions bot removed the stale The issue is stale. It will be closed within 7 days unless there are further conversation label Jun 20, 2025

richardliaw added the data Ray Data-related issues label Jun 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Data] Fix parallelism deriving heuristic to ensure parallelism stays w/in min/max bounds #47695

[Data] Fix parallelism deriving heuristic to ensure parallelism stays w/in min/max bounds #47695

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Data] Fix parallelism deriving heuristic to ensure parallelism stays w/in min/max bounds #47695

Are you sure you want to change the base?

[Data] Fix parallelism deriving heuristic to ensure parallelism stays w/in min/max bounds #47695

Uh oh!

Conversation

Why are these changes needed?

Related issue number

Checks

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!