Open
Description
What is the problem?
We currently make a few assumptions about 1-CPU tasks always being feasible (for example in our heartbeat code).
You can imagine that this could break down for an extremely elastic cluster which has num_cpus=0
on the head node.
Ray version and other system information (Python version, TensorFlow version, OS):
Reproduction (REQUIRED)
Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have no external library dependencies (i.e., use fake or mock data / environments):
If the code snippet cannot be run by itself, the issue will be closed with "needs-repro-script".
- I have verified my script runs in a clean environment and reproduces the issue.
- I have verified the issue also occurs with the latest wheels.