8000 [Core] Bad traceback on failure to reconnect to GCS server. · Issue #15235 · ray-project/ray · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
[Core] Bad traceback on failure to reconnect to GCS server. #15235
Open
@clarkzinzow

Description

@clarkzinzow

What is the problem?

Upon a failed GCS client RPC, the client will attempt to reconnect to the GCS server. If that reconnection fails, the client will fatally log the GCS address and port. However, multiple such crashes have been reported in which the GCS address is the empty string, suggesting a bug in this logic:

2021-03-22T21:35:51Z o F0322 14:35:51.321081     9   315 service_based_gcs_client.cc:207] Couldn't reconnect to GCS server. The last attempted GCS server address was :0

Reproduction (REQUIRED)

TODO: If not an easy fix, get a minimal reproduction.

  • I have verified my script runs in a clean environment and reproduces the issue.
  • I have verified the issue also occurs with the latest wheels.

Metadata

Metadata

Assignees

No one assigned

    Labels

    P3Issue moderate in impact or severitybugSomething that is supposed to be working; but isn'tcoreIssues that should be addressed in Ray Corefix-error-msgThis issue has a bad error message that should be improved.pending-cleanupThis issue is pending cleanup. It will be removed in 2 weeks after being assigned.

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0