8000 [Tune|RLlib] Add error-tolerant version of PB2 · Issue #40787 · ray-project/ray · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
[Tune|RLlib] Add error-tolerant version of PB2  #40787
Open
@simonsays1980

Description

@simonsays1980

Description

PB2 is running smoothly as long as no error in a single trial occurs. As very often certain combinations of hyperparameters are not tested out beforehand - this is exactly why users turn towards a tuning algorithm - they could lead to a trial that errors out. Specifically in PB2|PBT this is very severe because the results from all runs are used to get the best breeds over certain periods of running.

If now trials error out, the PB2 algorithm runs with only (num_samples - num_errored_out) trials which could in edge cases lead to runs with only a single trial - and then there is no sense anymore of PB2.

Use case

In some cases it is not obvious which hyperparameter cases are actually erroring out and figuring out which do, costs a lot of resources. In such cases users would prefer to run PB2 with many hyperparameter combinations and let the tuning algorithm simply "step over" errored trials by starting a new one with another randomly drawn hyperparameter combination (maybe with an appropriate bayesian approach).

This would could help to find a good combination of hyperparameters even in difficult settings.

Metadata

Metadata

Assignees

No one assigned

    Labels

    P3Issue moderate in impact or severityenhancementRequest for new feature and/or capabilitypending-cleanupThis issue is pending cleanup. It will be removed in 2 weeks after being assigned.tuneTune-related issues

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0