8000 Idea to improve training · Issue #32 · lifrordi/DeepStack-Leduc · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Idea to improve training #32
Open
Open
@happypepper

Description

@happypepper

Right now DeepStack is using masked huber loss to compute the loss where the bucket is given weight 0 if impossible and 1 if possible.
What if we changed the mask so it can be any value between 0 and 1 weighted by how likely that bucket is?

So if there are 2 buckets A and B that both have error of 0.5, but bucket A has range probability 0.01, and bucket B has probability 0.0001, it would give 100x more importance to updating bucket A's CFV to become closer to its target.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0