8000 GitHub - theophilegervet/discrete-off-policy-evaluation: Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

theophilegervet/discrete-off-policy-evaluation

Repository files navigation

discrete-off-policy-evaluation

Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.

Estimators:

  • Importance sampling (IS)
  • Weighted importance sampling (WIS)
  • Per-decision importance sampling (PDIS)
  • Weighted per-decision importance sampling (WPDIS)
  • Fitted Q evaluation (FQE)
  • Doubly robust (DR)
  • Weighted doubly robust (WDR)

About

Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0