Closed
Description
Right now, action precision (alpha
) in case of stochastic action sampling, is defaulted to alpha = 16.0
within the function of the control
module sample_action(..., alpha = 16.0)
. Change this so that an instance of Agent.agent
itself stores the action precision, and calls it in its method self.sample_action()
.
Originally posted by @helenegu in #4 (comment)