Open
Description
With the reference to this documentation, I've some problem implementing Reiforcement Learning using stable-baseline3. I've trained the model for 1 million time steps and once I load the model for rendering, the drone moves a little around its initial position, but it doesn't reach any waypoint. Is it possible that there is a problem with how the reward function is defined?
Metadata
Metadata
Assignees
Labels
No labels