Open
Description
I saw an implementation of the formula form. I wonder whether it is feasible to directly calculate the loss via torch.autograd operations, if I define an energy function, like the logSumExp function in this paper 'Your classifier is secretly an energy based model and you should treat it like one'.
Lines 5 to 15 in 7f27f4a
Metadata
Metadata
Assignees
Labels
No labels