-
Notifications
You must be signed in to change notification settings - Fork 11
Initial IMM training loss #4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Hi, thanks for your effort! I am curious why you use ema weight for y_r? Have you conducted some experiments and found it worked better? |
@XinYu-Andy thank you! Using ema weights for I updated the pr with this change for now! |
Are you doing experiments on cifar10? I conducted the experiment for a few weeks but was still not able to reproduce the results reported in the paper... |
Yes, with the DDPM++ UNet, using all the same reported hyperparams. I just started today and haven't extensively experimented yet, but would like to see a stable loss before anything further. |
Sounds good!👍 |
Initial attempt at the training loss defined in the paper (assumes constant decrement in$\eta(t)$ for mapping function).