A simple noise conditional score unet that can generate animal face images.
There are two models architecture in models
:
ncst
, a DiT-like architecture.unet
, a UNet based architecture.
- More experiment
- Change to SDE-based NCSN
- Project description
- Add citation