Controlnet - QR

Roadmap

My guess is that the authors of the famous reddit post trained a ControlNet conditioned on QR codes to achieve this.

Therefore I'll need to:

Create a function that receives an image and a url, generates the QR for said url, and then overlays the QR into the image. Lets define it as $QR(i, u)$.
Create a dataloader based on a subset of Laion-Aesthetic.
- For each image $i$, generate a QR code based on a random url $u$ and create $i_{qr} = QR(i, u)$.
- Return the prompt $t$ associated to $i$, the QR $qr$ and $i_{qr}$.
Train a ControlNet following this (kinda) logic:

cn = ControlNet
for (prompt, qr, img_qr) in dataloader:
    y = cn(prompt, qr)
    loss = sim(img_qr, y)

Advances

Todos

QRify
Dataloader
Training script
Train

QRify

Done. Here's an example of the interface to overlay a QR.

from utils import overlay_qr

overlay_qr(url="google.com", image="base_img.jpg", alpha=0.4)

I'm not sure which alpha should I use to create the final dataset though.

Dataset

The data preparation logic is ready.

from data import ImprovedAestheticsDataloader

ds = ImprovedAestheticsDataloader(split=f"train[0:25]")
ds.prepare_data()
ds.dataset # dataset in huggingface format

Training

I've trained various versions, but so far they all suck.

Here are some examples:

Clearly, the method I'm using to generate the training images is conditioning the model to just paste the QR into the image.

I'll try to improve this in the next iterations. 😊

ps: the code sucks atm.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
figures		figures
requirements		requirements
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
data.py		data.py
generate.py		generate.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Controlnet - QR

Roadmap

Advances

Todos

QRify

Dataset

Training

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ouhenio/qrdiffusion

Folders and files

Latest commit

History

Repository files navigation

Controlnet - QR

Roadmap

Advances

Todos

QRify

Dataset

Training

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages