8000 [Idea] Crowd-training of unreleased Flickr landscape model · Issue #17 · NVlabs/SPADE · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[Idea] Crowd-training of unreleased Flickr landscape model #17

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
sidyakinian opened this issue Apr 14, 2019 · 40 comments
Open

[Idea] Crowd-training of unreleased Flickr landscape model #17

sidyakinian opened this issue Apr 14, 2019 · 40 comments

Comments

@sidyakinian
Copy link
sidyakinian commented Apr 14, 2019

As you probably know, Flickr landscape pre-trained model could not be released in this repo. But that model can draw landscapes with unbelievable quality, much higher than that of coco-stuff, due to training on 40k Flickr images; the fact that it hasn't been released is disappointing.

Some of us probably want to train it yourself. (Me for one, and also @Lokiiiiii brought this up) Typically it would cost a few thousand dollars. Thankfully, Product Hunt offers a paid subscription which basically offers $5000 AWS credits for $720: https://www.producthunt.com/ship#launch (Product Hunt Ship Pro, yearly subscription)

This gets the cost down to $720, but it's still a lot. Since a few of us are going to do the same exact thing, why don't we train the model together and share the cost? $720 split among 5 people is already $144, which is fair for such a powerful model.

Once we have a few people in, we can start a crowdfunding campaign, pledge funds, train the model and share it among us.

What do you think of this?

@code-de
Copy link
code-de commented Apr 14, 2019

That's a great idea! Count me in.

@ueoo
Copy link
ueoo commented Apr 14, 2019

And me.

@banyet1
Copy link
banyet1 commented Apr 14, 2019

Also count me in.

@rslowinski
Copy link

on what license such model would be?

@sidyakinian
Copy link
Author

on what license such model would be?

@Ares97 Likely the same license that SPADE was released under

@sidyakinian
Copy link
Author
sidyakinian commented Apr 15, 2019

Okay, looks like we have a few people. Let's form a group chat in some messenger to discuss the next steps. @code-de, @hologerry, and @banyet1, would you please email me or just comment here which few messengers among these you find convenient?

  • Messenger
  • WhatsApp
  • iMessage
  • Discord
  • Telegram

We can then choose the messenger everyone picked and host the group chat there. My email is on my profile page.

@harsh2204, @wasd96040501, if you decide to join, you're welcome to email me too!

@ueoo
Copy link
ueoo commented Apr 15, 2019

Telegram would be great. Same username as Github.

@code-de
Copy link
code-de commented Apr 15, 2019

Telegram, Discord, WhatsApp work well for me - emailed you my usernames in each of them. Btw, @Ares97, @Lokiiiiii, @aeti-in - you guys seem to be interested in this as well?

@genekogan
Copy link

@sidyakinian i am also interested in this. please include me!

@sidyakinian
Copy link
Author

@sidyakinian i am also interested in this. please include me!

@genekogan Sure! Please remember to email me or comment some messengers so that you can join.

@banyet1
Copy link
banyet1 commented Apr 15, 2019

I'm using WhatsApp, just sent an email to u, please check it out.

@sidyakinian
Copy link
Author

@hologerry, @banyet1 So you guys picked different options, Telegram and WhatsApp, could one of you use the other messenger so that we can gather in one chat? Either of those two is fine with me and @code-de

@banyet1
Copy link
banyet1 commented Apr 15, 2019

I'm using telegram as well, identical with WhatsApp.

@aman-tiwari
Copy link

I assume there are no segmentation masks for this dataset available, right? So those will also have to be made (or inferred using another network)

@sidyakinian
Copy link
Author

@aman-tiwari Yes, we'll have to create the dataset ourselves. Thankfully, SPADE researchers used DeepLabV2 for it, which works pretty quickly. We'll just use that or something similar.

@noyoshi
Copy link
noyoshi commented Apr 16, 2019

Not sure how much help I could be for actually training the network, but I would love to know if / when this happens! I made a pretty rudamentary web UI for this, and would love to be able to use the Flickr model on it. Source code and public site: http://www.smartsketch.xyz

@samuelpietri
Copy link

@sidyakinian count me in as well, I just sent you an email

@sidyakinian
Copy link
Author
sidyakinian commented Apr 17, 2019

@noyoshi Training will take 1-2 weeks.

As of being of help, the guys and I can go two ways: pull together our own GPU resources, or buy AWS. If we settle on the latter, anyone could be of help by pledging money; most likely only one of us (the most experienced one) will actually train the model.

@mingyuliutw
Copy link
Contributor

As we mentioned in the GTC, we are making an online demo for everyone to play with the Flickr model (any mobile devices or desktops) and likely a standalone version for everyone with an NVIDIA GPU. Hopefully, it will not take too long.

@sidyakinian
Copy link
Author

@mingyuliutw That's awesome! I've heard somewhere that it's going to be released in summer.

@Flova
Copy link
Flova commented Apr 18, 2019

It's maybe a dumb question, but what's the main reason, of the missing Flickr model? Is it it's sheer size or are there any licencing issues?

@bitcoinmeetups
Copy link

Following

@sidyakinian
Copy link
Author

@bitcoinmeetups Hi! Please email me your Telegram if you wanna join the chat

@datar-ai
Copy link
datar-ai commented May 7, 2019

It's awesome ! count me in as well

@mod-cpu
Copy link
mod-cpu commented May 13, 2019

Very interested. Count me in

@banyet1
Copy link
banyet1 commented May 23, 2019

We've accomplished Flickr datasets(41K) training last week.

@bensnell
Copy link

@banyet1 What are your plans now that the model is complete? Do you have any intention of making it available to others? I would love to test it out.

@genekogan
Copy link

Hey everyone! We've finished training the model and it's available here https://drive.google.com/open?id=1QJr5HBv8PAjJuVNB9zf8EiA6IcIVCswa
Here's a video of it in action: https://twitter.com/genekogan/status/1136261959970709504

@Seth-Park
Copy link

@genekogan Great work!
Are there any plans to release the collected Flickr dataset?

@taki0112
Copy link

@genekogan Nice !!
Do you have any plans to release the dataset?

@mingyuliutw
Copy link
Contributor

The official Flickr model is available as a web demo via https://www.nvidia.com/en-us/research/ai-playground/

Better models will likely come in the summer. Stay tuned.

@aeti-in
Copy link
aeti-in commented Jun 13, 2019 via email

@prusnak
Copy link
prusnak commented Jun 13, 2019

@mingyuliutw Is there a plan to release the actual model, not just the tool to play with it?

@huge123
Copy link
huge123 commented Jun 17, 2019

@genekogan Thanks for your efforts, could the flickr dataset be shared with us?

@genekogan
Copy link

i'm not sure if releasing the dataset would violate the licenses of the actual photos as they belong to other people. if we can release it, i have no problem with that.

@aviel08
Copy link
aviel08 commented Jun 18, 2019 via email

@huge123
Copy link
huge123 commented Jun 18, 2019

i'm not sure if releasing the dataset would violate the licenses of the actual photos as they belong to other people. if we can release it, i have no problem with that.

It is indeed a tough problem. What methods you used to extract the sematic layout of flickr images, you annotated some samples and trained the model on your own, or used existing model?

@aviel08
Copy link
aviel08 commented Jun 18, 2019

What methods you used to extract the sematic layout of flickr images, you annotated some samples and trained the model on your own, or used existing model?

I always create my own datasets, either creating my labels or generating them from a 3D application but I know this a very specific scenario.

@prusnak
Copy link
prusnak commented Jun 18, 2019

@genekogan Flickr contains lots of photos which are licensed under the Creative Commons license. If you pick only these for training, I am pretty sure the trained model could be published under the same license again.

@huge123
Copy link
huge123 commented Jun 22, 2019

@aviel08 @genekogan Is it permissible to share some sematic labels for the Flickr images along with the dataset/dataloader script, I just want to test the pretrained model, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0