r/StableDiffusion Sep 11 '22

A better (?) way of doing img2img by finding the noise which reconstructs the original image Img2Img

Post image
889 Upvotes

View all comments

11

u/i_have_chosen_a_name Sep 12 '22

Wait if it can find the latent space representation of the original image does that not mean every single combination of 512x512 pixel is present in the data set? How is that possible. Surely the latent space only contains an aproximation, no?

Also I’m blown away at the development speed of this after being open sourced. Google their Imagen and OpenAi dalle2 will never be able to compete with the open source fine tuning you can get from w couple million dev monkeys all fucking around with the code and model.

3

u/StickiStickman Sep 12 '22

Surely the latent space only contains an aproximation, no?

Obviously, that's literally what he said though?

You also seem to have a bit of a fundamental misunderstanding how it works:

Wait if it can find the latent space representation of the original image does that not mean every single combination of 512x512 pixel is present in the data set?

It wouldn't mean that at all. It's not just copy pasting images from its dataset.