r/StableDiffusion Sep 11 '22

A better (?) way of doing img2img by finding the noise which reconstructs the original image Img2Img

u/Aumanidol Sep 12 '22

Did anyone manage to get good results with AUTOMATIC implementation? My workflow is as follows:

  • I upload a picture

  • select "img2img alternative test"

  • select Euler (not Euler a)

  • hit interrogate

  • paste the found prompt into the "original prompt" box

  • change something in the prompt (the one on top of the page) and hit generate.

Results so far have been terrible, especially with faces.

I've read that better results were attained lowering "CFG scale" to 0.0 (this UI doesn't allow for that and I have no access to the terminal for a couple of days), but lowering it to 1 doesn't seem to be doing anything good.

I've messed around with the decode parameters but nothing good came out of it either.


u/Aumanidol Sep 12 '22

worth mentioning: the prompt produced with the interrogate button on the very same picture used above is the following "a woman smiling and holding a cell phone in her hand and a cell phone in her other hand with a picture of a woman on it, by Adélaïde Labille-Guiard"

am I using the wrong implementation?


u/wildgurularry Sep 13 '22

Did you wind up getting anything working? Just playing around with it now, and the results are not quite as great as I expected. Of course, if I use the image of the woman posted above I get amazing results... but any of my own pictures that I have tried are failing miserably, unless they are just a fully cropped face.