update IMG2IMG.md

This commit is contained in:
mauwii 2022-12-02 06:51:32 +01:00 committed by Lincoln Stein
parent 470b267939
commit 8cdd961ad2

View File

@ -12,21 +12,19 @@ stable diffusion to build the prompt on top of the image you provide, preserving
the original's basic shape and layout. To use it, provide the `--init_img` the original's basic shape and layout. To use it, provide the `--init_img`
option as shown here: option as shown here:
```commandline !!! example ""
tree on a hill with a river, nature photograph, national geographic -I./test-pictures/tree-and-river-sketch.png -f 0.85
```
This will take the original image shown here: ```commandline
tree on a hill with a river, nature photograph, national geographic -I./test-pictures/tree-and-river-sketch.png -f 0.85
```
<figure markdown> <figure markdown>
![original-image](https://user-images.githubusercontent.com/50542132/193946000-c42a96d8-5a74-4f8a-b4c3-5213e6cadcce.png){ width=320 }
</figure>
and generate a new image based on it as shown here: | original image | generated image |
| :------------: | :-------------: |
| ![original-image](https://user-images.githubusercontent.com/50542132/193946000-c42a96d8-5a74-4f8a-b4c3-5213e6cadcce.png){ width=320 } | ![generated-image](https://user-images.githubusercontent.com/111189/194135515-53d4c060-e994-4016-8121-7c685e281ac9.png){ width=320 } |
<figure markdown> </figure>
![generated-image](https://user-images.githubusercontent.com/111189/194135515-53d4c060-e994-4016-8121-7c685e281ac9.png){ width=320 }
</figure>
The `--init_img` (`-I`) option gives the path to the seed picture. `--strength` The `--init_img` (`-I`) option gives the path to the seed picture. `--strength`
(`-f`) controls how much the original will be modified, ranging from `0.0` (keep (`-f`) controls how much the original will be modified, ranging from `0.0` (keep
@ -88,13 +86,15 @@ from a prompt. If the step count is 10, then the "latent space" (Stable
Diffusion's internal representation of the image) for the prompt "fire" with Diffusion's internal representation of the image) for the prompt "fire" with
seed `1592514025` develops something like this: seed `1592514025` develops something like this:
```bash !!! example ""
invoke> "fire" -s10 -W384 -H384 -S1592514025
```
<figure markdown> ```bash
![latent steps](../assets/img2img/000019.steps.png) invoke> "fire" -s10 -W384 -H384 -S1592514025
</figure> ```
<figure markdown>
![latent steps](../assets/img2img/000019.steps.png){ width=720 }
</figure>
Put simply: starting from a frame of fuzz/static, SD finds details in each frame Put simply: starting from a frame of fuzz/static, SD finds details in each frame
that it thinks look like "fire" and brings them a little bit more into focus, that it thinks look like "fire" and brings them a little bit more into focus,
@ -109,25 +109,23 @@ into the sequence at the appropriate point, with just the right amount of noise.
### A concrete example ### A concrete example
I want SD to draw a fire based on this hand-drawn image: !!! example "I want SD to draw a fire based on this hand-drawn image"
<figure markdown> ![drawing of a fireplace](../assets/img2img/fire-drawing.png){ align=left }
![drawing of a fireplace](../assets/img2img/fire-drawing.png)
</figure>
Let's only do 10 steps, to make it easier to see what's happening. If strength Let's only do 10 steps, to make it easier to see what's happening. If strength
is `0.7`, this is what the internal steps the algorithm has to take will look is `0.7`, this is what the internal steps the algorithm has to take will look
like: like:
<figure markdown> <figure markdown>
![gravity32](../assets/img2img/000032.steps.gravity.png) ![gravity32](../assets/img2img/000032.steps.gravity.png)
</figure> </figure>
With strength `0.4`, the steps look more like this: With strength `0.4`, the steps look more like this:
<figure markdown> <figure markdown>
![gravity30](../assets/img2img/000030.steps.gravity.png) ![gravity30](../assets/img2img/000030.steps.gravity.png)
</figure> </figure>
Notice how much more fuzzy the starting image is for strength `0.7` compared to Notice how much more fuzzy the starting image is for strength `0.7` compared to
`0.4`, and notice also how much longer the sequence is with `0.7`: `0.4`, and notice also how much longer the sequence is with `0.7`: