mirror of
https://github.com/invoke-ai/InvokeAI
synced 2024-08-30 20:32:17 +00:00
update IMG2IMG.md
This commit is contained in:
parent
470b267939
commit
8cdd961ad2
@ -12,21 +12,19 @@ stable diffusion to build the prompt on top of the image you provide, preserving
|
||||
the original's basic shape and layout. To use it, provide the `--init_img`
|
||||
option as shown here:
|
||||
|
||||
```commandline
|
||||
tree on a hill with a river, nature photograph, national geographic -I./test-pictures/tree-and-river-sketch.png -f 0.85
|
||||
```
|
||||
!!! example ""
|
||||
|
||||
This will take the original image shown here:
|
||||
```commandline
|
||||
tree on a hill with a river, nature photograph, national geographic -I./test-pictures/tree-and-river-sketch.png -f 0.85
|
||||
```
|
||||
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/c02fd/c02fddc819043556d56b4eaec86573279128f391" alt="original-image"{ width=320 }
|
||||
</figure>
|
||||
<figure markdown>
|
||||
|
||||
and generate a new image based on it as shown here:
|
||||
| original image | generated image |
|
||||
| :------------: | :-------------: |
|
||||
| data:image/s3,"s3://crabby-images/c02fd/c02fddc819043556d56b4eaec86573279128f391" alt="original-image"{ width=320 } | data:image/s3,"s3://crabby-images/9734b/9734bf025754f809520c5fef8d8f47b5bf8f5c30" alt="generated-image"{ width=320 } |
|
||||
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/9734b/9734bf025754f809520c5fef8d8f47b5bf8f5c30" alt="generated-image"{ width=320 }
|
||||
</figure>
|
||||
</figure>
|
||||
|
||||
The `--init_img` (`-I`) option gives the path to the seed picture. `--strength`
|
||||
(`-f`) controls how much the original will be modified, ranging from `0.0` (keep
|
||||
@ -88,13 +86,15 @@ from a prompt. If the step count is 10, then the "latent space" (Stable
|
||||
Diffusion's internal representation of the image) for the prompt "fire" with
|
||||
seed `1592514025` develops something like this:
|
||||
|
||||
```bash
|
||||
invoke> "fire" -s10 -W384 -H384 -S1592514025
|
||||
```
|
||||
!!! example ""
|
||||
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/8a59b/8a59bbd24053287004a4b06aeb055a3b804e480b" alt="latent steps"
|
||||
</figure>
|
||||
```bash
|
||||
invoke> "fire" -s10 -W384 -H384 -S1592514025
|
||||
```
|
||||
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/8a59b/8a59bbd24053287004a4b06aeb055a3b804e480b" alt="latent steps"{ width=720 }
|
||||
</figure>
|
||||
|
||||
Put simply: starting from a frame of fuzz/static, SD finds details in each frame
|
||||
that it thinks look like "fire" and brings them a little bit more into focus,
|
||||
@ -109,25 +109,23 @@ into the sequence at the appropriate point, with just the right amount of noise.
|
||||
|
||||
### A concrete example
|
||||
|
||||
I want SD to draw a fire based on this hand-drawn image:
|
||||
!!! example "I want SD to draw a fire based on this hand-drawn image"
|
||||
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/61915/619157b61675a459cffacca6cc2859edd13eac99" alt="drawing of a fireplace"
|
||||
</figure>
|
||||
data:image/s3,"s3://crabby-images/61915/619157b61675a459cffacca6cc2859edd13eac99" alt="drawing of a fireplace"{ align=left }
|
||||
|
||||
Let's only do 10 steps, to make it easier to see what's happening. If strength
|
||||
is `0.7`, this is what the internal steps the algorithm has to take will look
|
||||
like:
|
||||
Let's only do 10 steps, to make it easier to see what's happening. If strength
|
||||
is `0.7`, this is what the internal steps the algorithm has to take will look
|
||||
like:
|
||||
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/e1c78/e1c78f03255235872343f339c0f44e4771a65875" alt="gravity32"
|
||||
</figure>
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/e1c78/e1c78f03255235872343f339c0f44e4771a65875" alt="gravity32"
|
||||
</figure>
|
||||
|
||||
With strength `0.4`, the steps look more like this:
|
||||
With strength `0.4`, the steps look more like this:
|
||||
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/52d35/52d350fd13b7ca06f1a22ea1263c85f74b158dda" alt="gravity30"
|
||||
</figure>
|
||||
<figure markdown>
|
||||
data:image/s3,"s3://crabby-images/52d35/52d350fd13b7ca06f1a22ea1263c85f74b158dda" alt="gravity30"
|
||||
</figure>
|
||||
|
||||
Notice how much more fuzzy the starting image is for strength `0.7` compared to
|
||||
`0.4`, and notice also how much longer the sequence is with `0.7`:
|
||||
|
Loading…
Reference in New Issue
Block a user