Merge branch 'patch-2' of github.com:spezialspezial/stable-diffusion into spezialspezial-patch-2

2024-08-30 20:32:17 +00:00 · 2022-09-26 01:13:39 -04:00
parent 66896dcbbe d16d0d3726
commit f368ebea00
2 changed files with 113 additions and 120 deletions
--- a/README.md
+++ b/README.md
@ -2,6 +2,15 @@

 # InvokeAI: A Stable Diffusion Toolkit

+_Note: This fork is rapidly evolving. Please use the
+[Issues](https://github.com/invoke-ai/InvokeAI/issues) tab to
+report bugs and make feature requests. Be sure to use the provided
+templates. They will help aid diagnose issues faster._
+
+_This repository was formally known as lstein/stable-diffusion_
+
+# **Table of Contents**
+
 ![project logo](docs/assets/logo.png)

 [![discord badge]][discord link]
--- a/docs/features/CLI.md
+++ b/docs/features/CLI.md
@ -103,157 +103,141 @@ overridden on a per-prompt basis (see [List of prompt arguments]

 These arguments are deprecated but still work:

-<figure markdown>

-| Argument           | Shortcut | Default | Description                                                     |
-| ------------------ | -------- | ------- | --------------------------------------------------------------- |
-| `--weights <path>` |          | `None`  | Pth to weights file; use `--model stable-diffusion-1.4` instead |
-| `--laion400m`      | `-l`     | `False` | Use older LAION400m weights; use `--model=laion400m` instead    |
+| Argument           |  Shortcut  |  Default            |  Description |
+|--------------------|------------|---------------------|--------------|
+| --weights <path>   |            | None                | Pth to weights file; use `--model stable-diffusion-1.4` instead |
+| --laion400m        | -l         | False               | Use older LAION400m weights; use `--model=laion400m` instead |

-</figure>
+**A note on path names:** On Windows systems, you may run into
+  problems when passing the dream script standard backslashed path
+  names because the Python interpreter treats "\" as an escape.
+  You can either double your slashes (ick): C:\\\\path\\\\to\\\\my\\\\file, or
+  use Linux/Mac style forward slashes (better): C:/path/to/my/file.

-!!! note
+## List of prompt arguments

-    On Windows systems, you may run into problems when passing the dream script standard backslashed
-    path names because the Python interpreter treats `\` as an escape. You can either double your
-    slashes (ick): `C:\\path\\to\\my\\file`, or use Linux/Mac style forward slashes (better):
-    `C:/path/to/my/file`.
+After the dream.py script initializes, it will present you with a
+**dream>** prompt. Here you can enter information to generate images
+from text (txt2img), to embellish an existing image or sketch
+(img2img), or to selectively alter chosen regions of the image
+(inpainting).

-### List of prompt arguments
+### This is an example of txt2img:

-After the `dream.py` script initializes, it will present you with a **`dream>`**
-prompt. Here you can enter information to generate images from text (txt2img),
-to embellish an existing image or sketch (img2img), or to selectively alter
-chosen regions of the image (inpainting).
+~~~~
+dream> waterfall and rainbow -W640 -H480
+~~~~

-#### txt2img
+This will create the requested image with the dimensions 640 (width)
+and 480 (height).

-!!! example
+Here are the dream> command that apply to txt2img:

-    ```bash
-    dream> "waterfall and rainbow" -W640 -H480
-    ```
+| Argument           |  Shortcut  |  Default            |  Description |
+|--------------------|------------|---------------------|--------------|
+| "my prompt"        |            |                    | Text prompt to use. The quotation marks are optional. |
+| --width <int>      | -W<int>   | 512                 | Width of generated image |
+| --height <int>     | -H<int>   | 512                 | Height of generated image |
+| --iterations <int> | -n<int>   | 1                   | How many images to generate from this prompt |
+| --steps <int>      | -s<int>   | 50                  | How many steps of refinement to apply |
+| --cfg_scale <float>| -C<float> | 7.5                 | How hard to try to match the prompt to the generated image; any number greater than 1.0 works, but the useful range is roughly 5.0 to 20.0 |
+| --seed <int>       | -S<int>   | None                | Set the random seed for the next series of images. This can be used to recreate an image generated previously.|
+| --sampler <sampler>| -A<sampler>| k_lms              | Sampler to use. Use -h to get list of available samplers. |
+| --grid             | -g        | False               | Turn on grid mode to return a single image combining all the images generated by this prompt |
+| --individual       | -i        | True                | Turn off grid mode (deprecated; leave off --grid instead) |
+| --outdir <path>    |  -o<path> | outputs/img_samples  | Temporarily change the location of these images |
+| --seamless         |           | False               | Activate seamless tiling for interesting effects |
+| --log_tokenization | -t        | False               | Display a color-coded list of the parsed tokens derived from the prompt |
+| --skip_normalization| -x       | False               | Weighted subprompts will not be normalized. See [Weighted Prompts](./OTHER.md#weighted-prompts) |
+| --upscale <int> <float> | -U <int> <float> | -U 1 0.75| Upscale image by magnification factor (2, 4), and set strength of upscaling (0.0-1.0). If strength not set, will default to 0.75. |
+| --gfpgan_strength <float>  | -G <float> | -G0        | Fix faces using the GFPGAN algorithm; argument indicates how hard the algorithm should try (0.0-1.0) |
+| --save_original    | -save_orig| False               | When upscaling or fixing faces, this will cause the original image to be saved rather than replaced. |
+| --variation <float>  |-v<float>| 0.0                 | Add a bit of noise (0.0=none, 1.0=high) to the image in order to generate a series of variations. Usually used in combination with -S<seed> and -n<int> to generate a series a riffs on a starting image. See [Variations](./VARIATIONS.md). |
+| --with_variations <pattern> | -V<pattern>| None      | Combine two or more variations. See [Variations](./VARIATIONS.md) for now to use this. |

-    This will create the requested image with the dimensions 640 (width) and 480 (height).
+Note that the width and height of the image must be multiples of
+64. You can provide different values, but they will be rounded down to
+the nearest multiple of 64.

-Those are the `dream` commands that apply to txt2img:

-| Argument <img width="680" align="right"/> | Shortcut <img width="420" align="right"/> | Default <img width="480" align="right"/> | Description                                                                                                                                                                                                                                      |
-| ----------------------------------------- | ----------------------------------------- | ---------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
-| `"my prompt"`                             |                                           |                                          | Text prompt to use. The quotation marks are optional.                                                                                                                                                                                            |
-| `--width <int>`                           | `-W<int>`                                 | `512`                                    | Width of generated image                                                                                                                                                                                                                         |
-| `--height <int>`                          | `-H<int>`                                 | `512`                                    | Height of generated image                                                                                                                                                                                                                        |
-| `--iterations <int>`                      | `-n<int>`                                 | `1`                                      | How many images to generate from this prompt                                                                                                                                                                                                     |
-| `--steps <int>`                           | `-s<int>`                                 | `50`                                     | How many steps of refinement to apply                                                                                                                                                                                                            |
-| `--cfg_scale <float>`                     | `-C<float>`                               | `7.5`                                    | How hard to try to match the prompt to the generated image; any number greater than 0.0 works, but the useful range is roughly 5.0 to 20.0                                                                                                       |
-| `--seed <int>`                            | `-S<int>`                                 | `None`                                   | Set the random seed for the next series of images. This can be used to recreate an image generated previously.                                                                                                                                   |
-| `--sampler <sampler>`                     | `-A<sampler>`                             | `k_lms`                                  | Sampler to use. Use `-h` to get list of available samplers.                                                                                                                                                                                      |
-| `--grid`                                  | `-g`                                      | `False`                                  | Turn on grid mode to return a single image combining all the images generated by this prompt                                                                                                                                                     |
-| `--individual`                            | `-i`                                      | `True`                                   | Turn off grid mode (deprecated; leave off `--grid` instead)                                                                                                                                                                                      |
-| `--outdir <path>`                         | `-o<path>`                                | `outputs/img_samples`                    | Temporarily change the location of these images                                                                                                                                                                                                  |
-| `--seamless`                              |                                           | `False`                                  | Activate seamless tiling for interesting effects                                                                                                                                                                                                 |
-| `--log_tokenization`                      | `-t`                                      | `False`                                  | Display a color-coded list of the parsed tokens derived from the prompt                                                                                                                                                                          |
-| `--skip_normalization`                    | `-x`                                      | `False`                                  | Weighted subprompts will not be normalized. See [Weighted Prompts](./OTHER.md#weighted-prompts)                                                                                                                                                  |
-| `--upscale <int> <float>`                 | `-U <int> <float>`                        | `-U 1 0.75`                              | Upscale image by magnification factor (2, 4), and set strength of upscaling (0.0-1.0). If strength not set, will default to 0.75.                                                                                                                |
-| `--gfpgan_strength <float>`               | `-G <float>`                              | `-G0`                                    | Fix faces using the GFPGAN algorithm; argument indicates how hard the algorithm should try (0.0-1.0)                                                                                                                                             |
-| `--save_original`                         | `-save_orig`                              | `False`                                  | When upscaling or fixing faces, this will cause the original image to be saved rather than replaced.                                                                                                                                             |
-| `--variation <float>`                     | `-v<float>`                               | `0.0`                                    | Add a bit of noise (0.0=none, 1.0=high) to the image in order to generate a series of variations. Usually used in combination with `-S<seed>` and `-n<int>` to generate a series a riffs on a starting image. See [Variations](./VARIATIONS.md). |
-| `--with_variations <pattern>`             | `-V<pattern>`                             | `None`                                   | Combine two or more variations. See [Variations](./VARIATIONS.md) for now to use this.                                                                                                                                                           |
+### This is an example of img2img:	

-!!! note
+~~~~
+dream> waterfall and rainbow -I./vacation-photo.png -W640 -H480 --fit
+~~~~

-    The width and height of the image must be multiples of 64. You can provide different
-    values, but they will be rounded down to the nearest multiple of 64.
+This will modify the indicated vacation photograph by making it more
+like the prompt. Results will vary greatly depending on what is in the
+image. We also ask to --fit the image into a box no bigger than
+640x480. Otherwise the image size will be identical to the provided
+photo and you may run out of memory if it is large.

-#### img2img
+In addition to the command-line options recognized by txt2img, img2img
+accepts additional options:

-!!! example
+| Argument           |  Shortcut  |  Default            |  Description |
+|--------------------|------------|---------------------|--------------|
+| --init_img <path>  | -I<path>   | None                | Path to the initialization image |
+| --fit              | -F         | False               | Scale the image to fit into the specified -H and -W dimensions |
+| --strength <float> | -s<float>  | 0.75                | How hard to try to match the prompt to the initial image. Ranges from 0.0-0.99, with higher values replacing the initial image completely.|

-    ```bash
-    dream> "waterfall and rainbow" -I./vacation-photo.png -W640 -H480 --fit
-    ```
+### This is an example of inpainting:

-    This will modify the indicated vacation photograph by making it more like the prompt. Results will
-    vary greatly depending on what is in the image. We also ask to --fit the image into a box no bigger
-    than 640x480. Otherwise the image size will be identical to the provided photo and you may run out
-    of memory if it is large.
+~~~~
+dream> waterfall and rainbow -I./vacation-photo.png -M./vacation-mask.png -W640 -H480 --fit
+~~~~

-Repeated chaining of img2img on an image can result in significant color shifts
-in the output, especially if run with lower strength. Color correction can be
-run against a reference image to fix this issue. Use the original input image to
-the chain as the the reference image for each step in the chain.
+This will do the same thing as img2img, but image alterations will
+only occur within transparent areas defined by the mask file specified
+by -M. You may also supply just a single initial image with the areas
+to overpaint made transparent, but you must be careful not to destroy
+the pixels underneath when you create the transparent areas. See
+[Inpainting](./INPAINTING.md) for details.

-In addition to the command-line options recognized by txt2img, img2img accepts
-additional options:
+inpainting accepts all the arguments used for txt2img and img2img, as
+well as the --mask (-M) argument:

-| Argument <img width="160" align="right"/> | Shortcut    | Default | Description                                                                                                                                |
-| ----------------------------------------- | ----------- | ------- | ------------------------------------------------------------------------------------------------------------------------------------------ |
-| `--init_img <path>`                       | `-I<path>`  | `None`  | Path to the initialization image                                                                                                           |
-| `--init_color <path>`                     |             | `None`  | Path to reference image for color correction                                                                                               |
-| `--fit`                                   | `-F`        | `False` | Scale the image to fit into the specified -H and -W dimensions                                                                             |
-| `--strength <float>`                      | `-f<float>` | `0.75`  | How hard to try to match the prompt to the initial image. Ranges from 0.0-0.99, with higher values replacing the initial image completely. |
+| Argument           |  Shortcut  |  Default            |  Description |
+|--------------------|------------|---------------------|--------------|
+| --init_mask <path> | -M<path>   | None                |Path to an image the same size as the initial_image, with areas for inpainting made transparent.|

-#### Inpainting

-!!! example
+# Command-line editing and completion

-    ```bash
-    dream> "waterfall and rainbow" -I./vacation-photo.png -M./vacation-mask.png -W640 -H480 --fit
-    ```
+If you are on a Macintosh or Linux machine, the command-line offers
+convenient history tracking, editing, and command completion.

-    This will do the same thing as img2img, but image alterations will only occur within transparent
-    areas defined by the mask file specified by `-M`. You may also supply just a single initial image with
-    the areas to overpaint made transparent, but you must be careful not to destroy the pixels
-    underneath when you create the transparent areas. See [Inpainting](./INPAINTING.md) for details.
+- To scroll through previous commands and potentially edit/reuse them, use the up and down cursor keys.
+- To edit the current command, use the left and right cursor keys to position the cursor, and then backspace, delete or insert characters.
+- To move to the very beginning of the command, type CTRL-A (or command-A on the Mac)
+- To move to the end of the command, type CTRL-E.
+- To cut a section of the command, position the cursor where you want to start cutting and type CTRL-K.
+- To paste a cut section back in, position the cursor where you want to paste, and type CTRL-Y

-Inpainting accepts all the arguments used for txt2img and img2img, as well as
-the `--mask` (`-M`) argument:
+Windows users can get similar, but more limited, functionality if they
+launch dream.py with the "winpty" program:

-| Argument <img width="100" align="right"/> | Shortcut   | Default | Description                                                                                      |
-| ----------------------------------------- | ---------- | ------- | ------------------------------------------------------------------------------------------------ |
-| `--init_mask <path>`                      | `-M<path>` | `None`  | Path to an image the same size as the initial_image, with areas for inpainting made transparent. |
+~~~
+> winpty python scripts\dream.py
+~~~

-## Command-line editing and completion
+On the Mac and Linux platforms, when you exit dream.py, the last 1000
+lines of your command-line history will be saved. When you restart
+dream.py, you can access the saved history using the up-arrow key.

-If you are on a Macintosh or Linux machine, the command-line offers convenient
-history tracking, editing, and command completion.
+In addition, limited command-line completion is installed. In various
+contexts, you can start typing your command and press tab. A list of
+potential completions will be presented to you. You can then type a
+little more, hit tab again, and eventually autocomplete what you want.

- To scroll through previous commands and potentially edit/reuse them, use the
-  ++up++ and ++down++ cursor keys.
- To edit the current command, use the ++left++ and ++right++ cursor keys to
-  position the cursor, and then ++backspace++, ++delete++ or ++insert++
-  characters.
- To move to the very beginning of the command, type ++ctrl+a++ (or
-  ++command+a++ on the Mac)
- To move to the end of the command, type ++ctrl+e++.
- To cut a section of the command, position the cursor where you want to start
-  cutting and type ++ctrl+k++.
- To paste a cut section back in, position the cursor where you want to paste,
-  and type ++ctrl+y++
+When specifying file paths using the one-letter shortcuts, the CLI
+will attempt to complete pathnames for you. This is most handy for the
+-I (init image) and -M (init mask) paths. To initiate completion, start
+the path with a slash ("/") or "./". For example:

-Windows users can get similar, but more limited, functionality if they launch
-`dream.py` with the "winpty" program:
-
-```batch
-winpty python scripts\dream.py
-```
-
-On the Mac and Linux platforms, when you exit `dream.py`, the last 1000 lines of
-your command-line history will be saved. When you restart `dream.py`, you can
-access the saved history using the ++up++ key.
-
-In addition, limited command-line completion is installed. In various contexts,
-you can start typing your command and press tab. A list of potential completions
-will be presented to you. You can then type a little more, hit tab again, and
-eventually autocomplete what you want.
-
-When specifying file paths using the one-letter shortcuts, the CLI will attempt
-to complete pathnames for you. This is most handy for the `-I` (init image) and
-`-M` (init mask) paths. To initiate completion, start the path with a slash `/`
-or `./`, for example:
-
-```bash
-dream> "zebra with a mustache" -I./test-pictures<TAB>
+~~~
+dream> zebra with a mustache -I./test-pictures<TAB>
 -I./test-pictures/Lincoln-and-Parrot.png  -I./test-pictures/zebra.jpg        -I./test-pictures/madonna.png
 -I./test-pictures/bad-sketch.png          -I./test-pictures/man_with_eagle/
 ```