diff --git a/README.md b/README.md index cb8b3ac430..e952e72648 100644 --- a/README.md +++ b/README.md @@ -165,7 +165,7 @@ you can try starting `invoke.py` with the `--precision=float32` flag: - Deprecated `--full_precision` / `-F`. Simply omit it and `invoke.py` will auto configure. To switch away from auto use the new flag like `--precision=float32`. -For older changelogs, please visit the **[CHANGELOG](https://invoke-ai.github.io/InvokeAI/CHANGELOG/)**. +For older changelogs, please visit the **[CHANGELOG](https://invoke-ai.github.io/InvokeAI/CHANGELOG#v114-11-september-2022)**. ### Troubleshooting diff --git a/docs/CHANGELOG.md b/docs/CHANGELOG.md index 556a9d9ace..9b4dfe1cfc 100644 --- a/docs/CHANGELOG.md +++ b/docs/CHANGELOG.md @@ -6,64 +6,64 @@ title: Changelog ## v2.0.1 (13 October 2022) - - fix noisy images at high step count when using k* samplers - - dream.py script now calls invoke.py module directly rather than +- fix noisy images at high step count when using k* samplers +- dream.py script now calls invoke.py module directly rather than via a new python process (which could break the environment) ## v2.0.0 (9 October 2022) - - `dream.py` script renamed `invoke.py`. A `dream.py` script wrapper remains +- `dream.py` script renamed `invoke.py`. A `dream.py` script wrapper remains for backward compatibility. - - Completely new WebGUI - launch with `python3 scripts/invoke.py --web` - - Support for inpainting and outpainting - - img2img runs on all k* samplers - - Support for negative prompts - - Support for CodeFormer face reconstruction - - Support for Textual Inversion on Macintoshes - - Support in both WebGUI and CLI for post-processing of previously-generated images +- Completely new WebGUI - launch with `python3 scripts/invoke.py --web` +- Support for [inpainting](features/INPAINTING.md) and [outpainting](features/OUTPAINTING.md) +- img2img runs on all k* samplers +- Support for [negative prompts](features/PROMPTS.md#negative-and-unconditioned-prompts) +- Support for CodeFormer face reconstruction +- Support for Textual Inversion on Macintoshes +- Support in both WebGUI and CLI for [post-processing of previously-generated images](features/POSTPROCESS.md) using facial reconstruction, ESRGAN upscaling, outcropping (similar to DALL-E infinite canvas), and "embiggen" upscaling. See the `!fix` command. - - New `--hires` option on `invoke>` line allows larger images to be created without duplicating elements, at the cost of some performance. - - New `--perlin` and `--threshold` options allow you to add and control variation - during image generation (see Thresholding and Perlin Noise Initialization - - Extensive metadata now written into PNG files, allowing reliable regeneration of images +- New `--hires` option on `invoke>` line allows [larger images to be created without duplicating elements](features/CLI.md#this-is-an-example-of-txt2img), at the cost of some performance. +- New `--perlin` and `--threshold` options allow you to add and control variation + during image generation (see [Thresholding and Perlin Noise Initialization](features/OTHER.md#thresholding-and-perlin-noise-initialization-options)) +- Extensive metadata now written into PNG files, allowing reliable regeneration of images and tweaking of previous settings. - - Command-line completion in `invoke.py` now works on Windows, Linux and Mac platforms. - - Improved command-line completion behavior. +- Command-line completion in `invoke.py` now works on Windows, Linux and Mac platforms. +- Improved [command-line completion behavior](features/CLI.md) New commands added: - * List command-line history with `!history` - * Search command-line history with `!search` - * Clear history with `!clear` - - Deprecated `--full_precision` / `-F`. Simply omit it and `invoke.py` will auto + - List command-line history with `!history` + - Search command-line history with `!search` + - Clear history with `!clear` +- Deprecated `--full_precision` / `-F`. Simply omit it and `invoke.py` will auto configure. To switch away from auto use the new flag like `--precision=float32`. ## v1.14 (11 September 2022) - - Memory optimizations for small-RAM cards. 512x512 now possible on 4 GB GPUs. - - Full support for Apple hardware with M1 or M2 chips. - - Add "seamless mode" for circular tiling of image. Generates beautiful effects. +- Memory optimizations for small-RAM cards. 512x512 now possible on 4 GB GPUs. +- Full support for Apple hardware with M1 or M2 chips. +- Add "seamless mode" for circular tiling of image. Generates beautiful effects. ([prixt](https://github.com/prixt)). - - Inpainting support. - - Improved web server GUI. - - Lots of code and documentation cleanups. +- Inpainting support. +- Improved web server GUI. +- Lots of code and documentation cleanups. ## v1.13 (3 September 2022) - - Support image variations (see [VARIATIONS](features/VARIATIONS.md) +- Support image variations (see [VARIATIONS](features/VARIATIONS.md) ([Kevin Gibbons](https://github.com/bakkot) and many contributors and reviewers) - - Supports a Google Colab notebook for a standalone server running on Google hardware +- Supports a Google Colab notebook for a standalone server running on Google hardware [Arturo Mendivil](https://github.com/artmen1516) - - WebUI supports GFPGAN/ESRGAN facial reconstruction and upscaling +- WebUI supports GFPGAN/ESRGAN facial reconstruction and upscaling [Kevin Gibbons](https://github.com/bakkot) - - WebUI supports incremental display of in-progress images during generation +- WebUI supports incremental display of in-progress images during generation [Kevin Gibbons](https://github.com/bakkot) - - A new configuration file scheme that allows new models (including upcoming +- A new configuration file scheme that allows new models (including upcoming stable-diffusion-v1.5) to be added without altering the code. ([David Wager](https://github.com/maddavid12)) - - Can specify --grid on invoke.py command line as the default. - - Miscellaneous internal bug and stability fixes. - - Works on M1 Apple hardware. - - Multiple bug fixes. +- Can specify --grid on invoke.py command line as the default. +- Miscellaneous internal bug and stability fixes. +- Works on M1 Apple hardware. +- Multiple bug fixes. --- @@ -88,7 +88,7 @@ title: Changelog Seed memory only extends back to the previous command, but will work on all images generated with the -n# switch. - Variant generation support temporarily disabled pending more general solution. - Created a feature branch named **yunsaki-morphing-invoke** which adds experimental support for - iteratively modifying the prompt and its parameters. Please see[ Pull Request #86](https://github.com/lstein/stable-diffusion/pull/86) + iteratively modifying the prompt and its parameters. Please see[Pull Request #86](https://github.com/lstein/stable-diffusion/pull/86) for a synopsis of how this works. Note that when this feature is eventually added to the main branch, it will may be modified significantly. diff --git a/docs/features/CHANGELOG.md b/docs/features/CHANGELOG.md deleted file mode 100644 index 80ec5cf3a2..0000000000 --- a/docs/features/CHANGELOG.md +++ /dev/null @@ -1,143 +0,0 @@ ---- -title: Changelog ---- - -# :octicons-log-16: Changelog - -## v1.13 - -- Supports a Google Colab notebook for a standalone server running on Google - hardware [Arturo Mendivil](https://github.com/artmen1516) -- WebUI supports GFPGAN/ESRGAN facial reconstruction and upscaling - [Kevin Gibbons](https://github.com/bakkot) -- WebUI supports incremental display of in-progress images during generation - [Kevin Gibbons](https://github.com/bakkot) -- Output directory can be specified on the invoke> command line. -- The grid was displaying duplicated images when not enough images to fill the - final row [Muhammad Usama](https://github.com/SMUsamaShah) -- Can specify --grid on invoke.py command line as the default. -- Miscellaneous internal bug and stability fixes. - ---- - -## v1.12 (28 August 2022) - -- Improved file handling, including ability to read prompts from standard input. - (kudos to [Yunsaki](https://github.com/yunsaki) -- The web server is now integrated with the invoke.py script. Invoke by adding - --web to the invoke.py command arguments. -- Face restoration and upscaling via GFPGAN and Real-ESGAN are now automatically - enabled if the GFPGAN directory is located as a sibling to Stable Diffusion. - VRAM requirements are modestly reduced. Thanks to both - [Blessedcoolant](https://github.com/blessedcoolant) and - [Oceanswave](https://github.com/oceanswave) for their work on this. -- You can now swap samplers on the invoke> command line. - [Blessedcoolant](https://github.com/blessedcoolant) - ---- - -## v1.11 (26 August 2022) - -- NEW FEATURE: Support upscaling and face enhancement using the GFPGAN module. - (kudos to [Oceanswave](https://github.com/Oceanswave)) -- You now can specify a seed of -1 to use the previous image's seed, -2 to use - the seed for the image generated before that, etc. Seed memory only extends - back to the previous command, but will work on all images generated with the - -n# switch. -- Variant generation support temporarily disabled pending more general solution. -- Created a feature branch named **yunsaki-morphing-invoke** which adds - experimental support for iteratively modifying the prompt and its parameters. - Please - see[ Pull Request #86](https://github.com/lstein/stable-diffusion/pull/86) for - a synopsis of how this works. Note that when this feature is eventually added - to the main branch, it will may be modified significantly. - ---- - -## v1.10 (25 August 2022) - -- A barebones but fully functional interactive web server for online generation - of txt2img and img2img. - ---- - -## v1.09 (24 August 2022) - -- A new -v option allows you to generate multiple variants of an initial image - in img2img mode. (kudos to [Oceanswave](https://github.com/Oceanswave). -- [See this discussion in the PR for examples and details on use](https://github.com/lstein/stable-diffusion/pull/71#issuecomment-1226700810)) -- Added ability to personalize text to image generation (kudos to - [Oceanswave](https://github.com/Oceanswave) and - [nicolai256](https://github.com/nicolai256)) -- Enabled all of the samplers from k_diffusion - ---- - -## v1.08 (24 August 2022) - -- Escape single quotes on the invoke> command before trying to parse. This avoids - parse errors. -- Removed instruction to get Python3.8 as first step in Windows install. - Anaconda3 does it for you. -- Added bounds checks for numeric arguments that could cause crashes. -- Cleaned up the copyright and license agreement files. - ---- - -## v1.07 (23 August 2022) - -- Image filenames will now never fill gaps in the sequence, but will be assigned - the next higher name in the chosen directory. This ensures that the alphabetic - and chronological sort orders are the same. - ---- - -## v1.06 (23 August 2022) - -- Added weighted prompt support contributed by - [xraxra](https://github.com/xraxra) -- Example of using weighted prompts to tweak a demonic figure contributed by - [bmaltais](https://github.com/bmaltais) - ---- - -## v1.05 (22 August 2022 - after the drop) - -- Filenames now use the following formats: 000010.95183149.png -- Two files - produced by the same command (e.g. -n2), 000010.26742632.png -- distinguished - by a different seed. - 000011.455191342.01.png -- Two files produced by the same command using - 000011.455191342.02.png -- a batch size>1 (e.g. -b2). They have the same seed. - 000011.4160627868.grid#1-4.png -- a grid of four images (-g); the whole grid - can be regenerated with the indicated key - -- It should no longer be possible for one image to overwrite another -- You can use the "cd" and "pwd" commands at the invoke> prompt to set and - retrieve the path of the output directory. - -## v1.04 (22 August 2022 - after the drop) - -- Updated README to reflect installation of the released weights. -- Suppressed very noisy and inconsequential warning when loading the frozen CLIP - tokenizer. - -## v1.03 (22 August 2022) - -- The original txt2img and img2img scripts from the CompViz repository have been - moved into a subfolder named "orig_scripts", to reduce confusion. - -## v1.02 (21 August 2022) - -- A copy of the prompt and all of its switches and options is now stored in the - corresponding image in a tEXt metadata field named "Dream". You can read the - prompt using scripts/images2prompt.py, or an image editor that allows you to - explore the full metadata. **Please run "conda env update -f environment.yaml" - to load the k_lms dependencies!!** - -## v1.01 (21 August 2022) - -- added k_lms sampling. **Please run "conda env update -f environment.yaml" to - load the k_lms dependencies!!** -- use half precision arithmetic by default, resulting in faster execution and - lower memory requirements Pass argument --full_precision to invoke.py to get - slower but more accurate image generation diff --git a/docs/features/CLI.md b/docs/features/CLI.md index 583af95ca8..bbd147a450 100644 --- a/docs/features/CLI.md +++ b/docs/features/CLI.md @@ -100,9 +100,7 @@ overridden on a per-prompt basis (see [List of prompt arguments](#list-of-prompt | `--free_gpu_mem` | | `False` | Free GPU memory after sampling, to allow image decoding and saving in low VRAM conditions | | `--precision` | | `auto` | Set model precision, default is selected by device. Options: auto, float32, float16, autocast | -!!! warning deprecated - - These arguments are deprecated but still work: +!!! warning "These arguments are deprecated but still work"
@@ -131,7 +129,7 @@ from text ([txt2img](#txt2img)), to embellish an existing image or sketch ### txt2img -!!! example +!!! example "" ```bash invoke> waterfall and rainbow -W640 -H480 @@ -173,7 +171,7 @@ Here are the invoke> command that apply to txt2img: ### img2img -!!! example +!!! example "" ```bash invoke> waterfall and rainbow -I./vacation-photo.png -W640 -H480 --fit @@ -196,7 +194,7 @@ accepts additional options: ### inpainting -!!! example +!!! example "" ```bash invoke> waterfall and rainbow -I./vacation-photo.png -M./vacation-mask.png -W640 -H480 --fit diff --git a/docs/features/IMG2IMG.md b/docs/features/IMG2IMG.md index 0ce0c9d539..b77a0c4b11 100644 --- a/docs/features/IMG2IMG.md +++ b/docs/features/IMG2IMG.md @@ -17,15 +17,15 @@ tree on a hill with a river, nature photograph, national geographic -I./test-pic This will take the original image shown here: -
+
-
+ and generate a new image based on it as shown here: -
+
-
+ The `--init_img` (`-I`) option gives the path to the seed picture. `--strength` (`-f`) controls how much the original will be modified, ranging from `0.0` (keep the original intact), to `1.0` (ignore the @@ -41,11 +41,10 @@ interesting variants. Note that the prompt makes a big difference. For example, this slight variation on the prompt produces a very different image: -`photograph of a tree on a hill with a river` - -
+
-
+photograph of a tree on a hill with a river + !!! tip @@ -82,9 +81,9 @@ gaussian noise and progressively refines it over the requested number of steps, invoke> "fire" -s10 -W384 -H384 -S1592514025 ``` -
+
![latent steps](../assets/img2img/000019.steps.png) -
+ Put simply: starting from a frame of fuzz/static, SD finds details in each frame that it thinks look like "fire" and brings them a little bit more into focus, gradually scrubbing out the fuzz until a clear image remains. @@ -94,21 +93,21 @@ Put simply: starting from a frame of fuzz/static, SD finds details in each frame I want SD to draw a fire based on this hand-drawn image: -
+
![drawing of a fireplace](../assets/img2img/fire-drawing.png) -
+ Let's only do 10 steps, to make it easier to see what's happening. If strength is `0.7`, this is what the internal steps the algorithm has to take will look like: -
+
![gravity32](../assets/img2img/000032.steps.gravity.png) -
+ With strength `0.4`, the steps look more like this: -
+
![gravity30](../assets/img2img/000030.steps.gravity.png) -
+ Notice how much more fuzzy the starting image is for strength `0.7` compared to `0.4`, and notice also how much longer the sequence is with `0.7`: @@ -140,9 +139,9 @@ Here's strength `0.4` (note step count `50`, which is `20 ÷ 0.4` to make sure S invoke> "fire" -s50 -W384 -H384 -S1592514025 -I /tmp/fire-drawing.png -f 0.4 ``` -
+
![000035.1592514025](../assets/img2img/000035.1592514025.png) -
+ and here is strength `0.7` (note step count `30`, which is roughly `20 ÷ 0.7` to make sure SD does `20` steps from my image): @@ -150,29 +149,38 @@ and here is strength `0.7` (note step count `30`, which is roughly `20 ÷ 0.7` t invoke> "fire" -s30 -W384 -H384 -S1592514025 -I /tmp/fire-drawing.png -f 0.7 ``` -
+
![000046.1592514025](../assets/img2img/000046.1592514025.png) -
+ In both cases the image is nice and clean and "finished", but because at strength `0.7` Stable Diffusion has been give so much more freedom to improve on my badly-drawn flames, they've come out looking much better. You can really see the difference when looking at the latent steps. There's more noise on the first image with strength `0.7`: +
![gravity46](../assets/img2img/000046.steps.gravity.png) +
than there is for strength `0.4`: +
![gravity35](../assets/img2img/000035.steps.gravity.png) +
and that extra noise gives the algorithm more choices when it is evaluating how to denoise any particular pixel in the image. Unfortunately, it seems that `img2img` is very sensitive to the step count. Here's strength `0.7` with a step count of `29` (SD did 19 steps from my image): -
+
![gravity45](../assets/img2img/000045.1592514025.png) -
+ By comparing the latents we can sort of see that something got interpreted differently enough on the third or fourth step to lead to a rather different interpretation of the flames. +
![gravity46](../assets/img2img/000046.steps.gravity.png) +
+ +
![gravity45](../assets/img2img/000045.steps.gravity.png) +
This is the result of a difference in the de-noising "schedule" - basically the noise has to be cleaned by a certain degree each step or the model won't "converge" on the image properly (see [stable diffusion blog](https://huggingface.co/blog/stable_diffusion) for more about that). A different step count means a different schedule, which means things get interpreted slightly differently at every step. diff --git a/docs/features/INPAINTING.md b/docs/features/INPAINTING.md index d71e897fa4..0fcc62874b 100644 --- a/docs/features/INPAINTING.md +++ b/docs/features/INPAINTING.md @@ -50,28 +50,40 @@ We are hoping to get rid of the need for this workaround in an upcoming release. 1. Open image in Photoshop - ![step1](../assets/step1.png) +
+ ![step1](../assets/step1.png) +
2. Use any of the selection tools (Marquee, Lasso, or Wand) to select the area you desire to inpaint. +
![step2](../assets/step2.png) +
3. Because we'll be applying a mask over the area we want to preserve, you should now select the inverse by using the ++shift+ctrl+i++ shortcut, or right clicking and using the "Select Inverse" option. 4. You'll now create a mask by selecting the image layer, and Masking the selection. Make sure that you don't delete any of the undrlying image, or your inpainting results will be dramatically impacted. +
![step4](../assets/step4.png) +
5. Make sure to hide any background layers that are present. You should see the mask applied to your image layer, and the image on your canvas should display the checkered background. +
![step5](../assets/step5.png) +
6. Save the image as a transparent PNG by using `File`-->`Save a Copy` from the menu bar, or by using the keyboard shortcut ++alt+ctrl+s++ +
![step6](../assets/step6.png) +
7. After following the inpainting instructions above (either through the CLI or the Web UI), marvel at your newfound ability to selectively invoke. Lookin' good! +
![step7](../assets/step7.png) +
8. In the export dialogue, Make sure the "Save colour values from transparent pixels" checkbox is selected. diff --git a/docs/features/OUTPAINTING.md b/docs/features/OUTPAINTING.md index a9688150a7..bae0fdc70f 100644 --- a/docs/features/OUTPAINTING.md +++ b/docs/features/OUTPAINTING.md @@ -25,7 +25,9 @@ implementations. Consider this image: +
![curly_woman](../assets/outpainting/curly.png) +
Pretty nice, but it's annoying that the top of her head is cut off. She's also a bit off center. Let's fix that! @@ -42,9 +44,9 @@ specify any number of pixels to extend. You can also abbreviate The result looks like this: -
+
![curly_woman_outcrop](../assets/outpainting/curly-outcrop.png) -
+ The new image is actually slightly larger than the original (576x576, because 64 pixels were added to the top and right sides.) @@ -76,7 +78,9 @@ invoke> !fix images/curly.png --out_direction top 64 The result is shown here: +
![curly_woman_outpaint](../assets/outpainting/curly-outpaint.png) +
Although the effect is similar, there are significant differences from outcropping: diff --git a/docs/features/PROMPTS.md b/docs/features/PROMPTS.md index b5ef26858b..4d4c82ed6c 100644 --- a/docs/features/PROMPTS.md +++ b/docs/features/PROMPTS.md @@ -47,33 +47,33 @@ original prompt: `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180` -
+
![step1](../assets/negative_prompt_walkthru/step1.png) -
+ That image has a woman, so if we want the horse without a rider, we can influence the image not to have a woman by putting [woman] in the prompt, like this: `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve [woman]" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180` -
+
![step2](../assets/negative_prompt_walkthru/step2.png) -
+ That's nice - but say we also don't want the image to be quite so blue. We can add "blue" to the list of negative prompts, so it's now [woman blue]: `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve [woman blue]" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180` -
+
![step3](../assets/negative_prompt_walkthru/step3.png) -
+ Getting close - but there's no sense in having a saddle when our horse doesn't have a rider, so we'll add one more negative prompt: [woman blue saddle]. `#!bash "A fantastical translucent poney made of water and foam, ethereal, radiant, hyperalism, scottish folklore, digital painting, artstation, concept art, smooth, 8 k frostbite 3 engine, ultra detailed, art by artgerm and greg rutkowski and magali villeneuve [woman blue saddle]" -s 20 -W 512 -H 768 -C 7.5 -A k_euler_a -S 1654590180` -
+
![step4](../assets/negative_prompt_walkthru/step4.png) -
+ !!! notes "Notes about this feature:" @@ -112,56 +112,56 @@ different results each time you run them. --- -
+
### "blue sphere, red cube, hybrid" -
+ This example doesn't use melding at all and represents the default way of mixing concepts. -
+
![blue-sphere-red-cube-hyprid](../assets/prompt-blending/blue-sphere-red-cube-hybrid.png) -
+ It's interesting to see how the AI expressed the concept of "cube" as the four quadrants of the enclosing frame. If you look closely, there is depth there, so the enclosing frame is actually a cube. -
+
### "blue sphere:0.25 red cube:0.75 hybrid" ![blue-sphere-25-red-cube-75](../assets/prompt-blending/blue-sphere-0.25-red-cube-0.75-hybrid.png) -
+ Now that's interesting. We get neither a blue sphere nor a red cube, but a red sphere embedded in a brick wall, which represents a melding of concepts within the AI's "latent space" of semantic representations. Where is Ludwig Wittgenstein when you need him? -
+
### "blue sphere:0.75 red cube:0.25 hybrid" ![blue-sphere-75-red-cube-25](../assets/prompt-blending/blue-sphere-0.75-red-cube-0.25-hybrid.png) -
+ Definitely more blue-spherey. The cube is gone entirely, but it's really cool abstract art. -
+
### "blue sphere:0.5 red cube:0.5 hybrid" ![blue-sphere-5-red-cube-5-hybrid](../assets/prompt-blending/blue-sphere-0.5-red-cube-0.5-hybrid.png) -
+ Whoa...! I see blue and red, but no spheres or cubes. Is the word "hybrid" summoning up the concept of some sort of scifi creature? Let's find out. -
+
### "blue sphere:0.5 red cube:0.5" ![blue-sphere-5-red-cube-5](../assets/prompt-blending/blue-sphere-0.5-red-cube-0.5.png) -
+ Indeed, removing the word "hybrid" produces an image that is more like what we'd expect. diff --git a/docs/index.md b/docs/index.md index e0dabf00cb..f4c4d7e19d 100644 --- a/docs/index.md +++ b/docs/index.md @@ -98,62 +98,43 @@ You wil need one of the following: ```bash (invokeai) ~/InvokeAI$ python scripts/invoke.py --full_precision ``` + ## :octicons-log-16: Latest Changes +### v2.0.1 (13 October 2022) + +- fix noisy images at high step count when using k* samplers +- dream.py script now calls invoke.py module directly rather than + via a new python process (which could break the environment) + ### v2.0.0 (9 October 2022) + - `dream.py` script renamed `invoke.py`. A `dream.py` script wrapper remains - for backward compatibility. + for backward compatibility. - Completely new WebGUI - launch with `python3 scripts/invoke.py --web` -- Support for inpainting and outpainting +- Support for inpainting and outpainting - img2img runs on all k* samplers -- Support for negative prompts +- Support for negative prompts - Support for CodeFormer face reconstruction - Support for Textual Inversion on Macintoshes -- Support in both WebGUI and CLI for post-processing of previously-generated images - using facial reconstruction, ESRGAN upscaling, outcropping (similar to DALL-E infinite canvas), - and "embiggen" upscaling. See the `!fix` command. -- New `--hires` option on `invoke>` line allows larger images to be created without duplicating elements, at the cost of some performance. +- Support in both WebGUI and CLI for post-processing of previously-generated images + using facial reconstruction, ESRGAN upscaling, outcropping (similar to DALL-E infinite canvas), + and "embiggen" upscaling. See the `!fix` command. +- New `--hires` option on `invoke>` line allows larger images to be created without duplicating elements, at the cost of some performance. - New `--perlin` and `--threshold` options allow you to add and control variation - during image generation (see Thresholding and Perlin Noise Initialization + during image generation (see Thresholding and Perlin Noise Initialization - Extensive metadata now written into PNG files, allowing reliable regeneration of images - and tweaking of previous settings. + and tweaking of previous settings. - Command-line completion in `invoke.py` now works on Windows, Linux and Mac platforms. -- Improved command-line completion behavior. - New commands added: - * List command-line history with `!history` - * Search command-line history with `!search` - * Clear history with `!clear` +- Improved command-line completion behavior. + New commands added: + - List command-line history with `!history` + - Search command-line history with `!search` + - Clear history with `!clear` - Deprecated `--full_precision` / `-F`. Simply omit it and `invoke.py` will auto - configure. To switch away from auto use the new flag like `--precision=float32`. + configure. To switch away from auto use the new flag like `--precision=float32`. -### v1.14 (11 September 2022) - -- Memory optimizations for small-RAM cards. 512x512 now possible on 4 GB GPUs. -- Full support for Apple hardware with M1 or M2 chips. -- Add "seamless mode" for circular tiling of image. Generates beautiful effects. - ([prixt](https://github.com/prixt)). -- Inpainting support. -- Improved web server GUI. -- Lots of code and documentation cleanups. - -### v1.13 (3 September 2022 - -- Support image variations (see [VARIATIONS](features/VARIATIONS.md) - ([Kevin Gibbons](https://github.com/bakkot) and many contributors and reviewers) -- Supports a Google Colab notebook for a standalone server running on Google hardware - [Arturo Mendivil](https://github.com/artmen1516) -- WebUI supports GFPGAN/ESRGAN facial reconstruction and upscaling - [Kevin Gibbons](https://github.com/bakkot) -- WebUI supports incremental display of in-progress images during generation - [Kevin Gibbons](https://github.com/bakkot) -- A new configuration file scheme that allows new models (including upcoming stable-diffusion-v1.5) - to be added without altering the code. ([David Wager](https://github.com/maddavid12)) -- Can specify --grid on invoke.py command line as the default. -- Miscellaneous internal bug and stability fixes. -- Works on M1 Apple hardware. -- Multiple bug fixes. - -For older changelogs, please visit the **[CHANGELOG](features/CHANGELOG.md)**. +For older changelogs, please visit the **[CHANGELOG](CHANGELOG.md#v114-11-september-2022)**. ## :material-target: Troubleshooting