Last PR needed for v2.3.1 (#2788)

- Add curated set of starter models based on team discussion. The final list of starter models can be found in `invokeai/configs/INITIAL_MODELS.yaml` - To test model installation, I selected and installed all the models on the list. This led to my discovering that when there are no more starter models to display, the console front end crashes. So I made a fix to this in which the entire starter model selection is no longer shown. - Update model table in 050_INSTALL_MODELS.md - Add guide to dealing with low-memory situations - Version is now `v2.3.1`
2024-08-30 20:32:17 +00:00 · 2023-02-24 10:31:38 -05:00
parent ec2890c19b 4e446130d8
commit bd85e00530
6 changed files with 193 additions and 82 deletions
--- a/docs/installation/010_INSTALL_AUTOMATED.md
+++ b/docs/installation/010_INSTALL_AUTOMATED.md
@ -221,7 +221,10 @@ experimental versions later.

    - ***NSFW checker***
      If checked, InvokeAI will test images for potential sexual content
-      and blur them out if found.
+      and blur them out if found. Note that the NSFW checker consumes
+      an additional 0.6 GB of VRAM on top of the 2-3 GB of VRAM used
+      by most image models. If you have a low VRAM GPU (4-6 GB), you
+      can reduce out of memory errors by disabling the checker.

    - ***HuggingFace Access Token***
      InvokeAI has the ability to download embedded styles and subjects
@ -440,6 +443,52 @@ the [InvokeAI Issues](https://github.com/invoke-ai/InvokeAI/issues) section, or
 visit our [Discord Server](https://discord.gg/ZmtBAhwWhy) for interactive
 assistance.

+### Out of Memory Issues
+
+The models are large, VRAM is expensive, and you may find yourself
+faced with Out of Memory errors when generating images. Here are some
+tips to reduce the problem:
+
+* **4 GB of VRAM**
+
+This should be adequate for 512x512 pixel images using Stable Diffusion 1.5
+and derived models, provided that you **disable** the NSFW checker. To
+disable the filter, do one of the following:
+
+   * Select option (6) "_change InvokeAI startup options_" from the
+     launcher. This will bring up the console-based startup settings
+     dialogue and allow you to unselect the "NSFW Checker" option.
+   * Start the startup settings dialogue directly by running
+     `invokeai-configure --skip-sd-weights --skip-support-models`
+     from the command line.
+   * Find the `invokeai.init` initialization file in the InvokeAI root
+     directory, open it in a text editor, and change `--nsfw_checker`
+     to `--no-nsfw_checker`
+
+If you are on a CUDA system, you can realize significant memory
+savings by activating the `xformers` library as described above. The
+downside is `xformers` introduces non-deterministic behavior, such
+that images generated with exactly the same prompt and settings will
+be slightly different from each other. See above for more information.
+
+* **6 GB of VRAM**
+
+This is a border case. Using the SD 1.5 series you should be able to
+generate images up to 640x640 with the NSFW checker enabled, and up to
+1024x1024 with it disabled and `xformers` activated. 
+
+If you run into persistent memory issues there are a series of
+environment variables that you can set before launching InvokeAI that
+alter how the PyTorch machine learning library manages memory.  See
+https://pytorch.org/docs/stable/notes/cuda.html#memory-management for
+a list of these tweaks.
+
+* **12 GB of VRAM**
+
+This should be sufficient to generate larger images up to about
+1280x1280. If you wish to push further, consider activating
+`xformers`.
+
 ### Other Problems

 If you run into problems during or after installation, the InvokeAI team is
--- a/docs/installation/050_INSTALLING_MODELS.md
+++ b/docs/installation/050_INSTALLING_MODELS.md
@ -43,25 +43,31 @@ InvokeAI comes with support for a good set of starter models. You'll
 find them listed in the master models file
 `configs/INITIAL_MODELS.yaml` in the InvokeAI root directory. The
 subset that are currently installed are found in
-`configs/models.yaml`. The current list is:
+`configs/models.yaml`. As of v2.3.1, the list of starter models is:

-| Model                | HuggingFace Repo ID               | Description                                                | URL
-| -------------------- | --------------------------------- | ---------------------------------------------------------- | -------------------------------------------------------------- |
-| stable-diffusion-1.5 | runwayml/stable-diffusion-v1-5    | Most recent version of base Stable Diffusion model         | https://huggingface.co/runwayml/stable-diffusion-v1-5          |
-| stable-diffusion-1.4 | runwayml/stable-diffusion-v1-4    | Previous version of base Stable Diffusion model         | https://huggingface.co/runwayml/stable-diffusion-v1-4          |
-| inpainting-1.5       | runwayml/stable-diffusion-inpainting  | Stable diffusion 1.5 optimized for inpainting          | https://huggingface.co/runwayml/stable-diffusion-inpainting    |
-| stable-diffusion-2.1-base |stabilityai/stable-diffusion-2-1-base | Stable Diffusion version 2.1 trained on 512 pixel images | https://huggingface.co/stabilityai/stable-diffusion-2-1-base |
-| stable-diffusion-2.1-768 |stabilityai/stable-diffusion-2-1 | Stable Diffusion version 2.1 trained on 768 pixel images | https://huggingface.co/stabilityai/stable-diffusion-2-1        |
-| dreamlike-diffusion-1.0  | dreamlike-art/dreamlike-diffusion-1.0  | An SD 1.5 model finetuned on high quality art    | https://huggingface.co/dreamlike-art/dreamlike-diffusion-1.0   |
-| dreamlike-photoreal-2.0  | dreamlike-art/dreamlike-photoreal-2.0  | A photorealistic model trained on 768 pixel images| https://huggingface.co/dreamlike-art/dreamlike-photoreal-2.0   |
-| openjourney-4.0      | prompthero/openjourney              | An SD 1.5 model finetuned on Midjourney images prompt with "mdjrny-v4 style"   | https://huggingface.co/prompthero/openjourney  |
-| nitro-diffusion-1.0  | nitrosocke/Nitro-Diffusion          | An SD 1.5 model finetuned on three styles, prompt with "archer style", "arcane style" or "modern disney style" | https://huggingface.co/nitrosocke/Nitro-Diffusion|
-| trinart-2.0          | naclbit/trinart_stable_diffusion_v2 | An SD 1.5 model finetuned with ~40,000 assorted high resolution manga/anime-style pictures | https://huggingface.co/naclbit/trinart_stable_diffusion_v2|
-| trinart-characters-2_0 | naclbit/trinart_derrida_characters_v2_stable_diffusion | An SD 1.5 model finetuned with 19.2M manga/anime-style pictures | https://huggingface.co/naclbit/trinart_derrida_characters_v2_stable_diffusion|
+|Model Name | HuggingFace Repo ID | Description | URL |
+|---------- | ---------- | ----------- | --- |
+|stable-diffusion-1.5|runwayml/stable-diffusion-v1-5|Stable Diffusion version 1.5 diffusers model (4.27 GB)|https://huggingface.co/runwayml/stable-diffusion-v1-5 |
+|sd-inpainting-1.5|runwayml/stable-diffusion-inpainting|RunwayML SD 1.5 model optimized for inpainting, diffusers version (4.27 GB)|https://huggingface.co/runwayml/stable-diffusion-inpainting |
+|stable-diffusion-2.1|stabilityai/stable-diffusion-2-1|Stable Diffusion version 2.1 diffusers model, trained on 768 pixel images (5.21 GB)|https://huggingface.co/stabilityai/stable-diffusion-2-1 |
+|sd-inpainting-2.0|stabilityai/stable-diffusion-2-1|Stable Diffusion version 2.0 inpainting model (5.21 GB)|https://huggingface.co/stabilityai/stable-diffusion-2-1 |
+|analog-diffusion-1.0|wavymulder/Analog-Diffusion|An SD-1.5 model trained on diverse analog photographs (2.13 GB)|https://huggingface.co/wavymulder/Analog-Diffusion |
+|deliberate-1.0|XpucT/Deliberate|Versatile model that produces detailed images up to 768px (4.27 GB)|https://huggingface.co/XpucT/Deliberate |
+|d&d-diffusion-1.0|0xJustin/Dungeons-and-Diffusion|Dungeons & Dragons characters (2.13 GB)|https://huggingface.co/0xJustin/Dungeons-and-Diffusion |
+|dreamlike-photoreal-2.0|dreamlike-art/dreamlike-photoreal-2.0|A photorealistic model trained on 768 pixel images based on SD 1.5 (2.13 GB)|https://huggingface.co/dreamlike-art/dreamlike-photoreal-2.0 |
+|inkpunk-1.0|Envvi/Inkpunk-Diffusion|Stylized illustrations inspired by Gorillaz, FLCL and Shinkawa; prompt with "nvinkpunk" (4.27 GB)|https://huggingface.co/Envvi/Inkpunk-Diffusion |
+|openjourney-4.0|prompthero/openjourney|An SD 1.5 model fine tuned on Midjourney; prompt with "mdjrny-v4 style" (2.13 GB)|https://huggingface.co/prompthero/openjourney |
+|portrait-plus-1.0|wavymulder/portraitplus|An SD-1.5 model trained on close range portraits of people; prompt with "portrait+" (2.13 GB)|https://huggingface.co/wavymulder/portraitplus |
+|seek-art-mega-1.0|coreco/seek.art_MEGA|A general use SD-1.5 "anything" model that supports multiple styles (2.1 GB)|https://huggingface.co/coreco/seek.art_MEGA |
+|trinart-2.0|naclbit/trinart_stable_diffusion_v2|An SD-1.5 model finetuned with ~40K assorted high resolution manga/anime-style images (2.13 GB)|https://huggingface.co/naclbit/trinart_stable_diffusion_v2 |
+|waifu-diffusion-1.4|hakurei/waifu-diffusion|An SD-1.5 model trained on 680k anime/manga-style images (2.13 GB)|https://huggingface.co/hakurei/waifu-diffusion |

-Note that these files are covered by an "Ethical AI" license which forbids
-certain uses. When you initially download them, you are asked to
-accept the license terms.
+Note that these files are covered by an "Ethical AI" license which
+forbids certain uses. When you initially download them, you are asked
+to accept the license terms. In addition, some of these models carry
+additional license terms that limit their use in commercial
+applications or on public servers. Be sure to familiarize yourself
+with the model terms by visiting the URLs in the table above.

 ## Community-Contributed Models

--- a/invokeai/configs/INITIAL_MODELS.yaml
+++ b/invokeai/configs/INITIAL_MODELS.yaml
@ -6,53 +6,78 @@ stable-diffusion-1.5:
     repo_id: stabilityai/sd-vae-ft-mse
   recommended: True
   default: True
-inpainting-1.5:
+sd-inpainting-1.5:
  description: RunwayML SD 1.5 model optimized for inpainting, diffusers version (4.27 GB)
  repo_id: runwayml/stable-diffusion-inpainting
  format: diffusers
  vae:
    repo_id: stabilityai/sd-vae-ft-mse
  recommended: True
-dreamlike-diffusion-1.0:
-  description: An SD 1.5 model fine tuned on high quality art by dreamlike.art, diffusers version (2.13 BG)
-  format: diffusers
-  repo_id: dreamlike-art/dreamlike-diffusion-1.0
-  vae:
-     repo_id: stabilityai/sd-vae-ft-mse
-  recommended: True
-dreamlike-photoreal-2.0:
-  description: A photorealistic model trained on 768 pixel images based on SD 1.5 (2.13 GB)
-  format: diffusers
-  repo_id: dreamlike-art/dreamlike-photoreal-2.0
-  recommended: False
-stable-diffusion-2.1-768:
+stable-diffusion-2.1:
   description: Stable Diffusion version 2.1 diffusers model, trained on 768 pixel images (5.21 GB)
   repo_id: stabilityai/stable-diffusion-2-1
   format: diffusers
   recommended: True
-stable-diffusion-2.1-base:
-   description: Stable Diffusion version 2.1 diffusers base model, trained on 512 pixel images (5.21 GB)
-   repo_id: stabilityai/stable-diffusion-2-1-base
+sd-inpainting-2.0:
+   description: Stable Diffusion version 2.0 inpainting model (5.21 GB)
+   repo_id: stabilityai/stable-diffusion-2-1
   format: diffusers
   recommended: False
+analog-diffusion-1.0:
+   description: An SD-1.5 model trained on diverse analog photographs (2.13 GB)
+   repo_id: wavymulder/Analog-Diffusion
+   format: diffusers
+   recommended: false
+deliberate-1.0:
+   description: Versatile model that produces detailed images up to 768px (4.27 GB)
+   format: diffusers
+   repo_id: XpucT/Deliberate
+   recommended: False
+d&d-diffusion-1.0:
+   description: Dungeons & Dragons characters (2.13 GB)
+   format: diffusers
+   repo_id: 0xJustin/Dungeons-and-Diffusion
+   recommended: False
+dreamlike-photoreal-2.0:
+   description: A photorealistic model trained on 768 pixel images based on SD 1.5 (2.13 GB)
+   format: diffusers
+   repo_id: dreamlike-art/dreamlike-photoreal-2.0
+   recommended: False
+inkpunk-1.0:
+   description: Stylized illustrations inspired by Gorillaz, FLCL and Shinkawa; prompt with "nvinkpunk" (4.27 GB)
+   format: diffusers
+   repo_id: Envvi/Inkpunk-Diffusion
+   recommended: False
 openjourney-4.0:
-  description: An SD 1.5 model fine tuned on Midjourney images by PromptHero - include "mdjrny-v4 style" in your prompts (2.13 GB)
-  format: diffusers
-  repo_id: prompthero/openjourney
-  vae:
+   description: An SD 1.5 model fine tuned on Midjourney; prompt with "mdjrny-v4 style" (2.13 GB)
+   format: diffusers
+   repo_id: prompthero/openjourney
+   vae:
     repo_id: stabilityai/sd-vae-ft-mse
-  recommended: False
-nitro-diffusion-1.0:
-   description: A SD 1.5 model trained on three artstyles - prompt with "archer style", "arcane style" and/or "modern disney style" (2.13 GB)
-   repo_id: nitrosocke/Nitro-Diffusion
+   recommended: False
+portrait-plus-1.0:
+   description: An SD-1.5 model trained on close range portraits of people; prompt with "portrait+" (2.13 GB)
+   format: diffusers
+   repo_id: wavymulder/portraitplus
+   recommended: False
+seek-art-mega-1.0:
+   description: A general use SD-1.5 "anything" model that supports multiple styles (2.1 GB)
+   repo_id: coreco/seek.art_MEGA
   format: diffusers
   vae:
     repo_id: stabilityai/sd-vae-ft-mse
   recommended: False
 trinart-2.0:
-   description: An SD model finetuned with ~40,000 assorted high resolution manga/anime-style pictures, diffusers version (2.13 GB)
+   description: An SD-1.5 model finetuned with ~40K assorted high resolution manga/anime-style images (2.13 GB)
   repo_id: naclbit/trinart_stable_diffusion_v2
   format: diffusers
   vae:
     repo_id: stabilityai/sd-vae-ft-mse
   recommended: False
+waifu-diffusion-1.4:
+   description: An SD-1.5 model trained on 680k anime/manga-style images (2.13 GB)
+   repo_id: hakurei/waifu-diffusion
+   format: diffusers
+   vae:
+     repo_id: stabilityai/sd-vae-ft-mse
+   recommended: False
--- a/ldm/invoke/_version.py
+++ b/ldm/invoke/_version.py
@ -1 +1 @@
-__version__='2.3.1-rc4'
+__version__='2.3.1'
--- a/ldm/invoke/config/model_install.py
+++ b/ldm/invoke/config/model_install.py
@ -114,37 +114,37 @@ class addModelsForm(npyscreen.FormMultiPage):
                relx=4,
            )
        self.nextrely += 1
-        self.add_widget_intelligent(
-            CenteredTitleText,
-            name="== STARTER MODELS (recommended ones selected) ==",
-            editable=False,
-            color="CONTROL",
-        )
-        self.nextrely -= 1
-        self.add_widget_intelligent(
-            CenteredTitleText,
-            name="Select from a starter set of Stable Diffusion models from HuggingFace:",
-            editable=False,
-            labelColor="CAUTION",
-        )
-
-        self.nextrely -= 1
-        # if user has already installed some initial models, then don't patronize them
-        # by showing more recommendations
-        show_recommended = not self.existing_models
-        self.models_selected = self.add_widget_intelligent(
-            npyscreen.MultiSelect,
-            name="Install Starter Models",
-            values=starter_model_labels,
-            value=[
-                self.starter_model_list.index(x)
-                for x in self.starter_model_list
-                if show_recommended and x in recommended_models
-            ],
-            max_height=len(starter_model_labels) + 1,
-            relx=4,
-            scroll_exit=True,
-        )
+        if len(self.starter_model_list) > 0:
+            self.add_widget_intelligent(
+                CenteredTitleText,
+                name="== STARTER MODELS (recommended ones selected) ==",
+                editable=False,
+                color="CONTROL",
+            )
+            self.nextrely -= 1
+            self.add_widget_intelligent(
+                CenteredTitleText,
+                name="Select from a starter set of Stable Diffusion models from HuggingFace.",
+                editable=False,
+                labelColor="CAUTION",
+            )
+            self.nextrely -= 1
+            # if user has already installed some initial models, then don't patronize them
+            # by showing more recommendations
+            show_recommended = not self.existing_models
+            self.models_selected = self.add_widget_intelligent(
+                npyscreen.MultiSelect,
+                name="Install Starter Models",
+                values=starter_model_labels,
+                value=[
+                    self.starter_model_list.index(x)
+                    for x in self.starter_model_list
+                    if show_recommended and x in recommended_models
+                ],
+                max_height=len(starter_model_labels) + 1,
+                relx=4,
+                scroll_exit=True,
+            )
        self.add_widget_intelligent(
            CenteredTitleText,
            name='== IMPORT LOCAL AND REMOTE MODELS ==',
@ -166,7 +166,11 @@ class addModelsForm(npyscreen.FormMultiPage):
            )
            self.nextrely -= 1
        self.import_model_paths = self.add_widget_intelligent(
-            TextBox, max_height=5, scroll_exit=True, editable=True, relx=4
+            TextBox,
+            max_height=7,
+            scroll_exit=True,
+            editable=True,
+            relx=4
        )
        self.nextrely += 1
        self.show_directory_fields = self.add_widget_intelligent(
@ -241,7 +245,8 @@ class addModelsForm(npyscreen.FormMultiPage):

    def resize(self):
        super().resize()
-        self.models_selected.values = self._get_starter_model_labels()
+        if hasattr(self,'models_selected'):
+            self.models_selected.values = self._get_starter_model_labels()

    def _clear_scan_directory(self):
        if not self.show_directory_fields.value:
@ -320,11 +325,14 @@ class addModelsForm(npyscreen.FormMultiPage):
        selections = self.parentApp.user_selections

        # starter models to install/remove
-        starter_models = dict(
-            map(
-                lambda x: (self.starter_model_list[x], True), self.models_selected.value
+        if hasattr(self,'models_selected'):
+            starter_models = dict(
+                map(
+                    lambda x: (self.starter_model_list[x], True), self.models_selected.value
+                )
            )
-        )
+        else:
+            starter_models = dict()
        selections.purge_deleted_models = False
        if hasattr(self, "previously_installed_models"):
            unchecked = [
--- a/scripts/make_models_markdown_table.py
+++ b/scripts/make_models_markdown_table.py
@ -0,0 +1,23 @@
+#!/usr/bin/env python
+
+'''
+This script is used at release time to generate a markdown table describing the
+starter models. This text is then manually copied into 050_INSTALL_MODELS.md.
+'''
+
+from omegaconf import OmegaConf
+from pathlib import Path
+
+
+def main():
+    initial_models_file = Path(__file__).parent / '../invokeai/configs/INITIAL_MODELS.yaml'
+    models = OmegaConf.load(initial_models_file)
+    print('|Model Name | HuggingFace Repo ID | Description | URL |')
+    print('|---------- | ---------- | ----------- | --- |')
+    for model in models:
+        repo_id = models[model].repo_id
+        url = f'https://huggingface.co/{repo_id}'
+        print(f'|{model}|{repo_id}|{models[model].description}|{url} |')
+
+if __name__ == '__main__':
+    main()