InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
blessedcoolant	50135b726e	fix: git stash	2023-06-12 08:53:09 +12:00
Lincoln Stein	085ab54124	remove modified models.py and migrate code to models/base.py	2023-06-11 16:10:15 -04:00
Lincoln Stein	8e1a56875e	remove defunct code	2023-06-11 12:57:06 -04:00
Lincoln Stein	000626ab2e	move all installation code out of model_manager	2023-06-11 12:51:50 -04:00
Sergey Borisov	694fd0c92f	Fixes, first runable version	2023-06-11 16:42:40 +03:00
user1	fd715026a7	First pass at ControlNet "guess mode" implementation.	2023-06-11 02:00:39 -07:00
Gregg Helt	c647056287	Feat/easy param (#3504 ) * Testing change to LatentsToText to allow setting different cfg_scale values per diffusion step. * Adding first attempt at float param easing node, using Penner easing functions. * Core implementation of ControlNet and MultiControlNet. * Added support for ControlNet and MultiControlNet to legacy non-nodal Txt2Img in backend/generator. Although backend/generator will likely disappear by v3.x, right now they are very useful for testing core ControlNet and MultiControlNet functionality while node codebase is rapidly evolving. * Added example of using ControlNet with legacy Txt2Img generator * Resolving rebase conflict * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Resolving conflicts in rebase to origin/main * Refactored ControlNet nodes so they subclass from PreprocessedControlInvocation, and only need to override run_processor(image) (instead of reimplementing invoke()) * changes to base class for controlnet nodes * Added HED, LineArt, and OpenPose ControlNet nodes * Added an additional "raw_processed_image" output port to controlnets, mainly so could route ImageField to a ShowImage node * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * More rebase repair. * Added support for using multiple control nets. Unfortunately this breaks direct usage of Control node output port ==> TextToLatent control input port -- passing through a Collect node is now required. Working on fixing this... * Fixed use of ControlNet control_weight parameter * Fixed lint-ish formatting error * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Refactored controlnet node to output ControlField that bundles control info. * changes to base class for controlnet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Cleaning up TextToLatent arg testing * Cleaning up mistakes after rebase. * Removed last bits of dtype and and device hardwiring from controlnet section * Refactored ControNet support to consolidate multiple parameters into data struct. Also redid how multiple controlnets are handled. * Added support for specifying which step iteration to start using each ControlNet, and which step to end using each controlnet (specified as fraction of total steps) * Cleaning up prior to submitting ControlNet PR. Mostly turning off diagnostic printing. Also fixed error when there is no controlnet input. * Added dependency on controlnet-aux v0.0.3 * Commented out ZoeDetector. Will re-instate once there's a controlnet-aux release that supports it. * Switched CotrolNet node modelname input from free text to default list of popular ControlNet model names. * Fix to work with current stable release of controlnet_aux (v0.0.3). Turned of pre-processor params that were added post v0.0.3. Also change defaults for shuffle. * Refactored most of controlnet code into its own method to declutter TextToLatents.invoke(), and make upcoming integration with LatentsToLatents easier. * Cleaning up after ControlNet refactor in TextToLatentsInvocation * Extended node-based ControlNet support to LatentsToLatentsInvocation. * chore(ui): regen api client * fix(ui): add value to conditioning field * fix(ui): add control field type * fix(ui): fix node ui type hints * fix(nodes): controlnet input accepts list or single controlnet * Moved to controlnet_aux v0.0.4, reinstated Zoe controlnet preprocessor. Also in pyproject.toml had to specify downgrade of timm to 0.6.13 _after_ controlnet-aux installs timm >= 0.9.2, because timm >0.6.13 breaks Zoe preprocessor. * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Switching to ControlField for output from controlnet nodes. * Resolving conflicts in rebase to origin/main * Refactored ControlNet nodes so they subclass from PreprocessedControlInvocation, and only need to override run_processor(image) (instead of reimplementing invoke()) * changes to base class for controlnet nodes * Added HED, LineArt, and OpenPose ControlNet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Added support for using multiple control nets. Unfortunately this breaks direct usage of Control node output port ==> TextToLatent control input port -- passing through a Collect node is now required. Working on fixing this... * Fixed use of ControlNet control_weight parameter * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Refactored controlnet node to output ControlField that bundles control info. * changes to base class for controlnet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Cleaning up TextToLatent arg testing * Cleaning up mistakes after rebase. * Removed last bits of dtype and and device hardwiring from controlnet section * Refactored ControNet support to consolidate multiple parameters into data struct. Also redid how multiple controlnets are handled. * Added support for specifying which step iteration to start using each ControlNet, and which step to end using each controlnet (specified as fraction of total steps) * Cleaning up prior to submitting ControlNet PR. Mostly turning off diagnostic printing. Also fixed error when there is no controlnet input. * Commented out ZoeDetector. Will re-instate once there's a controlnet-aux release that supports it. * Switched CotrolNet node modelname input from free text to default list of popular ControlNet model names. * Fix to work with current stable release of controlnet_aux (v0.0.3). Turned of pre-processor params that were added post v0.0.3. Also change defaults for shuffle. * Refactored most of controlnet code into its own method to declutter TextToLatents.invoke(), and make upcoming integration with LatentsToLatents easier. * Cleaning up after ControlNet refactor in TextToLatentsInvocation * Extended node-based ControlNet support to LatentsToLatentsInvocation. * chore(ui): regen api client * fix(ui): fix node ui type hints * fix(nodes): controlnet input accepts list or single controlnet * Added Mediapipe image processor for use as ControlNet preprocessor. Also hacked in ability to specify HF subfolder when loading ControlNet models from string. * Fixed bug where MediapipFaceProcessorInvocation was ignoring max_faces and min_confidence params. * Added nodes for float params: ParamFloatInvocation and FloatCollectionOutput. Also added FloatOutput. * Added mediapipe install requirement. Should be able to remove once controlnet_aux package adds mediapipe to its requirements. * Added float to FIELD_TYPE_MAP ins constants.ts * Progress toward improvement in fieldTemplateBuilder.ts getFieldType() * Fixed controlnet preprocessors and controlnet handling in TextToLatents to work with revised Image services. * Cleaning up from merge, re-adding cfg_scale to FIELD_TYPE_MAP * Making sure cfg_scale of type list[float] can be used in image metadata, to support param easing for cfg_scale * Fixed math for per-step param easing. * Added option to show plot of param value at each step * Just cleaning up after adding param easing plot option, removing vestigial code. * Modified control_weight ControlNet param to be polistmorphic -- can now be either a single float weight applied for all steps, or a list of floats of size total_steps, that specifies weight for each step. * Added more informative error message when _validat_edge() throws an error. * Just improving parm easing bar chart title to include easing type. * Added requirement for easing-functions package * Taking out some diagnostic prints. * Added option to use both easing function and mirror of easing function together. * Fixed recently introduced problem (when pulled in main), triggered by num_steps in StepParamEasingInvocation not having a default value -- just added default. --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-06-11 16:27:44 +10:00
Sergey Borisov	738ba40f51	Fixes	2023-06-11 06:12:21 +03:00
Sergey Borisov	3ce3a7ee72	Rewrite model configs, separate models	2023-06-11 04:49:09 +03:00
Lincoln Stein	74b43c9bdf	fix incorrect variable/typenames in model_cache	2023-06-10 10:41:48 -04:00
Lincoln Stein	3d2ff7755e	resolve conflicts	2023-06-10 10:13:54 -04:00
Lincoln Stein	a87d52a389	resolve conflicts between lstein & sttalker changes	2023-06-10 09:59:19 -04:00
Lincoln Stein	959e64c9b3	start removing repo_id support	2023-06-10 09:57:23 -04:00
Sergey Borisov	2c056ead42	New models structure draft	2023-06-10 03:14:10 +03:00
Lincoln Stein	30f20b55d5	fix logger behavior so that it is initialized after command line parsed (#3509 ) In some cases the command-line was getting parsed before the logger was initialized, causing the logger not to pick up custom logging instructions from `--log_handlers`. This PR fixes the issue.	2023-06-09 08:24:47 -07:00
Lincoln Stein	1bca32ed16	Merge branch 'main' into lstein/fix-logger-reconfiguration	2023-06-09 06:27:26 -07:00
psychedelicious	7f91139e21	fix(ui): fix crash when using dropdown on certain device resolutions	2023-06-09 22:19:30 +10:00
blessedcoolant	c53b7c7389	ui: misc fixes (#3525 ) [fix(ui): blur tab on click](`93f3658a4a`) Fixes issue where after clicking a tab, using the arrow keys changes tab instead of changing selected image [fix(ui): fix canvas not filling screen on first load](`68be95acbb`) [feat(ui): remove clear temp folder canvas button](`813f79f0f9`) This button is nonfunctional. Soon we will introduce a different way to handle clearing out intermediate images (likely automated).	2023-06-09 23:44:21 +12:00
psychedelicious	93f3658a4a	fix(ui): blur tab on click Fixes issue where after clicking a tab, using the arrow keys changes tab instead of changing selected image	2023-06-09 18:20:52 +10:00
psychedelicious	68be95acbb	fix(ui): fix canvas not filling screen on first load	2023-06-09 17:55:11 +10:00
psychedelicious	813f79f0f9	feat(ui): remove clear temp folder canvas button This button is nonfunctional. Soon we will introduce a different way to handle clearing out intermediate images (likely automated).	2023-06-09 17:33:17 +10:00
blessedcoolant	c3ec86bc70	feat(ui): enhance IAICustomSelect (#3523 ) Now accepts an array of strings or array of `IAICustomSelectOption`s. This supports custom labels and tooltips within the select component.	2023-06-09 18:26:20 +12:00
psychedelicious	05a19753c6	feat(ui): remove controlnet model descriptions These are not yet exposed on the UI - somebody who understands what they do better can add them when we have a place to expose them	2023-06-09 16:20:30 +10:00
psychedelicious	a33327c651	feat(ui): enhance IAICustomSelect Now accepts an array of strings or array of `IAICustomSelectOption`s. This supports custom labels and tooltips within the select component.	2023-06-09 16:00:17 +10:00
blessedcoolant	6ad7cc4f2a	feat(ui): decrease delay on dnd to 150ms (#3522 )	2023-06-09 17:54:24 +12:00
psychedelicious	c506355b8b	feat(ui): decrease delay on dnd to 150ms	2023-06-09 15:53:17 +10:00
psychedelicious	d54168b8fb	feat(nodes): add tests for depth-first execution	2023-06-09 14:53:45 +10:00
psychedelicious	c91b071c47	fix(nodes): use DFS with preorder traversal	2023-06-09 14:53:45 +10:00
psychedelicious	9c57b18008	fix(nodes): update `Invoker.invoke()` docstring	2023-06-09 14:53:45 +10:00
psychedelicious	69539a0472	feat(nodes): depth-first execution There was an issue where for graphs w/ iterations, your images were output all at once, at the very end of processing. So if you canceled halfway through an execution of 10 nodes, you wouldn't get any images - even though you'd completed 5 images' worth of inference. ## Cause Because graphs executed breadth-first (i.e. depth-by-depth), leaf nodes were necessarily processed last. For image generation graphs, your `LatentsToImage` will be leaf nodes, and be the last depth to be executed. For example, a `TextToLatents` graph w/ 3 iterations would execute all 3 `TextToLatents` nodes fully before moving to the next depth, where the `LatentsToImage` nodes produce output images, resulting in a node execution order like this: 1. TextToLatents 2. TextToLatents 3. TextToLatents 4. LatentsToImage 5. LatentsToImage 6. LatentsToImage ## Solution This PR makes a two changes to graph execution to execute as deeply as it can along each branch of the graph. ### Eager node preparation We now prepare as many nodes as possible, instead of just a single node at a time. We also need to change the conditions in which nodes are prepared. Previously, nodes were prepared only when all of their direct ancestors were executed. The updated logic prepares nodes that: - are not `Iterate` nodes whose inputs have not been executed - do not have any unexecuted `Iterate` ancestor nodes This results in graphs always being maximally prepared. ### Always execute the deepest prepared node We now choose the next node to execute by traversing from the bottom of the graph instead of the top, choosing the first node whose inputs are all executed. This means we always execute the deepest node possible. ## Result Graphs now execute depth-first, so instead of an execution order like this: 1. TextToLatents 2. TextToLatents 3. TextToLatents 4. LatentsToImage 5. LatentsToImage 6. LatentsToImage ... we get an execution order like this: 1. TextToLatents 2. LatentsToImage 3. TextToLatents 4. LatentsToImage 5. TextToLatents 6. LatentsToImage Immediately after inference, the image is decoded and sent to the gallery. fixes #3400	2023-06-09 14:53:45 +10:00
blessedcoolant	7bce455d16	Merge branch 'main' into diffusers-upgrade	2023-06-09 16:27:52 +12:00
blessedcoolant	3f45294c61	feat(ui): restore reset button for init image (#3521 )	2023-06-09 16:02:26 +12:00
psychedelicious	fd03c7eebe	feat(ui): restore reset button for init image	2023-06-09 14:00:23 +10:00
blessedcoolant	07c49a5726	feat(ui): skip resize on img2img if not needed (#3520 )	2023-06-09 15:56:22 +12:00
psychedelicious	8c688f8e29	feat(ui): skip resize on img2img if not needed	2023-06-09 13:54:23 +10:00
Lincoln Stein	887576d217	add directory scanning for loras, controlnets and textual_inversions	2023-06-08 23:11:53 -04:00
Lincoln Stein	6652f3405b	merge with main	2023-06-08 21:08:43 -04:00
Lincoln Stein	3d13167d32	Merge branch 'main' into lstein/fix-logger-reconfiguration	2023-06-08 13:41:24 -07:00
Lincoln Stein	f2bb507ebb	allow logger to be reconfigured after startup	2023-06-08 09:23:11 -04:00
blessedcoolant	fe8f3381fc	create databases directory on startup (#3518 ) This PR creates the databases directory at app startup time. It also removes a couple of debugging statements that were inadvertently left in the model manager.	2023-06-08 23:40:32 +12:00
Lincoln Stein	2a6d11e645	create databases directory on startup	2023-06-08 07:17:54 -04:00
Lincoln Stein	01f46d3c7d	Merge branch 'main' into lstein/fix-logger-reconfiguration	2023-06-07 19:50:44 -07:00
Lincoln Stein	5f76b62553	Update installer support for main (#3448 ) # Make InvokeAI package installable by mere mortals This commit makes InvokeAI 3.0 to be installable via PyPi.org and/or the installer script. The install process is now pretty much identical to the 2.3 process, including creating launcher scripts `invoke.sh` and `invoke.bat`. Main changes: 1. Moved static web pages into `invokeai/frontend/web` and modified the API to look for them there. This allows pip to copy the files into the distribution directory so that user no longer has to be in repo root to launch, and enables PyPi installations with `pip install invokeai` 2. Update invoke.sh and invoke.bat to launch the new web application properly. This also changes the wording for launching the CLI from "generate images" to "explore the InvokeAI node system," since I would not recommend using the CLI to generate images routinely. 3. Fix a bug in the checkpoint converter script that was identified during testing. 4. Better error reporting when checkpoint converter fails. 5. Rebuild front end. # Major improvements to the model installer. 1. The text user interface for `invokeai-model-install` has been expanded to allow the user to install controlnet, LoRA, textual inversion, diffusers and checkpoint models. The user can install interactively (without leaving the TUI), or in batch mode after exiting the application. ![image](https://github.com/invoke-ai/InvokeAI/assets/111189/f8f7ac23-3e18-4973-b7fe-729864c703a0) 2. The `invokeai-model-install` command now lets you list, add and delete models from the command line: ## Listing models ``` $ invokeai-model-install --list diffusers Diffuser models: analog-diffusion-1.0 not loaded diffusers An SD-1.5 model trained on diverse analog photographs (2.13 GB) d&d-diffusion-1.0 not loaded diffusers Dungeons & Dragons characters (2.13 GB) deliberate-1.0 not loaded diffusers Versatile model that produces detailed images up to 768px (4.27 GB) DreamShaper not loaded diffusers Imported diffusers model DreamShaper sd-inpainting-1.5 not loaded diffusers RunwayML SD 1.5 model optimized for inpainting, diffusers version (4.27 GB) sd-inpainting-2.0 not loaded diffusers Stable Diffusion version 2.0 inpainting model (5.21 GB) stable-diffusion-1.5 not loaded diffusers Stable Diffusion version 1.5 diffusers model (4.27 GB) stable-diffusion-2.1 not loaded diffusers Stable Diffusion version 2.1 diffusers model, trained on 768 pixel images (5.21 GB) ``` ``` $ invokeai-model-install --list tis Loading Python libraries... Installed Textual Inversion Embeddings: EasyNegative ahx-beta-453407d ``` ## Installing models (this example shows correct handling of a server side error at Civitai) ``` $ invokeai-model-install --diffusers https://civitai.com/api/download/models/46259 Linaqruf/anything-v3.0 Loading Python libraries... [2023-06-05 22:17:23,556]::[InvokeAI]::INFO --> INSTALLING EXTERNAL MODELS [2023-06-05 22:17:23,557]::[InvokeAI]::INFO --> Probing https://civitai.com/api/download/models/46259 for import [2023-06-05 22:17:23,557]::[InvokeAI]::INFO --> https://civitai.com/api/download/models/46259 appears to be a URL [2023-06-05 22:17:23,763]::[InvokeAI]::ERROR --> An error occurred during downloading /home/lstein/invokeai-test/models/ldm/stable-diffusion-v1/46259: Internal Server Error [2023-06-05 22:17:23,763]::[InvokeAI]::ERROR --> ERROR DOWNLOADING https://civitai.com/api/download/models/46259: {"error":"Invalid database operation","cause":{"clientVersion":"4.12.0"}} [2023-06-05 22:17:23,764]::[InvokeAI]::INFO --> Probing Linaqruf/anything-v3.0 for import [2023-06-05 22:17:23,764]::[InvokeAI]::DEBUG --> Linaqruf/anything-v3.0 appears to be a HuggingFace diffusers repo_id [2023-06-05 22:17:23,768]::[InvokeAI]::INFO --> Loading diffusers model from Linaqruf/anything-v3.0 [2023-06-05 22:17:23,769]::[InvokeAI]::DEBUG --> Using faster float16 precision [2023-06-05 22:17:23,883]::[InvokeAI]::ERROR --> An unexpected error occurred while downloading the model: 404 Client Error. (Request ID: Root=1-647e9733-1b0ee3af67d6ac3456b1ebfc) Revision Not Found for url: https://huggingface.co/Linaqruf/anything-v3.0/resolve/fp16/model_index.json. Invalid rev id: fp16) Downloading (…)ain/model_index.json: 100%\|██████████████████████████████████████████████████████████████████████████████████████████████\| 511/511 [00:00<00:00, 2.57MB/s] Downloading (…)cial_tokens_map.json: 100%\|██████████████████████████████████████████████████████████████████████████████████████████████\| 472/472 [00:00<00:00, 6.13MB/s] Downloading (…)cheduler_config.json: 100%\|██████████████████████████████████████████████████████████████████████████████████████████████\| 341/341 [00:00<00:00, 3.30MB/s] Downloading (…)okenizer_config.json: 100%\|██████████████████████████████████████████████████████████████████████████████████████████████\| 807/807 [00:00<00:00, 11.3MB/s] ``` ## Deleting models ``` invokeai-model-install --delete --diffusers anything-v3 Loading Python libraries... [2023-06-05 22:19:45,927]::[InvokeAI]::INFO --> Processing requested deletions [2023-06-05 22:19:45,927]::[InvokeAI]::INFO --> anything-v3... [2023-06-05 22:19:45,927]::[InvokeAI]::INFO --> Deleting the cached model directory for Linaqruf/anything-v3.0 [2023-06-05 22:19:45,948]::[InvokeAI]::WARNING --> Deletion of this model is expected to free 4.3G ```	2023-06-07 19:25:07 -07:00
Lincoln Stein	4bbe3b0d00	Merge branch 'main' into release/make-web-dist-startable	2023-06-07 19:21:01 -07:00
Lincoln Stein	9ed86a08f1	multiple small fixes 1. Contents of autoscan directory field are restored after doing an installation. 2. Activate dialogue to choose V2 parameterization when importing from a directory. 3. Remove autoscan directory from init file when its checkbox is unselected. 4. Add widget cycling behavior to install models form.	2023-06-07 17:32:00 -04:00
blessedcoolant	68405910ba	Upgrade to Diffusers 0.17.0	2023-06-08 04:42:52 +12:00
blessedcoolant	0a50e2638c	fix(ui): default controlnet autoprocess to true (#3513 ) I had accidentally defaulted it to false	2023-06-08 01:56:53 +12:00
psychedelicious	fc7c5da4dd	fix(ui): default controlnet autoprocess to true I had accidentally defaulted it to false	2023-06-07 23:55:24 +10:00
Lincoln Stein	a3357e073c	refactor exception handling	2023-06-07 07:35:34 -04:00
Lincoln Stein	d114833a12	pause after printing exception	2023-06-07 07:26:14 -04:00

... 4 5 6 7 8 ...

5080 Commits