InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Gregg Helt	c647056287	Feat/easy param (#3504 ) * Testing change to LatentsToText to allow setting different cfg_scale values per diffusion step. * Adding first attempt at float param easing node, using Penner easing functions. * Core implementation of ControlNet and MultiControlNet. * Added support for ControlNet and MultiControlNet to legacy non-nodal Txt2Img in backend/generator. Although backend/generator will likely disappear by v3.x, right now they are very useful for testing core ControlNet and MultiControlNet functionality while node codebase is rapidly evolving. * Added example of using ControlNet with legacy Txt2Img generator * Resolving rebase conflict * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Resolving conflicts in rebase to origin/main * Refactored ControlNet nodes so they subclass from PreprocessedControlInvocation, and only need to override run_processor(image) (instead of reimplementing invoke()) * changes to base class for controlnet nodes * Added HED, LineArt, and OpenPose ControlNet nodes * Added an additional "raw_processed_image" output port to controlnets, mainly so could route ImageField to a ShowImage node * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * More rebase repair. * Added support for using multiple control nets. Unfortunately this breaks direct usage of Control node output port ==> TextToLatent control input port -- passing through a Collect node is now required. Working on fixing this... * Fixed use of ControlNet control_weight parameter * Fixed lint-ish formatting error * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Refactored controlnet node to output ControlField that bundles control info. * changes to base class for controlnet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Cleaning up TextToLatent arg testing * Cleaning up mistakes after rebase. * Removed last bits of dtype and and device hardwiring from controlnet section * Refactored ControNet support to consolidate multiple parameters into data struct. Also redid how multiple controlnets are handled. * Added support for specifying which step iteration to start using each ControlNet, and which step to end using each controlnet (specified as fraction of total steps) * Cleaning up prior to submitting ControlNet PR. Mostly turning off diagnostic printing. Also fixed error when there is no controlnet input. * Added dependency on controlnet-aux v0.0.3 * Commented out ZoeDetector. Will re-instate once there's a controlnet-aux release that supports it. * Switched CotrolNet node modelname input from free text to default list of popular ControlNet model names. * Fix to work with current stable release of controlnet_aux (v0.0.3). Turned of pre-processor params that were added post v0.0.3. Also change defaults for shuffle. * Refactored most of controlnet code into its own method to declutter TextToLatents.invoke(), and make upcoming integration with LatentsToLatents easier. * Cleaning up after ControlNet refactor in TextToLatentsInvocation * Extended node-based ControlNet support to LatentsToLatentsInvocation. * chore(ui): regen api client * fix(ui): add value to conditioning field * fix(ui): add control field type * fix(ui): fix node ui type hints * fix(nodes): controlnet input accepts list or single controlnet * Moved to controlnet_aux v0.0.4, reinstated Zoe controlnet preprocessor. Also in pyproject.toml had to specify downgrade of timm to 0.6.13 _after_ controlnet-aux installs timm >= 0.9.2, because timm >0.6.13 breaks Zoe preprocessor. * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Switching to ControlField for output from controlnet nodes. * Resolving conflicts in rebase to origin/main * Refactored ControlNet nodes so they subclass from PreprocessedControlInvocation, and only need to override run_processor(image) (instead of reimplementing invoke()) * changes to base class for controlnet nodes * Added HED, LineArt, and OpenPose ControlNet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Added support for using multiple control nets. Unfortunately this breaks direct usage of Control node output port ==> TextToLatent control input port -- passing through a Collect node is now required. Working on fixing this... * Fixed use of ControlNet control_weight parameter * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Refactored controlnet node to output ControlField that bundles control info. * changes to base class for controlnet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Cleaning up TextToLatent arg testing * Cleaning up mistakes after rebase. * Removed last bits of dtype and and device hardwiring from controlnet section * Refactored ControNet support to consolidate multiple parameters into data struct. Also redid how multiple controlnets are handled. * Added support for specifying which step iteration to start using each ControlNet, and which step to end using each controlnet (specified as fraction of total steps) * Cleaning up prior to submitting ControlNet PR. Mostly turning off diagnostic printing. Also fixed error when there is no controlnet input. * Commented out ZoeDetector. Will re-instate once there's a controlnet-aux release that supports it. * Switched CotrolNet node modelname input from free text to default list of popular ControlNet model names. * Fix to work with current stable release of controlnet_aux (v0.0.3). Turned of pre-processor params that were added post v0.0.3. Also change defaults for shuffle. * Refactored most of controlnet code into its own method to declutter TextToLatents.invoke(), and make upcoming integration with LatentsToLatents easier. * Cleaning up after ControlNet refactor in TextToLatentsInvocation * Extended node-based ControlNet support to LatentsToLatentsInvocation. * chore(ui): regen api client * fix(ui): fix node ui type hints * fix(nodes): controlnet input accepts list or single controlnet * Added Mediapipe image processor for use as ControlNet preprocessor. Also hacked in ability to specify HF subfolder when loading ControlNet models from string. * Fixed bug where MediapipFaceProcessorInvocation was ignoring max_faces and min_confidence params. * Added nodes for float params: ParamFloatInvocation and FloatCollectionOutput. Also added FloatOutput. * Added mediapipe install requirement. Should be able to remove once controlnet_aux package adds mediapipe to its requirements. * Added float to FIELD_TYPE_MAP ins constants.ts * Progress toward improvement in fieldTemplateBuilder.ts getFieldType() * Fixed controlnet preprocessors and controlnet handling in TextToLatents to work with revised Image services. * Cleaning up from merge, re-adding cfg_scale to FIELD_TYPE_MAP * Making sure cfg_scale of type list[float] can be used in image metadata, to support param easing for cfg_scale * Fixed math for per-step param easing. * Added option to show plot of param value at each step * Just cleaning up after adding param easing plot option, removing vestigial code. * Modified control_weight ControlNet param to be polistmorphic -- can now be either a single float weight applied for all steps, or a list of floats of size total_steps, that specifies weight for each step. * Added more informative error message when _validat_edge() throws an error. * Just improving parm easing bar chart title to include easing type. * Added requirement for easing-functions package * Taking out some diagnostic prints. * Added option to use both easing function and mirror of easing function together. * Fixed recently introduced problem (when pulled in main), triggered by num_steps in StepParamEasingInvocation not having a default value -- just added default. --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-06-11 16:27:44 +10:00
psychedelicious	c91b071c47	fix(nodes): use DFS with preorder traversal	2023-06-09 14:53:45 +10:00
psychedelicious	9c57b18008	fix(nodes): update `Invoker.invoke()` docstring	2023-06-09 14:53:45 +10:00
psychedelicious	69539a0472	feat(nodes): depth-first execution There was an issue where for graphs w/ iterations, your images were output all at once, at the very end of processing. So if you canceled halfway through an execution of 10 nodes, you wouldn't get any images - even though you'd completed 5 images' worth of inference. ## Cause Because graphs executed breadth-first (i.e. depth-by-depth), leaf nodes were necessarily processed last. For image generation graphs, your `LatentsToImage` will be leaf nodes, and be the last depth to be executed. For example, a `TextToLatents` graph w/ 3 iterations would execute all 3 `TextToLatents` nodes fully before moving to the next depth, where the `LatentsToImage` nodes produce output images, resulting in a node execution order like this: 1. TextToLatents 2. TextToLatents 3. TextToLatents 4. LatentsToImage 5. LatentsToImage 6. LatentsToImage ## Solution This PR makes a two changes to graph execution to execute as deeply as it can along each branch of the graph. ### Eager node preparation We now prepare as many nodes as possible, instead of just a single node at a time. We also need to change the conditions in which nodes are prepared. Previously, nodes were prepared only when all of their direct ancestors were executed. The updated logic prepares nodes that: - are not `Iterate` nodes whose inputs have not been executed - do not have any unexecuted `Iterate` ancestor nodes This results in graphs always being maximally prepared. ### Always execute the deepest prepared node We now choose the next node to execute by traversing from the bottom of the graph instead of the top, choosing the first node whose inputs are all executed. This means we always execute the deepest node possible. ## Result Graphs now execute depth-first, so instead of an execution order like this: 1. TextToLatents 2. TextToLatents 3. TextToLatents 4. LatentsToImage 5. LatentsToImage 6. LatentsToImage ... we get an execution order like this: 1. TextToLatents 2. LatentsToImage 3. TextToLatents 4. LatentsToImage 5. TextToLatents 6. LatentsToImage Immediately after inference, the image is decoded and sent to the gallery. fixes #3400	2023-06-09 14:53:45 +10:00
Lincoln Stein	90333c0074	merge with main	2023-06-05 22:03:44 -04:00
Lincoln Stein	9e31b1f387	Merge branch 'main' into lstein/config-management-fixes	2023-06-04 18:17:43 -04:00
Lincoln Stein	31e97ead2a	move invokeai.db to ~/invokeai/databases - The invokeai.db database file has now been moved into `INVOKEAIROOT/databases`. Using plural here for possible future with more than one database file. - Removed a few dangling debug messages that appeared during testing. - Rebuilt frontend to test web.	2023-06-03 20:25:34 -04:00
Lincoln Stein	0b49995659	merge with main	2023-06-03 20:06:27 -04:00
Lincoln Stein	31281d7181	Merge branch 'main' into lstein/logging-improvements	2023-06-02 22:56:13 -04:00
Lincoln Stein	1632ac6b9f	add controlnet model downloading	2023-05-30 13:49:43 -04:00
Lincoln Stein	c9ee42450e	added controlnet models to frontend; backend needs to be done	2023-05-30 00:38:37 -04:00
Lincoln Stein	10fe31c2a1	Merge branch 'main' into lstein/config-management-fixes	2023-05-29 21:03:03 -04:00
psychedelicious	3e3dd39ae4	fix(nodes): fix images service update() for `is_intermediate`	2023-05-28 20:19:56 -04:00
psychedelicious	f31e62afad	feat(nodes): make list images route use offset pagination Because we dynamically insert images into the DB and UI's images state, `page`/`per_page` pagination makes loading the images awkward. Using `offset`/`limit` pagination lets us query for images with an offset equal to the number of images already loaded (which match the query parameters). The result is that we always get the correct next page of images when loading more.	2023-05-28 20:19:56 -04:00
psychedelicious	160267c71a	feat(nodes): refactor image types - Remove `ImageType` entirely, it is confusing - Create `ResourceOrigin`, may be `internal` or `external` - Revamp `ImageCategory`, may be `general`, `mask`, `control`, `user`, `other`. Expect to add more as time goes on - Update images `list` route to accept `include_categories` OR `exclude_categories` query parameters to afford finer-grained querying. All services are updated to accomodate this change. The new setup should account for our types of images, including the combinations we couldn't really handle until now: - Canvas init and masks - Canvas when saved-to-gallery or merged	2023-05-28 20:19:56 -04:00
psychedelicious	fd47e70c92	feat(nodes): use higher precision timestamps in db	2023-05-28 20:19:56 -04:00
psychedelicious	9317b42e5f	feat(nodes, ui): wip image types	2023-05-28 20:19:56 -04:00
psychedelicious	ee0225f4ba	fix(nodes): handle intermediates during `images.get_many()`	2023-05-28 20:19:56 -04:00
psychedelicious	33a0af4637	feat(nodes): add nameservice Currenly only used to make names for images, but when latents, conditioning, etc are managed in DB, will do the same for them. Intended to eventually support custom naming schemes.	2023-05-28 20:19:56 -04:00
psychedelicious	020f3ccf07	fix(nodes): controlnet input accepts list or single controlnet	2023-05-26 21:44:00 -04:00
Lincoln Stein	5c0f0d1808	Merge branch 'main' into lstein/logging-improvements	2023-05-26 08:57:17 -04:00
Lincoln Stein	951900a86a	Merge branch 'main' into lstein/config-management-fixes	2023-05-26 08:56:41 -04:00
psychedelicious	d2c8a53c55	feat(nodes): change intermediates handling - `ImageType` is now restricted to `results` and `uploads`. - Add a reserved `meta` field to nodes to hold the `is_intermediate` boolean. We can extend it in the future to support other node `meta`. - Add a `is_intermediate` column to the `images` table to hold this. (When `latents`, `conditioning` etc are added to the DB, they will also have this column.) - All nodes default to `not intermediate`. Nodes must explicitly be marked `intermediate` for their outputs to be `intermediate`. - When building a graph, you can set `node.meta.is_intermediate=True` and it will be handled as an intermediate. - Add a new `update()` method to the `ImageService`, and a route to call it. Updates have a strict model, currently only `session_id` and `image_category` may be updated. - Add a new `update()` method to the `ImageRecordStorageService` to update the image record using the model.	2023-05-25 22:17:14 -04:00
Lincoln Stein	e56965ad76	documentation tweaks; fixed initialization in a couple more places	2023-05-25 21:10:00 -04:00
Lincoln Stein	2273b3a8c8	fix potential race condition in config system	2023-05-25 20:41:26 -04:00
Lincoln Stein	34f567abd4	Merge branch 'main' into lstein/logging-improvements	2023-05-25 08:48:47 -04:00
Lincoln Stein	b87f3043ae	add logging configuration	2023-05-24 23:57:15 -04:00
psychedelicious	3829ffbe66	fix(tests): add `--use_memory_db` flag; use it in tests	2023-05-25 12:12:31 +10:00
psychedelicious	3000436121	chore(nodes): remove unused imports	2023-05-25 12:12:31 +10:00
psychedelicious	37cdd91f5d	fix(nodes): use forward declarations for InvocationServices Also use `TYPE_CHECKING` to get IDE hints.	2023-05-25 12:12:31 +10:00
psychedelicious	ff6b345d45	fix(nodes): rebase fixes	2023-05-24 11:30:47 -04:00
psychedelicious	d2c223de8f	feat(nodes): move fully* to new images service * except i haven't rebuilt inpaint in latents	2023-05-24 11:30:47 -04:00
psychedelicious	23d9d58c08	fix(nodes): fix bugs with serving images When returning a `FileResponse`, we must provide a valid path, else an exception is raised outside the route handler. Add the `validate_path` method back to the service so we can validate paths before returning the file. I don't like this but apparently this is just how `starlette` and `fastapi` works with `FileResponse`.	2023-05-24 11:30:47 -04:00
psychedelicious	035425ef24	feat(nodes): address feedback - Address database feedback: - Remove all the extraneous tables. Only an `images` table now: - `image_type` and `image_category` are unrestricted strings. When creating images, the provided values are checked to ensure they are a valid type and category. - Add `updated_at` and `deleted_at` columns. `deleted_at` is currently unused. - Use SQLite's built-in timestamp features to populate these. Add a trigger to update `updated_at` when the row is updated. Currently no way to update a row. - Rename the `id` column in `images` to `image_name` - Rename `ImageCategory.IMAGE` to `ImageCategory.GENERAL` - Move all exceptions outside their base classes to make them more portable. - Add `width` and `height` columns to the database. These store the actual dimensions of the image file, whereas the metadata's `width` and `height` refer to the respective generation parameters and are nullable. - Make `deserialize_image_record` take a `dict` instead of `sqlite3.Row` - Improve comments throughout - Tidy up unused code/files and some minor organisation	2023-05-24 11:30:47 -04:00
psychedelicious	021e5a2aa3	feat(nodes): improve metadata service comments	2023-05-24 11:30:47 -04:00
psychedelicious	c31ff364ab	fix(nodes): tidy images service	2023-05-24 11:30:47 -04:00
psychedelicious	5a7e611e0a	fix(nodes): fix image url	2023-05-24 11:30:47 -04:00
psychedelicious	5de3c41d19	feat(nodes): add metadata handling	2023-05-24 11:30:47 -04:00
psychedelicious	b9375186a5	feat(nodes): consolidate image routers	2023-05-24 11:30:47 -04:00
psychedelicious	11bd932cba	feat(nodes): revert invocation_complete url hack	2023-05-24 11:30:47 -04:00
psychedelicious	60d25f105f	fix(nodes): restore metadata traverser	2023-05-24 11:30:47 -04:00
psychedelicious	52c9e6ec91	feat(nodes): organise/tidy	2023-05-24 11:30:47 -04:00
psychedelicious	c0f132e41a	hack(nodes): hack to get image urls in the invocation complete event	2023-05-24 11:30:47 -04:00
psychedelicious	cc1160a43a	feat(nodes): streamline urlservice	2023-05-24 11:30:47 -04:00
psychedelicious	5bf9891553	feat(nodes): it works	2023-05-24 11:30:47 -04:00
psychedelicious	22c34c343a	feat(nodes): fix types for InvocationServices	2023-05-24 11:30:47 -04:00
psychedelicious	f7804f6126	feat(nodes): add logger to images service	2023-05-24 11:30:47 -04:00
psychedelicious	d14b02e93f	feat(logger): fix logger type issues	2023-05-24 11:30:47 -04:00
psychedelicious	1b75d899ae	feat(nodes): wip image storage implementation	2023-05-24 11:30:47 -04:00
psychedelicious	d4aa79acd7	fix(nodes): use `save` instead of `set` `set` is a python builtin	2023-05-24 11:30:47 -04:00

1 2 3

124 Commits