InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
user1	909f538fb5	Switching over to controlnet_utils prepare_control_image(), with added resize_mode.	2023-07-20 00:41:49 -07:00
Martin Kristiansen	07c48b2fd1	Moving detected precision to DEFAULT_PRECISION constant	2023-07-19 11:55:37 -04:00
Martin Kristiansen	fface339ae	Same fix for ImageToLatentsInvocation	2023-07-19 11:38:13 -04:00
Martin Kristiansen	2ec9dab595	Changing ImageToLatentsInvocation node to default to detected precision instead of fp16	2023-07-19 11:16:00 -04:00
mickr777	d8db618de0	import choose_torch_device from ...backend.util.devices	2023-07-19 16:43:02 +10:00
mickr777	19d67b29e7	Remove not needed text	2023-07-19 15:20:40 +10:00
mickr777	52e7e0b31b	Missing def choose_torch_device	2023-07-19 15:15:55 +10:00
Brandon Rising	ee7b36cea5	Merge branch 'main' into onnx-testing	2023-07-18 22:56:41 -04:00
blessedcoolant	3f1d5000c0	Merge branch 'main' into nodes-stuff	2023-07-19 02:37:50 +12:00
blessedcoolant	0c18c5d603	feat: Add titles and tags to all Nodes	2023-07-19 02:26:45 +12:00
StAlKeR7779	889b77d3d6	Merge branch 'main' into save_vram	2023-07-18 16:55:48 +03:00
Sergey Borisov	fbbc4b3f69	Fixes	2023-07-18 16:51:16 +03:00
Sergey Borisov	bc11296a5e	Disable lazy offloading on disabled vram cache, move resulted tensors to cpu(to not stack vram tensors in cache), fix - text encoder not freed(detach)	2023-07-18 16:20:25 +03:00
blessedcoolant	13da881953	Merge branch 'main' into sdxl-support	2023-07-18 13:34:07 +12:00
Sergey Borisov	fe78a08e37	Fix sd1/2 models conditionings	2023-07-16 06:24:24 +03:00
Sergey Borisov	c9c2229917	Separate prompt to sdxl and sdxl-refiner, add denoising start-end fields, add l2l node(supports both sdxl and sdxl-refiner), add fp32 to vae encode	2023-07-16 06:00:37 +03:00
Lincoln Stein	ccbfa5d862	resolve conflicts	2023-07-15 19:47:50 -04:00
psychedelicious	7b6159f8d6	feat(nodes): emit model loading events - remove dependency on having access to a `node` during emits, would need a bit of additional args passed through the system and I don't think its necessary at this point. this also allowed us to drop an extraneous fetching/parsing of the session from db. - provide the invocation context to all `get_model()` calls, so the events are able to be emitted - test all model loading events in the app and confirm socket events are received	2023-07-16 02:12:01 +10:00
psychedelicious	29b2e59e65	fix(nodes): fix ref to ctx mgr service, missing import	2023-07-15 19:56:44 +10:00
psychedelicious	788dcbde70	fix(nodes): add missing import	2023-07-15 19:56:44 +10:00
Sergey Borisov	6ab9a5e108	Draft	2023-07-15 19:56:44 +10:00
psychedelicious	23c1a6b9d5	fix(nodes): make ResizeLatents w/h optional now you can connect to them in node editor	2023-07-14 21:42:42 +10:00
Brandon Rising	524888bf3b	Merge branch 'main' into feat/onnx	2023-07-13 14:23:57 -04:00
psychedelicious	50bef87da7	feat(db,nodes,api): refactor metadata Metadata for the Linear UI is now sneakily provided via a `MetadataAccumulator` node, which the client populates / hooks up while building the graph. Additionally, we provide the unexpanded graph with the metadata API response. Both of these are embedded into the PNGs. - Remove `metadata` from `ImageDTO` - Split up the `images/` routes to accomodate this; metadata is only retrieved per-image - `images/{image_name}` now gets the DTO - `images/{image_name}/metadata` gets the new metadata - `images/{image_name}/full` gets the full-sized image file - Remove old metadata service - Add `MetadataAccumulator` node, `CoreMetadataField`, hook up to `LatentsToImage` node - Add `get_raw()` method to `ItemStorage`, retrieves the row from DB as a string, no pydantic parsing - Update `images`related services to handle storing and retrieving the new metadata - Add `get_metadata_graph_from_raw_session` which extracts the `graph` from `session` without needing to hydrate the session in pydantic, in preparation for providing it as metadata; also removes all references to the `MetadataAccumulator` node	2023-07-13 15:40:05 +10:00
Sergey Borisov	358ced6bab	SDXL Prompt and t2l nodes draft, add fp32 to vae decode	2023-07-11 18:19:36 +03:00
Lincoln Stein	9edf78dd2e	merge with main	2023-07-05 09:12:54 -04:00
blessedcoolant	639d88afd6	revert: inference_mode to no_grad	2023-07-05 16:39:15 +12:00
blessedcoolant	c0501ed5c2	fix: Slow loading of Loras Co-Authored-By: StAlKeR7779 <7768370+StAlKeR7779@users.noreply.github.com>	2023-07-05 12:47:34 +10:00
Lincoln Stein	ed86d0b708	Union[foo, None]=>Optional[foo]	2023-07-03 12:17:45 -04:00
StAlKeR7779	ac46b129bf	Merge branch 'main' into feat/lora_model_patch	2023-06-28 22:43:58 +03:00
psychedelicious	2e14528e4c	feat(nodes): default to CPU noise	2023-06-27 13:57:31 +10:00
Sergey Borisov	5cebf67ee4	Apply lora by patching lora instead of hooks	2023-06-26 03:57:33 +03:00
user1	c5faffc18b	Merge branch 'main' of github.com:invoke-ai/InvokeAI into feat/controlnet-control-modes Only "real" conflicts were in: invokeai/frontend/web/src/features/controlNet/components/ControlNet.tsx invokeai/frontend/web/src/features/controlNet/store/controlNetSlice.ts	2023-06-24 17:05:57 -07:00
psychedelicious	bab3a9504e	fix(nodes): fix LatentsToImage not using `is_intermediate` when creating images Appears this was removed during a merge conflict resolution.	2023-06-24 17:57:39 +10:00
Sergey Borisov	7759b3f75a	Small refactor	2023-06-21 04:24:25 +03:00
Sergey Borisov	4d337f6abc	ONNX Model/runtime first implementation	2023-06-21 02:12:21 +03:00
Sergey Borisov	9b32407744	Provide generator to all schedulers step function to make both ancestral and sde schedulers reproducible	2023-06-19 00:34:01 +03:00
Sergey Borisov	f3d9797ebe	Add dpmpp_sde and dpmpp_2m_sde schedulers(with karras)	2023-06-18 23:38:15 +03:00
blessedcoolant	6b8e88ad7f	Merge branch 'main' into feat/controlnet-control-modes	2023-06-15 03:18:41 +12:00
StAlKeR7779	d0ee3558d1	Merge branch 'main' into lstein/new-model-manager	2023-06-14 17:29:01 +03:00
psychedelicious	a1773197e9	feat(nodes): remove `image_origin` from most places - remove `image_origin` from most places where we interact with images - consolidate image file storage into a single `images/` dir Images have an `image_origin` attribute but it is not actually used when retrieving images, nor will it ever be. It is still used when creating images and helps to differentiate between internally generated images and uploads. It was included in eg API routes and image service methods as a holdover from the previous app implementation where images were not managed in a database. Now that we have images in a db, we can do away with this and simplify basically everything that touches images. The one potentially controversial change is to no longer separate internal and external images on disk. If we retain this separation, we have to keep `image_origin` around in a number of spots and it getting image paths on disk painful. So, I am have gotten rid of this organisation. Images are now all stored in `images`, regardless of their origin. As we improve the image management features, this change will hopefully become transparent.	2023-06-14 23:08:27 +10:00
user1	de3e6cdb02	Switched over to ControlNet control_mode with 4 options: balanced, more_prompt, more_control, even_more_control. Based on True/False combinations of internal booleans cfg_injection and soft_injection	2023-06-13 21:08:34 -07:00
Sergey Borisov	26090011c4	Fix conflict resolve, add model configs to type annotation	2023-06-14 00:26:37 +03:00
StAlKeR7779	c9ae26a176	Merge branch 'main' into lstein/new-model-manager	2023-06-13 23:37:52 +03:00
user1	8495764d45	Moving from ControlNet guess_mode to separate booleans for cfg_injection and soft_injection for testing control modes	2023-06-13 00:41:36 -07:00
user1	8b7fac75ed	First pass at ControlNet "guess mode" implementation.	2023-06-13 00:41:36 -07:00
blessedcoolant	2a814d886b	Merge branch 'main' into diffusers-upgrade	2023-06-13 05:29:15 +12:00
Gregg Helt	c647056287	Feat/easy param (#3504 ) * Testing change to LatentsToText to allow setting different cfg_scale values per diffusion step. * Adding first attempt at float param easing node, using Penner easing functions. * Core implementation of ControlNet and MultiControlNet. * Added support for ControlNet and MultiControlNet to legacy non-nodal Txt2Img in backend/generator. Although backend/generator will likely disappear by v3.x, right now they are very useful for testing core ControlNet and MultiControlNet functionality while node codebase is rapidly evolving. * Added example of using ControlNet with legacy Txt2Img generator * Resolving rebase conflict * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Resolving conflicts in rebase to origin/main * Refactored ControlNet nodes so they subclass from PreprocessedControlInvocation, and only need to override run_processor(image) (instead of reimplementing invoke()) * changes to base class for controlnet nodes * Added HED, LineArt, and OpenPose ControlNet nodes * Added an additional "raw_processed_image" output port to controlnets, mainly so could route ImageField to a ShowImage node * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * More rebase repair. * Added support for using multiple control nets. Unfortunately this breaks direct usage of Control node output port ==> TextToLatent control input port -- passing through a Collect node is now required. Working on fixing this... * Fixed use of ControlNet control_weight parameter * Fixed lint-ish formatting error * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Refactored controlnet node to output ControlField that bundles control info. * changes to base class for controlnet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Cleaning up TextToLatent arg testing * Cleaning up mistakes after rebase. * Removed last bits of dtype and and device hardwiring from controlnet section * Refactored ControNet support to consolidate multiple parameters into data struct. Also redid how multiple controlnets are handled. * Added support for specifying which step iteration to start using each ControlNet, and which step to end using each controlnet (specified as fraction of total steps) * Cleaning up prior to submitting ControlNet PR. Mostly turning off diagnostic printing. Also fixed error when there is no controlnet input. * Added dependency on controlnet-aux v0.0.3 * Commented out ZoeDetector. Will re-instate once there's a controlnet-aux release that supports it. * Switched CotrolNet node modelname input from free text to default list of popular ControlNet model names. * Fix to work with current stable release of controlnet_aux (v0.0.3). Turned of pre-processor params that were added post v0.0.3. Also change defaults for shuffle. * Refactored most of controlnet code into its own method to declutter TextToLatents.invoke(), and make upcoming integration with LatentsToLatents easier. * Cleaning up after ControlNet refactor in TextToLatentsInvocation * Extended node-based ControlNet support to LatentsToLatentsInvocation. * chore(ui): regen api client * fix(ui): add value to conditioning field * fix(ui): add control field type * fix(ui): fix node ui type hints * fix(nodes): controlnet input accepts list or single controlnet * Moved to controlnet_aux v0.0.4, reinstated Zoe controlnet preprocessor. Also in pyproject.toml had to specify downgrade of timm to 0.6.13 _after_ controlnet-aux installs timm >= 0.9.2, because timm >0.6.13 breaks Zoe preprocessor. * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Switching to ControlField for output from controlnet nodes. * Resolving conflicts in rebase to origin/main * Refactored ControlNet nodes so they subclass from PreprocessedControlInvocation, and only need to override run_processor(image) (instead of reimplementing invoke()) * changes to base class for controlnet nodes * Added HED, LineArt, and OpenPose ControlNet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Added support for using multiple control nets. Unfortunately this breaks direct usage of Control node output port ==> TextToLatent control input port -- passing through a Collect node is now required. Working on fixing this... * Fixed use of ControlNet control_weight parameter * Core implementation of ControlNet and MultiControlNet. * Added first controlnet preprocessor node for canny edge detection. * Initial port of controlnet node support from generator-based TextToImageInvocation node to latent-based TextToLatentsInvocation node * Switching to ControlField for output from controlnet nodes. * Refactored controlnet node to output ControlField that bundles control info. * changes to base class for controlnet nodes * Added more preprocessor nodes for: MidasDepth ZoeDepth MLSD NormalBae Pidi LineartAnime ContentShuffle Removed pil_output options, ControlNet preprocessors should always output as PIL. Removed diagnostics and other general cleanup. * Prep for splitting pre-processor and controlnet nodes * Refactored controlnet nodes: split out controlnet stuff into separate node, stripped controlnet stuff form image processing/analysis nodes. * Added resizing of controlnet image based on noise latent. Fixes a tensor mismatch issue. * Cleaning up TextToLatent arg testing * Cleaning up mistakes after rebase. * Removed last bits of dtype and and device hardwiring from controlnet section * Refactored ControNet support to consolidate multiple parameters into data struct. Also redid how multiple controlnets are handled. * Added support for specifying which step iteration to start using each ControlNet, and which step to end using each controlnet (specified as fraction of total steps) * Cleaning up prior to submitting ControlNet PR. Mostly turning off diagnostic printing. Also fixed error when there is no controlnet input. * Commented out ZoeDetector. Will re-instate once there's a controlnet-aux release that supports it. * Switched CotrolNet node modelname input from free text to default list of popular ControlNet model names. * Fix to work with current stable release of controlnet_aux (v0.0.3). Turned of pre-processor params that were added post v0.0.3. Also change defaults for shuffle. * Refactored most of controlnet code into its own method to declutter TextToLatents.invoke(), and make upcoming integration with LatentsToLatents easier. * Cleaning up after ControlNet refactor in TextToLatentsInvocation * Extended node-based ControlNet support to LatentsToLatentsInvocation. * chore(ui): regen api client * fix(ui): fix node ui type hints * fix(nodes): controlnet input accepts list or single controlnet * Added Mediapipe image processor for use as ControlNet preprocessor. Also hacked in ability to specify HF subfolder when loading ControlNet models from string. * Fixed bug where MediapipFaceProcessorInvocation was ignoring max_faces and min_confidence params. * Added nodes for float params: ParamFloatInvocation and FloatCollectionOutput. Also added FloatOutput. * Added mediapipe install requirement. Should be able to remove once controlnet_aux package adds mediapipe to its requirements. * Added float to FIELD_TYPE_MAP ins constants.ts * Progress toward improvement in fieldTemplateBuilder.ts getFieldType() * Fixed controlnet preprocessors and controlnet handling in TextToLatents to work with revised Image services. * Cleaning up from merge, re-adding cfg_scale to FIELD_TYPE_MAP * Making sure cfg_scale of type list[float] can be used in image metadata, to support param easing for cfg_scale * Fixed math for per-step param easing. * Added option to show plot of param value at each step * Just cleaning up after adding param easing plot option, removing vestigial code. * Modified control_weight ControlNet param to be polistmorphic -- can now be either a single float weight applied for all steps, or a list of floats of size total_steps, that specifies weight for each step. * Added more informative error message when _validat_edge() throws an error. * Just improving parm easing bar chart title to include easing type. * Added requirement for easing-functions package * Taking out some diagnostic prints. * Added option to use both easing function and mirror of easing function together. * Fixed recently introduced problem (when pulled in main), triggered by num_steps in StepParamEasingInvocation not having a default value -- just added default. --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-06-11 16:27:44 +10:00
Sergey Borisov	2c056ead42	New models structure draft	2023-06-10 03:14:10 +03:00
blessedcoolant	7bce455d16	Merge branch 'main' into diffusers-upgrade	2023-06-09 16:27:52 +12:00

... 2 3 4 5 6 ...

312 Commits