InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
psychedelicious	f465a956a3	feat(ui): remove "images can be restored" messages	2024-08-04 09:43:20 +10:00
psychedelicious	9edb02d7ef	build: remove send2trash dependency	2024-08-04 09:43:20 +10:00
psychedelicious	6c4cf58a31	feat(app): delete model_images instead of using send2trash	2024-08-04 09:43:20 +10:00
psychedelicious	08993c0d29	feat(app): delete images instead of using send2trash Closes #6709	2024-08-04 09:43:20 +10:00
Mary Hipp	571ba87e13	fix(ui): include upscale metadata for SDXL multidiffusion	2024-08-01 21:30:42 -04:00
Ryan Dick	f27b6e2b44	Add Grounded SAM support (text prompt image segmentation) (#6701 ) ## Summary This PR enables Grounded SAM workflows (https://arxiv.org/pdf/2401.14159) via the following: - `GroundingDinoInvocation` for running a Grounding DINO model. - `SegmentAnythingModelInvocation` for running a SAM model. - `MaskTensorToImageInvocation` for convenient visualization. Other notes: - Uses the transformers implementation of Grounding DINO and SAM. - The new models are treated as 'utility models' meaning that they are not visible in the Models tab, and are downloaded automatically the first time that they are used. <img width="874" alt="image" src="https://github.com/user-attachments/assets/1cbaa97d-0e27-4943-86b1-dc7327ba8675"> ## Example Input image ![be10ec0c-20a8-4ac7-840e-d1a05fffdb6a](https://github.com/user-attachments/assets/bf21572c-635d-4703-b4ab-7aba658a9671) Prompt: "wheels", all other configs default Result: ![2221c44e-64e6-4b18-b4cb-610514b7a554](https://github.com/user-attachments/assets/344b91f4-7f4a-4b70-8e2e-3b4a0e55176d) ## Related Issues / Discussions Thanks to @blessedcoolant for the initial draft here: https://github.com/invoke-ai/InvokeAI/pull/6678 ## QA Instructions Manual tests: - [ ] Test that default settings work well. - [ ] Test with / without apply_polygon_refinement - [ ] Test mask_filter options - [ ] Test detection_threshold values - [ ] Test RGB input image - [ ] Test RGBA input image - [ ] Test grayscale input image - [ ] Smoke test that an empty mask is returned when 0 objects are detected - [ ] Test on CPU - [ ] Test on MPS (Works on Mac OS, but had to force both models to run on CPU instead of MPS) Performance: - Peak GPU memory utilization with both Grounding DINO and SAM models loaded is ~4.5GB. (The models do not need to be loaded at the same time, so could be offloaded by the MM if needed.) - On an RTX4090, with the models already cached, node execution takes ~0.6 secs. - On my CPU, with the models cached, node execution takes ~10secs. ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-08-01 20:40:18 +02:00
Ryan Dick	981475a624	Merge branch 'main' into ryan/grounded-sam	2024-08-01 20:30:35 +02:00
Ryan Dick	27ac61a4fb	Expose all model options in the GroundingDinoInvocation and the SegmentAnythingInvocation.	2024-08-01 14:23:32 -04:00
Ryan Dick	675ffc2757	Remove BoundingBoxInvocation field name overrides.	2024-08-01 14:05:44 -04:00
Ryan Dick	44b21f10f1	Add a pydantic model_validator to BoundingBoxField to check the validity of the coords.	2024-08-01 14:00:57 -04:00
Ryan Dick	c6d49e8b1f	Shorten SegmentAnythingInvocation and GroundingDinoInvocatino docstrings, since they are used as the invocation descriptions in the UI.	2024-08-01 10:17:42 -04:00
Ryan Dick	e6a512aa86	(minor) Tweak order of mask operations.	2024-08-01 10:12:24 -04:00
Ryan Dick	c3a6a6fb22	Rename SegmentAnythingModelInvocation -> SegmentAnythingInvocation.	2024-08-01 10:00:36 -04:00
Ryan Dick	b9dc3460ba	Rename SegmentAnythingModel -> SegmentAnythingPipeline.	2024-08-01 09:57:47 -04:00
Ryan Dick	63581ec980	(minor) Add None check to fix static type checking error.	2024-08-01 09:51:53 -04:00
chainchompa	08b1feeed7	add base prop for destination to direct users to different tabs on initial load (#6706 ) ## Summary - we want a way to load the studio while being directed to a specific tab, introduced a destination prop to achieve that <!--A description of the changes in this PR. Include the kind of change (fix, feature, docs, etc), the "why" and the "how". Screenshots or videos are useful for frontend changes.--> ## Related Issues / Discussions <!--WHEN APPLICABLE: List any related issues or discussions on github or discord. If this PR closes an issue, please use the "Closes #1234" format, so that the issue will be automatically closed when the PR merges.--> ## QA Instructions <!--WHEN APPLICABLE: Describe how you have tested the changes in this PR. Provide enough detail that a reviewer can reproduce your tests.--> ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [ ] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 19:25:36 -04:00
blessedcoolant	f5cfdcf32d	feat: Add BoundingBox Primitive Node	2024-08-01 04:09:08 +05:30
chainchompa	e78fb428f0	simplify destination prop handling	2024-07-31 18:06:22 -04:00
chainchompa	31e270e32c	add base prop for destination to direct users to different tabs	2024-07-31 17:20:51 -04:00
Ryan Dick	b5832768dc	Return a MaskOutput from SegmentAnythingModelInvocation. And add a MaskTensorToImageInvocation.	2024-07-31 17:16:14 -04:00
Ryan Dick	4ce64b69cb	Modular backend - LoRA/LyCORIS (#6667 ) ## Summary Code for lora patching from #6577. Additionally made it the way, that lora can patch not only `weight`, but also `bias`, because saw some loras which doing it. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ## Merge Plan Replace old lora patcher with new after review done. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 21:31:31 +02:00
Ryan Dick	5a9173f766	Merge branch 'main' into stalker-modular_lora	2024-07-31 15:13:22 -04:00
Ryan Dick	0bb7ed44f6	Add some docs to OriginalWeightsStorage and fix type hints.	2024-07-31 15:08:24 -04:00
Ryan Dick	fca119773b	Split invokeai/backend/image_util/segment_anything/ dir into grounding_dino/ and segment_anything/	2024-07-31 12:28:47 -04:00
Ryan Dick	0193267a53	Split GroundedSamInvocation into GroundingDinoInvocation and SegmentAnythingModelInvocation.	2024-07-31 12:20:23 -04:00
Ryan Dick	73386826d6	Make GroundingDinoPipeline and SegmentAnythingModel subclasses of RawModel for type checking purposes.	2024-07-31 10:25:34 -04:00
Ryan Dick	9f448fecb7	Move invokeai/backend/grounded_sam -> invokeai/backend/image_util/grounded_sam	2024-07-31 10:00:30 -04:00
Ryan Dick	bcd1483a14	Re-order GroundedSAMInvocation._to_numpy_masks(...) to do slightly more work on the GPU.	2024-07-31 09:51:14 -04:00
Ryan Dick	e206890e25	Use staticmethods rather than inner functions for the Grounding DINO and SAM model loaders.	2024-07-31 09:28:52 -04:00
Ryan Dick	0a7048f650	(minor) Simplify GroundedSAMInvocation._merge_masks(...).	2024-07-31 08:58:51 -04:00
Ryan Dick	e8ecf5e155	(minor) Move apply_polygon_refinement condition up a layer.	2024-07-31 08:50:56 -04:00
Ryan Dick	33e8604b57	Make Grounding DINO DetectionResult a Pydantic model.	2024-07-31 08:47:00 -04:00
Ryan Dick	cec7399366	(minor) Use a new variable name to satisfy type checks.	2024-07-31 08:27:01 -04:00
Ryan Dick	bdae81e429	(minor) Simplify GroundedSAMInvocation._filter_detections()	2024-07-31 08:25:19 -04:00
Ryan Dick	67c32f3d6c	Fix typo: zip(..., strict=True)	2024-07-31 08:15:28 -04:00
blessedcoolant	94d64b8a78	Fix gradient mask values range (#6688 ) ## Summary Gradient mask node outputs mask tensor with values in range [-1, 1], which unexpected range for mask. It handled in denoise node the way it translates to [0, 2] mask, which looks even more wrongly) From discussion with @dunkeroni I understand him as he thought that negative values will be treated same as 0, so clamping values not change intended node logic. ## Related Issues / Discussions #6643 ## QA Instructions \- ## Merge Plan \- ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 06:37:32 +05:30
blessedcoolant	fa3c0c81b3	Merge branch 'main' into stalker7779/fix_gradient_mask	2024-07-31 06:30:44 +05:30
blessedcoolant	66547b99c1	Add more karras schedulers (#6695 ) ## Summary Add karras variants of `deis`, `unipc`, `kdpm2` and `kdpm_2_a` schedulers. Also added `dpmpp_3` schedulers, but `dpmpp_3s` currently bugged, so added only 3m: https://github.com/huggingface/diffusers/issues/9007 ## Related Issues / Discussions \- ## QA Instructions \- ## Merge Plan ~@psychedelicious We need to decide what to do with schedulers order, as it looks a bit broken:~ ![image](https://github.com/user-attachments/assets/e41674af-d87c-4432-8014-c90bd86965a6) ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 06:09:26 +05:30
blessedcoolant	328e58be4c	Merge branch 'main' into stalker7779/new_karras_schedulers	2024-07-31 05:56:13 +05:30
Ryan Dick	5701c79fab	Prevent Grounding DINO and Segment Anything from being moved to MPS - they don't work on MPS devices.	2024-07-30 23:04:15 +02:00
Ryan Dick	2da9f913f3	Add detection_result.py - was forgotten in a prior commit	2024-07-30 16:04:29 -04:00
Ryan Dick	6b10b59abe	Make GroundedSAMInvocation work with any input image mode (RGB, RGBA, grayscale).	2024-07-30 15:55:57 -04:00
Ryan Dick	918f77bce0	Move some logic from GroundedSAMInvocation to the backend classes.	2024-07-30 15:34:33 -04:00
Ryan Dick	aca2a2fa13	Add mask_filter and detection_threshold options to the GroundedSAMInvocation.	2024-07-30 14:22:40 -04:00
Ryan Dick	ff6398f7d8	Add a GroundedSamInvocation for image segmentation from a text prompt (Grounding DINO + Segment Anything Model).	2024-07-30 11:12:26 -04:00
Sergey Borisov	cf996472b9	Suggested changes Co-Authored-By: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2024-07-30 04:50:56 +03:00
Sergey Borisov	156d14c349	Run api regen	2024-07-30 04:05:21 +03:00
Sergey Borisov	86f705bf48	Optimize weights handling	2024-07-30 03:39:01 +03:00
Sergey Borisov	1fd9631f2d	Comments fix Co-Authored-By: Ryan Dick <14897797+RyanJDick@users.noreply.github.com>	2024-07-30 00:39:50 +03:00
Sergey Borisov	2227a2357f	Suggested changes + simplify weights logic in patching Co-Authored-By: Ryan Dick <14897797+RyanJDick@users.noreply.github.com>	2024-07-30 00:34:37 +03:00

1 2 3 4 5 ...

12355 Commits