InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Mary Hipp	e05cc62e5f	add style presets API layer to UI	2024-08-05 13:37:07 -04:00
psychedelicious	4d4f921a4e	build: exclude matplotlib 3.9.1 There was a problem w/ this release on windows and the builds were pulled from pypi. When installing invoke on windows, pip attempts to build from source, but most (all?) systems won't have the prerequisites for this and installs fail. This also affects GH actions. The simple fix is to exclude version 3.9.1 from our deps. For more information, see https://github.com/matplotlib/matplotlib/issues/28551	2024-08-05 08:38:44 +10:00
psychedelicious	98db8f395b	feat(app): clean up DiskImageStorage types	2024-08-04 09:43:20 +10:00
psychedelicious	f465a956a3	feat(ui): remove "images can be restored" messages	2024-08-04 09:43:20 +10:00
psychedelicious	9edb02d7ef	build: remove send2trash dependency	2024-08-04 09:43:20 +10:00
psychedelicious	6c4cf58a31	feat(app): delete model_images instead of using send2trash	2024-08-04 09:43:20 +10:00
psychedelicious	08993c0d29	feat(app): delete images instead of using send2trash Closes #6709	2024-08-04 09:43:20 +10:00
blessedcoolant	4f8a4b0f22	Merge branch 'main' into depth_anything_v2	2024-08-03 00:38:57 +05:30
blessedcoolant	a743f3c9b5	fix: implement model to func for depth anything	2024-08-03 00:37:17 +05:30
Mary Hipp	217fe40d99	feat(api): add style_presets router, make sure all CRUD is working, add is_default	2024-08-02 12:29:54 -04:00
Mary Hipp	b76bf50b93	feat(db,api): create new table for style presets, build out record storage service for style presets	2024-08-01 22:20:11 -04:00
Mary Hipp	571ba87e13	fix(ui): include upscale metadata for SDXL multidiffusion	2024-08-01 21:30:42 -04:00
Ryan Dick	f27b6e2b44	Add Grounded SAM support (text prompt image segmentation) (#6701 ) ## Summary This PR enables Grounded SAM workflows (https://arxiv.org/pdf/2401.14159) via the following: - `GroundingDinoInvocation` for running a Grounding DINO model. - `SegmentAnythingModelInvocation` for running a SAM model. - `MaskTensorToImageInvocation` for convenient visualization. Other notes: - Uses the transformers implementation of Grounding DINO and SAM. - The new models are treated as 'utility models' meaning that they are not visible in the Models tab, and are downloaded automatically the first time that they are used. <img width="874" alt="image" src="https://github.com/user-attachments/assets/1cbaa97d-0e27-4943-86b1-dc7327ba8675"> ## Example Input image ![be10ec0c-20a8-4ac7-840e-d1a05fffdb6a](https://github.com/user-attachments/assets/bf21572c-635d-4703-b4ab-7aba658a9671) Prompt: "wheels", all other configs default Result: ![2221c44e-64e6-4b18-b4cb-610514b7a554](https://github.com/user-attachments/assets/344b91f4-7f4a-4b70-8e2e-3b4a0e55176d) ## Related Issues / Discussions Thanks to @blessedcoolant for the initial draft here: https://github.com/invoke-ai/InvokeAI/pull/6678 ## QA Instructions Manual tests: - [ ] Test that default settings work well. - [ ] Test with / without apply_polygon_refinement - [ ] Test mask_filter options - [ ] Test detection_threshold values - [ ] Test RGB input image - [ ] Test RGBA input image - [ ] Test grayscale input image - [ ] Smoke test that an empty mask is returned when 0 objects are detected - [ ] Test on CPU - [ ] Test on MPS (Works on Mac OS, but had to force both models to run on CPU instead of MPS) Performance: - Peak GPU memory utilization with both Grounding DINO and SAM models loaded is ~4.5GB. (The models do not need to be loaded at the same time, so could be offloaded by the MM if needed.) - On an RTX4090, with the models already cached, node execution takes ~0.6 secs. - On my CPU, with the models cached, node execution takes ~10secs. ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-08-01 20:40:18 +02:00
Ryan Dick	981475a624	Merge branch 'main' into ryan/grounded-sam	2024-08-01 20:30:35 +02:00
Ryan Dick	27ac61a4fb	Expose all model options in the GroundingDinoInvocation and the SegmentAnythingInvocation.	2024-08-01 14:23:32 -04:00
Ryan Dick	675ffc2757	Remove BoundingBoxInvocation field name overrides.	2024-08-01 14:05:44 -04:00
Ryan Dick	44b21f10f1	Add a pydantic model_validator to BoundingBoxField to check the validity of the coords.	2024-08-01 14:00:57 -04:00
Ryan Dick	c6d49e8b1f	Shorten SegmentAnythingInvocation and GroundingDinoInvocatino docstrings, since they are used as the invocation descriptions in the UI.	2024-08-01 10:17:42 -04:00
Ryan Dick	e6a512aa86	(minor) Tweak order of mask operations.	2024-08-01 10:12:24 -04:00
Ryan Dick	c3a6a6fb22	Rename SegmentAnythingModelInvocation -> SegmentAnythingInvocation.	2024-08-01 10:00:36 -04:00
Ryan Dick	b9dc3460ba	Rename SegmentAnythingModel -> SegmentAnythingPipeline.	2024-08-01 09:57:47 -04:00
Ryan Dick	63581ec980	(minor) Add None check to fix static type checking error.	2024-08-01 09:51:53 -04:00
chainchompa	08b1feeed7	add base prop for destination to direct users to different tabs on initial load (#6706 ) ## Summary - we want a way to load the studio while being directed to a specific tab, introduced a destination prop to achieve that <!--A description of the changes in this PR. Include the kind of change (fix, feature, docs, etc), the "why" and the "how". Screenshots or videos are useful for frontend changes.--> ## Related Issues / Discussions <!--WHEN APPLICABLE: List any related issues or discussions on github or discord. If this PR closes an issue, please use the "Closes #1234" format, so that the issue will be automatically closed when the PR merges.--> ## QA Instructions <!--WHEN APPLICABLE: Describe how you have tested the changes in this PR. Provide enough detail that a reviewer can reproduce your tests.--> ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [ ] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 19:25:36 -04:00
blessedcoolant	f5cfdcf32d	feat: Add BoundingBox Primitive Node	2024-08-01 04:09:08 +05:30
chainchompa	e78fb428f0	simplify destination prop handling	2024-07-31 18:06:22 -04:00
chainchompa	31e270e32c	add base prop for destination to direct users to different tabs	2024-07-31 17:20:51 -04:00
Ryan Dick	b5832768dc	Return a MaskOutput from SegmentAnythingModelInvocation. And add a MaskTensorToImageInvocation.	2024-07-31 17:16:14 -04:00
Ryan Dick	4ce64b69cb	Modular backend - LoRA/LyCORIS (#6667 ) ## Summary Code for lora patching from #6577. Additionally made it the way, that lora can patch not only `weight`, but also `bias`, because saw some loras which doing it. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ## Merge Plan Replace old lora patcher with new after review done. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 21:31:31 +02:00
Ryan Dick	5a9173f766	Merge branch 'main' into stalker-modular_lora	2024-07-31 15:13:22 -04:00
Ryan Dick	0bb7ed44f6	Add some docs to OriginalWeightsStorage and fix type hints.	2024-07-31 15:08:24 -04:00
blessedcoolant	332bc9da5b	fix: Update depth anything node default to v2	2024-07-31 23:52:29 +05:30
blessedcoolant	08def3da95	fix: Update canvas depth anything processor default to v2	2024-07-31 23:50:13 +05:30
blessedcoolant	daf899f9c4	fix: Move the manual image resizing out of the depth anything pipeline	2024-07-31 23:38:12 +05:30
blessedcoolant	13fb2d1f49	fix: Add Depth Anything V2 as a new option It is also now the default in the UI replacing Depth Anything V1 small	2024-07-31 23:29:43 +05:30
blessedcoolant	95dde802ea	fix: assert the return depth map to be a PIL image	2024-07-31 23:22:01 +05:30
Ryan Dick	fca119773b	Split invokeai/backend/image_util/segment_anything/ dir into grounding_dino/ and segment_anything/	2024-07-31 12:28:47 -04:00
Ryan Dick	0193267a53	Split GroundedSamInvocation into GroundingDinoInvocation and SegmentAnythingModelInvocation.	2024-07-31 12:20:23 -04:00
blessedcoolant	b4cf78a95d	fix: make DA Pipeline a subclass of RawModel	2024-07-31 21:14:49 +05:30
Ryan Dick	73386826d6	Make GroundingDinoPipeline and SegmentAnythingModel subclasses of RawModel for type checking purposes.	2024-07-31 10:25:34 -04:00
Ryan Dick	9f448fecb7	Move invokeai/backend/grounded_sam -> invokeai/backend/image_util/grounded_sam	2024-07-31 10:00:30 -04:00
Ryan Dick	bcd1483a14	Re-order GroundedSAMInvocation._to_numpy_masks(...) to do slightly more work on the GPU.	2024-07-31 09:51:14 -04:00
Ryan Dick	e206890e25	Use staticmethods rather than inner functions for the Grounding DINO and SAM model loaders.	2024-07-31 09:28:52 -04:00
Ryan Dick	0a7048f650	(minor) Simplify GroundedSAMInvocation._merge_masks(...).	2024-07-31 08:58:51 -04:00
Ryan Dick	e8ecf5e155	(minor) Move apply_polygon_refinement condition up a layer.	2024-07-31 08:50:56 -04:00
Ryan Dick	33e8604b57	Make Grounding DINO DetectionResult a Pydantic model.	2024-07-31 08:47:00 -04:00
Ryan Dick	cec7399366	(minor) Use a new variable name to satisfy type checks.	2024-07-31 08:27:01 -04:00
Ryan Dick	bdae81e429	(minor) Simplify GroundedSAMInvocation._filter_detections()	2024-07-31 08:25:19 -04:00
Ryan Dick	67c32f3d6c	Fix typo: zip(..., strict=True)	2024-07-31 08:15:28 -04:00
blessedcoolant	94d64b8a78	Fix gradient mask values range (#6688 ) ## Summary Gradient mask node outputs mask tensor with values in range [-1, 1], which unexpected range for mask. It handled in denoise node the way it translates to [0, 2] mask, which looks even more wrongly) From discussion with @dunkeroni I understand him as he thought that negative values will be treated same as 0, so clamping values not change intended node logic. ## Related Issues / Discussions #6643 ## QA Instructions \- ## Merge Plan \- ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 06:37:32 +05:30
blessedcoolant	fa3c0c81b3	Merge branch 'main' into stalker7779/fix_gradient_mask	2024-07-31 06:30:44 +05:30

... 3 4 5 6 7 ...

12574 Commits