InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2025-07-25 21:05:37 +00:00

Author	SHA1	Message	Date
chainchompa	af159acbdf	cleanup	2024-08-14 08:58:38 -04:00
chainchompa	471719bbbe	add base prop for selectedWorkflow to allow loading a workflow on launch	2024-08-14 08:47:02 -04:00
psychedelicious	f6b8970bd1	fix(app): create reference to events task to prevent accidental GC This wasn't a problem, but it's advised in the official docs so I've done it.	2024-08-12 07:49:58 +10:00
psychedelicious	29325a7214	fix(app): use asyncio queue and existing event loop for events Around the time we (I) implemented pydantic events, I noticed a short pause between progress images every 4 or 5 steps when generating with SDXL. It didn't happen with SD1.5, but I did notice that with SD1.5, we'd get 4 or 5 progress events simultaneously. I'd expect one event every ~25ms, matching my it/s with SD1.5. Mysterious! Digging in, I found an issue is related to our use of a synchronous queue for events. When the event queue is empty, we must call `asyncio.sleep` before checking again. We were sleeping for 100ms. Said another way, every time we clear the event queue, we have to wait 100ms before another event can be dispatched, even if it is put on the queue immediately after we start waiting. In practice, this means our events get buffered into batches, dispatched once every 100ms. This explains why I was getting batches of 4 or 5 SD1.5 progress events at once, but not the intermittent SDXL delay. But this 100ms wait has another effect when the events are put on the queue in intervals that don't perfectly line up with the 100ms wait. This is most noticeable when the time between events is >100ms, and can add up to 100ms delay before the event is dispatched. For example, say the queue is empty and we start a 100ms wait. Then, immediately after - like 0.01ms later - we push an event on to the queue. We still need to wait another 99.9ms before that event will be dispatched. That's the SDXL delay. The easy fix is to reduce the sleep to something like 0.01 seconds, but this feels kinda dirty. Can't we just wait on the queue and dispatch every event immediately? Not with the normal synchronous queue - but we can with `asyncio.Queue`. I switched the events queue to use `asyncio.Queue` (as seen in this commit), which lets us asynchronous wait on the queue in a loop. Unfortunately, I ran into another issue - events now felt like their timing was inconsistent, but in a different way than with the 100ms sleep. The time between pushing events on the queue and dispatching them was not consistently ~0ms as I'd expect - it was highly variable from ~0ms up to ~100ms. This is resolved by passing the asyncio loop directly into the events service and using its methods to create the task and interact with the queue. I don't fully understand why this resolved the issue, because either way we are interacting with the same event loop (as shown by `asyncio.get_running_loop()`). I suppose there's some scheduling magic happening.	2024-08-12 07:49:58 +10:00
psychedelicious	8ecf72838d	fix(api): image downloads with correct filename Closes #6730	2024-08-10 09:53:56 -04:00
psychedelicious	c3ab8a6aa8	chore(ui): bump rest of deps	2024-08-10 07:45:23 -04:00
psychedelicious	1931aa3e70	chore(ui): typegen	2024-08-10 07:45:23 -04:00
psychedelicious	d3d8055055	feat(ui): update typegen script	2024-08-10 07:45:23 -04:00
psychedelicious	476b0a0403	chore(ui): bump openapi-typescript	2024-08-10 07:45:23 -04:00
psychedelicious	f66584713c	fix(api): sort OpenAPI schema properties for InvocationOutputMap This makes the schema output deterministic!	2024-08-10 07:45:23 -04:00
psychedelicious	33624fc2fa	fix(api): duplicate operation id for get_image_full There's a FastAPI bug that results in the OpenAPI spec outputting the same operation id for each operation when specifying multiple HTTP methods. - Discussion: https://github.com/tiangolo/fastapi/discussions/8449 - Pending PR to fix: https://github.com/tiangolo/fastapi/pull/10694 In our case, we have a `get_image_full` endpoint that handles GET and HEAD. This results in an invalid OpenAPI schema. A workaround is to use two route decorators for the operation handler. This works as expected - HEAD requests get the header, and GET requests get the resource. And the OpenAPI schema is valid.	2024-08-10 07:45:23 -04:00
Mary Hipp	09d1e190e7	show warning for maxUpscaleDimension if model tab is disabled	2024-08-09 14:07:55 -04:00
Sergey Borisov	17ff8196cb	Remove tmp code	2024-08-07 22:06:05 -04:00
Sergey Borisov	68f993998a	Add support for norm layer	2024-08-07 22:06:05 -04:00
Sergey Borisov	7da6120b39	Fix LoKR refactor bug	2024-08-07 22:06:05 -04:00
blessedcoolant	6cd40965c4	Depth Anything V2 (#6674 ) - Updated the previous DepthAnything manual implementation to use the `transformers` implementation instead. So we can get upstream features. - Plugged in the DepthAnything models to be handled by Invoke's Model Manager. - `small_v2` model will use DepthAnythingV2. This has been added as a new model option and is now also the default in the Linear UI. ![opera_TxRhmbFole](https://github.com/user-attachments/assets/2a25abe3-ba0b-4f97-b75a-2ce5fd6246e6) # Merge Review and merge.	2024-08-07 20:26:58 +05:30
Kent Keirsey	408a1d6dbb	Merge branch 'main' into depth_anything_v2	2024-08-07 10:45:56 -04:00
Hosted Weblate	140670d00e	translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. Co-authored-by: Hosted Weblate <hosted@weblate.org> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
Phrixus2023	70233fae5d	translationBot(ui): update translation (Chinese (Simplified)) Currently translated at 98.1% (1296 of 1321 strings) Co-authored-by: Phrixus2023 <920414016@qq.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/zh_Hans/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
Alexander Eichhorn	6f457a6c4c	translationBot(ui): update translation (German) Currently translated at 65.1% (860 of 1321 strings) Co-authored-by: Alexander Eichhorn <pfannkuchensack@einfach-doof.de> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/de/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
B N	5c319f5356	translationBot(ui): update translation (German) Currently translated at 64.8% (857 of 1321 strings) Co-authored-by: B N <berndnieschalk@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/de/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
Riccardo Giovanetti	991a04f090	translationBot(ui): update translation (Italian) Currently translated at 98.6% (1303 of 1321 strings) translationBot(ui): update translation (Italian) Currently translated at 98.6% (1302 of 1320 strings) translationBot(ui): update translation (Italian) Currently translated at 98.6% (1294 of 1312 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
psychedelicious	c39fa75113	docs(ui): add comment in useIsTooLargeToUpscale	2024-08-06 11:49:35 +10:00
psychedelicious	f7863e17ce	docs(ui): add docstring for maxUpscaleDimension	2024-08-06 11:49:35 +10:00
psychedelicious	7c526390ed	fix(ui): compare upscaledPixels vs square of max dimension	2024-08-06 11:49:35 +10:00
Mary Hipp	2cff20f87a	update translations, change config value to be dimension instead of total pixels	2024-08-06 11:49:35 +10:00
Mary Hipp	90ec757802	lint	2024-08-06 11:49:35 +10:00
Mary Hipp	4b85dfcefe	(ui): restore optioanl limit on upcsale output resolution	2024-08-06 11:49:35 +10:00
Mary Hipp	21deefdc41	(ui): add image resolution badge to initial upscale image	2024-08-06 11:49:35 +10:00
psychedelicious	4d4f921a4e	build: exclude matplotlib 3.9.1 There was a problem w/ this release on windows and the builds were pulled from pypi. When installing invoke on windows, pip attempts to build from source, but most (all?) systems won't have the prerequisites for this and installs fail. This also affects GH actions. The simple fix is to exclude version 3.9.1 from our deps. For more information, see https://github.com/matplotlib/matplotlib/issues/28551	2024-08-05 08:38:44 +10:00
psychedelicious	98db8f395b	feat(app): clean up DiskImageStorage types	2024-08-04 09:43:20 +10:00
psychedelicious	f465a956a3	feat(ui): remove "images can be restored" messages	2024-08-04 09:43:20 +10:00
psychedelicious	9edb02d7ef	build: remove send2trash dependency	2024-08-04 09:43:20 +10:00
psychedelicious	6c4cf58a31	feat(app): delete model_images instead of using send2trash	2024-08-04 09:43:20 +10:00
psychedelicious	08993c0d29	feat(app): delete images instead of using send2trash Closes #6709	2024-08-04 09:43:20 +10:00
blessedcoolant	4f8a4b0f22	Merge branch 'main' into depth_anything_v2	2024-08-03 00:38:57 +05:30
blessedcoolant	a743f3c9b5	fix: implement model to func for depth anything	2024-08-03 00:37:17 +05:30
Mary Hipp	571ba87e13	fix(ui): include upscale metadata for SDXL multidiffusion	2024-08-01 21:30:42 -04:00
Ryan Dick	f27b6e2b44	Add Grounded SAM support (text prompt image segmentation) (#6701 ) ## Summary This PR enables Grounded SAM workflows (https://arxiv.org/pdf/2401.14159) via the following: - `GroundingDinoInvocation` for running a Grounding DINO model. - `SegmentAnythingModelInvocation` for running a SAM model. - `MaskTensorToImageInvocation` for convenient visualization. Other notes: - Uses the transformers implementation of Grounding DINO and SAM. - The new models are treated as 'utility models' meaning that they are not visible in the Models tab, and are downloaded automatically the first time that they are used. <img width="874" alt="image" src="https://github.com/user-attachments/assets/1cbaa97d-0e27-4943-86b1-dc7327ba8675"> ## Example Input image ![be10ec0c-20a8-4ac7-840e-d1a05fffdb6a](https://github.com/user-attachments/assets/bf21572c-635d-4703-b4ab-7aba658a9671) Prompt: "wheels", all other configs default Result: ![2221c44e-64e6-4b18-b4cb-610514b7a554](https://github.com/user-attachments/assets/344b91f4-7f4a-4b70-8e2e-3b4a0e55176d) ## Related Issues / Discussions Thanks to @blessedcoolant for the initial draft here: https://github.com/invoke-ai/InvokeAI/pull/6678 ## QA Instructions Manual tests: - [ ] Test that default settings work well. - [ ] Test with / without apply_polygon_refinement - [ ] Test mask_filter options - [ ] Test detection_threshold values - [ ] Test RGB input image - [ ] Test RGBA input image - [ ] Test grayscale input image - [ ] Smoke test that an empty mask is returned when 0 objects are detected - [ ] Test on CPU - [ ] Test on MPS (Works on Mac OS, but had to force both models to run on CPU instead of MPS) Performance: - Peak GPU memory utilization with both Grounding DINO and SAM models loaded is ~4.5GB. (The models do not need to be loaded at the same time, so could be offloaded by the MM if needed.) - On an RTX4090, with the models already cached, node execution takes ~0.6 secs. - On my CPU, with the models cached, node execution takes ~10secs. ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-08-01 20:40:18 +02:00
Ryan Dick	981475a624	Merge branch 'main' into ryan/grounded-sam	2024-08-01 20:30:35 +02:00
Ryan Dick	27ac61a4fb	Expose all model options in the GroundingDinoInvocation and the SegmentAnythingInvocation.	2024-08-01 14:23:32 -04:00
Ryan Dick	675ffc2757	Remove BoundingBoxInvocation field name overrides.	2024-08-01 14:05:44 -04:00
Ryan Dick	44b21f10f1	Add a pydantic model_validator to BoundingBoxField to check the validity of the coords.	2024-08-01 14:00:57 -04:00
Ryan Dick	c6d49e8b1f	Shorten SegmentAnythingInvocation and GroundingDinoInvocatino docstrings, since they are used as the invocation descriptions in the UI.	2024-08-01 10:17:42 -04:00
Ryan Dick	e6a512aa86	(minor) Tweak order of mask operations.	2024-08-01 10:12:24 -04:00
Ryan Dick	c3a6a6fb22	Rename SegmentAnythingModelInvocation -> SegmentAnythingInvocation.	2024-08-01 10:00:36 -04:00
Ryan Dick	b9dc3460ba	Rename SegmentAnythingModel -> SegmentAnythingPipeline.	2024-08-01 09:57:47 -04:00
Ryan Dick	63581ec980	(minor) Add None check to fix static type checking error.	2024-08-01 09:51:53 -04:00
chainchompa	08b1feeed7	add base prop for destination to direct users to different tabs on initial load (#6706 ) ## Summary - we want a way to load the studio while being directed to a specific tab, introduced a destination prop to achieve that <!--A description of the changes in this PR. Include the kind of change (fix, feature, docs, etc), the "why" and the "how". Screenshots or videos are useful for frontend changes.--> ## Related Issues / Discussions <!--WHEN APPLICABLE: List any related issues or discussions on github or discord. If this PR closes an issue, please use the "Closes #1234" format, so that the issue will be automatically closed when the PR merges.--> ## QA Instructions <!--WHEN APPLICABLE: Describe how you have tested the changes in this PR. Provide enough detail that a reviewer can reproduce your tests.--> ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [ ] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_	2024-07-31 19:25:36 -04:00
blessedcoolant	f5cfdcf32d	feat: Add BoundingBox Primitive Node	2024-08-01 04:09:08 +05:30

1 2 3 4 5 ...

12400 Commits