InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Mary Hipp	e381e021e9	knip lint	2024-08-08 14:00:17 -04:00
Mary Hipp	641af64f93	regnerate schema	2024-08-08 13:58:25 -04:00
Mary Hipp	a7b83c8b5b	Merge remote-tracking branch 'origin/main' into maryhipp/style-presets	2024-08-08 13:56:59 -04:00
Mary Hipp	4cc41e0188	translations and lint fix	2024-08-08 13:56:37 -04:00
Mary Hipp	442fc02429	resize images to 100x100 for style preset images	2024-08-08 12:56:55 -04:00
Mary Hipp	9a4d075074	fix path for style_preset_images, fix png type when converting blobs to files, built view mode components	2024-08-08 12:31:20 -04:00
Sergey Borisov	17ff8196cb	Remove tmp code	2024-08-07 22:06:05 -04:00
Sergey Borisov	68f993998a	Add support for norm layer	2024-08-07 22:06:05 -04:00
Sergey Borisov	7da6120b39	Fix LoKR refactor bug	2024-08-07 22:06:05 -04:00
blessedcoolant	6cd40965c4	Depth Anything V2 (#6674 ) - Updated the previous DepthAnything manual implementation to use the `transformers` implementation instead. So we can get upstream features. - Plugged in the DepthAnything models to be handled by Invoke's Model Manager. - `small_v2` model will use DepthAnythingV2. This has been added as a new model option and is now also the default in the Linear UI. ![opera_TxRhmbFole](https://github.com/user-attachments/assets/2a25abe3-ba0b-4f97-b75a-2ce5fd6246e6) # Merge Review and merge.	2024-08-07 20:26:58 +05:30
Kent Keirsey	408a1d6dbb	Merge branch 'main' into depth_anything_v2	2024-08-07 10:45:56 -04:00
Mary Hipp	0b0abfbe8f	clean up image implementation	2024-08-07 10:36:38 -04:00
Mary Hipp	cc96dcf0ed	style preset images	2024-08-07 09:58:27 -04:00
Mary Hipp	2604fd9fde	a whole bunch of stuff	2024-08-06 15:31:13 -04:00
Hosted Weblate	140670d00e	translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. Co-authored-by: Hosted Weblate <hosted@weblate.org> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
Phrixus2023	70233fae5d	translationBot(ui): update translation (Chinese (Simplified)) Currently translated at 98.1% (1296 of 1321 strings) Co-authored-by: Phrixus2023 <920414016@qq.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/zh_Hans/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
Alexander Eichhorn	6f457a6c4c	translationBot(ui): update translation (German) Currently translated at 65.1% (860 of 1321 strings) Co-authored-by: Alexander Eichhorn <pfannkuchensack@einfach-doof.de> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/de/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
B N	5c319f5356	translationBot(ui): update translation (German) Currently translated at 64.8% (857 of 1321 strings) Co-authored-by: B N <berndnieschalk@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/de/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
Riccardo Giovanetti	991a04f090	translationBot(ui): update translation (Italian) Currently translated at 98.6% (1303 of 1321 strings) translationBot(ui): update translation (Italian) Currently translated at 98.6% (1302 of 1320 strings) translationBot(ui): update translation (Italian) Currently translated at 98.6% (1294 of 1312 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2024-08-06 17:54:47 +10:00
psychedelicious	c39fa75113	docs(ui): add comment in useIsTooLargeToUpscale	2024-08-06 11:49:35 +10:00
psychedelicious	f7863e17ce	docs(ui): add docstring for maxUpscaleDimension	2024-08-06 11:49:35 +10:00
psychedelicious	7c526390ed	fix(ui): compare upscaledPixels vs square of max dimension	2024-08-06 11:49:35 +10:00
Mary Hipp	2cff20f87a	update translations, change config value to be dimension instead of total pixels	2024-08-06 11:49:35 +10:00
Mary Hipp	90ec757802	lint	2024-08-06 11:49:35 +10:00
Mary Hipp	4b85dfcefe	(ui): restore optioanl limit on upcsale output resolution	2024-08-06 11:49:35 +10:00
Mary Hipp	21deefdc41	(ui): add image resolution badge to initial upscale image	2024-08-06 11:49:35 +10:00
Mary Hipp	857d74bbfe	wip apply and calculate prompt with interpolation	2024-08-05 19:11:48 -04:00
Mary Hipp	fd7a635777	(ui) the most basic crud ui: view list of presets, create a new preset, edit/delete existing presets	2024-08-05 15:48:23 -04:00
Mary Hipp	af9110e964	fix prompt concat logic	2024-08-05 13:42:28 -04:00
Mary Hipp	a61209206b	remove custom SDXL prompts component	2024-08-05 13:40:46 -04:00
Mary Hipp	e05cc62e5f	add style presets API layer to UI	2024-08-05 13:37:07 -04:00
psychedelicious	4d4f921a4e	build: exclude matplotlib 3.9.1 There was a problem w/ this release on windows and the builds were pulled from pypi. When installing invoke on windows, pip attempts to build from source, but most (all?) systems won't have the prerequisites for this and installs fail. This also affects GH actions. The simple fix is to exclude version 3.9.1 from our deps. For more information, see https://github.com/matplotlib/matplotlib/issues/28551	2024-08-05 08:38:44 +10:00
psychedelicious	98db8f395b	feat(app): clean up DiskImageStorage types	2024-08-04 09:43:20 +10:00
psychedelicious	f465a956a3	feat(ui): remove "images can be restored" messages	2024-08-04 09:43:20 +10:00
psychedelicious	9edb02d7ef	build: remove send2trash dependency	2024-08-04 09:43:20 +10:00
psychedelicious	6c4cf58a31	feat(app): delete model_images instead of using send2trash	2024-08-04 09:43:20 +10:00
psychedelicious	08993c0d29	feat(app): delete images instead of using send2trash Closes #6709	2024-08-04 09:43:20 +10:00
blessedcoolant	4f8a4b0f22	Merge branch 'main' into depth_anything_v2	2024-08-03 00:38:57 +05:30
blessedcoolant	a743f3c9b5	fix: implement model to func for depth anything	2024-08-03 00:37:17 +05:30
Mary Hipp	217fe40d99	feat(api): add style_presets router, make sure all CRUD is working, add is_default	2024-08-02 12:29:54 -04:00
Mary Hipp	b76bf50b93	feat(db,api): create new table for style presets, build out record storage service for style presets	2024-08-01 22:20:11 -04:00
Mary Hipp	571ba87e13	fix(ui): include upscale metadata for SDXL multidiffusion	2024-08-01 21:30:42 -04:00
Ryan Dick	f27b6e2b44	Add Grounded SAM support (text prompt image segmentation) (#6701 ) ## Summary This PR enables Grounded SAM workflows (https://arxiv.org/pdf/2401.14159) via the following: - `GroundingDinoInvocation` for running a Grounding DINO model. - `SegmentAnythingModelInvocation` for running a SAM model. - `MaskTensorToImageInvocation` for convenient visualization. Other notes: - Uses the transformers implementation of Grounding DINO and SAM. - The new models are treated as 'utility models' meaning that they are not visible in the Models tab, and are downloaded automatically the first time that they are used. <img width="874" alt="image" src="https://github.com/user-attachments/assets/1cbaa97d-0e27-4943-86b1-dc7327ba8675"> ## Example Input image ![be10ec0c-20a8-4ac7-840e-d1a05fffdb6a](https://github.com/user-attachments/assets/bf21572c-635d-4703-b4ab-7aba658a9671) Prompt: "wheels", all other configs default Result: ![2221c44e-64e6-4b18-b4cb-610514b7a554](https://github.com/user-attachments/assets/344b91f4-7f4a-4b70-8e2e-3b4a0e55176d) ## Related Issues / Discussions Thanks to @blessedcoolant for the initial draft here: https://github.com/invoke-ai/InvokeAI/pull/6678 ## QA Instructions Manual tests: - [ ] Test that default settings work well. - [ ] Test with / without apply_polygon_refinement - [ ] Test mask_filter options - [ ] Test detection_threshold values - [ ] Test RGB input image - [ ] Test RGBA input image - [ ] Test grayscale input image - [ ] Smoke test that an empty mask is returned when 0 objects are detected - [ ] Test on CPU - [ ] Test on MPS (Works on Mac OS, but had to force both models to run on CPU instead of MPS) Performance: - Peak GPU memory utilization with both Grounding DINO and SAM models loaded is ~4.5GB. (The models do not need to be loaded at the same time, so could be offloaded by the MM if needed.) - On an RTX4090, with the models already cached, node execution takes ~0.6 secs. - On my CPU, with the models cached, node execution takes ~10secs. ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_	2024-08-01 20:40:18 +02:00
Ryan Dick	981475a624	Merge branch 'main' into ryan/grounded-sam	2024-08-01 20:30:35 +02:00
Ryan Dick	27ac61a4fb	Expose all model options in the GroundingDinoInvocation and the SegmentAnythingInvocation.	2024-08-01 14:23:32 -04:00
Ryan Dick	675ffc2757	Remove BoundingBoxInvocation field name overrides.	2024-08-01 14:05:44 -04:00
Ryan Dick	44b21f10f1	Add a pydantic model_validator to BoundingBoxField to check the validity of the coords.	2024-08-01 14:00:57 -04:00
Ryan Dick	c6d49e8b1f	Shorten SegmentAnythingInvocation and GroundingDinoInvocatino docstrings, since they are used as the invocation descriptions in the UI.	2024-08-01 10:17:42 -04:00
Ryan Dick	e6a512aa86	(minor) Tweak order of mask operations.	2024-08-01 10:12:24 -04:00
Ryan Dick	c3a6a6fb22	Rename SegmentAnythingModelInvocation -> SegmentAnythingInvocation.	2024-08-01 10:00:36 -04:00

... 3 4 5 6 7 ...

12604 Commits