InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Ryan Dick	40f9e49b5e	Demote model cache logs from warning to debug based on the conversation here: https://discord.com/channels/1020123559063990373/1049495067846524939/1161647290189090816	2023-10-11 12:02:46 -04:00
Ryan Dick	98fa234529	Bump safetensors to ~=0.4.0 (#4844 ) ## What type of PR is this? (check all applicable) - [ ] Refactor - [ ] Feature - [ ] Bug Fix - [x] Optimization - [ ] Documentation Update - [ ] Community Node Submission ## Description @Millu pointed out this safetensors PR a few weeks ago, which claimed to offer a performance benefit: https://github.com/huggingface/safetensors/pull/362 . It was superseded by https://github.com/huggingface/safetensors/pull/363 and included in the latest [safetensors 0.4.0 release](https://github.com/huggingface/safetensors/releases/tag/v0.4.0). Here are the results from my local performance comparison: ``` Before(0.3.1) / After(0.4.0) sdxl:main:tokenizer from disk to cpu in 0.46s / 0.46s sdxl:main:text_encoder from disk to cpu in 2.12s / 2.32s embroidered_style_v1_sdxl.safetensors:sdxl:lora' from disk to cpu in 0.67s / 0.36s VoxelXL_v1.safetensors:sdxl:lora' from disk to cpu in 1.64s / 0.60s ryan_db_sdxl_epoch640.safetensors:sdxl:lora' from disk to cpu in 2.46s / 1.40s sdxl:main:tokenizer_2 from disk to cpu in 0.37s / 0.39s sdxl:main:text_encoder_2 from disk to cpu in 3.78s / 4.70s sdxl:main:unet from disk to cpu in 4.66s / 3.08s sdxl:main:scheduler from disk to cpu in 0.34s / 0.33s sdxl:main:vae from disk to cpu in 0.66s / 0.51s TOTAL GRAPH EXECUTION TIME: 56.489s / 53.416s ``` The benefit was marginal on my system (maybe even within measurement error), but I figured we might as well pull it.	2023-10-11 09:40:47 -04:00
psychedelicious	15b33ad501	feat(nodes): add freeu support Add support for FreeU. See: - https://huggingface.co/docs/diffusers/main/en/using-diffusers/freeu - https://github.com/ChenyangSi/FreeU Implementation: - `ModelPatcher.apply_freeu()` handles the enabling freeu (which is very simple with diffusers). - `FreeUConfig` model added to hold the hyperparameters. - `freeu_config` added as optional sub-field on `UNetField`. - `FreeUInvocation` added, works like LoRA - chain it to add the FreeU config to the UNet - No support for model-dependent presets, this will be a future workflow editor enhancement Closes #4845	2023-10-11 13:49:28 +11:00
Ryan Dick	fe889235cc	Bump safetensors to ~=0.4.0	2023-10-10 18:00:15 -04:00
Ryan Dick	462c1d4c9b	Improve model load times from disk: skip unnecessary weight init (#4840 ) ## What type of PR is this? (check all applicable) - [ ] Refactor - [ ] Feature - [ ] Bug Fix - [x] Optimization - [ ] Documentation Update - [ ] Community Node Submission ## Have you updated all relevant documentation? - [x] Yes - [ ] No ## Description This PR optimizes the time to load models from disk. In my local testing, SDXL text_encoder_2 models saw the greatest improvement: - Before change, load time (disk to cpu): 14 secs - After change, load time (disk to cpu): 4 secs See the in-code documentation for an explanation of how this speedup is achieved. ## Related Tickets & Documents This change was previously proposed on the HF transformers repo, but did not get any traction: https://github.com/huggingface/transformers/issues/18505#issue-1330728188 ## QA Instructions, Screenshots, Recordings I don't expect any adverse effects, but the new context manager is applied while loading all models, so it would make sense to exercise everything. ## Added/updated tests? - [x] Yes - [ ] No	2023-10-10 13:40:20 -04:00
Ryan Dick	0ed36158c8	Merge branch 'main' into ryan/optimize-model-load	2023-10-10 13:31:08 -04:00
Ryan Dick	f3c138a208	(minor) Fix Flake8.	2023-10-10 10:06:53 -04:00
Ryan Dick	61242bf86a	Fix bug in skip_torch_weight_init() where the original behavior of torch.nn.Conv*d modules wasn't being restored correctly.	2023-10-10 10:05:50 -04:00
psychedelicious	d118d02df4	feat(ui): add mapping for sketch and scribble control adapter processors	2023-10-09 23:24:56 -04:00
Ryan Dick	58b56e9b1e	Add a skip_torch_weight_init() context manager to improve model load times (from disk).	2023-10-09 14:12:56 -04:00
psychedelicious	1f751f8c21	fix(ui): remove extraneous cache update	2023-10-09 20:11:21 +11:00
psychedelicious	ca95a3bd0d	fix(ui): fix canvas soft-lock if canceled before first generation The canvas needs to be set to staging mode as soon as a canvas-destined batch is enqueued. If the batch is is fully canceled before an image is generated, we need to remove that batch from the canvas `batchIds` watchlist, else canvas gets stuck in staging mode with no way to exit. The changes here allow the batch status to be tracked, and if a batch has all its items completed, we can remove it from the `batchIds` watchlist. The `batchIds` watchlist now accurately represents incomplete canvas batches, fixing this cause of soft lock.	2023-10-09 20:11:21 +11:00
psychedelicious	55b40a9425	feat(events): add batch status and queue status to queue item status changed events The UI will always re-fetch queue and batch status on receiving this event, so we may as well jsut include that data in the event and save the extra network roundtrips.	2023-10-09 20:11:21 +11:00
psychedelicious	90083cc88d	fix(ui): fix use all hotkey	2023-10-09 20:03:14 +11:00
Lincoln Stein	ead754432a	add a lists of t2i adapters to startup set (#4828 ) ## What type of PR is this? (check all applicable) - [X] Feature ## Have you discussed this change with the InvokeAI team? - [X] No, because: Non-controversial ## Have you updated all relevant documentation? - [ ] Yes - [X] N/A ## Description This adds a list of T2I adapters to the “starter models” offered by the TUI installer. None of the models is selected by default; this can be done easily if requested. The models offered to the user are: ``` TencentARC/t2iadapter_canny_sd15v2 TencentARC/t2iadapter_sketch_sd15v2 TencentARC/t2iadapter_depth_sd15v2 TencentARC/t2iadapter_zoedepth_sd15v1 TencentARC/t2i-adapter-canny-sdxl-1.0 TencentARC/t2i-adapter-depth-zoe-sdxl-1.0 TencentARC/t2i-adapter-lineart-sdxl-1.0 TencentARC/t2i-adapter-sketch-sdxl-1.0 ``` ## Related Tickets & Documents PR #4612 ## QA Instructions, Screenshots, Recordings The revised installer has a new IP-ADAPTERS tab that looks like this: ![IMG_0255](https://github.com/invoke-ai/InvokeAI/assets/111189/0e01b1f6-7191-49a1-ac63-2c913826d299) ## Added/updated tests? - [ ] Yes - [X] No : It would be good to have a suite of model download tests, but not set up yet.	2023-10-08 19:49:43 -04:00
Lincoln Stein	fa9ea93477	add a lists of t2i adapters to startup set	2023-10-08 18:53:21 -04:00
Lincoln Stein	fe0cf2c160	remove hardcoded subfolder name from model downloader	2023-10-08 17:45:39 -04:00
psychedelicious	a681fa4b03	fix(ui): invalidate query cache for all models on sync models Also realised the tags were set up incorrectly, fixed that to get type safety with tags.	2023-10-07 22:30:15 +11:00
psychedelicious	1cc686734b	feat(ui): on base model change, disable control adapters Previously it deleted them entirely.	2023-10-07 22:30:15 +11:00
psychedelicious	82e8b92ba0	feat(ui): display toast when enabling t2i/controlnet and disabling the other	2023-10-07 22:30:15 +11:00
psychedelicious	e86658f864	feat(ui): disable invoke button if enabled control adapter model does not match base model	2023-10-07 22:30:15 +11:00
psychedelicious	ad136c2680	fix(ui): do not add control adapters with incompatible models to graph	2023-10-07 22:30:15 +11:00
psychedelicious	35374ec531	feat(ui): update graphs for multi ip adapter	2023-10-07 22:30:15 +11:00
psychedelicious	ed82bf6bb8	feat(ui): disable control adapter buttons if no models available	2023-10-07 22:30:15 +11:00
psychedelicious	078c9b6964	feat(nodes,ui): add t2i to linear UI - Update backend metadata for t2i adapter - Fix typo in `T2IAdapterInvocation`: `ip_adapter_model` -> `t2i_adapter_model` - Update linear graphs to use t2i adapter - Add client metadata recall for t2i adapter - Fix bug with controlnet metadata recall - processor should be set to 'none' when recalling a control adapter	2023-10-07 22:30:15 +11:00
psychedelicious	1a9d2f1701	feat(ui): spruce up control adapter ui	2023-10-07 22:30:15 +11:00
psychedelicious	3e93159bce	fix(ui): enable duplicated control adapter	2023-10-07 22:30:15 +11:00
psychedelicious	b57ebe52e4	chore(ui): "controlnet" -> "controladapters"	2023-10-07 22:30:15 +11:00
psychedelicious	ba4616ff89	feat(ui): add limits to enabled control adapters - only 1 ip adapter at a time - controlnet and t2i cannot both be active at once	2023-10-07 22:30:15 +11:00
psychedelicious	dcfbd49e1b	fix(ui): fix control adapters recall	2023-10-07 22:30:15 +11:00
psychedelicious	913fc83cbf	fix(ui): fix control adapter autoprocess	2023-10-07 22:30:15 +11:00
psychedelicious	6b8ce34eb3	fix(ui): fix excessive re-renders	2023-10-07 22:30:15 +11:00
psychedelicious	9508e0c9db	feat(ui): refactor control adapters Control adapters logic/state/ui is now generalized to hold controlnet, ip_adapter and t2i_adapter. In the future, other control adapter types can be added. TODO: - Limit IP adapter to 1 - Add T2I adapter to linear graphs - Fix autoprocess - T2I metadata saving & recall - Improve on control adapters UI	2023-10-07 22:30:15 +11:00
Ryan Dick	9c720da021	Bump DenoiseLatentsInvocation version.	2023-10-06 20:43:43 -04:00
Ryan Dick	e1b576c72d	yarn build	2023-10-06 20:43:43 -04:00
Ryan Dick	971ccfb081	Refactor multi-IP-Adapter to clean up the interface around changing scales.	2023-10-06 20:43:43 -04:00
Ryan Dick	43a3c3c7ea	Fix typo in setting IP-Adapter scales.	2023-10-06 20:43:43 -04:00
Ryan Dick	4df1cdb34d	Tidy _prepare_attention_processors(...) logic.	2023-10-06 20:43:43 -04:00
Ryan Dick	3f860c3523	Fixup IP-Adapter locale strings.	2023-10-06 20:43:43 -04:00
Ryan Dick	d8d0c9af09	Fix handling of scales with multiple IP-Adapters.	2023-10-06 20:43:43 -04:00
Ryan Dick	9403672ac0	Bugfix for multi-ip-adapter in DenoiseLatentsInvocation.	2023-10-06 20:43:43 -04:00
Ryan Dick	94591840a7	Frontend changes to enable multiple IP-Adapters in the workflow editor.	2023-10-06 20:43:43 -04:00
Ryan Dick	26b91a538a	Fixes to get IP-Adapter tests working with new multi-IP-Adapter support.	2023-10-06 20:43:43 -04:00
Ryan Dick	7ca456d674	Update IP-Adapter model to enable running multiple IP-Adapters at once. (Not tested yet.)	2023-10-06 20:43:43 -04:00
Ryan Dick	78828b6b9c	WIP - Accept a list of IPAdapterFields in DenoiseLatents.	2023-10-06 20:43:43 -04:00
Ryan Dick	166ff9d301	Proposal: Support slow tests that depend on models (#4813 ) ## What type of PR is this? (check all applicable) - [ ] Refactor - [x] Feature - [ ] Bug Fix - [ ] Optimization - [ ] Documentation Update - [ ] Community Node Submission ## Have you discussed this change with the InvokeAI team? - [x] Yes - [ ] No, because: ## Have you updated all relevant documentation? - [x] Yes - [ ] No ## Description This PR adds support for slow unit tests that depend on models. It includes: - Documentation explaining the handling of fast vs. slow unit tests. - Utilities to assist with writing tests that depend on models. - A sample test that loads and runs an IP-Adapter model. This is far from complete test coverage of IP-Adapter - it's just intended as a first example of how to write tests with models. Suggestion for reviewers: Start with docs/contributing/TESTS.md ## QA Instructions, Screenshots, Recordings I've tested it all, but it would make sense for others to try running both the fast tests and the slow tests. ## Added/updated tests? - [x] Yes - [ ] No	2023-10-06 19:55:38 -04:00
Ryan Dick	4f97bd4418	Merge branch 'main' into ryan/model-tests	2023-10-06 19:47:28 -04:00
Ryan Dick	e0e001758a	Remove @slow decorator in favor of @pytest.mark.slow.	2023-10-06 18:26:06 -04:00
Ryan Dick	c1887135b3	Improve model cache debug logging (#4784 ) ## What type of PR is this? (check all applicable) - [ ] Refactor - [ ] Feature - [ ] Bug Fix - [x] Optimization - [ ] Documentation Update - [ ] Community Node Submission ## Have you discussed this change with the InvokeAI team? - [x] Yes - [ ] No, because: ## Have you updated all relevant documentation? - [x] Yes - [ ] No ## Description This PR adds detailed debug logging to the model cache in order to give more visibility into the model cache's memory utilization. This PR does not make any functional changes to the model cache. Every time a model is moved from disk to CPU, or between CPU/CUDA, a log like this is emitted: ```bash [2023-10-03 15:17:20,599]::[InvokeAI]::DEBUG --> Moved model '/home/ryan/invokeai/models/.cache/63742ed45b499e55620c402d6df26a20:sdxl:main:unet' from cpu to cuda in 1.23s. Estimated model size: 4.782 GB. Process RAM (-4.722): 6.987GB -> 2.265GB libc mmap allocated (-4.722): 6.030GB -> 1.308GB libc arena used (-0.061): 0.402GB -> 0.341GB libc arena free (+0.061): 0.006GB -> 0.067GB libc total allocated (-4.722): 6.439GB -> 1.717GB libc total used (-4.783): 6.433GB -> 1.649GB VRAM (+4.881): 1.538GB -> 6.418GB ``` ## Related Tickets & Documents https://github.com/invoke-ai/InvokeAI/pull/4694 contains related fixes to some known memory issues. ## QA Instructions, Screenshots, Recordings Make sure debug logs are enabled and you should see the new logs. We should test each of the following environments: - [x] Linux - [x] Mac OS + MPS - [x] Windows ## Added/updated tests? - [x] Yes - [ ] No Added unit tests for the new utilities. Test coverage is still low for the ModelCache, but not worse than before.	2023-10-06 10:21:42 -04:00
Ryan Dick	096d195d6e	Merge branch 'main' into ryan/model-cache-logging-only	2023-10-06 09:52:45 -04:00

... 7 8 9 10 11 ...

8415 Commits