InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
psychedelicious	38343917f8	fix(backend): revert non-blocking device transfer In #6490 we enabled non-blocking torch device transfers throughout the model manager's memory management code. When using this torch feature, torch attempts to wait until the tensor transfer has completed before allowing any access to the tensor. Theoretically, that should make this a safe feature to use. This provides a small performance improvement but causes race conditions in some situations. Specific platforms/systems are affected, and complicated data dependencies can make this unsafe. - Intermittent black images on MPS devices - reported on discord and #6545, fixed with special handling in #6549. - Intermittent OOMs and black images on a P4000 GPU on Windows - reported in #6613, fixed in this commit. On my system, I haven't experience any issues with generation, but targeted testing of non-blocking ops did expose a race condition when moving tensors from CUDA to CPU. One workaround is to use torch streams with manual sync points. Our application logic is complicated enough that this would be a lot of work and feels ripe for edge cases and missed spots. Much safer is to fully revert non-locking - which is what this change does.	2024-07-16 08:59:42 +10:00
Ryan Dick	1d449097cc	Apply ruff rule to disallow all relative imports.	2024-07-04 09:35:37 -04:00
Ryan Dick	9da5925287	Add ruff rule to disallow relative parent imports.	2024-07-04 09:35:37 -04:00
Ryan Dick	414750a45d	Update calc_model_size_by_data(...) to handle all expected model types, and to log an error if an unexpected model type is received.	2024-07-04 09:08:25 -04:00
Lincoln Stein	a3cb5da130	Improve RAM<->VRAM memory copy performance in LoRA patching and elsewhere (#6490 ) * allow model patcher to optimize away the unpatching step when feasible * remove lazy_offloading functionality * allow model patcher to optimize away the unpatching step when feasible * remove lazy_offloading functionality * do not save original weights if there is a CPU copy of state dict * Update invokeai/backend/model_manager/load/load_base.py Co-authored-by: Ryan Dick <ryanjdick3@gmail.com> * documentation fixes requested during penultimate review * add non-blocking=True parameters to several torch.nn.Module.to() calls, for slight performance increases * fix ruff errors * prevent crash on non-cuda-enabled systems --------- Co-authored-by: Lincoln Stein <lstein@gmail.com> Co-authored-by: Kent Keirsey <31807370+hipsterusername@users.noreply.github.com> Co-authored-by: Ryan Dick <ryanjdick3@gmail.com>	2024-06-13 17:10:03 +00:00
Ryan Dick	7ca677578e	Create a UNetAttentionPatcher for patching UNet models with CustomAttnProcessor2_0 modules.	2024-04-09 08:12:12 -04:00
blessedcoolant	be574cb764	fix: incorrect suffix check in ip adapter checkpoint file	2024-04-03 22:38:28 +05:30
blessedcoolant	e574815413	chore: clean up merge conflicts	2024-04-03 20:28:00 +05:30
blessedcoolant	fb293dcd84	Merge branch 'checkpoint-ip-adapter' of https://github.com/blessedcoolant/InvokeAI into checkpoint-ip-adapter	2024-04-03 20:23:07 +05:30
blessedcoolant	414851f2f0	fix: raise and present the runtime error from the exception	2024-04-03 20:21:50 +05:30
blessedcoolant	2dcbb7223b	fix: use Path for ip_adapter_ckpt_path instead of str	2024-04-03 20:21:03 +05:30
blessedcoolant	14a9f74b17	cleanup: use load_file of safetensors directly for loading ip adapters	2024-04-03 12:40:13 +05:30
blessedcoolant	1372ef15b3	fix: Fail when unexpected keys are found in IP Adapter models	2024-04-03 12:40:11 +05:30
blessedcoolant	be1212de9a	fix: Raise a better error when incorrect CLIP Vision model is used	2024-04-03 12:40:10 +05:30
blessedcoolant	936b99bd3c	chore: improve types in ip_adapter backend file	2024-04-03 12:40:02 +05:30
blessedcoolant	67afb1763e	wip: Initial implementation of safetensor support for IP Adapter	2024-04-03 12:39:52 +05:30
blessedcoolant	23390f1516	cleanup: use load_file of safetensors directly for loading ip adapters	2024-04-01 06:37:38 +05:30
blessedcoolant	6e4c2d3685	fix: Fail when unexpected keys are found in IP Adapter models	2024-03-29 12:34:56 +05:30
blessedcoolant	cd078b1865	fix: Raise a better error when incorrect CLIP Vision model is used	2024-03-29 11:58:10 +05:30
blessedcoolant	688a0f30bb	chore: improve types in ip_adapter backend file	2024-03-27 22:08:23 +05:30
blessedcoolant	b013d0e064	wip: Initial implementation of safetensor support for IP Adapter	2024-03-27 22:08:14 +05:30
psychedelicious	5a3195f757	final tidying before marking PR as ready for review - Replace AnyModelLoader with ModelLoaderRegistry - Fix type check errors in multiple files - Remove apparently unneeded `get_model_config_enum()` method from model manager - Remove last vestiges of old model manager - Updated tests and documentation resolve conflict with seamless.py	2024-03-01 10:42:33 +11:00
Lincoln Stein	5d612ec095	Tidy names and locations of modules - Rename old "model_management" directory to "model_management_OLD" in order to catch dangling references to original model manager. - Caught and fixed most dangling references (still checking) - Rename lora, textual_inversion and model_patcher modules - Introduce a RawModel base class to simplfy the Union returned by the model loaders. - Tidy up the model manager 2-related tests. Add useful fixtures, and a finalizer to the queue and installer fixtures that will stop the services and release threads.	2024-03-01 10:42:33 +11:00
Lincoln Stein	a23dedd2ee	make model manager v2 ready for PR review - Replace legacy model manager service with the v2 manager. - Update invocations to use new load interface. - Fixed many but not all type checking errors in the invocations. Most were unrelated to model manager - Updated routes. All the new routes live under the route tag `model_manager_v2`. To avoid confusion with the old routes, they have the URL prefix `/api/v2/models`. The old routes have been de-registered. - Added a pytest for the loader. - Updated documentation in contributing/MODEL_MANAGER.md	2024-03-01 10:42:33 +11:00
Ryan Dick	693c6cf5e4	Add support for IPAdapterFull models. The changes are based on this upstream PR: https://github.com/tencent-ailab/IP-Adapter/pull/139 .	2023-11-29 15:07:21 -08:00
psychedelicious	6494e8e551	chore: ruff format	2023-11-11 10:55:40 +11:00
psychedelicious	513fceac82	chore: ruff check - fix pycodestyle	2023-11-11 10:55:33 +11:00
psychedelicious	99a8ebe3a0	chore: ruff check - fix flake8-bugbear	2023-11-11 10:55:28 +11:00
Ryan Dick	3079c75a60	(minor) Make it more clear that shape annotations are just comments and not commented lines of code.	2023-10-16 08:35:32 -04:00
Ryan Dick	8464450a53	Add support for multi-image IP-Adapter.	2023-10-14 12:50:33 -04:00
Ryan Dick	971ccfb081	Refactor multi-IP-Adapter to clean up the interface around changing scales.	2023-10-06 20:43:43 -04:00
Ryan Dick	4df1cdb34d	Tidy _prepare_attention_processors(...) logic.	2023-10-06 20:43:43 -04:00
Ryan Dick	d8d0c9af09	Fix handling of scales with multiple IP-Adapters.	2023-10-06 20:43:43 -04:00
Ryan Dick	26b91a538a	Fixes to get IP-Adapter tests working with new multi-IP-Adapter support.	2023-10-06 20:43:43 -04:00
Ryan Dick	7ca456d674	Update IP-Adapter model to enable running multiple IP-Adapters at once. (Not tested yet.)	2023-10-06 20:43:43 -04:00
Ryan Dick	fbe6452c45	Add support for IPAdapterPlusXL based on `6219530507`.	2023-10-04 22:35:17 -04:00
Ryan Dick	399ebe443e	Fix IP-Adapter calculation of memory footprint.	2023-09-25 18:28:10 -04:00
Ryan Dick	bf62553150	(minor) Update documentation to reflect that a bug was fixed in InvokeAI/ip_adapter_sdxl_vit_h by `e178288fb6`	2023-09-20 20:18:33 -04:00
Ryan Dick	92b49e45bb	Address flake8 error.	2023-09-18 16:33:16 -04:00
Ryan Dick	b05b8ef677	Switch to using torch 2.0 attention for IP-Adapter (more memory-efficient).	2023-09-18 16:30:53 -04:00
user1	ced297ed21	Initial implementation of IP-Adapter "begin_step_percent" and "end_step_percent" for controlling on which steps IP-Adapter is applied in the denoising loop.	2023-09-16 08:24:12 -07:00
Ryan Dick	343df03a92	isort	2023-09-15 13:18:00 -04:00
Ryan Dick	c104807201	Update list of supported IP-Adapters.	2023-09-14 23:43:19 -04:00
Ryan Dick	990ce9a1da	Lookup IP-Adapter linked image encoder from disk instead of storing in model config metadata.	2023-09-14 23:06:57 -04:00
Ryan Dick	c2f074dc2f	Fix python static checks.	2023-09-14 16:48:47 -04:00
Ryan Dick	fca60862e2	Add README.md describing IP-Adapter model formats.	2023-09-14 16:02:07 -04:00
Ryan Dick	94c186bb4c	Fix bug in IPAdapter.to(...).	2023-09-14 15:45:25 -04:00
Ryan Dick	a22c8cb3a1	Improve robustness of check for IPAdapter vs IPAdapterPlus.	2023-09-14 15:25:41 -04:00
Ryan Dick	781e8521d5	Eliminate the need for IPAdapter.initialize().	2023-09-14 15:02:59 -04:00
Ryan Dick	d114d0ba95	Remove need for the image_encoder param in IPAdapter.initialize().	2023-09-14 14:14:35 -04:00

1 2

66 Commits