InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
psychedelicious	38343917f8	fix(backend): revert non-blocking device transfer In #6490 we enabled non-blocking torch device transfers throughout the model manager's memory management code. When using this torch feature, torch attempts to wait until the tensor transfer has completed before allowing any access to the tensor. Theoretically, that should make this a safe feature to use. This provides a small performance improvement but causes race conditions in some situations. Specific platforms/systems are affected, and complicated data dependencies can make this unsafe. - Intermittent black images on MPS devices - reported on discord and #6545, fixed with special handling in #6549. - Intermittent OOMs and black images on a P4000 GPU on Windows - reported in #6613, fixed in this commit. On my system, I haven't experience any issues with generation, but targeted testing of non-blocking ops did expose a race condition when moving tensors from CUDA to CPU. One workaround is to use torch streams with manual sync points. Our application logic is complicated enough that this would be a lot of work and feels ripe for edge cases and missed spots. Much safer is to fully revert non-locking - which is what this change does.	2024-07-16 08:59:42 +10:00
psychedelicious	c7562dd6c0	fix(backend): mps should not use `non_blocking` We can get black outputs when moving tensors from CPU to MPS. It appears MPS to CPU is fine. See: - https://github.com/pytorch/pytorch/issues/107455 - https://discuss.pytorch.org/t/should-we-set-non-blocking-to-true/38234/28 Changes: - Add properties for each device on `TorchDevice` as a convenience. - Add `get_non_blocking` static method on `TorchDevice`. This utility takes a torch device and returns the flag to be used for non_blocking when moving a tensor to the device provided. - Update model patching and caching APIs to use this new utility. Fixes: #6545	2024-06-27 19:15:23 +10:00
Lincoln Stein	e93f4d632d	[util] Add generic torch device class (#6174 ) * introduce new abstraction layer for GPU devices * add unit test for device abstraction * fix ruff * convert TorchDeviceSelect into a stateless class * move logic to select context-specific execution device into context API * add mock hardware environments to pytest * remove dangling mocker fixture * fix unit test for running on non-CUDA systems * remove unimplemented get_execution_device() call * remove autocast precision * Multiple changes: 1. Remove TorchDeviceSelect.get_execution_device(), as well as calls to context.models.get_execution_device(). 2. Rename TorchDeviceSelect to TorchDevice 3. Added back the legacy public API defined in `invocation_api`, including choose_precision(). 4. Added a config file migration script to accommodate removal of precision=autocast. * add deprecation warnings to choose_torch_device() and choose_precision() * fix test crash * remove app_config argument from choose_torch_device() and choose_torch_dtype() --------- Co-authored-by: Lincoln Stein <lstein@gmail.com>	2024-04-15 13:12:49 +00:00
psychedelicious	9ab6655491	feat(backend): clean up choose_precision - Allow user-defined precision on MPS. - Use more explicit logic to handle all possible cases. - Add comments. - Remove the app_config args (they were effectively unused, just get the config using the singleton getter util)	2024-04-07 09:41:05 -04:00
psychedelicious	a397fdbd25	chore: ruff	2024-03-27 08:16:27 -04:00
psychedelicious	a291a42abc	feat: display torch device on startup This functionality disappeared at some point.	2024-03-27 08:16:27 -04:00
Lincoln Stein	d871fca643	partially address --root CLI argument handling - fix places where `get_config()` is being called at import time rather than at run time. - add regression test for import time get_config() calling.	2024-03-19 09:24:28 +11:00
psychedelicious	897fe497dc	fix(config): use new get_config across the app, use correct settings	2024-03-19 09:24:28 +11:00
psychedelicious	7b1f9409bc	fix(config): drop nonexistent `config.use_cpu` setting	2024-03-19 09:24:28 +11:00
Lincoln Stein	db340bc253	fix invokeai_configure script to work with new mm; rename CLIs	2024-03-01 10:42:33 +11:00
Lincoln Stein	5745ce9c7d	Multiple refinements on loaders: - Cache stat collection enabled. - Implemented ONNX loading. - Add ability to specify the repo version variant in installer CLI. - If caller asks for a repo version that doesn't exist, will fall back to empty version rather than raising an error.	2024-03-01 10:42:33 +11:00
Lincoln Stein	67eb715093	loaders for main, controlnet, ip-adapter, clipvision and t2i	2024-03-01 10:42:33 +11:00
Lincoln Stein	8ba5360269	model loading and conversion implemented for vaes	2024-03-01 10:42:33 +11:00
Millun Atluri	74e644c4ba	Allow bfloat16 to be configurable in invoke.yaml (#5469 ) * feat: allow bfloat16 to be configurable in invoke.yaml * fix: `torch_dtype()` util - Use `choose_precision` to get the precision string - Do not reference deprecated `config.full_precision` flat (why does this still exist?), if a user had this enabled it would override their actual precision setting and potentially cause a lot of confusion. --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2024-01-12 18:40:37 +00:00
Lincoln Stein	6460dcc7e0	use torch.bfloat16 on cuda systems	2024-01-04 23:25:52 -05:00
gogurtenjoyer	2d11d97dad	remove MacOS Sonoma check in devices.py (#5312 ) * remove MacOS Sonoma check in devices.py As of pytorch 2.1.0, float16 works with our MPS fixes on Sonoma, so the check is no longer needed. * remove unused platform import	2023-12-22 00:42:47 +00:00
Martin Kristiansen	caea6d11c6	isort wip 2	2023-09-12 13:01:58 -04:00
Lincoln Stein	ed38eaa10c	refactor InvokeAIAppConfig	2023-08-17 13:47:26 -04:00
psychedelicious	b6e369c745	chore: black	2023-08-05 12:28:35 +10:00
gogurtenjoyer	ecabfc252b	devices.py - Update MPS FP16 check to account for upcoming MacOS Sonoma float16 doesn't seem to work on MacOS Sonoma due to further changes with Metal. This'll default back to float32 for Sonoma users.	2023-08-05 12:28:35 +10:00
Martin Kristiansen	218b6d0546	Apply black	2023-07-27 10:54:01 -04:00
Lincoln Stein	10d3bccf32	Mac MPS FP16 fixes (#3641 ) This PR is to allow FP16 precision to work on Macs with MPS. In addition, it centralizes the torch fixes/workarounds required for MPS into a new backend utility `mps_fixes.py`. This is conditionally imported in `api_app.py`/`cli_app.py`. Many MANY thanks to @StAlKeR7779 for patiently working to debug and fix these issues.	2023-07-07 17:43:23 -04:00
gogurtenjoyer	233869b56a	Mac MPS FP16 fixes This PR is to allow FP16 precision to work on Macs with MPS. In addition, it centralizes the torch fixes/workarounds required for MPS into a new backend utility file `mps_fixes.py`. This is conditionally imported in `api_app.py`/`cli_app.py`. Many MANY thanks to StAlKeR7779 for patiently working to debug and fix these issues.	2023-07-04 18:10:53 -04:00
Lincoln Stein	ac9ec4e75a	restore 3.9 compatibility by replacing \| with Union[]	2023-07-03 10:57:40 -04:00
Lincoln Stein	2465c7987b	Revert "restore 3.9 compatibility by replacing \| with Union[]" This reverts commit `76bafeb99e`.	2023-07-03 10:56:41 -04:00
Lincoln Stein	76bafeb99e	restore 3.9 compatibility by replacing \| with Union[]	2023-07-03 10:55:04 -04:00
Lincoln Stein	2273b3a8c8	fix potential race condition in config system	2023-05-25 20:41:26 -04:00
Lincoln Stein	7ea995149e	fixes to env parsing, textual inversion & help text - Make environment variable settings case InSenSiTive: INVOKEAI_MAX_LOADED_MODELS and InvokeAI_Max_Loaded_Models environment variables will both set `max_loaded_models` - Updated realesrgan to use new config system. - Updated textual_inversion_training to use new config system. - Discovered a race condition when InvokeAIAppConfig is created at module load time, which makes it impossible to customize or replace the help message produced with --help on the command line. To fix this, moved all instances of get_invokeai_config() from module load time to object initialization time. Makes code cleaner, too. - Added `--from_file` argument to `invokeai-node-cli` and changed github action to match. CI tests will hopefully work now.	2023-05-18 10:48:23 -04:00
Lincoln Stein	e4196bbe5b	adjust non-app modules to use new config system	2023-05-04 00:43:51 -04:00
Lincoln Stein	15ffb53e59	remove globals, args, generate and the legacy CLI	2023-05-03 23:36:51 -04:00
Lincoln Stein	60a98cacef	all vestiges of ldm.invoke removed	2023-03-03 01:02:00 -05:00
Lincoln Stein	6a990565ff	all files migrated; tweaks needed	2023-03-03 00:02:15 -05:00

32 Commits