InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Lincoln Stein	0c8f0e3386	add max_load_models parameter for model cache control - ldm.generate.Generator() now takes an argument named `max_load_models`. This is an integer that limits the model cache size. When the cache reaches the limit, it will start purging older models from cache. - CLI takes an argument --max_load_models, default to 2. This will keep one model in GPU and the other in CPU and switch back and forth quickly. - To not cache models at all, pass --max_load_models=1	2022-10-31 08:53:16 -04:00
Lincoln Stein	fbfffe028f	add --no-interactive mode	2022-10-30 00:33:48 -04:00
Lincoln Stein	19b6c671a6	further improvements to preload_models script - User can choose to download just recommended models, customize list to download, or skip downloading altogether. - Does direct download to models directory instead of to HuggingFace cache - Able to resume interrupted downloads	2022-10-30 00:17:05 -04:00
Lincoln Stein	b532e6dd17	wording and formatting tweaks	2022-10-29 11:28:17 -04:00
Lincoln Stein	3caa95ced9	add more step-by-step documentation and links	2022-10-29 09:18:48 -04:00
Lincoln Stein	ef68a419f1	preload_models.py script downloads the weight files - user can select which weight files to download using huggingface cache - user must log in to huggingface, generate an access token, and accept license terms the very first time this is run. After that, everything works automatically. - added placeholder for docs for installing models - also got rid of unused config files. hopefully they weren't needed for textual inversion, but I don't think so.	2022-10-29 01:02:45 -04:00
Lincoln Stein	fe7ab6e480	fix crash in !del_model command	2022-10-28 11:20:04 -04:00
Lincoln Stein	aa785c3ef1	ready for merge after documentation added	2022-10-27 11:55:00 -04:00
Lincoln Stein	799dc6d0df	acceptable integration of new prompting system and inpainting This was a difficult merge because both PR #1108 and #1243 made changes to obscure parts of the diffusion code. - prompt weighting, merging and cross-attention working - cross-attention does not work with runwayML inpainting model, but weighting and merging are tested and working - CLI command parsing code rewritten in order to get embedded quotes right - --hires now works with runwayML inpainting - --embiggen does not work with runwayML and will give an error - Added an --invert option to invert masks applied to inpainting - Updated documentation	2022-10-27 01:51:35 -04:00
Lincoln Stein	8d5a225011	allow for empty prompts (useful for inpaint removal)	2022-10-25 17:26:00 -04:00
Lincoln Stein	99d23c4d81	fix merge conflicts	2022-10-25 07:30:26 -04:00
Lincoln Stein	9bef643bf5	fix a few more metadata bugs - facetool and upscale arguments now written into metadata - cleaned up handling of !fetch command	2022-10-25 00:31:43 -04:00
Lincoln Stein	f6b31d51e0	fix incorrect handling of single quotes in prompts	2022-10-25 00:31:43 -04:00
Lincoln Stein	b159b2fe42	add support for safety checker (NSFW filter) Now you can activate the Hugging Face `diffusers` library safety check for NSFW and other potentially disturbing imagery. To turn on the safety check, pass --safety_checker at the command line. For developers, the flag is `safety_checker=True` passed to ldm.generate.Generate(). Once the safety checker is turned on, it cannot be turned off unless you reinitialize a new Generate object. When the safety checker is active, suspect images will be blurred and a warning icon is added. There is also a warning message printed in the CLI, but it can be a little hard to see because of its positioning in the output stream. There is a slight but noticeable delay when the safety checker runs. Note that invisible watermarking is not currently implemented. The watermark code distributed by the CompViz distribution uses a library that does not seem to be able to retrieve the watermarks it creates, and it does not appear that Hugging Face `diffusers` or other SD distributions are doing any watermarking.	2022-10-23 22:26:18 -04:00
Lincoln Stein	f25c1f900f	add support for loading VAE autoencoders To add a VAE autoencoder to an existing model: 1. Download the appropriate autoencoder and put it into models/ldm/stable-diffusion Note that you MUST use a VAE that was written for the original CompViz Stable Diffusion codebase. For v1.4, that would be the file named vae-ft-mse-840000-ema-pruned.ckpt that you can download from https://huggingface.co/stabilityai/sd-vae-ft-mse-original 2. Edit config/models.yaml to contain the following stanza, modifying `weights` and `vae` as required to match the weights and vae model file names. There is no requirement to rename the VAE file. ~~~ stable-diffusion-1.4: weights: models/ldm/stable-diffusion-v1/sd-v1-4.ckpt description: Stable Diffusion v1.4 config: configs/stable-diffusion/v1-inference.yaml vae: models/ldm/stable-diffusion-v1/vae-ft-mse-840000-ema-pruned.ckpt width: 512 height: 512 ~~~ 3. Alternatively from within the `invoke.py` CLI, you may use the command `!editmodel stable-diffusion-1.4` to bring up a simple editor that will allow you to add the path to the VAE. 4. If you are just installing InvokeAI for the first time, you can also use `!import_model models/ldm/stable-diffusion/sd-v1.4.ckpt` instead to create the configuration from scratch. 5. That's it!	2022-10-23 09:33:15 -04:00
Lincoln Stein	ce6d618e3b	outcropping improvements - catch syntax errors in the outcrop coordinates - work (after a fashion) on non-Invoke generated images	2022-10-23 09:33:00 -04:00
Lincoln Stein	b2bf2b08ff	Merge branch 'model-switching' into development	2022-10-21 21:27:59 -04:00
Lincoln Stein	c9f9eed04e	resolve numerous small merge bugs - This merges PR #882 Coauthor: ArDiouscuros	2022-10-21 12:57:15 -04:00
Lincoln Stein	be7de4849c	Merge branch 'development' into model-switching	2022-10-21 00:55:52 -04:00
Lincoln Stein	83e6ab08aa	further improvements to model loading - code for committing config changes to models.yaml now in module rather than in invoke script - model marked "default" is now loaded if model not specified on command line - uncache changed models when edited, so that they reload properly - removed liaon from models.yaml and added stable-diffusion-1.5	2022-10-21 00:28:54 -04:00
psychedelicious	899ba975a6	Improves logic to determine if clipseg weights should be downloaded	2022-10-20 06:56:50 -04:00
psychedelicious	bfa65560eb	Fixes torch.load() for MPS/CPU	2022-10-20 06:56:50 -04:00
psychedelicious	ed9307f469	Fix typo	2022-10-20 06:56:50 -04:00
Lincoln Stein	a357bf4f19	add !mask command to view output of clipseg - The !mask command takes an image path, a text prompt, and (optionally) a masking threshold. It creates a mask over the region indicated by the prompt, and outputs several files that show which regions will be masked by the chosen prompt and threshold. - The mask images should not be passed directly to img2img because they are designed for visualization only. Instead, use the --text_mask option to pass the selected prompt and threshold. - See docs/features/INPAINTING.md for details.	2022-10-20 06:56:50 -04:00
Lincoln Stein	2ca4242f5f	fix clipseg loading problems - The directory "models" in the main InvokeAI directory was conflicting with loading "models.clipseg". To fix this issue, I have renamed the models.clipseg to clipseg_models.clipseg, and applied this change to the 'models-rename' branch of invoke-ai's fork of clipseg.	2022-10-20 06:56:50 -04:00
Lincoln Stein	62dd3b7d7d	resolve models.clipseg vs clipseg ambiguity	2022-10-18 23:09:26 -04:00
Lincoln Stein	66f6ef1b35	fix syntax errors in preload	2022-10-18 19:25:18 -04:00
Lincoln Stein	a488b14373	prevent preload warning message	2022-10-18 17:09:17 -04:00
Lincoln Stein	32122e0312	clipseg library and environment in place	2022-10-18 08:27:48 -04:00
Lincoln Stein	c974c95e2b	Merge branch 'development' of github.com:invoke-ai/InvokeAI into development	2022-10-17 23:14:55 -04:00
Lincoln Stein	3b2590243c	^C at invoke> cmd line exits gracefully	2022-10-17 23:14:32 -04:00
Lincoln Stein	0cf11ce488	add option to CLI and pngwriter that allows user to set PNG compression level - In CLI: the argument is --png_compression <0..9> (-z<0..9>) - In API, pass `compress_level` to PngWriter.save_image_and_prompt_to_png() Compression ranges from 0 (no compression) to 9 (maximum compression). Default value is 6 (as specified by Pillow package). This addresses an issue first raised in #652.	2022-10-17 22:27:47 -04:00
Lincoln Stein	ef2058824a	add a strength value to `inpaint_replace` - --inpaint_replace 0.X will cause inpainting to ignore what is under the masked region with a strength ranging from 0 (don't ignore at all) to 1.0 (ignore completely) - sync with upstream development - update docs	2022-10-16 10:06:47 -04:00
Lincoln Stein	a705a5a0aa	enhance support for model switching and editing - Error checks for invalid model - Add !del_model command to invoke.py - Add del_model() method to model_cache - Autocompleter kept in sync with model addition/subtraction.	2022-10-15 15:46:29 -04:00
Lincoln Stein	c4fb8e304b	fix noisy images at high step counts At step counts greater than ~75, the ksamplers start producing noisy images when using the Karras noise schedule. This PR reverts to using the model's own noise schedule, which eliminates the problem at the cost of slowing convergence at lower step counts. This PR also introduces a new CLI `--save_intermediates <n>' argument, which will save every nth intermediate image into a subdirectory named `intermediates/<image_prefix>'. Addresses issue #1083.	2022-10-14 16:19:45 -04:00
Lincoln Stein	fe2a2cfc8b	Merge branch 'development' into model-switching	2022-10-14 13:18:59 -04:00
db3000	1ea541baa6	Reword deprecation warning for dream.py - this plus previous commit closes #1087	2022-10-14 07:33:10 -04:00
db3000	82b7c118c4	Forward dream.py to invoke.py using the same interpreter, add deprecation warning	2022-10-14 07:31:35 -04:00
Lincoln Stein	1c501333e8	minor doc fixes	2022-10-14 07:30:26 -04:00
db3000	ce5e57d828	Generalize facetool strength argument	2022-10-14 00:03:06 -04:00
Lincoln Stein	e98fe9c22d	fix noisy images at high step counts At step counts greater than ~75, the ksamplers start producing noisy images when using the Karras noise schedule. This PR reverts to using the model's own noise schedule, which eliminates the problem at the cost of slowing convergence at lower step counts. This PR also introduces a new CLI `--save_intermediates <n>' argument, which will save every nth intermediate image into a subdirectory named `intermediates/<image_prefix>'. Addresses issue #1083.	2022-10-14 00:01:59 -04:00
Lincoln Stein	6afc0f9b38	add ability to import and edit alternative models online - !import_model <path/to/model/weights> will import a new model, prompt the user for its name and description, write it to the models.yaml file, and load it. - !edit_model <model_name> will bring up a previously-defined model and prompt the user to edit its descriptive fields. Example of !import_model <pre> invoke> <b>!import_model models/ldm/stable-diffusion-v1/model-epoch08-float16.ckpt</b> >> Model import in process. Please enter the values needed to configure this model: Name for this model: <b>waifu-diffusion</b> Description of this model: <b>Waifu Diffusion v1.3</b> Configuration file for this model: <b>configs/stable-diffusion/v1-inference.yaml</b> Default image width: <b>512</b> Default image height: <b>512</b> >> New configuration: waifu-diffusion: config: configs/stable-diffusion/v1-inference.yaml description: Waifu Diffusion v1.3 height: 512 weights: models/ldm/stable-diffusion-v1/model-epoch08-float16.ckpt width: 512 OK to import [n]? <b>y</b> >> Caching model stable-diffusion-1.4 in system RAM >> Loading waifu-diffusion from models/ldm/stable-diffusion-v1/model-epoch08-float16.ckpt \| LatentDiffusion: Running in eps-prediction mode \| DiffusionWrapper has 859.52 M params. \| Making attention of type 'vanilla' with 512 in_channels \| Working with z of shape (1, 4, 32, 32) = 4096 dimensions. \| Making attention of type 'vanilla' with 512 in_channels \| Using faster float16 precision </pre> Example of !edit_model <pre> invoke> <b>!edit_model waifu-diffusion</b> >> Editing model waifu-diffusion from configuration file ./configs/models.yaml description: <b>Waifu diffusion v1.4beta</b> weights: models/ldm/stable-diffusion-v1/<b>model-epoch10-float16.ckpt</b> config: configs/stable-diffusion/v1-inference.yaml width: 512 height: 512 >> New configuration: waifu-diffusion: config: configs/stable-diffusion/v1-inference.yaml description: Waifu diffusion v1.4beta weights: models/ldm/stable-diffusion-v1/model-epoch10-float16.ckpt height: 512 width: 512 OK to import [n]? y >> Caching model stable-diffusion-1.4 in system RAM >> Loading waifu-diffusion from models/ldm/stable-diffusion-v1/model-epoch10-float16.ckpt ... </pre>	2022-10-13 23:48:07 -04:00
db3000	7f491fd2d2	Reword deprecation warning for dream.py	2022-10-13 12:12:05 -04:00
db3000	203a6d8a00	Forward dream.py to invoke.py using the same interpreter, add deprecation warning	2022-10-13 12:12:05 -04:00
CapableWeb	6c0dd9b5ef	Add back old `dream.py` as `legacy_api.py` This commit "reverts" the new API changes by extracting the old functionality into new files. The work is based on the commit `803a51d5adca7e6e28491fc414fd3937bee7cb79` PngWriter regained PromptFormatter as old server used that. `server_legacy.py` is the old server that `dream.py` used. Finally `legacy_api.py` is what `dream.py` used to be at the mentioned commit. One manually run test has been added in order to be able to test compatibility with the old API, currently just testing that the API endpoint works the same way + the image hash is the same as it used to be before.	2022-10-12 20:35:56 -04:00
Lincoln Stein	aa6aa68753	proposed fix to work on mps systems	2022-10-12 11:08:27 -04:00
Lincoln Stein	488334710b	enable fast switching between models in invoke.py - This PR enables two new commands in the invoke.py script !models -- list the available models and their cache status !switch <model> -- switch to the indicated model Example: invoke> !models laion400m not loaded Latent Diffusion LAION400M model stable-diffusion-1.4 active Stable Diffusion inference model version 1.4 waifu-1.3 cached Waifu anime model version 1.3 invoke> !switch waifu-1.3 >> Caching model stable-diffusion-1.4 in system RAM >> Retrieving model waifu-1.3 from system RAM cache The name and descriptions of the models are taken from `config/models.yaml`. A future enhancement to `model_cache.py` will be to enable new model stanzas to be added to the file programmatically. This will be useful for the WebGUI. More details: - Use fast switching algorithm described in PR #948 - Models are selected using their configuration stanza name given in models.yaml. - To avoid filling up CPU RAM with cached models, this PR implements an LRU cache that monitors available CPU RAM. - The caching code allows the minimum value of available RAM to be adjusted, but invoke.py does not currently have a command-line argument that allows you to set it. The minimum free RAM is arbitrarily set to 2 GB. - Add optional description field to configs/models.yaml Unrelated fixes: - Added ">>" to CompViz model loading messages in order to make user experience more consistent. - When generating an image greater than defaults, will only warn about possible VRAM filling the first time. - Fixed bug that was causing help message to be printed twice. This involved moving the import line for the web backend into the section where it is called. Coauthored by: @ArDiouscuros	2022-10-12 02:37:42 -04:00
Lincoln Stein	2b1aaf4ee7	rename all modules from ldm.dream to ldm.invoke - scripts and documentation updated to match - ran preflight checks on both web and CLI and seems to be working	2022-10-08 11:37:23 -04:00
Lincoln Stein	98fe044dee	rebrand CLI from "dream" to "invoke" - rename dream.py to invoke.py - create a compatibility script named dream.py that execs() invoke.py - redo documentation - change help message in args - this does not rename the libraries, which are still ldm.dream.util, etc	2022-10-08 09:32:06 -04:00
plucked	6499b99dad	revert accidental edit	2022-10-07 10:26:14 +00:00

1 2 3 4 5 ...

297 Commits