InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Lincoln Stein	8731b498c0	fix merge conflicts	2022-09-20 23:55:57 -04:00
Lincoln Stein	f408ef2e6c	resolved multiple conflicts between PR #683 and subsequent PRs	2022-09-20 23:41:43 -04:00
Lincoln Stein	283a0d72c7	tweaks to make postprocess fixing work better - modify strength of embiggen to reduce tiling ghosts - normalize naming of postprocessed files (could improve more to avoid name collisions) - move restoration modules under ldm.dream	2022-09-20 23:38:04 -04:00
blessedcoolant	1b5013ab72	GFPGAN and Real ESRGAN Implementation Refactor	2022-09-20 23:37:19 -04:00
Lincoln Stein	e8bb39370c	add ability to post-process images from the CLI - supports gfpgan, esrgan, codeformer and embiggen - To use: dream> !fix ./outputs/img-samples/000056.292144555.png -ft gfpgan -U2 -G0.8 dream> !fix ./outputs/img-samples/000056.292144555.png -ft codeformer -G 0.8 dream> !fix ./outputs/img-samples/000056.29214455.png -U4 dream> !fix ./outputs/img-samples/000056.292144555.png -embiggen 1.5 The first example invokes gfpgan to fix faces and esrgan to upscale. The second example invokes codeformer to fix faces, no upscaling The third example uses esrgan to upscale 4X The four example runs embiggen to enlarge 1.5X - This is very preliminary work. There are some anomalies to note: 1. The syntax is non-obvious. I would prefer something like: !fix esrgan,gfpgan !fix esrgan !fix embiggen,codeformer However, this will require refactoring the gfpgan and embiggen code. 2. Images generated using gfpgan, esrgan or codeformer all are named "xxxxxx.xxxxxx.postprocessed.png" and the original is saved. However, the prefix is a new one that is not related to the original. 3. Images generated using embiggen are named "xxxxx.xxxxxxx.png", and once again the prefix is new. I'm not sure whether the prefix should be aligned with the original file's prefix or not. Probably not, but opinions welcome.	2022-09-20 23:33:20 -04:00
Lincoln Stein	fc61ddab3c	Merge branch 'development' into postprocessing-commands	2022-09-20 18:48:42 -04:00
Mihail Dumitrescu	d176fb07cd	Replace --full_precision with --precision that works even if not specified Allowed values are 'auto', 'float32', 'autocast', 'float16'. If not specified or 'auto' a working precision is automatically selected based on the torch device. Context: #526 Deprecated --full_precision / -F Tested on both cuda and cpu by calling scripts/dream.py without arguments and checked the auto configuration worked. With --precision=auto/float32/autocast/float16 it performs as expected, either working or failing with a reasonable error. Also checked Img2Img.	2022-09-20 17:08:00 -04:00
Lincoln Stein	23af057e5c	tweaks to make postprocess fixing work better - modify strength of embiggen to reduce tiling ghosts - normalize naming of postprocessed files (could improve more to avoid name collisions) - move restoration modules under ldm.dream	2022-09-19 14:54:52 -04:00
Lincoln Stein	c14bdcb8fd	combine PRs #690 and #683	2022-09-19 13:59:43 -04:00
Lincoln Stein	f816526d0d	add ability to post-process images from the CLI - supports gfpgan, esrgan, codeformer and embiggen - To use: dream> !fix ./outputs/img-samples/000056.292144555.png -ft gfpgan -U2 -G0.8 dream> !fix ./outputs/img-samples/000056.292144555.png -ft codeformer -G 0.8 dream> !fix ./outputs/img-samples/000056.29214455.png -U4 dream> !fix ./outputs/img-samples/000056.292144555.png -embiggen 1.5 The first example invokes gfpgan to fix faces and esrgan to upscale. The second example invokes codeformer to fix faces, no upscaling The third example uses esrgan to upscale 4X The four example runs embiggen to enlarge 1.5X - This is very preliminary work. There are some anomalies to note: 1. The syntax is non-obvious. I would prefer something like: !fix esrgan,gfpgan !fix esrgan !fix embiggen,codeformer However, this will require refactoring the gfpgan and embiggen code. 2. Images generated using gfpgan, esrgan or codeformer all are named "xxxxxx.xxxxxx.postprocessed.png" and the original is saved. However, the prefix is a new one that is not related to the original. 3. Images generated using embiggen are named "xxxxx.xxxxxxx.png", and once again the prefix is new. I'm not sure whether the prefix should be aligned with the original file's prefix or not. Probably not, but opinions welcome.	2022-09-19 12:38:14 -04:00
blessedcoolant	7b0cbb34d6	GFPGAN and Real ESRGAN Implementation Refactor	2022-09-19 23:38:56 +12:00
blessedcoolant	f3292a6953	Implement CodeFormer Face Restoration (#669 ) * Implement CodeFormer Face Restoration * fix codeformer model destination path Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2022-09-18 15:01:05 -04:00
Danny Beer	045aa7a9a3	Support color correction for img2img and inpainting (#613 ) * Support color correction for img2img and inpainting, avoiding the shift to magenta seen when running images through img2img repeatedly. * Fix docs for color correction * add --init_color to prompt reconstruction * For best results, the --init_color option should point to the very first image used in the sequence of img2img operations. Otherwise color correction will skew towards cyan. Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2022-09-18 09:47:57 -04:00
Lincoln Stein	cbac95b02a	Merge with PR #602 - New and improved web api - Author: @Kyle0654	2022-09-16 16:35:34 -04:00
Lincoln Stein	403d02d94f	implementation of RFC #266 (#587 ) * Feature complete for #266 with exception of several small deviations: 1. initial image and model weight hashes use full sha256 hash rather than first 8 digits 2. Initialization parameters for post-processing steps not provided 3. Uses top-level "images" tags for both a single image and a grid of images. This change was suggested in a comment. * Added scripts/sd_metadata.py to retrieve and print metadata from PNG files * New ldm.dream.args.Args class is a namespace like object which holds all defaults and can be modified during exection to hold current settings. * Modified dream.py and server.py to accommodate Args class.	2022-09-16 13:09:04 -04:00
Any-Winter-4079	60b731e7ab	Update dream.py. k_euler_a and k_dpm_2_a M1 fix (#579 ) * Update dream.py. k_euler_a and k_dpm_2_a M1 fix Make results reproducible (so runs with the same seed produce the same result). Implements fix by @wbowling referenced in https://github.com/lstein/stable-diffusion/issues/397#issuecomment-1240679294 * Update dream.py. Remove import torch from dream.py * generate.py: k_euler_a and k_dpm_2_a M1 fix #579 Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>	2022-09-15 11:02:17 -04:00
Lincoln Stein	e6179af46a	Refactor generate.py and dream.py (#534 ) * revert inadvertent change of conda env name (#528) * Refactor generate.py and dream.py * config file path (models.yaml) is parsed inside Generate() to simplify API * Better handling of keyboard interrupts in file loading mode vs interactive * Removed oodles of unused variables. * move nonfunctional inpainting out of the scripts directory * fix ugly ddim tqdm formatting	2022-09-14 07:02:31 -04:00
Mihai	0bc6779361	Disable autocast for cpu to fix error. Remove unused precision arg. (#518 ) When running on just cpu (intel), a call to torch.layer_norm would error with RuntimeError: expected scalar type BFloat16 but found Float Fix buggy device handling in model.py. Tested with scripts/dream.py --full_precision on just cpu on intel laptop. Works but slow at ~10s/it.	2022-09-12 16:55:21 -04:00
Travco	dbf2c63c90	Add Embiggen automation to upscale-cut-img2img-stitch and achieve high res without extra VRAM (#437 ) * Add Embiggen automation * Make embiggen_tiles masking more intelligent and count from one (at least for the user), rewrite sections of Embiggen README, fix various typos throughout README * drop duplicate log message	2022-09-12 15:37:26 -04:00
Lincoln Stein	839e30e4b8	improve CUDA VRAM monitoring extra check that device==cuda before getting VRAM stats	2022-09-11 10:10:24 -04:00
Lincoln Stein	5c43988862	reduce VRAM memory usage by half during model loading * This moves the call to half() before model.to(device) to avoid GPU copy of full model. Improves speed and reduces memory usage dramatically * This fix contributed by @mh-dm (Mihai)	2022-09-10 10:02:43 -04:00
Lincoln Stein	723d074442	Allow ctrl c when using --from_file (#472 ) * added ansi escapes to highlight key parts of CLI session * adjust exception handling so that ^C will abort when reading prompts from a file	2022-09-09 18:49:51 -04:00
Lincoln Stein	10db192cc4	changes to dogettx optimizations to run on m1 * Author @any-winter-4079 * Author @dogettx Thanks to many individuals who contributed time and hardware to benchmarking and debugging these changes.	2022-09-09 09:51:41 -04:00
Lincoln Stein	7996a30e3a	add auto-creation of mask for inpainting (#438 ) * now use a single init image for both image and mask * turn on debugging for now to write out mask and image * add back -M option as a fallback	2022-09-08 07:34:03 -04:00
Lincoln Stein	dd2aedacaf	report VRAM usage stats during initial model loading (#419 )	2022-09-07 13:23:53 -04:00
Lincoln Stein	720e5cd651	Refactoring simplet2i (#387 ) * start refactoring -not yet functional * first phase of refactor done - not sure weighted prompts working * Second phase of refactoring. Everything mostly working. * The refactoring has moved all the hard-core inference work into ldm.dream.generator., where there are submodules for txt2img and img2img. inpaint will go in there as well. Some additional refactoring will be done soon, but relatively minor work. * fix -save_orig flag to actually work * add @neonsecret attention.py memory optimization * remove unneeded imports * move token logging into conditioning.py * add placeholder version of inpaint; porting in progress * fix crash in img2img * inpainting working; not tested on variations * fix crashes in img2img * ported attention.py memory optimization #117 from basujindal branch * added @torch_no_grad() decorators to img2img, txt2img, inpaint closures * Final commit prior to PR against development * fixup crash when generating intermediate images in web UI * rename ldm.simplet2i to ldm.generate * add backward-compatibility simplet2i shell with deprecation warning * add back in mps exception, addresses @vargol comment in #354 * replaced Conditioning class with exported functions * fix wrong type of with_variations attribute during intialization * changed "image_iterator()" to "get_make_image()" * raise NotImplementedError for calling get_make_image() in parent class * Update ldm/generate.py better error message Co-authored-by: Kevin Gibbons <bakkot@gmail.com> * minor stylistic fixes and assertion checks from code review * moved get_noise() method into img2img class * break get_noise() into two methods, one for txt2img and the other for img2img * inpainting works on non-square images now * make get_noise() an abstract method in base class * much improved inpainting Co-authored-by: Kevin Gibbons <bakkot@gmail.com>	2022-09-05 20:40:10 -04:00

26 Commits