From 15952906a1dfb9189aa0f51b8216ada6d869e61a Mon Sep 17 00:00:00 2001 From: Abdullah Barhoum Date: Wed, 31 Aug 2022 18:17:44 +0200 Subject: [PATCH] Add Textual inversion --- README.md | 17 +++++++++++++++-- build/Dockerfile | 11 ++++++----- 2 files changed, 21 insertions(+), 7 deletions(-) diff --git a/README.md b/README.md index 53cf5fd..67376d5 100644 --- a/README.md +++ b/README.md @@ -4,6 +4,13 @@ Run Stable Diffusion on your machine with a nice UI without any hassle! This repository provides the [WebUI](https://github.com/hlky/stable-diffusion-webui) as docker for easy setup and deployment. Please note that this repo delivers all cutting-edge unstable changes from the WebUI, so expect some bugs. +### Features +* Interactive UI with many features, and more on the way! +* Support for 6GB GPU cards. +* GFPGAN for face reconstruction, RealESRGAN for super-sampling. +* [Textual Inversion](https://github.com/hlky/sd-enable-textual-inversion) +* many more! + ## Setup make sure you have docker installed and up to date. Download this repo and run: @@ -15,8 +22,8 @@ docker compose build you can let it build in the background while you download the different models - [Stable Diffusion v1.4 (4GB)](https://www.googleapis.com/storage/v1/b/aai-blog-files/o/sd-v1-4.ckpt?alt=media), rename to `model.ckpt` -- (Optional) [GFPGANv1.3.pth (333MB)](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth) to improve generated faces. -- (Optional) [RealESRGAN_x4plus.pth (64MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth) and [RealESRGAN_x4plus_anime_6B.pth (18MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.2.4/RealESRGAN_x4plus_anime_6B.pth) for super-sampling. +- (Optional) [GFPGANv1.3.pth (333MB)](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth). +- (Optional) [RealESRGAN_x4plus.pth (64MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth) and [RealESRGAN_x4plus_anime_6B.pth (18MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.2.4/RealESRGAN_x4plus_anime_6B.pth). Put all of the downloaded files in the `models` folder, it should look something like this: @@ -46,6 +53,12 @@ in the `docker-compose.yml` you can change the `CLI_ARGS` variable contains all By default: `--extra-models-cpu --optimized-turbo` are given, which allow you to use this model on a 6GB GPU. +## FAQ + +- Enable [Textual Inversion](https://github.com/hlky/sd-enable-textual-inversion) remove `--optimize` and `--optimize-turbo` flags and add `--no-half`, [more info here](https://github.com/AbdBarho/stable-diffusion-webui-docker/issues/6). +- [The Application Freezes When Trying Batch Sizes Higher Than One](https://github.com/AbdBarho/stable-diffusion-webui-docker/issues/12) +- [Output is a green imagee](https://github.com/hlky/stable-diffusion-webui/issues/34) + # Disclaimer The authors of this project are not responsible for any content generated using this interface. diff --git a/build/Dockerfile b/build/Dockerfile index c6d82b7..2e6fbf5 100644 --- a/build/Dockerfile +++ b/build/Dockerfile @@ -9,7 +9,7 @@ RUN git clone https://github.com/hlky/stable-diffusion.git && cd stable-diffusio RUN conda env update --file stable-diffusion/environment.yaml --name base && conda clean -a -y # fonts for generating the grid -RUN apt-get update && apt install fonts-dejavu-core && apt-get clean +RUN apt-get update && apt install fonts-dejavu-core rsync -y && apt-get clean # Note: don't update the sha of previous versions because the install will take forever # instead, update the repo state in a later step @@ -29,10 +29,11 @@ RUN apt-get update && apt install fonts-dejavu-core && apt-get clean # EOF # Textual-inversion: -# RUN <