Add Textual inversion

2022-08-31 18:17:44 +02:00
parent 4aaf38970a
commit 15952906a1
2 changed files with 21 additions and 7 deletions
--- a/README.md
+++ b/README.md
@ -4,6 +4,13 @@ Run Stable Diffusion on your machine with a nice UI without any hassle!

 This repository provides the [WebUI](https://github.com/hlky/stable-diffusion-webui) as docker for easy setup and deployment. Please note that this repo delivers all cutting-edge unstable changes from the WebUI, so expect some bugs.

+### Features
+* Interactive UI with many features, and more on the way!
+* Support for 6GB GPU cards.
+* GFPGAN for face reconstruction, RealESRGAN for super-sampling.
+* [Textual Inversion](https://github.com/hlky/sd-enable-textual-inversion)
+* many more!
+
 ## Setup

 make sure you have docker installed and up to date. Download this repo and run:
@ -15,8 +22,8 @@ docker compose build
 you can let it build in the background while you download the different models

 - [Stable Diffusion v1.4 (4GB)](https://www.googleapis.com/storage/v1/b/aai-blog-files/o/sd-v1-4.ckpt?alt=media), rename to `model.ckpt`
- (Optional) [GFPGANv1.3.pth (333MB)](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth) to improve generated faces.
- (Optional) [RealESRGAN_x4plus.pth (64MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth) and [RealESRGAN_x4plus_anime_6B.pth (18MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.2.4/RealESRGAN_x4plus_anime_6B.pth) for super-sampling.
+- (Optional) [GFPGANv1.3.pth (333MB)](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth).
+- (Optional) [RealESRGAN_x4plus.pth (64MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth) and [RealESRGAN_x4plus_anime_6B.pth (18MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.2.4/RealESRGAN_x4plus_anime_6B.pth).

 Put all of the downloaded files in the `models` folder, it should look something like this:

@ -46,6 +53,12 @@ in the `docker-compose.yml` you can change the `CLI_ARGS` variable contains all

 By default: `--extra-models-cpu --optimized-turbo` are given, which allow you to use this model on a 6GB GPU.

+## FAQ
+
+- Enable [Textual Inversion](https://github.com/hlky/sd-enable-textual-inversion) remove `--optimize` and `--optimize-turbo` flags and add `--no-half`, [more info here](https://github.com/AbdBarho/stable-diffusion-webui-docker/issues/6).
+- [The Application Freezes When Trying Batch Sizes Higher Than One](https://github.com/AbdBarho/stable-diffusion-webui-docker/issues/12)
+- [Output is a green imagee](https://github.com/hlky/stable-diffusion-webui/issues/34)
+
 # Disclaimer

 The authors of this project are not responsible for any content generated using this interface.
--- a/build/Dockerfile
+++ b/build/Dockerfile
@ -9,7 +9,7 @@ RUN git clone https://github.com/hlky/stable-diffusion.git && cd stable-diffusio
 RUN conda env update --file stable-diffusion/environment.yaml --name base && conda clean -a -y

 # fonts for generating the grid
-RUN apt-get update && apt install fonts-dejavu-core && apt-get clean
+RUN apt-get update && apt install fonts-dejavu-core rsync -y && apt-get clean

 # Note: don't update the sha of previous versions because the install will take forever
 # instead, update the repo state in a later step
@ -29,10 +29,11 @@ RUN apt-get update && apt install fonts-dejavu-core && apt-get clean
 # EOF

 # Textual-inversion:
-# RUN <<EOF
-# git clone https://github.com/hlky/sd-enable-textual-inversion.git
-# cp -rf sd-enable-textual-inversion /stable-diffusion
-# EOF
+RUN <<EOF
+git clone https://github.com/hlky/sd-enable-textual-inversion.git &&
+cd /sd-enable-textual-inversion && git reset --hard 08f9b5046552d17cf7327b30a98410222741b070 &&
+rsync -a /sd-enable-textual-inversion/ /stable-diffusion/
+EOF

 # add info
 COPY info.py /info.py