Add Textual inversion

This commit is contained in:
Abdullah Barhoum 2022-08-31 18:17:44 +02:00
parent 4aaf38970a
commit 15952906a1
2 changed files with 21 additions and 7 deletions

View File

@ -4,6 +4,13 @@ Run Stable Diffusion on your machine with a nice UI without any hassle!
This repository provides the [WebUI](https://github.com/hlky/stable-diffusion-webui) as docker for easy setup and deployment. Please note that this repo delivers all cutting-edge unstable changes from the WebUI, so expect some bugs.
### Features
* Interactive UI with many features, and more on the way!
* Support for 6GB GPU cards.
* GFPGAN for face reconstruction, RealESRGAN for super-sampling.
* [Textual Inversion](https://github.com/hlky/sd-enable-textual-inversion)
* many more!
## Setup
make sure you have docker installed and up to date. Download this repo and run:
@ -15,8 +22,8 @@ docker compose build
you can let it build in the background while you download the different models
- [Stable Diffusion v1.4 (4GB)](https://www.googleapis.com/storage/v1/b/aai-blog-files/o/sd-v1-4.ckpt?alt=media), rename to `model.ckpt`
- (Optional) [GFPGANv1.3.pth (333MB)](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth) to improve generated faces.
- (Optional) [RealESRGAN_x4plus.pth (64MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth) and [RealESRGAN_x4plus_anime_6B.pth (18MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.2.4/RealESRGAN_x4plus_anime_6B.pth) for super-sampling.
- (Optional) [GFPGANv1.3.pth (333MB)](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth).
- (Optional) [RealESRGAN_x4plus.pth (64MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth) and [RealESRGAN_x4plus_anime_6B.pth (18MB)](https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.2.4/RealESRGAN_x4plus_anime_6B.pth).
Put all of the downloaded files in the `models` folder, it should look something like this:
@ -46,6 +53,12 @@ in the `docker-compose.yml` you can change the `CLI_ARGS` variable contains all
By default: `--extra-models-cpu --optimized-turbo` are given, which allow you to use this model on a 6GB GPU.
## FAQ
- Enable [Textual Inversion](https://github.com/hlky/sd-enable-textual-inversion) remove `--optimize` and `--optimize-turbo` flags and add `--no-half`, [more info here](https://github.com/AbdBarho/stable-diffusion-webui-docker/issues/6).
- [The Application Freezes When Trying Batch Sizes Higher Than One](https://github.com/AbdBarho/stable-diffusion-webui-docker/issues/12)
- [Output is a green imagee](https://github.com/hlky/stable-diffusion-webui/issues/34)
# Disclaimer
The authors of this project are not responsible for any content generated using this interface.

View File

@ -9,7 +9,7 @@ RUN git clone https://github.com/hlky/stable-diffusion.git && cd stable-diffusio
RUN conda env update --file stable-diffusion/environment.yaml --name base && conda clean -a -y
# fonts for generating the grid
RUN apt-get update && apt install fonts-dejavu-core && apt-get clean
RUN apt-get update && apt install fonts-dejavu-core rsync -y && apt-get clean
# Note: don't update the sha of previous versions because the install will take forever
# instead, update the repo state in a later step
@ -29,10 +29,11 @@ RUN apt-get update && apt install fonts-dejavu-core && apt-get clean
# EOF
# Textual-inversion:
# RUN <<EOF
# git clone https://github.com/hlky/sd-enable-textual-inversion.git
# cp -rf sd-enable-textual-inversion /stable-diffusion
# EOF
RUN <<EOF
git clone https://github.com/hlky/sd-enable-textual-inversion.git &&
cd /sd-enable-textual-inversion && git reset --hard 08f9b5046552d17cf7327b30a98410222741b070 &&
rsync -a /sd-enable-textual-inversion/ /stable-diffusion/
EOF
# add info
COPY info.py /info.py