mirror of
https://github.com/invoke-ai/InvokeAI
synced 2024-08-30 20:32:17 +00:00
* Squashed commit of the following: commit 82d9c25d9a87ea5f6fbdb0445313e48cd8ae3035 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 19:29:11 2022 +0200 fix branch name in mkdocs-flow commit 2e276cecc1942050ae8b22f4a417c576c49763a1 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 19:28:35 2022 +0200 fix theme name commit 2eb77c1173598e161e8a5a9812b021b831363183 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 19:14:42 2022 +0200 fixed some links and formating in main README commit 66a7152e48e8c66c9286f47a6b088920a8fcdcd8 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 08:58:58 2022 +0200 trigger mkdocs deployment on main commit 897cc373ce4f9d18d8cb2837b5931c4b674c836b Merge: 89da371 3b5a830 Author: Matthias Wild <40327258+mauwii@users.noreply.github.com> Date: Wed Sep 14 07:51:23 2022 +0200 Merge pull request #1 from mauwii/mkdocs Mkdocs commit 3b5a8308eb6b7ee572ff07f78829f64bcf55b4d9 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 07:42:56 2022 +0200 huge update I was pretty busy trying to make the Readmes / docs look good in MkDocs commit 0b4f5a926ff8aad1460e0be8093ee02d719a576d Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 07:41:45 2022 +0200 update mkdocs config commit 872172ea7008aa1217c1259c7ecab16ddafd1a0e Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 07:33:49 2022 +0200 added the mkdocs-git-revision-date-plugin commit eac81bf87588ec88b65f3415aa838c330fbd621b Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 06:46:43 2022 +0200 add prettier config remove markdownlint move and rename requirements-mkdocs.txt commit b36d4cc088a49af56026aa2dd2e85d6bfb181bd1 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 02:06:39 2022 +0200 add dark theme commit a14f18fede6febccf1c117cba50a1aff1eac80a9 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 01:38:02 2022 +0200 update mkdocs flow and config commit 2764b48693ed382bc01a2cb7aab282dbf95cd9ee Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 01:15:33 2022 +0200 add mkdocs workflow commit 1bd22523b1a52ac24bc5f950debb450f903c2d51 Author: mauwii <Mauwii@outlook.de> Date: Wed Sep 14 00:57:37 2022 +0200 I already begun with formating / revising the sites * change repository in mkdocs config to lstein * adapt changes from repos main README.md Co-authored-by: Lincoln Stein <lincoln.stein@gmail.com>
92 lines
2.9 KiB
Markdown
92 lines
2.9 KiB
Markdown
---
|
|
title: TEXTUAL_INVERSION
|
|
---
|
|
|
|
## **Personalizing Text-to-Image Generation**
|
|
|
|
You may personalize the generated images to provide your own styles or objects
|
|
by training a new LDM checkpoint and introducing a new vocabulary to the fixed
|
|
model as a (.pt) embeddings file. Alternatively, you may use or train
|
|
HuggingFace Concepts embeddings files (.bin) from
|
|
<https://huggingface.co/sd-concepts-library> and its associated notebooks.
|
|
|
|
## **Training**
|
|
|
|
To train, prepare a folder that contains images sized at 512x512 and execute the
|
|
following:
|
|
|
|
### WINDOWS
|
|
|
|
As the default backend is not available on Windows, if you're using that
|
|
platform, set the environment variable `PL_TORCH_DISTRIBUTED_BACKEND` to `gloo`
|
|
|
|
```bash
|
|
python3 ./main.py --base ./configs/stable-diffusion/v1-finetune.yaml \
|
|
--actual_resume ./models/ldm/stable-diffusion-v1/model.ckpt \
|
|
-t \
|
|
-n my_cat \
|
|
--gpus 0 \
|
|
--data_root D:/textual-inversion/my_cat \
|
|
--init_word 'cat'
|
|
```
|
|
|
|
During the training process, files will be created in
|
|
`/logs/[project][time][project]/` where you can see the process.
|
|
|
|
Conditioning contains the training prompts inputs, reconstruction the input
|
|
images for the training epoch samples, samples scaled for a sample of the prompt
|
|
and one with the init word provided.
|
|
|
|
On a RTX3090, the process for SD will take ~1h @1.6 iterations/sec.
|
|
|
|
!!! Info _Note_
|
|
|
|
According to the associated paper, the optimal number of
|
|
images is 3-5. Your model may not converge if you use more images than
|
|
that.
|
|
|
|
Training will run indefinitely, but you may wish to stop it (with ctrl-c) before
|
|
the heat death of the universe, when you find a low loss epoch or around ~5000
|
|
iterations. Note that you can set a fixed limit on the number of training steps
|
|
by decreasing the "max_steps" option in
|
|
configs/stable_diffusion/v1-finetune.yaml (currently set to 4000000)
|
|
|
|
## **Run the Model**
|
|
|
|
Once the model is trained, specify the trained .pt or .bin file when starting
|
|
dream using
|
|
|
|
```bash
|
|
python3 ./scripts/dream.py \
|
|
--embedding_path /path/to/embedding.pt \
|
|
--full_precision
|
|
```
|
|
|
|
Then, to utilize your subject at the dream prompt
|
|
|
|
```bash
|
|
dream> "a photo of *"
|
|
```
|
|
|
|
This also works with image2image
|
|
|
|
```bash
|
|
dream> "waterfall and rainbow in the style of *" --init_img=./init-images/crude_drawing.png --strength=0.5 -s100 -n4
|
|
```
|
|
|
|
For .pt files it's also possible to train multiple tokens (modify the
|
|
placeholder string in `configs/stable-diffusion/v1-finetune.yaml`) and combine
|
|
LDM checkpoints using:
|
|
|
|
```bash
|
|
python3 ./scripts/merge_embeddings.py \
|
|
--manager_ckpts /path/to/first/embedding.pt \
|
|
[</path/to/second/embedding.pt>,[...]] \
|
|
--output_path /path/to/output/embedding.pt
|
|
```
|
|
|
|
Credit goes to rinongal and the repository
|
|
|
|
Please see [the repository](https://github.com/rinongal/textual_inversion) and
|
|
associated paper for details and limitations.
|