maryhipp
|
09d1f75fe9
|
add FLUX schnell starter models and submodels as dependenices or adhoc download options
|
2024-08-21 14:27:35 -04:00 |
|
maryhipp
|
c095af65fb
|
add case for clip embed models in probe
|
2024-08-21 14:27:35 -04:00 |
|
maryhipp
|
2c72295b1c
|
update flux_model_loader node to take a T5 encoder from node field instead of hardcoded list, assume all models have been downloaded
|
2024-08-21 14:27:34 -04:00 |
|
Ryan Dick
|
e41025ddc7
|
Move requantize.py to the quatnization/ dir.
|
2024-08-21 18:21:44 +00:00 |
|
Ryan Dick
|
38c2e7801f
|
Add docs to the requantize(...) function explaining why it was copied from optimum-quanto.
|
2024-08-21 18:19:47 +00:00 |
|
Ryan Dick
|
d11dc6ddd0
|
Remove duplicate log_time(...) function.
|
2024-08-21 18:10:24 +00:00 |
|
Brandon Rising
|
8b0b496c2d
|
More flux loader cleanup
|
2024-08-21 12:37:25 -04:00 |
|
Brandon Rising
|
ada483f65e
|
Various styling and exception type updates
|
2024-08-21 11:59:04 -04:00 |
|
Brandon Rising
|
0913d062d8
|
Switch inheritance class of flux model loaders
|
2024-08-21 11:30:16 -04:00 |
|
Brandon Rising
|
d4872253a1
|
Update doc string for import_local_model and remove access_token since it's only usable for local file paths
|
2024-08-21 11:18:07 -04:00 |
|
Ryan Dick
|
e680cf76f6
|
Address minor review comments.
|
2024-08-21 13:45:22 +00:00 |
|
Ryan Dick
|
253b2b1dc6
|
Rename t5Encoder -> t5_encoder.
|
2024-08-21 13:27:54 +00:00 |
|
Mary Hipp
|
5edec7f105
|
add default workflow for flux t2i
|
2024-08-21 09:11:17 -04:00 |
|
Mary Hipp
|
ae9a1549ae
|
exclude flux models from main model dropdown
|
2024-08-21 09:11:17 -04:00 |
|
Brandon Rising
|
c819da8859
|
Some cleanup of the tags and description of flux nodes
|
2024-08-21 09:11:15 -04:00 |
|
Brandon Rising
|
e19f079cb2
|
Add t5 encoders and clip embeds to the model manager
|
2024-08-21 09:10:53 -04:00 |
|
Brandon Rising
|
dd24f83d43
|
Fix styling/lint
|
2024-08-21 09:10:22 -04:00 |
|
Brandon Rising
|
da766f5a7e
|
Fix support for 8b quantized t5 encoders, update exception messages in flux loaders
|
2024-08-21 09:10:22 -04:00 |
|
Ryan Dick
|
120e1cf1e9
|
Add tqdm progress bar to FLUX denoising.
|
2024-08-21 09:10:22 -04:00 |
|
Ryan Dick
|
5e2351f3bf
|
Fix FLUX output image clamping. And a few other minor fixes to make inference work with the full bfloat16 FLUX transformer model.
|
2024-08-21 09:10:22 -04:00 |
|
Brandon Rising
|
d705c3cf0e
|
Select dev/schnell based on state dict, use correct max seq len based on dev/schnell, and shift in inference, separate vae flux params into separate config
|
2024-08-21 09:10:20 -04:00 |
|
Brandon Rising
|
115f350f6f
|
Install sub directories with folders correctly, ensure consistent dtype of tensors in flux pipeline and vae
|
2024-08-21 09:09:39 -04:00 |
|
Brandon Rising
|
be6cb2c07c
|
Working inference node with quantized bnb nf4 checkpoint
|
2024-08-21 09:09:39 -04:00 |
|
Brandon Rising
|
4fb5529493
|
Remove unused param on _run_vae_decoding in flux text to image
|
2024-08-21 09:09:39 -04:00 |
|
Brandon Rising
|
b43ee0b837
|
Add nf4 bnb quantized format
|
2024-08-21 09:09:39 -04:00 |
|
Brandon Rising
|
3312fe8fc4
|
Run ruff, setup initial text to image node
|
2024-08-21 09:09:39 -04:00 |
|
Brandon Rising
|
01a2449dae
|
Add backend functions and classes for Flux implementation, Update the way flux encoders/tokenizers are loaded for prompt encoding, Update way flux vae is loaded
|
2024-08-21 09:09:37 -04:00 |
|
Brandon Rising
|
cfe9d0ce0a
|
Some UI cleanup, regenerate schema
|
2024-08-21 09:08:22 -04:00 |
|
Brandon Rising
|
46b6314482
|
Run Ruff
|
2024-08-21 09:06:38 -04:00 |
|
Brandon Rising
|
46d5107ff1
|
Run Ruff
|
2024-08-21 09:06:38 -04:00 |
|
Brandon Rising
|
6ea1278d22
|
Manage quantization of models within the loader
|
2024-08-21 09:06:34 -04:00 |
|
Brandon Rising
|
4556b57382
|
Remove changes to v1 workflow
|
2024-08-21 09:04:40 -04:00 |
|
Brandon Rising
|
f425d3aa3c
|
Setup flux model loading in the UI
|
2024-08-21 09:04:37 -04:00 |
|
Ryan Dick
|
d7a39a4d67
|
WIP on moving from diffusers to FLUX
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
3e8a550fab
|
More improvements for LLM.int8() - not fully tested.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
0e96794c6e
|
LLM.int8() quantization is working, but still some rough edges to solve.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
23a7328a66
|
Clean up NF4 implementation.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
c3cf8c3b6b
|
NF4 inference working
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
110d58d107
|
NF4 loading working... I think.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
3480e06688
|
wip
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
3ba60e1656
|
Split a FluxTextEncoderInvocation out from the FluxTextToImageInvocation. This has the advantage that we benfit from automatic caching when the prompt isn't changed.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
cdd47b657b
|
Make quantized loading fast for both T5XXL and FLUX transformer.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
68c712d254
|
Make quantized loading fast.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
44d7a74b88
|
WIP - experimentation
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
e8fb8f4d12
|
Make float16 inference work with FLUX on 24GB GPU.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
9381211508
|
Add support for 8-bit quantizatino of the FLUX T5XXL text encoder.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
8cce4a40d4
|
Make 8-bit quantization save/reload work for the FLUX transformer. Reload is still very slow with the current optimum.quanto implementation.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
4833746698
|
Minor improvements to FLUX workflow.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
8b9bf55bba
|
Got FLUX schnell working with 8-bit quantization. Still lots of rough edges to clean up.
|
2024-08-21 08:59:19 -04:00 |
|
Ryan Dick
|
7b199fed4f
|
Use the FluxPipeline.encode_prompt() api rather than trying to run the two text encoders separately.
|
2024-08-21 08:59:18 -04:00 |
|