InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Ryan Dick	eeabb7ebe5	Make quantized loading fast for both T5XXL and FLUX transformer.	2024-08-26 20:17:50 -04:00
Ryan Dick	3cf0365a35	Make float16 inference work with FLUX on 24GB GPU.	2024-08-26 20:17:50 -04:00
Ryan Dick	5870742bb9	Add support for 8-bit quantizatino of the FLUX T5XXL text encoder.	2024-08-26 20:17:50 -04:00
Ryan Dick	01d8c62c57	Make 8-bit quantization save/reload work for the FLUX transformer. Reload is still very slow with the current optimum.quanto implementation.	2024-08-26 20:17:50 -04:00
Ryan Dick	55a242b2d6	Minor improvements to FLUX workflow.	2024-08-26 20:17:50 -04:00
Ryan Dick	45263b339f	Got FLUX schnell working with 8-bit quantization. Still lots of rough edges to clean up.	2024-08-26 20:17:50 -04:00
Ryan Dick	3319491861	Use the FluxPipeline.encode_prompt() api rather than trying to run the two text encoders separately.	2024-08-26 20:17:50 -04:00
Ryan Dick	b39031ea53	First draft of FluxTextToImageInvocation.	2024-08-26 20:17:50 -04:00