Add comment about incorrect T5 Tokenizer size calculation.

2024-08-30 20:32:17 +00:00 · 2024-08-22 16:09:46 +00:00 · 2024-08-22 16:09:46 +00:00 · 1c1f2c6664
commit 1c1f2c6664
parent d7c22b3bf7
1 changed files with 3 additions and 0 deletions
--- a/invokeai/backend/model_manager/load/model_util.py
+++ b/invokeai/backend/model_manager/load/model_util.py
@ -57,6 +57,9 @@ def calc_model_size_by_data(logger: logging.Logger, model: AnyModel) -> int:
            T5Tokenizer,
        ),
    ):
+        # HACK(ryand): len(model) just returns the vocabulary size, so this is blatantly wrong. It should be small
+        # relative to the text encoder that it's used with, so shouldn't matter too much, but we should fix this at some
+        # point.
        return len(model)
    else:
        # TODO(ryand): Promote this from a log to an exception once we are confident that we are handling all of the