Lincoln Stein
|
05a27bda5e
|
generalize model loading support, include loras/embeds
|
2023-05-06 15:58:44 -04:00 |
|
Lincoln Stein
|
e0214a32bc
|
mostly ported to new manager API; needs testing
|
2023-05-06 00:44:12 -04:00 |
|
Lincoln Stein
|
af8c7c7d29
|
model manager rewritten to use model_cache; API changed!
|
2023-05-05 19:32:28 -04:00 |
|
Lincoln Stein
|
a4e36bc02a
|
when model is forcibly moved into RAM update loaded_models set
|
2023-05-04 23:28:03 -04:00 |
|
Lincoln Stein
|
68bc0112fa
|
implement lazy GPU offloading and ref counting
|
2023-05-04 23:15:32 -04:00 |
|
Lincoln Stein
|
e1fed52c66
|
work on model cache and its regression test finished
|
2023-05-03 12:38:18 -04:00 |
|
Lincoln Stein
|
bb959448c1
|
implement hashing for local & remote models
|
2023-05-02 16:52:27 -04:00 |
|
Lincoln Stein
|
2e2abf6ea6
|
caching of subparts working
|
2023-05-01 22:57:30 -04:00 |
|
Lincoln Stein
|
956ad6bcf5
|
add redesigned model cache for diffusers & transformers
|
2023-04-28 00:41:52 -04:00 |
|