InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2025-07-26 05:17:55 +00:00

Author	SHA1	Message	Date
Emmanuel Ferdman	c80ad90f72	Migrate to modern logger interface Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>	2025-06-13 13:07:09 +10:00
Lucian Hardy	a4cddfa47d	feat(ui): model relationship management Adds full support for managing model-to-model relationships in the UI and backend. Introduces RelatedModels subpanel for linking and unlinking models in model management. - Adds REST API routes for adding, removing, and retrieving model relationships. - New database migration: creates model_relationships table for bidirectional links. - New service layer (model_relationships) for relationship management. - Updated frontend: Related models float to top of LoRA/Main grouped model comboboxes for quick access. - Added 'Show Only Related' toggle badge to MainModelPicker filter bar Amended commit to remove changes to ParamMainModelSelect.tsx and MainModelPicker.tsx to avoid conflict with upstream deletion/ rewrite	2025-05-19 10:29:07 +10:00
Ryan Dick	1e2c7c51b5	Move load_custom_nodes() to run_app() entrypoint.	2025-02-28 20:54:26 +00:00
Ryan Dick	68d14de3ee	Split run_app.py and api_app.py so that api_app.py is more narrowly responsible for just initializing the FastAPI app. This also gives clearer control over the order of the initialization steps, which will be important as we add planned torch configurations that must be applied before torch is imported.	2025-02-28 20:10:24 +00:00
Ryan Dick	38991ffc35	Add register_mime_types() startup util.	2025-02-28 20:10:24 +00:00
Ryan Dick	f345c0fabc	Create an apply_monkeypatches() start util.	2025-02-28 20:10:24 +00:00
Ryan Dick	ca23b5337e	Simplify port selection logic to avoid the need for a global port variable.	2025-02-28 20:10:19 +00:00
Ryan Dick	35910d3952	Move check_cudnn() and jurigged setup to startup_utils.py.	2025-02-28 20:08:53 +00:00
Ryan Dick	6f1dcf385b	Move find_port() util to its own file.	2025-02-28 20:08:53 +00:00
psychedelicious	d40f2fa37c	feat(app): improved custom load loading ordering Previously, custom node loading occurred _during module imports_. A consequence of this is that when a custom node import fails (e.g. its type clobbers an existing node), the app fails to start up. In fact, any time we import basically anything from the app, we trigger custom node imports! Not good. This logic is now in its own function, called as the API app starts up. If a custom node load fails for any reason, it no longer prevents the app from starting up. One other bonus we get from this is that we can now ensure custom nodes are loaded _after_ core nodes. Any clobbering that may occur while loading custom nodes is now guaranteed to be a custom node clobbering a core node's type - and not the other way round.	2025-02-27 12:39:37 +11:00
psychedelicious	858bf9cf8c	feat(api): less verbose uvicorn logs Uvicorn's logging is rather verbose. This change adds a `log_level_network` config setting to independently control uvicorn's log outputs. The setting defaults to warning. The change hides the helpful startup message that says the host and port we are running on. For example: `Uvicorn running on http://0.0.0.0:9090 (Press CTRL+C to quit` The ASGI lifespan handler is updated to log an equivalent message on startup, regardless of log level settings. Besides being helpful, the launcher relies on a message like this to launch the app. So, previously, if the user set their log level to anything above info (e.g. warning or error), the launcher would fail to open the app. This change prevents that edge case.	2024-12-20 09:19:04 +11:00
psychedelicious	21017edcde	fix(api): UI crash with `TypeError: i.map is not a function` This pops up every now and then and I could never figure it out. A user figured it out in #6936. The cause is appending a query string to the app URL. For example: ```sh http://127.0.0.1:9090/?__theme=dark ``` The query string breaking the static file serving, which prevents our translations from loading correctly. Instead of the JSON translations, FastAPI sends the index HTML page. The UI then errors when attempting to parse the translation JSON. The query string ?__theme=dark is used by Gradio to force dark mode. I believe the users with this issue are doing the same thing the user in #6936 did (just change the port number on an existing bookmark) or their browser history/bookmark includes the query string. Though this is technically a user-caused problem (we cannot prevent the user from using a malformed URL), we can work around it. When query string is used on the root path, we can redirect the browser to the root path without the query string. This is done via very simple middleware. Closes #6696 Closes #6817 Closes #6828 Closes #6936 Closes #6983	2024-09-30 13:15:57 +10:00
Mary Hipp	9c732ac3b1	Merge remote-tracking branch 'origin/main' into maryhipp/style-presets	2024-08-12 14:53:45 -04:00
psychedelicious	29325a7214	fix(app): use asyncio queue and existing event loop for events Around the time we (I) implemented pydantic events, I noticed a short pause between progress images every 4 or 5 steps when generating with SDXL. It didn't happen with SD1.5, but I did notice that with SD1.5, we'd get 4 or 5 progress events simultaneously. I'd expect one event every ~25ms, matching my it/s with SD1.5. Mysterious! Digging in, I found an issue is related to our use of a synchronous queue for events. When the event queue is empty, we must call `asyncio.sleep` before checking again. We were sleeping for 100ms. Said another way, every time we clear the event queue, we have to wait 100ms before another event can be dispatched, even if it is put on the queue immediately after we start waiting. In practice, this means our events get buffered into batches, dispatched once every 100ms. This explains why I was getting batches of 4 or 5 SD1.5 progress events at once, but not the intermittent SDXL delay. But this 100ms wait has another effect when the events are put on the queue in intervals that don't perfectly line up with the 100ms wait. This is most noticeable when the time between events is >100ms, and can add up to 100ms delay before the event is dispatched. For example, say the queue is empty and we start a 100ms wait. Then, immediately after - like 0.01ms later - we push an event on to the queue. We still need to wait another 99.9ms before that event will be dispatched. That's the SDXL delay. The easy fix is to reduce the sleep to something like 0.01 seconds, but this feels kinda dirty. Can't we just wait on the queue and dispatch every event immediately? Not with the normal synchronous queue - but we can with `asyncio.Queue`. I switched the events queue to use `asyncio.Queue` (as seen in this commit), which lets us asynchronous wait on the queue in a loop. Unfortunately, I ran into another issue - events now felt like their timing was inconsistent, but in a different way than with the 100ms sleep. The time between pushing events on the queue and dispatching them was not consistently ~0ms as I'd expect - it was highly variable from ~0ms up to ~100ms. This is resolved by passing the asyncio loop directly into the events service and using its methods to create the task and interact with the queue. I don't fully understand why this resolved the issue, because either way we are interacting with the same event loop (as shown by `asyncio.get_running_loop()`). I suppose there's some scheduling magic happening.	2024-08-12 07:49:58 +10:00
Mary Hipp	581029ebaa	ruff	2024-08-08 14:21:37 -04:00
Mary Hipp	217fe40d99	feat(api): add style_presets router, make sure all CRUD is working, add is_default	2024-08-02 12:29:54 -04:00
psychedelicious@windows	2c1a91241e	fix(app): windows indefinite hang while finding port For some reason, I started getting this indefinite hang when the app checks if port 9090 is available. After some fiddling around, I found that adding a timeout resolves the issue. I confirmed that the util still works by starting the app on 9090, then starting a second instance. The second instance correctly saw 9090 in use and moved to 9091.	2024-07-13 14:46:41 +10:00
Ryan Dick	1d449097cc	Apply ruff rule to disallow all relative imports.	2024-07-04 09:35:37 -04:00
Ryan Dick	9da5925287	Add ruff rule to disallow relative parent imports.	2024-07-04 09:35:37 -04:00
psychedelicious	2f9ebdec69	fix(app): openapi schema generation Some tech debt related to dynamic pydantic schemas for invocations became problematic. Including the invocations and results in the event schemas was breaking pydantic's handling of ref schemas. I don't really understand why - I think it's a pydantic bug in a remote edge case that we are hitting. After many failed attempts I landed on this implementation, which is actually much tidier than what was in there before. - Create pydantic-enabled types for `AnyInvocation` and `AnyInvocationOutput` and use these in place of the janky dynamic unions. Actually, they are kinda the same, but better encapsulated. Use these in `Graph`, `GraphExecutionState`, `InvocationEventBase` and `InvocationCompleteEvent`. - Revise the custom openapi function to work with the new models. - Split out the custom openapi function to a separate file. Add a `post_transform` callback so consumers can customize the output schema. - Update makefile scripts.	2024-05-30 12:03:03 +10:00
psychedelicious	f82df2661a	docs: clarify comment in api_app	2024-05-27 09:06:02 +10:00
psychedelicious	b3a051250f	feat(api): sort socket event names for openapi schema Deterministic ordering prevents extraneous, non-functional changes to the autogenerated types	2024-05-27 09:06:02 +10:00
psychedelicious	d97186dfc8	feat(events): remove payload registry, add method to get event classes We don't need to use the payload schema registry. All our events are dispatched as pydantic models, which are already validated on instantiation. We do want to add all events to the OpenAPI schema, and we referred to the payload schema registry for this. To get all events, add a simple helper to EventBase. This is functionally identical to using the schema registry.	2024-05-27 09:06:02 +10:00
psychedelicious	9bd78823a3	refactor(events): use pydantic schemas for events Our events handling and implementation has a couple pain points: - Adding or removing data from event payloads requires changes wherever the events are dispatched from. - We have no type safety for events and need to rely on string matching and dict access when interacting with events. - Frontend types for socket events must be manually typed. This has caused several bugs. `fastapi-events` has a neat feature where you can create a pydantic model as an event payload, give it an `__event_name__` attr, and then dispatch the model directly. This allows us to eliminate a layer of indirection and some unpleasant complexity: - Event handler callbacks get type hints for their event payloads, and can use `isinstance` on them if needed. - Event payload construction is now the responsibility of the event itself (a pydantic model), not the service. Every event model has a `build` class method, encapsulating this logic. The build methods are provided as few args as possible. For example, `InvocationStartedEvent.build()` gets the invocation instance and queue item, and can choose the data it wants to include in the event payload. - Frontend event types may be autogenerated from the OpenAPI schema. We use the payload registry feature of `fastapi-events` to collect all payload models into one place, making it trivial to keep our schema and frontend types in sync. This commit moves the backend over to this improved event handling setup.	2024-05-27 09:06:02 +10:00
psychedelicious	18b0977a31	feat(api): add InvocationOutputMap to OpenAPI schema This dynamically generated schema object maps node types to their pydantic schemas. This makes it much simpler to infer node types in the UI.	2024-05-15 14:09:44 +10:00
Lincoln Stein	e93f4d632d	[util] Add generic torch device class (#6174 ) * introduce new abstraction layer for GPU devices * add unit test for device abstraction * fix ruff * convert TorchDeviceSelect into a stateless class * move logic to select context-specific execution device into context API * add mock hardware environments to pytest * remove dangling mocker fixture * fix unit test for running on non-CUDA systems * remove unimplemented get_execution_device() call * remove autocast precision * Multiple changes: 1. Remove TorchDeviceSelect.get_execution_device(), as well as calls to context.models.get_execution_device(). 2. Rename TorchDeviceSelect to TorchDevice 3. Added back the legacy public API defined in `invocation_api`, including choose_precision(). 4. Added a config file migration script to accommodate removal of precision=autocast. * add deprecation warnings to choose_torch_device() and choose_precision() * fix test crash * remove app_config argument from choose_torch_device() and choose_torch_dtype() --------- Co-authored-by: Lincoln Stein <lstein@gmail.com>	2024-04-15 13:12:49 +00:00
Ryan Dick	86d536755d	Check for cuDNN version compatibility issues on startup. Prior to this check, the app would silently run with ~50% performance degradation caused by a cuDNN version mismatch.	2024-03-28 07:32:06 +11:00
psychedelicious	a291a42abc	feat: display torch device on startup This functionality disappeared at some point.	2024-03-27 08:16:27 -04:00
psychedelicious	6af6673a4f	feat: move all config-related initialization to app HF login, legacy yaml confs, and default init file are all handled during app setup. All directories are created as they are needed by the app. No need to check for a valid root dir - we will make it if it doesn't exist.	2024-03-20 15:05:25 +11:00
psychedelicious	1cb1b60b4c	tidy: "check_root.py" -> "check_directories.py"	2024-03-19 09:24:28 +11:00
psychedelicious	1d4517d00d	tidy: "validate_root" -> "validate_directories"	2024-03-19 09:24:28 +11:00
psychedelicious	ce9aeeece3	feat: single app entrypoint with CLI arg parsing We have two problems with how argparse is being utilized: - We parse CLI args as the `api_app.py` file is read. This causes a problem pytest, which has an incompatible set of CLI args. Some tests import the FastAPI app, which triggers the config to parse CLI args, which receives the pytest args and fails. - We've repeatedly had problems when something that uses the config is imported before the CLI args are parsed. When this happens, the root dir may not be set correctly, so we attempt to operate on incorrect paths. To resolve these issues, we need to lift CLI arg parsing outside of the application code, but still let the application access the CLI args. We can create a external app entrypoint to do this. - `InvokeAIArgs` is a simple helper class that parses CLI args and stores the result. - `run_app()` is the new entrypoint. It first parses CLI args, then runs `invoke_api` to start the app. The `invokeai-web` project script and `invokeai-web.py` dev script now call `run_app()` instead of `invoke_api()`. The first time `get_config()` is called to get the singleton config object, it retrieves the args from `InvokeAIArgs`, sets the root dir if provided, then merges settings in from `invokeai.yaml`. CLI arg parsing is now safely insulated from application code, but still accessible. And we don't need to worry about import order having an impact on anything, because by the time the app is running, we have already parsed CLI args. Whew!	2024-03-19 09:24:28 +11:00
psychedelicious	f69938c6a8	fix(config): revised config methods - `write_file` requires an destination file path - `read_config` -> `merge_from_file`, if no path is provided, reads from `self.init_file_path` - update app, tests to use new methods - fix configurator, was overwriting config file data unexpectedly	2024-03-19 09:24:28 +11:00
psychedelicious	b9884a6166	feat(config): split out `parse_args` and `read_config` logic from `get_config` Having this all in the `get_config` function makes testing hard. Move these two functions to their own methods, and call them on app startup explicitly.	2024-03-19 09:24:28 +11:00
psychedelicious	7ca447ded1	fix(config): use new config setup in api_app.py	2024-03-19 09:24:28 +11:00
psychedelicious	daeb766468	feat(api): add ModelIdentifierField to openapi schema - Also add `ProgressImage`	2024-03-10 11:03:38 +11:00
psychedelicious	99c0662e3f	fix(nodes): load config before doing anything else This was preventing custom nodes from loading if a custom nodes dir was specified Closes #5862	2024-03-07 10:36:27 +11:00
Brandon Rising	39725e9560	Next: Remove deprecated app.on_event usage in api runner	2024-03-01 10:42:33 +11:00
psychedelicious	725c03cf87	refactor(nodes): merge processors Consolidate graph processing logic into session processor. With graphs as the unit of work, and the session queue distributing graphs, we no longer need the invocation queue or processor. Instead, the session processor dequeues the next session and processes it in a simple loop, greatly simplifying the app. - Remove `graph_execution_manager` service. - Remove `queue` (invocation queue) service. - Remove `processor` (invocation processor) service. - Remove queue-related logic from `Invoker`. It now only starts and stops the services, providing them with access to other services. - Remove unused `invocation_retrieval_error` and `session_retrieval_error` events, these are no longer needed. - Clean up stats service now that it is less coupled to the rest of the app. - Refactor cancellation logic - cancellations now originate from session queue (i.e. HTTP cancel endpoint) and are emitted as events. Processor gets the events and sets the canceled event. Access to this event is provided to the invocation context for e.g. the step callback. - Remove `sessions` router; it provided access to `graph_executions` but that no longer exists.	2024-03-01 10:42:33 +11:00
psychedelicious	b79ae3a101	fix(nodes): fix OpenAPI schema generation The change to `Graph.nodes` and `GraphExecutionState.results` validation requires some fanagling to get the OpenAPI schema generation to work. See new comments for a details.	2024-03-01 10:42:33 +11:00
psychedelicious	5a3195f757	final tidying before marking PR as ready for review - Replace AnyModelLoader with ModelLoaderRegistry - Fix type check errors in multiple files - Remove apparently unneeded `get_model_config_enum()` method from model manager - Remove last vestiges of old model manager - Updated tests and documentation resolve conflict with seamless.py	2024-03-01 10:42:33 +11:00
Lincoln Stein	4027e845d4	add back the `heuristic_import()` method and extend repo_ids to arbitrary file paths	2024-03-01 10:42:33 +11:00
Lincoln Stein	a23dedd2ee	make model manager v2 ready for PR review - Replace legacy model manager service with the v2 manager. - Update invocations to use new load interface. - Fixed many but not all type checking errors in the invocations. Most were unrelated to model manager - Updated routes. All the new routes live under the route tag `model_manager_v2`. To avoid confusion with the old routes, they have the URL prefix `/api/v2/models`. The old routes have been de-registered. - Added a pytest for the loader. - Updated documentation in contributing/MODEL_MANAGER.md	2024-03-01 10:42:33 +11:00
psychedelicious	992b02aa65	tidy(nodes): move all field things to fields.py Unfortunately, this is necessary to prevent circular imports at runtime.	2024-03-01 10:42:33 +11:00
psychedelicious	5fa13fba36	chore: ruff	2024-01-22 16:10:25 +11:00
psychedelicious	f28f761436	fix(api): add `NoCacheStaticFiles` to prevent all caching The previous method wasn't totally foolproof, and locales/assets were cached. To solve this once and for all (famous last words, I know), we can subclass `StaticFiles` and use maximally strict no-caching headers to disable caching on all static files.	2024-01-22 16:10:25 +11:00
psychedelicious	98a44d7fa1	feat(ui): update assets - Add various brand images, organise images - Create favicon for docs pages (light blue version of key logo) - Rename app title to `Invoke - Community Edition`	2024-01-12 08:02:59 +11:00
Lincoln Stein	fbede84405	[feature] Download Queue (#5225 ) * add base definition of download manager * basic functionality working * add unit tests for download queue * add documentation and FastAPI route * fix docs * add missing test dependency; fix import ordering * fix file path length checking on windows * fix ruff check error * move release() into the __del__ method * disable testing of stderr messages due to issues with pytest capsys fixture * fix unsorted imports * harmonized implementation of start() and stop() calls in download and & install modules * Update invokeai/app/services/download/download_base.py Co-authored-by: Ryan Dick <ryanjdick3@gmail.com> * replace test datadir fixture with tmp_path * replace DownloadJobBase->DownloadJob in download manager documentation * make source and dest arguments to download_queue.download() an AnyHttpURL and Path respectively * fix pydantic typecheck errors in the download unit test * ruff formatting * add "job cancelled" as an event rather than an exception * fix ruff errors * Update invokeai/app/services/download/download_default.py Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com> * use threading.Event to stop service worker threads; handle unfinished job edge cases * remove dangling STOP job definition * fix ruff complaint * fix ruff check again * avoid race condition when start() and stop() are called simultaneously from different threads * avoid race condition in stop() when a job becomes active while shutting down --------- Co-authored-by: Lincoln Stein <lstein@gmail.com> Co-authored-by: Ryan Dick <ryanjdick3@gmail.com> Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com> Co-authored-by: Kent Keirsey <31807370+hipsterusername@users.noreply.github.com>	2023-12-22 12:35:57 -05:00
Kevin Turner	fd4e041e7c	feat: serve HTTPS when configured with `ssl_certfile`	2023-12-12 16:01:43 +11:00
psychedelicious	daf00efa4d	fix(api): only attempt to serve UI build if it exists	2023-12-11 12:30:13 +11:00

1 2 3

133 Commits