InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
psychedelicious	2dd3a85ade	feat(processor): update enriched errors & `fail_queue_item()`	2024-05-24 20:02:24 +10:00
psychedelicious	3c41c67d13	fix(processor): restore missing update of session	2024-05-24 20:02:24 +10:00
psychedelicious	6c79be7dc3	chore: ruff	2024-05-24 20:02:24 +10:00
psychedelicious	097619ef51	feat(processor): get user/project from queue item w/ fallback	2024-05-24 20:02:24 +10:00
psychedelicious	a1f7a9cd6f	fix(app): fix logging of error classes instead of class names	2024-05-24 20:02:24 +10:00
psychedelicious	25b9c19eed	feat(app): handle preparation errors as node errors We were not handling node preparation errors as node errors before. Here's the explanation, copied from a comment that is no longer required: --- TODO(psyche): Sessions only support errors on nodes, not on the session itself. When an error occurs outside node execution, it bubbles up to the processor where it is treated as a queue item error. Nodes are pydantic models. When we prepare a node in `session.next()`, we set its inputs. This can cause a pydantic validation error. For example, consider a resize image node which has a constraint on its `width` input field - it must be greater than zero. During preparation, if the width is set to zero, pydantic will raise a validation error. When this happens, it breaks the flow before `invocation` is set. We can't set an error on the invocation because we didn't get far enough to get it - we don't know its id. Hence, we just set it as a queue item error. --- This change wraps the node preparation step with exception handling. A new `NodeInputError` exception is raised when there is a validation error. This error has the node (in the state it was in just prior to the error) and an identifier of the input that failed. This allows us to mark the node that failed preparation as errored, correctly making such errors _node_ errors and not _processor_ errors. It's much easier to diagnose these situations. The error messages look like this: > Node b5ac87c6-0678-4b8c-96b9-d215aee12175 has invalid incoming input for height Some of the exception handling logic is cleaned up.	2024-05-24 20:02:24 +10:00
psychedelicious	cc2d877699	docs(app): explain why errors are handled poorly	2024-05-24 20:02:24 +10:00
psychedelicious	be82404759	tidy(app): "outputs" -> "output"	2024-05-24 20:02:24 +10:00
psychedelicious	33f9fe2c86	tidy(app): rearrange proccessor	2024-05-24 20:02:24 +10:00
psychedelicious	1d973f92ff	feat(app): support multiple processor lifecycle callbacks	2024-05-24 20:02:24 +10:00
psychedelicious	7f70cde038	feat(app): make things in session runner private	2024-05-24 20:02:24 +10:00
psychedelicious	47722528a3	feat(app): iterate on processor split 2 - Use protocol to define callbacks, this allows them to have kwargs - Shuffle the profiler around a bit - Move `thread_limit` and `polling_interval` to `__init__`; `start` is called programmatically and will never get these args in practice	2024-05-24 20:02:24 +10:00
psychedelicious	be41c84305	feat(app): iterate on processor split - Add `OnNodeError` and `OnNonFatalProcessorError` callbacks - Move all session/node callbacks to `SessionRunner` - this ensures we dump perf stats before resetting them and generally makes sense to me - Remove `complete` event from `SessionRunner`, it's essentially the same as `OnAfterRunSession` - Remove extraneous `next_invocation` block, which would treat a processor error as a node error - Simplify loops - Add some callbacks for testing, to be removed before merge	2024-05-24 20:02:24 +10:00
brandonrising	82b4298b03	Fix next node calling logic	2024-05-24 20:02:24 +10:00
brandonrising	fa6c7badd6	Run ruff	2024-05-24 20:02:24 +10:00
brandonrising	45d2504c1e	Break apart session processor and the running of each session into separate classes	2024-05-24 20:02:24 +10:00
psychedelicious	93e4c3dbc2	feat(app): update queue item's session on session completion The session is never updated in the queue after it is first enqueued. As a result, the queue detail view in the frontend never never updates and the session itself doesn't show outputs, execution graph, etc. We need a new method on the queue service to update a queue item's session, then call it before updating the queue item's status. Queue item status may be updated via a session-type event _or_ queue-type event. Adding the updated session to all these events is a hairy - simpler to just update the session before we do anything that could trigger a queue item status change event: - Before calling `emit_session_complete` in the processor (handles session error, completed and cancel events and the corresponding queue events) - Before calling `cancel_queue_item` in the processor (handles another way queue items can be canceled, outside the session execution loop) When serializing the session, both in the new service method and the `get_queue_item` endpoint, we need to use `exclude_none=True` to prevent unexpected validation errors.	2024-05-24 08:59:49 +10:00
psychedelicious	17e1fc5254	chore(app): ruff	2024-05-18 09:21:45 +10:00
maryhipp	84e031edc2	add nulable project also	2024-05-18 09:21:45 +10:00
maryhipp	b6b7e737e0	ruff	2024-05-18 09:21:45 +10:00
maryhipp	5f3e7afd45	add nullable user to invocation error events	2024-05-18 09:21:45 +10:00
psychedelicious	b18442ded4	fix(queue): poll queue on finished queue item When a queue item is finished (completed, canceled, failed), immediately poll the queue for the next queue item. Closes #6189	2024-04-12 07:31:47 +10:00
psychedelicious	f83edcf990	feat(nodes): simplify processor loop with an early continue Prefer an early return/continue to reduce the indentation of the processor loop. Easier to read. There are other ways to improve its structure but at first glance, they seem to involve changing the logic in scarier ways.	2024-04-01 08:39:25 +11:00
psychedelicious	a6dd50aeaf	fix(nodes): 100% cpu usage when processor paused Should be waiting on the resume event instead of checking it in a loop	2024-04-01 08:39:25 +11:00
Lincoln Stein	1badf0f32f	refactor if/else logic slightly	2024-03-31 12:42:39 -04:00
Lincoln Stein	3c9c58e0fa	fix 100% CPU load in `session_processor_default._process()`	2024-03-31 12:42:39 -04:00
psychedelicious	9a1b35fa37	fix(queue): pause & resume This must not have been tested after the processors were unified. Needed to shift the logic around so the resume event is handled correctly. Clear and easy fix.	2024-03-30 08:25:33 -04:00
brandonrising	43bcedee10	Run ruff	2024-03-29 08:45:34 +11:00
brandonrising	98cc9b963c	Only cancel session processor if current generating queue item is cancelled	2024-03-29 08:45:34 +11:00
Ryan Dick	f6028a4c61	Log a stack trace for invocation errors.	2024-03-04 23:01:56 +11:00
psychedelicious	89fa36a818	chore(nodes): update TODO comment	2024-03-01 10:42:33 +11:00
psychedelicious	e3f9da29ba	tidy(nodes): clean up profiler/stats in processor, better comments	2024-03-01 10:42:33 +11:00
psychedelicious	0b0cb0ccc6	feat(nodes): making invocation class var in processor	2024-03-01 10:42:33 +11:00
psychedelicious	fa39523b11	feat(nodes): improved error messages in processor	2024-03-01 10:42:33 +11:00
psychedelicious	16676feea8	feat(nodes): make processor thread limit and polling interval configurable	2024-03-01 10:42:33 +11:00
psychedelicious	ccfe6b6bef	chore(nodes): "context_data" -> "data" Changed within InvocationContext, for brevity.	2024-03-01 10:42:33 +11:00
psychedelicious	18adcc1dd2	feat(nodes): add whole queue_item to InvocationContextData No reason to not have the whole thing in there.	2024-03-01 10:42:33 +11:00
psychedelicious	3cfac8b843	feat(nodes): better invocation error messages	2024-03-01 10:42:33 +11:00
psychedelicious	0788b6ecee	chore(nodes): add comments for cancel state	2024-03-01 10:42:33 +11:00
psychedelicious	725c03cf87	refactor(nodes): merge processors Consolidate graph processing logic into session processor. With graphs as the unit of work, and the session queue distributing graphs, we no longer need the invocation queue or processor. Instead, the session processor dequeues the next session and processes it in a simple loop, greatly simplifying the app. - Remove `graph_execution_manager` service. - Remove `queue` (invocation queue) service. - Remove `processor` (invocation processor) service. - Remove queue-related logic from `Invoker`. It now only starts and stops the services, providing them with access to other services. - Remove unused `invocation_retrieval_error` and `session_retrieval_error` events, these are no longer needed. - Clean up stats service now that it is less coupled to the rest of the app. - Refactor cancellation logic - cancellations now originate from session queue (i.e. HTTP cancel endpoint) and are emitted as events. Processor gets the events and sets the canceled event. Access to this event is provided to the invocation context for e.g. the step callback. - Remove `sessions` router; it provided access to `graph_executions` but that no longer exists.	2024-03-01 10:42:33 +11:00
psychedelicious	c42d692ea6	feat: workflow library (#5148 ) * chore: bump pydantic to 2.5.2 This release fixes pydantic/pydantic#8175 and allows us to use `JsonValue` * fix(ui): exclude public/en.json from prettier config * fix(workflow_records): fix SQLite workflow insertion to ignore duplicates * feat(backend): update workflows handling Update workflows handling for Workflow Library. Updated Workflow Storage "Embedded Workflows" are workflows associated with images, and are now only stored in the image files. "Library Workflows" are not associated with images, and are stored only in DB. This works out nicely. We have always saved workflows to files, but recently began saving them to the DB in addition to in image files. When that happened, we stopped reading workflows from files, so all the workflows that only existed in images were inaccessible. With this change, access to those workflows is restored, and no workflows are lost. Updated Workflow Handling in Nodes Prior to this change, workflows were embedded in images by passing the whole workflow JSON to a special workflow field on a node. In the node's `invoke()` function, the node was able to access this workflow and save it with the image. This (inaccurately) models workflows as a property of an image and is rather awkward technically. A workflow is now a property of a batch/session queue item. It is available in the InvocationContext and therefore available to all nodes during `invoke()`. Database Migrations Added a `SQLiteMigrator` class to handle database migrations. Migrations were needed to accomodate the DB-related changes in this PR. See the code for details. The `images`, `workflows` and `session_queue` tables required migrations for this PR, and are using the new migrator. Other tables/services are still creating tables themselves. A followup PR will adapt them to use the migrator. Other/Support Changes - Add a `has_workflow` column to `images` table to indicate that the image has an embedded workflow. - Add handling for retrieving the workflow from an image in python. The image file must be fetched, the workflow extracted, and then sent to client, avoiding needing the browser to parse the image file. With the `has_workflow` column, the UI knows if there is a workflow to be fetched, and only fetches when the user requests to load the workflow. - Add route to get the workflow from an image - Add CRUD service/routes for the library workflows - `workflow_images` table and services removed (no longer needed now that embedded workflows are not in the DB) * feat(ui): updated workflow handling (WIP) Clientside updates for the backend workflow changes. Includes roughed-out workflow library UI. * feat: revert SQLiteMigrator class Will pursue this in a separate PR. * feat(nodes): do not overwrite custom node module names Use a different, simpler method to detect if a node is custom. * feat(nodes): restore WithWorkflow as no-op class This class is deprecated and no longer needed. Set its workflow attr value to None (meaning it is now a no-op), and issue a warning when an invocation subclasses it. * fix(nodes): fix get_workflow from queue item dict func * feat(backend): add WorkflowRecordListItemDTO This is the id, name, description, created at and updated at workflow columns/attrs. Used to display lists of workflowsl * chore(ui): typegen * feat(ui): add workflow loading, deleting to workflow library UI * feat(ui): workflow library pagination button styles * wip * feat: workflow library WIP - Save to library - Duplicate - Filter/sort - UI/queries * feat: workflow library - system graphs - wip * feat(backend): sync system workflows to db * fix: merge conflicts * feat: simplify default workflows - Rename "system" -> "default" - Simplify syncing logic - Update UI to match * feat(workflows): update default workflows - Update TextToImage_SD15 - Add TextToImage_SDXL - Add README * feat(ui): refine workflow list UI * fix(workflow_records): typo * fix(tests): fix tests * feat(ui): clean up workflow library hooks * fix(db): fix mis-ordered db cleanup step It was happening before pruning queue items - should happen afterwards, else you have to restart the app again to free disk space made available by the pruning. * feat(ui): tweak reset workflow editor translations * feat(ui): split out workflow redux state The `nodes` slice is a rather complicated slice. Removing `workflow` makes it a bit more reasonable. Also helps to flatten state out a bit. * docs: update default workflows README * fix: tidy up unused files, unrelated changes * fix(backend): revert unrelated service organisational changes * feat(backend): workflow_records.get_many arg "filter_text" -> "query" * feat(ui): use custom hook in current image buttons Already in use elsewhere, forgot to use it here. * fix(ui): remove commented out property * fix(ui): fix workflow loading - Different handling for loading from library vs external - Fix bug where only nodes and edges loaded * fix(ui): fix save/save-as workflow naming * fix(ui): fix circular dependency * fix(db): fix bug with releasing without lock in db.clean() * fix(db): remove extraneous lock * chore: bump ruff * fix(workflow_records): default `category` to `WorkflowCategory.User` This allows old workflows to validate when reading them from the db or image files. * hide workflow library buttons if feature is disabled --------- Co-authored-by: Mary Hipp <maryhipp@Marys-MacBook-Air.local>	2023-12-09 09:48:38 +11:00
psychedelicious	da443973cb	chore: ruff	2023-11-21 20:22:27 +11:00
psychedelicious	6494e8e551	chore: ruff format	2023-11-11 10:55:40 +11:00
psychedelicious	3a136420d5	chore: ruff check - fix flake8-comprensions	2023-11-11 10:55:23 +11:00
psychedelicious	d2fb29cf0d	fix(app): remove errant logger line	2023-10-12 12:15:06 -04:00
psychedelicious	402cf9b0ee	feat: refactor services folder/module structure Refactor services folder/module structure. Motivation While working on our services I've repeatedly encountered circular imports and a general lack of clarity regarding where to put things. The structure introduced goes a long way towards resolving those issues, setting us up for a clean structure going forward. Services Services are now in their own folder with a few files: - `services/{service_name}/__init__.py`: init as needed, mostly empty now - `services/{service_name}/{service_name}_base.py`: the base class for the service - `services/{service_name}/{service_name}_{impl_type}.py`: the default concrete implementation of the service - typically one of `sqlite`, `default`, or `memory` - `services/{service_name}/{service_name}_common.py`: any common items - models, exceptions, utilities, etc Though it's a bit verbose to have the service name both as the folder name and the prefix for files, I found it is _extremely_ confusing to have all of the base classes just be named `base.py`. So, at the cost of some verbosity when importing things, I've included the service name in the filename. There are some minor logic changes. For example, in `InvocationProcessor`, instead of assigning the model manager service to a variable to be used later in the file, the service is used directly via the `Invoker`. Shared Things that are used across disparate services are in `services/shared/`: - `default_graphs.py`: previously in `services/` - `graphs.py`: previously in `services/` - `paginatation`: generic pagination models used in a few services - `sqlite`: the `SqliteDatabase` class, other sqlite-specific things	2023-10-12 12:15:06 -04:00
psychedelicious	88e16ce051	fix(nodes): mark session queue items failed on processor error When the processor has an error and it has a queue item, mark that item failed. This addresses processor errors resulting in `in_progress` queue items, which create a soft lock of the processor, requiring the user to cancel the `in_progress` item before anything else processes.	2023-10-05 09:32:29 +11:00
Lincoln Stein	f37ffda966	replace case statements with if/else to support python 3.9	2023-09-25 18:33:39 +10:00
Brandon Rising	6cc7b55ec5	Add wait on exception	2023-09-21 11:18:57 -04:00
Brandon Rising	883e9973ec	When an exception happens within the session processor loop, record and move on	2023-09-21 11:10:25 -04:00

1 2

52 Commits