InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI synced 2024-08-30 20:32:17 +00:00

Author	SHA1	Message	Date
Ryan Dick	1d449097cc	Apply ruff rule to disallow all relative imports.	2024-07-04 09:35:37 -04:00
Ryan Dick	9da5925287	Add ruff rule to disallow relative parent imports.	2024-07-04 09:35:37 -04:00
psychedelicious	dfad37a262	docs: update comments & docstrings	2024-05-27 09:06:02 +10:00
psychedelicious	084cf26ed6	refactor: remove all session events There's no longer any need for session-scoped events now that we have the session queue. Session started/completed/canceled map 1-to-1 to queue item status events, but queue item status events also have an event for failed state. We can simplify queue and processor handling substantially by removing session events and instead using queue item events. - Remove the session-scoped events entirely. - Remove all event handling from session queue. The processor still needs to respond to some events from the queue: `QueueClearedEvent`, `BatchEnqueuedEvent` and `QueueItemStatusChangedEvent`. - Pass an `is_canceled` callback to the invocation context instead of the cancel event - Update processor logic to ensure the local instance of the current queue item is synced with the instance in the database. This prevents race conditions and ensures lifecycle callback do not get stale callbacks. - Update docstrings and comments - Add `complete_queue_item` method to session queue service as an explicit way to mark a queue item as successfully completed. Previously, the queue listened for session complete events to do this. Closes #6442	2024-05-27 09:06:02 +10:00
psychedelicious	368127bd25	feat(events): register_events supports single event	2024-05-27 09:06:02 +10:00
psychedelicious	575943d0ad	fix(processor): move session started event to session runner	2024-05-27 09:06:02 +10:00
psychedelicious	25d1d2b591	tidy(processor): use separate handlers for each event type Just a bit clearer without needing `isinstance` checks.	2024-05-27 09:06:02 +10:00
psychedelicious	2dc752ea83	feat(events): simplify event classes - Remove ABCs, they do not work well with pydantic - Remove the event type classvar - unused - Remove clever logic to require an event name - we already get validation for this during schema registration. - Rename event bases to all end in "Base"	2024-05-27 09:06:02 +10:00
psychedelicious	9bd78823a3	refactor(events): use pydantic schemas for events Our events handling and implementation has a couple pain points: - Adding or removing data from event payloads requires changes wherever the events are dispatched from. - We have no type safety for events and need to rely on string matching and dict access when interacting with events. - Frontend types for socket events must be manually typed. This has caused several bugs. `fastapi-events` has a neat feature where you can create a pydantic model as an event payload, give it an `__event_name__` attr, and then dispatch the model directly. This allows us to eliminate a layer of indirection and some unpleasant complexity: - Event handler callbacks get type hints for their event payloads, and can use `isinstance` on them if needed. - Event payload construction is now the responsibility of the event itself (a pydantic model), not the service. Every event model has a `build` class method, encapsulating this logic. The build methods are provided as few args as possible. For example, `InvocationStartedEvent.build()` gets the invocation instance and queue item, and can choose the data it wants to include in the event payload. - Frontend event types may be autogenerated from the OpenAPI schema. We use the payload registry feature of `fastapi-events` to collect all payload models into one place, making it trivial to keep our schema and frontend types in sync. This commit moves the backend over to this improved event handling setup.	2024-05-27 09:06:02 +10:00
psychedelicious	50dd569411	fix(processor): race condition that could result in node errors not getting reported I had set the cancel event at some point during troubleshooting an unrelated issue. It seemed logical that it should be set there, and didn't seem to break anything. However, this is not correct. The cancel event should not be set in response to a queue status change event. Doing so can cause a race condition when nodes are executed very quickly. It's possible that a previously-executed session's queue item status change event is handled after the next session starts executing. The cancel event is set and the session runner sees it aborting the session run early. In hindsight, it doesn't make sense to set the cancel event here either. It should be set in response to user action, e.g. the user cancelled the session or cleared the queue (which implicitly cancels the current session). These events actually trigger the queue item status changed event, so if we set the cancel event here, we'd be setting it twice per cancellation.	2024-05-24 20:02:24 +10:00
psychedelicious	9c926f249f	feat(processor): add debug log stmts to session running callbacks	2024-05-24 20:02:24 +10:00
psychedelicious	80faeac913	fix(processor): fix race condition related to clearing the queue	2024-05-24 20:02:24 +10:00
psychedelicious	e365d35c93	docs(processor): update docstrings, comments	2024-05-24 20:02:24 +10:00
psychedelicious	2dd3a85ade	feat(processor): update enriched errors & `fail_queue_item()`	2024-05-24 20:02:24 +10:00
psychedelicious	3c41c67d13	fix(processor): restore missing update of session	2024-05-24 20:02:24 +10:00
psychedelicious	6c79be7dc3	chore: ruff	2024-05-24 20:02:24 +10:00
psychedelicious	097619ef51	feat(processor): get user/project from queue item w/ fallback	2024-05-24 20:02:24 +10:00
psychedelicious	a1f7a9cd6f	fix(app): fix logging of error classes instead of class names	2024-05-24 20:02:24 +10:00
psychedelicious	25b9c19eed	feat(app): handle preparation errors as node errors We were not handling node preparation errors as node errors before. Here's the explanation, copied from a comment that is no longer required: --- TODO(psyche): Sessions only support errors on nodes, not on the session itself. When an error occurs outside node execution, it bubbles up to the processor where it is treated as a queue item error. Nodes are pydantic models. When we prepare a node in `session.next()`, we set its inputs. This can cause a pydantic validation error. For example, consider a resize image node which has a constraint on its `width` input field - it must be greater than zero. During preparation, if the width is set to zero, pydantic will raise a validation error. When this happens, it breaks the flow before `invocation` is set. We can't set an error on the invocation because we didn't get far enough to get it - we don't know its id. Hence, we just set it as a queue item error. --- This change wraps the node preparation step with exception handling. A new `NodeInputError` exception is raised when there is a validation error. This error has the node (in the state it was in just prior to the error) and an identifier of the input that failed. This allows us to mark the node that failed preparation as errored, correctly making such errors _node_ errors and not _processor_ errors. It's much easier to diagnose these situations. The error messages look like this: > Node b5ac87c6-0678-4b8c-96b9-d215aee12175 has invalid incoming input for height Some of the exception handling logic is cleaned up.	2024-05-24 20:02:24 +10:00
psychedelicious	cc2d877699	docs(app): explain why errors are handled poorly	2024-05-24 20:02:24 +10:00
psychedelicious	be82404759	tidy(app): "outputs" -> "output"	2024-05-24 20:02:24 +10:00
psychedelicious	33f9fe2c86	tidy(app): rearrange proccessor	2024-05-24 20:02:24 +10:00
psychedelicious	1d973f92ff	feat(app): support multiple processor lifecycle callbacks	2024-05-24 20:02:24 +10:00
psychedelicious	7f70cde038	feat(app): make things in session runner private	2024-05-24 20:02:24 +10:00
psychedelicious	47722528a3	feat(app): iterate on processor split 2 - Use protocol to define callbacks, this allows them to have kwargs - Shuffle the profiler around a bit - Move `thread_limit` and `polling_interval` to `__init__`; `start` is called programmatically and will never get these args in practice	2024-05-24 20:02:24 +10:00
psychedelicious	be41c84305	feat(app): iterate on processor split - Add `OnNodeError` and `OnNonFatalProcessorError` callbacks - Move all session/node callbacks to `SessionRunner` - this ensures we dump perf stats before resetting them and generally makes sense to me - Remove `complete` event from `SessionRunner`, it's essentially the same as `OnAfterRunSession` - Remove extraneous `next_invocation` block, which would treat a processor error as a node error - Simplify loops - Add some callbacks for testing, to be removed before merge	2024-05-24 20:02:24 +10:00
brandonrising	82b4298b03	Fix next node calling logic	2024-05-24 20:02:24 +10:00
brandonrising	fa6c7badd6	Run ruff	2024-05-24 20:02:24 +10:00
brandonrising	45d2504c1e	Break apart session processor and the running of each session into separate classes	2024-05-24 20:02:24 +10:00
psychedelicious	93e4c3dbc2	feat(app): update queue item's session on session completion The session is never updated in the queue after it is first enqueued. As a result, the queue detail view in the frontend never never updates and the session itself doesn't show outputs, execution graph, etc. We need a new method on the queue service to update a queue item's session, then call it before updating the queue item's status. Queue item status may be updated via a session-type event _or_ queue-type event. Adding the updated session to all these events is a hairy - simpler to just update the session before we do anything that could trigger a queue item status change event: - Before calling `emit_session_complete` in the processor (handles session error, completed and cancel events and the corresponding queue events) - Before calling `cancel_queue_item` in the processor (handles another way queue items can be canceled, outside the session execution loop) When serializing the session, both in the new service method and the `get_queue_item` endpoint, we need to use `exclude_none=True` to prevent unexpected validation errors.	2024-05-24 08:59:49 +10:00
psychedelicious	17e1fc5254	chore(app): ruff	2024-05-18 09:21:45 +10:00
maryhipp	84e031edc2	add nulable project also	2024-05-18 09:21:45 +10:00
maryhipp	b6b7e737e0	ruff	2024-05-18 09:21:45 +10:00
maryhipp	5f3e7afd45	add nullable user to invocation error events	2024-05-18 09:21:45 +10:00
psychedelicious	b18442ded4	fix(queue): poll queue on finished queue item When a queue item is finished (completed, canceled, failed), immediately poll the queue for the next queue item. Closes #6189	2024-04-12 07:31:47 +10:00
psychedelicious	f83edcf990	feat(nodes): simplify processor loop with an early continue Prefer an early return/continue to reduce the indentation of the processor loop. Easier to read. There are other ways to improve its structure but at first glance, they seem to involve changing the logic in scarier ways.	2024-04-01 08:39:25 +11:00
psychedelicious	a6dd50aeaf	fix(nodes): 100% cpu usage when processor paused Should be waiting on the resume event instead of checking it in a loop	2024-04-01 08:39:25 +11:00
Lincoln Stein	1badf0f32f	refactor if/else logic slightly	2024-03-31 12:42:39 -04:00
Lincoln Stein	3c9c58e0fa	fix 100% CPU load in `session_processor_default._process()`	2024-03-31 12:42:39 -04:00
psychedelicious	9a1b35fa37	fix(queue): pause & resume This must not have been tested after the processors were unified. Needed to shift the logic around so the resume event is handled correctly. Clear and easy fix.	2024-03-30 08:25:33 -04:00
brandonrising	43bcedee10	Run ruff	2024-03-29 08:45:34 +11:00
brandonrising	98cc9b963c	Only cancel session processor if current generating queue item is cancelled	2024-03-29 08:45:34 +11:00
Ryan Dick	f6028a4c61	Log a stack trace for invocation errors.	2024-03-04 23:01:56 +11:00
psychedelicious	89fa36a818	chore(nodes): update TODO comment	2024-03-01 10:42:33 +11:00
psychedelicious	e3f9da29ba	tidy(nodes): clean up profiler/stats in processor, better comments	2024-03-01 10:42:33 +11:00
psychedelicious	0b0cb0ccc6	feat(nodes): making invocation class var in processor	2024-03-01 10:42:33 +11:00
psychedelicious	fa39523b11	feat(nodes): improved error messages in processor	2024-03-01 10:42:33 +11:00
psychedelicious	16676feea8	feat(nodes): make processor thread limit and polling interval configurable	2024-03-01 10:42:33 +11:00
psychedelicious	ccfe6b6bef	chore(nodes): "context_data" -> "data" Changed within InvocationContext, for brevity.	2024-03-01 10:42:33 +11:00
psychedelicious	18adcc1dd2	feat(nodes): add whole queue_item to InvocationContextData No reason to not have the whole thing in there.	2024-03-01 10:42:33 +11:00

1 2

65 Commits