CenterPadCrop node (#3861)

* add centerpadcrop node - Allows users to add padding to or crop images from the center - Also outputs a white mask with the dimensions of the output image for use with outpainting * add CenterPadCrop to NODES.md Updates NODES.md with CenterPadCrop entry. * remove mask & output class - Remove "ImageMaskOutput" where both image and mask are output - Remove ability to output mask from node --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>
2024-08-30 20:32:17 +00:00 · 2023-11-30 11:15:59 +01:00 · 2023-11-30 11:15:59 +01:00 · 3e01c396e1
commit 3e01c396e1
parent 0beb08686c
3 changed files with 229 additions and 117 deletions
--- a/docs/nodes/defaultNodes.md
+++ b/docs/nodes/defaultNodes.md
@ -1,104 +1,106 @@
 # List of Default Nodes

-The table below contains a list of the default nodes shipped with InvokeAI and their descriptions.
+The table below contains a list of the default nodes shipped with InvokeAI and
+their descriptions.

-| Node <img width=160 align="right"> | Function                                                                              |
-|: ---------------------------------- | :--------------------------------------------------------------------------------------|
-|Add Integers 			| Adds two numbers|
-|Boolean Primitive Collection 			| A collection of boolean primitive values|
-|Boolean Primitive 			| A boolean primitive value|
-|Canny Processor 			| Canny edge detection for ControlNet|
-|CLIP Skip 			| Skip layers in clip text_encoder model.|
-|Collect 			| Collects values into a collection|
-|Color Correct 			| Shifts the colors of a target image to match the reference image, optionally using a mask to only color-correct certain regions of the target image.|
-|Color Primitive 			| A color primitive value|
-|Compel Prompt 			| Parse prompt using compel package to conditioning.|
-|Conditioning Primitive Collection 			| A collection of conditioning tensor primitive values|
-|Conditioning Primitive 			| A conditioning tensor primitive value|
-|Content Shuffle Processor 			| Applies content shuffle processing to image|
-|ControlNet 			| Collects ControlNet info to pass to other nodes|
-|Denoise Latents 			| Denoises noisy latents to decodable images|
-|Divide Integers 			| Divides two numbers|
-|Dynamic Prompt 			| Parses a prompt using adieyal/dynamicprompts' random or combinatorial generator|
-|[FaceMask](./detailedNodes/faceTools.md#facemask) | Generates masks for faces in an image to use with Inpainting|
-|[FaceIdentifier](./detailedNodes/faceTools.md#faceidentifier)             | Identifies and labels faces in an image|
-|[FaceOff](./detailedNodes/faceTools.md#faceoff)             | Creates a new image that is a scaled bounding box with a mask on the face for Inpainting|
-|Float Math             | Perform basic math operations on two floats|
-|Float Primitive Collection 			| A collection of float primitive values|
-|Float Primitive 			| A float primitive value|
-|Float Range 			| Creates a range|
-|HED (softedge) Processor 			| Applies HED edge detection to image|
-|Blur Image 			| Blurs an image|
-|Extract Image Channel 			| Gets a channel from an image.|
-|Image Primitive Collection 			| A collection of image primitive values|
-|Integer Math           | Perform basic math operations on two integers|
-|Convert Image Mode 			| Converts an image to a different mode.|
-|Crop Image 			| Crops an image to a specified box. The box can be outside of the image.|
-|Image Hue Adjustment 			| Adjusts the Hue of an image.|
-|Inverse Lerp Image 			| Inverse linear interpolation of all pixels of an image|
-|Image Primitive 			| An image primitive value|
-|Lerp Image 			| Linear interpolation of all pixels of an image|
-|Offset Image Channel 			| Add to or subtract from an image color channel by a uniform value.|
-|Multiply Image Channel 			| Multiply or Invert an image color channel by a scalar value.|
-|Multiply Images 			| Multiplies two images together using `PIL.ImageChops.multiply()`.|
-|Blur NSFW Image 			| Add blur to NSFW-flagged images|
-|Paste Image 			| Pastes an image into another image.|
-|ImageProcessor 			| Base class for invocations that preprocess images for ControlNet|
-|Resize Image 			| Resizes an image to specific dimensions|
-|Round Float            | Rounds a float to a specified number of decimal places|
-|Float to Integer       | Converts a float to an integer. Optionally rounds to an even multiple of a input number.|
-|Scale Image 			| Scales an image by a factor|
-|Image to Latents 			| Encodes an image into latents.|
-|Add Invisible Watermark 			| Add an invisible watermark to an image|
-|Solid Color Infill 			| Infills transparent areas of an image with a solid color|
-|PatchMatch Infill 			| Infills transparent areas of an image using the PatchMatch algorithm|
-|Tile Infill 			| Infills transparent areas of an image with tiles of the image|
-|Integer Primitive Collection 			| A collection of integer primitive values|
-|Integer Primitive 			| An integer primitive value|
-|Iterate 			| Iterates over a list of items|
-|Latents Primitive Collection 			| A collection of latents tensor primitive values|
-|Latents Primitive 			| A latents tensor primitive value|
-|Latents to Image 			| Generates an image from latents.|
-|Leres (Depth) Processor 			| Applies leres processing to image|
-|Lineart Anime Processor 			| Applies line art anime processing to image|
-|Lineart Processor 			| Applies line art processing to image|
-|LoRA Loader 			| Apply selected lora to unet and text_encoder.|
-|Main Model Loader 			| Loads a main model, outputting its submodels.|
-|Combine Mask 			| Combine two masks together by multiplying them using `PIL.ImageChops.multiply()`.|
-|Mask Edge 			| Applies an edge mask to an image|
-|Mask from Alpha 			| Extracts the alpha channel of an image as a mask.|
-|Mediapipe Face Processor 			| Applies mediapipe face processing to image|
-|Midas (Depth) Processor 			| Applies Midas depth processing to image|
-|MLSD Processor 			| Applies MLSD processing to image|
-|Multiply Integers 			| Multiplies two numbers|
-|Noise 			| Generates latent noise.|
-|Normal BAE Processor 			| Applies NormalBae processing to image|
-|ONNX Latents to Image 			| Generates an image from latents.|
-|ONNX Prompt (Raw) 			| A node to process inputs and produce outputs. May use dependency injection in __init__ to receive providers.|
-|ONNX Text to Latents 			| Generates latents from conditionings.|
-|ONNX Model Loader 			| Loads a main model, outputting its submodels.|
-|OpenCV Inpaint 			| Simple inpaint using opencv.|
-|Openpose Processor 			| Applies Openpose processing to image|
-|PIDI Processor 			| Applies PIDI processing to image|
-|Prompts from File 			| Loads prompts from a text file|
-|Random Integer 			| Outputs a single random integer.|
-|Random Range 			| Creates a collection of random numbers|
-|Integer Range 			| Creates a range of numbers from start to stop with step|
-|Integer Range of Size 			| Creates a range from start to start + size with step|
-|Resize Latents 			| Resizes latents to explicit width/height (in pixels). Provided dimensions are floor-divided by 8.|
-|SDXL Compel Prompt 			| Parse prompt using compel package to conditioning.|
-|SDXL LoRA Loader 			| Apply selected lora to unet and text_encoder.|
-|SDXL Main Model Loader 			| Loads an sdxl base model, outputting its submodels.|
-|SDXL Refiner Compel Prompt 			| Parse prompt using compel package to conditioning.|
-|SDXL Refiner Model Loader 			| Loads an sdxl refiner model, outputting its submodels.|
-|Scale Latents 			| Scales latents by a given factor.|
-|Segment Anything Processor 			| Applies segment anything processing to image|
-|Show Image 			| Displays a provided image, and passes it forward in the pipeline.|
-|Step Param Easing 			| Experimental per-step parameter easing for denoising steps|
-|String Primitive Collection 			| A collection of string primitive values|
-|String Primitive 			| A string primitive value|
-|Subtract Integers 			| Subtracts two numbers|
-|Tile Resample Processor 			| Tile resampler processor|
-|Upscale (RealESRGAN) 			| Upscales an image using RealESRGAN.|
-|VAE Loader 			| Loads a VAE model, outputting a VaeLoaderOutput|
-|Zoe (Depth) Processor 			| Applies Zoe depth processing to image|
+| Node <img width=160 align="right">                            | Function                                                                                                                                             |
+| :------------------------------------------------------------ | :--------------------------------------------------------------------------------------------------------------------------------------------------- |
+| Add Integers                                                  | Adds two numbers                                                                                                                                     |
+| Boolean Primitive Collection                                  | A collection of boolean primitive values                                                                                                             |
+| Boolean Primitive                                             | A boolean primitive value                                                                                                                            |
+| Canny Processor                                               | Canny edge detection for ControlNet                                                                                                                  |
+| CenterPadCrop                                                 | Pad or crop an image's sides from the center by specified pixels. Positive values are outside of the image.                                          |
+| CLIP Skip                                                     | Skip layers in clip text_encoder model.                                                                                                              |
+| Collect                                                       | Collects values into a collection                                                                                                                    |
+| Color Correct                                                 | Shifts the colors of a target image to match the reference image, optionally using a mask to only color-correct certain regions of the target image. |
+| Color Primitive                                               | A color primitive value                                                                                                                              |
+| Compel Prompt                                                 | Parse prompt using compel package to conditioning.                                                                                                   |
+| Conditioning Primitive Collection                             | A collection of conditioning tensor primitive values                                                                                                 |
+| Conditioning Primitive                                        | A conditioning tensor primitive value                                                                                                                |
+| Content Shuffle Processor                                     | Applies content shuffle processing to image                                                                                                          |
+| ControlNet                                                    | Collects ControlNet info to pass to other nodes                                                                                                      |
+| Denoise Latents                                               | Denoises noisy latents to decodable images                                                                                                           |
+| Divide Integers                                               | Divides two numbers                                                                                                                                  |
+| Dynamic Prompt                                                | Parses a prompt using adieyal/dynamicprompts' random or combinatorial generator                                                                      |
+| [FaceMask](./detailedNodes/faceTools.md#facemask)             | Generates masks for faces in an image to use with Inpainting                                                                                         |
+| [FaceIdentifier](./detailedNodes/faceTools.md#faceidentifier) | Identifies and labels faces in an image                                                                                                              |
+| [FaceOff](./detailedNodes/faceTools.md#faceoff)               | Creates a new image that is a scaled bounding box with a mask on the face for Inpainting                                                             |
+| Float Math                                                    | Perform basic math operations on two floats                                                                                                          |
+| Float Primitive Collection                                    | A collection of float primitive values                                                                                                               |
+| Float Primitive                                               | A float primitive value                                                                                                                              |
+| Float Range                                                   | Creates a range                                                                                                                                      |
+| HED (softedge) Processor                                      | Applies HED edge detection to image                                                                                                                  |
+| Blur Image                                                    | Blurs an image                                                                                                                                       |
+| Extract Image Channel                                         | Gets a channel from an image.                                                                                                                        |
+| Image Primitive Collection                                    | A collection of image primitive values                                                                                                               |
+| Integer Math                                                  | Perform basic math operations on two integers                                                                                                        |
+| Convert Image Mode                                            | Converts an image to a different mode.                                                                                                               |
+| Crop Image                                                    | Crops an image to a specified box. The box can be outside of the image.                                                                              |
+| Image Hue Adjustment                                          | Adjusts the Hue of an image.                                                                                                                         |
+| Inverse Lerp Image                                            | Inverse linear interpolation of all pixels of an image                                                                                               |
+| Image Primitive                                               | An image primitive value                                                                                                                             |
+| Lerp Image                                                    | Linear interpolation of all pixels of an image                                                                                                       |
+| Offset Image Channel                                          | Add to or subtract from an image color channel by a uniform value.                                                                                   |
+| Multiply Image Channel                                        | Multiply or Invert an image color channel by a scalar value.                                                                                         |
+| Multiply Images                                               | Multiplies two images together using `PIL.ImageChops.multiply()`.                                                                                    |
+| Blur NSFW Image                                               | Add blur to NSFW-flagged images                                                                                                                      |
+| Paste Image                                                   | Pastes an image into another image.                                                                                                                  |
+| ImageProcessor                                                | Base class for invocations that preprocess images for ControlNet                                                                                     |
+| Resize Image                                                  | Resizes an image to specific dimensions                                                                                                              |
+| Round Float                                                   | Rounds a float to a specified number of decimal places                                                                                               |
+| Float to Integer                                              | Converts a float to an integer. Optionally rounds to an even multiple of a input number.                                                             |
+| Scale Image                                                   | Scales an image by a factor                                                                                                                          |
+| Image to Latents                                              | Encodes an image into latents.                                                                                                                       |
+| Add Invisible Watermark                                       | Add an invisible watermark to an image                                                                                                               |
+| Solid Color Infill                                            | Infills transparent areas of an image with a solid color                                                                                             |
+| PatchMatch Infill                                             | Infills transparent areas of an image using the PatchMatch algorithm                                                                                 |
+| Tile Infill                                                   | Infills transparent areas of an image with tiles of the image                                                                                        |
+| Integer Primitive Collection                                  | A collection of integer primitive values                                                                                                             |
+| Integer Primitive                                             | An integer primitive value                                                                                                                           |
+| Iterate                                                       | Iterates over a list of items                                                                                                                        |
+| Latents Primitive Collection                                  | A collection of latents tensor primitive values                                                                                                      |
+| Latents Primitive                                             | A latents tensor primitive value                                                                                                                     |
+| Latents to Image                                              | Generates an image from latents.                                                                                                                     |
+| Leres (Depth) Processor                                       | Applies leres processing to image                                                                                                                    |
+| Lineart Anime Processor                                       | Applies line art anime processing to image                                                                                                           |
+| Lineart Processor                                             | Applies line art processing to image                                                                                                                 |
+| LoRA Loader                                                   | Apply selected lora to unet and text_encoder.                                                                                                        |
+| Main Model Loader                                             | Loads a main model, outputting its submodels.                                                                                                        |
+| Combine Mask                                                  | Combine two masks together by multiplying them using `PIL.ImageChops.multiply()`.                                                                    |
+| Mask Edge                                                     | Applies an edge mask to an image                                                                                                                     |
+| Mask from Alpha                                               | Extracts the alpha channel of an image as a mask.                                                                                                    |
+| Mediapipe Face Processor                                      | Applies mediapipe face processing to image                                                                                                           |
+| Midas (Depth) Processor                                       | Applies Midas depth processing to image                                                                                                              |
+| MLSD Processor                                                | Applies MLSD processing to image                                                                                                                     |
+| Multiply Integers                                             | Multiplies two numbers                                                                                                                               |
+| Noise                                                         | Generates latent noise.                                                                                                                              |
+| Normal BAE Processor                                          | Applies NormalBae processing to image                                                                                                                |
+| ONNX Latents to Image                                         | Generates an image from latents.                                                                                                                     |
+| ONNX Prompt (Raw)                                             | A node to process inputs and produce outputs. May use dependency injection in **init** to receive providers.                                         |
+| ONNX Text to Latents                                          | Generates latents from conditionings.                                                                                                                |
+| ONNX Model Loader                                             | Loads a main model, outputting its submodels.                                                                                                        |
+| OpenCV Inpaint                                                | Simple inpaint using opencv.                                                                                                                         |
+| Openpose Processor                                            | Applies Openpose processing to image                                                                                                                 |
+| PIDI Processor                                                | Applies PIDI processing to image                                                                                                                     |
+| Prompts from File                                             | Loads prompts from a text file                                                                                                                       |
+| Random Integer                                                | Outputs a single random integer.                                                                                                                     |
+| Random Range                                                  | Creates a collection of random numbers                                                                                                               |
+| Integer Range                                                 | Creates a range of numbers from start to stop with step                                                                                              |
+| Integer Range of Size                                         | Creates a range from start to start + size with step                                                                                                 |
+| Resize Latents                                                | Resizes latents to explicit width/height (in pixels). Provided dimensions are floor-divided by 8.                                                    |
+| SDXL Compel Prompt                                            | Parse prompt using compel package to conditioning.                                                                                                   |
+| SDXL LoRA Loader                                              | Apply selected lora to unet and text_encoder.                                                                                                        |
+| SDXL Main Model Loader                                        | Loads an sdxl base model, outputting its submodels.                                                                                                  |
+| SDXL Refiner Compel Prompt                                    | Parse prompt using compel package to conditioning.                                                                                                   |
+| SDXL Refiner Model Loader                                     | Loads an sdxl refiner model, outputting its submodels.                                                                                               |
+| Scale Latents                                                 | Scales latents by a given factor.                                                                                                                    |
+| Segment Anything Processor                                    | Applies segment anything processing to image                                                                                                         |
+| Show Image                                                    | Displays a provided image, and passes it forward in the pipeline.                                                                                    |
+| Step Param Easing                                             | Experimental per-step parameter easing for denoising steps                                                                                           |
+| String Primitive Collection                                   | A collection of string primitive values                                                                                                              |
+| String Primitive                                              | A string primitive value                                                                                                                             |
+| Subtract Integers                                             | Subtracts two numbers                                                                                                                                |
+| Tile Resample Processor                                       | Tile resampler processor                                                                                                                             |
+| Upscale (RealESRGAN)                                          | Upscales an image using RealESRGAN.                                                                                                                  |
+| VAE Loader                                                    | Loads a VAE model, outputting a VaeLoaderOutput                                                                                                      |
+| Zoe (Depth) Processor                                         | Applies Zoe depth processing to image                                                                                                                |
--- a/invokeai/app/invocations/image.py
+++ b/invokeai/app/invocations/image.py
@ -100,6 +100,61 @@ class ImageCropInvocation(BaseInvocation, WithWorkflow, WithMetadata):
        )


+@invocation(
+    invocation_type="img_pad_crop",
+    title="Center Pad or Crop Image",
+    category="image",
+    tags=["image", "pad", "crop"],
+    version="1.0.0",
+)
+class CenterPadCropInvocation(BaseInvocation):
+    """Pad or crop an image's sides from the center by specified pixels. Positive values are outside of the image."""
+
+    image: ImageField = InputField(description="The image to crop")
+    left: int = InputField(
+        default=0,
+        description="Number of pixels to pad/crop from the left (negative values crop inwards, positive values pad outwards)",
+    )
+    right: int = InputField(
+        default=0,
+        description="Number of pixels to pad/crop from the right (negative values crop inwards, positive values pad outwards)",
+    )
+    top: int = InputField(
+        default=0,
+        description="Number of pixels to pad/crop from the top (negative values crop inwards, positive values pad outwards)",
+    )
+    bottom: int = InputField(
+        default=0,
+        description="Number of pixels to pad/crop from the bottom (negative values crop inwards, positive values pad outwards)",
+    )
+
+    def invoke(self, context: InvocationContext) -> ImageOutput:
+        image = context.services.images.get_pil_image(self.image.image_name)
+
+        # Calculate and create new image dimensions
+        new_width = image.width + self.right + self.left
+        new_height = image.height + self.top + self.bottom
+        image_crop = Image.new(mode="RGBA", size=(new_width, new_height), color=(0, 0, 0, 0))
+
+        # Paste new image onto input
+        image_crop.paste(image, (self.left, self.top))
+
+        image_dto = context.services.images.create(
+            image=image_crop,
+            image_origin=ResourceOrigin.INTERNAL,
+            image_category=ImageCategory.GENERAL,
+            node_id=self.id,
+            session_id=context.graph_execution_state_id,
+            is_intermediate=self.is_intermediate,
+        )
+
+        return ImageOutput(
+            image=ImageField(image_name=image_dto.image_name),
+            width=image_dto.width,
+            height=image_dto.height,
+        )
+
+
@invocation("img_paste", title="Paste Image", tags=["image", "paste"], category="image", version="1.1.0")
 class ImagePasteInvocation(BaseInvocation, WithWorkflow, WithMetadata):
    """Pastes an image into another image."""
--- a/invokeai/frontend/web/src/services/api/schema.d.ts
+++ b/invokeai/frontend/web/src/services/api/schema.d.ts