Skip to content

Add new open-source model and built-in tool blueprints#13980

Merged
alexisrolland merged 20 commits into
masterfrom
blueprints-0519
May 25, 2026
Merged

Add new open-source model and built-in tool blueprints#13980
alexisrolland merged 20 commits into
masterfrom
blueprints-0519

Conversation

@comfyui-wiki

@comfyui-wiki comfyui-wiki commented May 19, 2026

Copy link
Copy Markdown
Member

New Blueprints

Open-source model blueprints

  • Stable Audio 3 Medium Base
  • SDPose: video/image to pose map
  • Anima Base
  • VOID
  • WAN2.1 VACE: Video Inpainting
  • Mediapipe: Video, Image Depth Estimation; Geometry Estimation
  • Anima Base v1.0

Built-in tool

  • Merge Videos
  • Select Per-line Text by index

Update blueprint category

@comfyui-wiki comfyui-wiki changed the title New Blueprints May 24, 2026
@comfyui-wiki comfyui-wiki marked this pull request as ready for review May 24, 2026 13:56
@coderabbitai

coderabbitai Bot commented May 24, 2026

Copy link
Copy Markdown

Review Change Stack

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

  • @coderabbitai resume to resume automatic reviews.
  • @coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

  • ▶️ Resume reviews
  • 🔍 Trigger review
📝 Walkthrough

Walkthrough

This PR adds many new ComfyUI blueprints (audio generation Stable Audio 3 base/medium with optional Qwen reprompting; Image/Video pose mapping via SDPose; Merge Videos; Split Image Grid; Select Per-Line Text; Text to Image Anima; MoGe geometry and depth estimators for image and video; Image/Video face detection via MediaPipe) and applies category metadata reclassifications to several existing blueprints. No exported code/API signatures were changed.

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description check ✅ Passed The description directly relates to the changeset, listing the specific new model and tool blueprints added along with category updates.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.
Title check ✅ Passed The title accurately summarizes the main changes: addition of new open-source model blueprints (Stable Audio 3, SDPose, Anima, etc.) and new built-in tool blueprints (Merge Videos, Select Per-Line Text by Index), with category updates.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 3

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@blueprints/Merge` Videos.json:
- Around line 362-365: The blueprint currently hardcodes EmptyAudio via
widgets_values: [60, 44100, 2] which causes AV desync; update the switch logic
so that when drop_audio is true the CreateVideo node receives either a generated
EmptyAudio whose duration is computed from the merged output (duration =
frame_count / fps, derived from the images list and fps) or, preferably, no
audio at all by routing a None/omitted value into CreateVideo.audio; modify the
ComfySwitchNode/PrimitiveBoolean branch that selects on_true to produce either a
dynamically-sized EmptyAudio (use the same frame_count and fps inputs used by
CreateVideo to compute seconds) or change the wiring so the switch returns
null/undefined for audio when drop_audio is true, ensuring
CreateVideo.execute(..., audio: Optional[Input.Audio] = None) receives no audio.

In `@blueprints/Select` Per-Line Text by Index.json:
- Around line 295-303: The "First Group" regex (currently
(?:[^\n]*\n){index}([^\n]+)) fails on empty indexed lines; update both
occurrences of the "First Group" pattern in blueprints/Select Per-Line Text by
Index.json to an anchored, empty-tolerant version by switching the capture to
allow zero-length matches and anchoring the start (e.g. replace the existing
pattern with one that begins with ^ and uses [^\n]* for the captured group) so
an empty line at the target index is correctly returned.

In `@blueprints/Text` to Image (Anima).json:
- Line 910: Fix the typo in the JSON group's title by removing the extra closing
parenthesis in the "title" value currently set to "Image Size(1MP))"; update the
string (e.g., to "Image Size (1MP)" or "Image Size(1MP)" per project style) in
the same JSON object where the "title" key appears so the group title no longer
contains the stray ')'.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 7bfd35ad-d43d-4448-accf-b0b5948ef5fa

📥 Commits

Reviewing files that changed from the base of the PR and between 08d809d and 93da5af.

📒 Files selected for processing (13)
  • blueprints/Audio Generation (Stable Audio 3 Medium Base).json
  • blueprints/Audio Generation (Stable Audio 3 Medium).json
  • blueprints/Depth to Video (ltx 2.0).json
  • blueprints/Image to Depth Map (Lotus).json
  • blueprints/Image to Pose Map (SDPose Multi-Person).json
  • blueprints/Image to Pose Map (SDPose-OOD).json
  • blueprints/Merge Videos.json
  • blueprints/Select Per-Line Text by Index.json
  • blueprints/Split Image Grid to Tiles.json
  • blueprints/Text to Image (Anima).json
  • blueprints/Video Inpaint (VOID).json
  • blueprints/Video Inpainting (Wan2.1 VACE).json
  • blueprints/Video to Pose Map (SDPose Multi-Person).json
Comment thread blueprints/Merge Videos.json
Comment thread blueprints/Select Per-Line Text by Index.json
Comment thread blueprints/Text to Image (Anima).json Outdated
alexisrolland
alexisrolland previously approved these changes May 25, 2026

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@blueprints/Video` Face Detection (Mediapipe).json:
- Line 225: The "description" field in the Video Face Detection (Mediapipe)
blueprint currently says it detects landmarks "from an image" but the workflow
consumes video; update the description string to say it detects facial landmarks
from a video (e.g., "Detects facial landmarks from a video using MediaPipe,
outputting landmark data, face bounding boxes, and an optional face-region
mask.") so it accurately reflects the input modality referenced in the
blueprint's "description" property.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: efe99e4e-35f2-49c5-9913-90498cca4842

📥 Commits

Reviewing files that changed from the base of the PR and between 93da5af and 5b7bab5.

📒 Files selected for processing (28)
  • blueprints/Canny to Image (Z-Image-Turbo).json
  • blueprints/Canny to Video (LTX 2.0).json
  • blueprints/ControlNet (Z-Image-Turbo).json
  • blueprints/Depth to Image (Z-Image-Turbo).json
  • blueprints/Depth to Video (ltx 2.0).json
  • blueprints/First-Last-Frame to Video (LTX-2.3).json
  • blueprints/First-Last-Frame to Video.json
  • blueprints/Geometry Estimation (MoGe).json
  • blueprints/Image Captioning (gemini).json
  • blueprints/Image Depth Estimation (MoGe).json
  • blueprints/Image Face Detection (Mediapipe).json
  • blueprints/Image Segmentation (SAM3).json
  • blueprints/Image Upscale(Z-image-Turbo).json
  • blueprints/Image to Depth Map (Lotus).json
  • blueprints/Image to Pose Map (SDPose Multi-Person).json
  • blueprints/Image to Pose Map (SDPose-OOD).json
  • blueprints/Pose to Image (Z-Image-Turbo).json
  • blueprints/Pose to Video (LTX 2.0).json
  • blueprints/Prompt Enhance.json
  • blueprints/Remove Background (BiRefNet).json
  • blueprints/Video Captioning (Gemini).json
  • blueprints/Video Depth Estimation (MoGe).json
  • blueprints/Video Face Detection (Mediapipe).json
  • blueprints/Video Inpaint (VOID).json
  • blueprints/Video Inpaint(Wan2.1 VACE).json
  • blueprints/Video Segmentation (SAM3).json
  • blueprints/Video Upscale(GAN x4).json
  • blueprints/Video to Pose Map (SDPose Multi-Person).json
💤 Files with no reviewable changes (1)
  • blueprints/Video Inpaint(Wan2.1 VACE).json
✅ Files skipped from review due to trivial changes (15)
  • blueprints/Image Segmentation (SAM3).json
  • blueprints/Pose to Image (Z-Image-Turbo).json
  • blueprints/First-Last-Frame to Video (LTX-2.3).json
  • blueprints/Canny to Video (LTX 2.0).json
  • blueprints/Video Captioning (Gemini).json
  • blueprints/ControlNet (Z-Image-Turbo).json
  • blueprints/First-Last-Frame to Video.json
  • blueprints/Canny to Image (Z-Image-Turbo).json
  • blueprints/Depth to Image (Z-Image-Turbo).json
  • blueprints/Pose to Video (LTX 2.0).json
  • blueprints/Remove Background (BiRefNet).json
  • blueprints/Image Upscale(Z-image-Turbo).json
  • blueprints/Image Captioning (gemini).json
  • blueprints/Image to Depth Map (Lotus).json
  • blueprints/Depth to Video (ltx 2.0).json
Comment thread blueprints/Video Face Detection (Mediapipe).json Outdated
@comfyui-wiki comfyui-wiki changed the title Add new open-source model and new built-in tool blueprints May 25, 2026
@alexisrolland alexisrolland self-requested a review May 25, 2026 19:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants