Include workflow_id in all execution WebSocket messages (CORE-198) by christian-byrne · Pull Request #13684 · Comfy-Org/ComfyUI

christian-byrne · 2026-05-03T06:08:40Z

PR Created by the Glary-Bot Agent

Why

The frontend already stores extra_data["extra_pnginfo"]["workflow"]["id"] when queueing a prompt and surfaces it via GET /api/jobs, but none of the WebSocket events emitted during execution carry it. As a result, frontends that need to scope progress state by workflow (multi-tab UIs, the Cloud frontend, etc.) have to maintain a prompt_id → workflow_id mapping that races with execution_start and silently leaks state across workflows when terminal frames are dropped.

This PR plumbs the existing workflow id through the executor and adds it alongside prompt_id on every execution event so clients can scope state purely from the message payload.

What changes

Single source of truth for workflow id

comfy_execution/jobs.py gains a public extract_workflow_id(extra_data) helper that returns None for any missing or non-string value. The existing _extract_job_metadata (used by /api/jobs) now delegates to it so REST and WS share one extraction path.

Plumbing

PromptExecutor.execute_async extracts the id once and stores it as self.workflow_id and self.server.last_workflow_id.
The module-level execute() coroutine and _send_cached_ui() take an explicit workflow_id argument.
ProgressRegistry.__init__ and reset_progress_state() accept an optional workflow_id, which WebUIProgressHandler reads when emitting progress_state and PREVIEW_IMAGE_WITH_METADATA.
The legacy progress event in main.py and the post-completion executing reset both pull from server.last_workflow_id.

WebSocket events that now include `workflow_id`

execution_start, execution_success, execution_error, execution_interrupted, execution_cached
executing, executed
progress, progress_state
the metadata dict in PREVIEW_IMAGE_WITH_METADATA

Hygiene

self.workflow_id and self.server.last_workflow_id are cleared in the finally block of execute_async so a progress callback racing with teardown cannot attach a previous run's workflow id to a later event.
The reconnect path in server.py's /ws handler now sends prompt_id and workflow_id alongside node, matching the regular executing payload so reconnecting clients can recover both prompt- and workflow-scoped state.

Backward compatibility

This is strictly additive. Every existing field is preserved; clients that don't read workflow_id are unaffected. When a prompt was queued without an extra_pnginfo.workflow.id (programmatic API usage, older frontends), the field is sent as null rather than omitted, so consumers see a consistent shape.

Tests

tests/execution/test_jobs.py: 10 new cases covering extract_workflow_id edge cases (missing/non-dict/non-string/empty values).
tests-unit/execution_test/test_workflow_id_in_ws_messages.py (new):
- Mocked PromptServer tests assert progress_state payloads include workflow_id at both the top level and per-node, plus the PREVIEW_IMAGE_WITH_METADATA metadata.
- ProgressRegistry and reset_progress_state propagation tests.
- AST-based static guard that walks execution.py, main.py, and comfy_execution/progress.py and fails if any dict literal carrying prompt_id is missing workflow_id. This covers execution_start/_success/_error/_interrupted/_cached/executing/executed/progress without needing a GPU-backed import of execution.py and serves as a regression net for future refactors.

Test counts:

pytest tests/execution/test_jobs.py → 58 passed (was 48)
pytest tests-unit/ (excluding the GPU-only preview_method_override_test.py) → 734 passed, 0 failed
ruff check on every touched file → clean

Manual verification

Confirmed the backend boots cleanly with --cpu --disable-all-custom-nodes and /system_stats responds. Live end-to-end ws capture against a real prompt was not run because the sandbox has no checkpoints / sample workflow handy and no GPU. The AST static guard plus the mock-server payload assertions cover every event listed above deterministically.

The frontend already stores extra_data['extra_pnginfo']['workflow']['id'] when queueing a prompt and exposes it via the /api/jobs REST endpoint, but none of the WebSocket events emitted during execution carry it. That makes it impossible to scope progress state by workflow on the client without maintaining a job_id -> workflow_id mapping that races with execution_start. This adds workflow_id alongside prompt_id on every execution event: - execution_start, execution_success, execution_error, execution_interrupted, execution_cached, executing, executed - progress and progress_state - the metadata block on PREVIEW_IMAGE_WITH_METADATA A new public extract_workflow_id helper in comfy_execution/jobs.py is the single source of truth for the lookup; the existing _extract_job_metadata delegates to it. The id is plumbed through PromptExecutor (stored as self.workflow_id and on server.last_workflow_id), the module-level execute() coroutine, the _send_cached_ui helper, and ProgressRegistry / reset_progress_state so WebUIProgressHandler can include it in progress_state and preview-image metadata. The progress hook in main.py reads server.last_workflow_id to populate the legacy 'progress' event. Tests cover the helper's edge cases (missing/non-string ids, non-dict levels) and that the WebUIProgressHandler emits workflow_id on every progress_state payload via mocked PromptServer.

…rden tests - Clear self.server.last_workflow_id and self.workflow_id in the PromptExecutor finally block so a progress callback racing with teardown can no longer attach the previous run's workflow_id to a later 'progress' event. - Include prompt_id and last_workflow_id in the reconnect 'executing' message in server.py so reconnecting clients can recover both workflow- and prompt-scoped execution state, matching the regular 'executing' payload. - Add an AST-based static guard that walks execution.py, main.py, and comfy_execution/progress.py and asserts every dict literal carrying prompt_id also carries workflow_id. Also add a unit test covering PREVIEW_IMAGE_WITH_METADATA metadata. Together these regression-test every emitter (execution_start/success/error/interrupted/cached, executing, executed, progress, progress_state, preview metadata) without requiring a GPU-backed import of execution.py.

coderabbitai · 2026-05-12T08:21:22Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: faa6bf8f-28ab-42e9-9291-2d8a9c9e7bd6

📥 Commits

Reviewing files that changed from the base of the PR and between 137de03 and 56c0ca3.

📒 Files selected for processing (3)

execution.py
main.py
server.py

🚧 Files skipped from review as they are similar to previous changes (3)

server.py
main.py
execution.py

📝 Walkthrough

Walkthrough

This PR adds workflow_id tracking throughout the node execution pipeline. It extracts workflow_id from request metadata via a new extract_workflow_id() helper, propagates it through PromptExecutor, ProgressRegistry, and WebUIProgressHandler, and includes it in all relevant WebSocket/progress payloads. reset_progress_state and progress handlers accept or emit workflow_id. Execution paths and cached-UI messages are updated to pass workflow_id. Tests were added to validate extraction, propagation, payload contents, preview-image metadata, and include AST-based regression checks.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 9.43% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately and concisely captures the main change: adding workflow_id to all execution WebSocket messages, matching the primary objective of this PR.
Description check	✅ Passed	The description is comprehensive and directly related to the changeset, explaining the why, what, implementation details, backward compatibility, and test coverage.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@main.py`:
- Line 325: The "executing" reset uses server_instance.last_workflow_id after
e.execute(...) returns, but the executor clears last_workflow_id in finally
which can emit workflow_id: None; capture the workflow id from the
prompt/context before calling e.execute (e.g., read getattr(server_instance,
'last_workflow_id', None) or the prompt's workflow id into a local variable like
workflow_id_for_prompt) and pass that local value into
server_instance.send_sync("executing", {"node": None, "prompt_id": prompt_id,
"workflow_id": workflow_id_for_prompt}, server_instance.client_id) so the
per-item workflow id is preserved even if last_workflow_id is cleared in
finally.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: d7664042-4cf3-4d37-99a9-960bc31c56e7

📥 Commits

Reviewing files that changed from the base of the PR and between f6d5068 and 1f0b137.

📒 Files selected for processing (7)

comfy_execution/jobs.py
comfy_execution/progress.py
execution.py
main.py
server.py
tests-unit/execution_test/test_workflow_id_in_ws_messages.py
tests/execution/test_jobs.py

glary-bot · 2026-05-12T08:22:46Z

Thanks for the automated walkthrough — replying re: the docstring-coverage warning.

The 9.62% is computed across the whole repo, but the surfaces this PR actually adds or modifies already match (or exceed) the local convention for each touched file:

File	Existing docstring coverage on `master`	Notes
`comfy_execution/jobs.py`	81.2% (13/16)	New `extract_workflow_id` has a full docstring, including the `extra_pnginfo.workflow.id` storage contract.
`comfy_execution/progress.py`	55.3% (21/38)	New `workflow_id` params are documented at the class/function level; signatures are self-describing.
`main.py`	13.3% (2/15)	Touched `hijack_progress` already follows the file's no-docstring convention.
`execution.py`	0.0% (0/58)	None of `PromptExecutor`, `add_message`, `execute`, etc. have docstrings on `master`. Adding them only on touched lines would be inconsistent and cargo-culted.

The 80% threshold isn't a Comfy-Org convention — the affected files (especially execution.py) deliberately rely on type hints, descriptive names, and inline comments instead. The test functions also follow the existing pytest pattern in tests/execution/test_jobs.py of using descriptive method names rather than docstrings.

I'd rather leave docstrings as-is here to keep the diff focused on the actual behavior change. Happy to revisit if a maintainer disagrees and wants this PR to add docstrings beyond its scope.

The previous commit cleared server.last_workflow_id in PromptExecutor's finally block to prevent racing progress callbacks from leaking stale ids. That fix introduced a new bug: prompt_worker in main.py reads server.last_workflow_id *after* e.execute() returns to populate the terminal 'executing' (node=None) message that tells the client a prompt is done. Because the executor has already cleared the field by then, this message always emitted workflow_id=None. Capture the workflow id from extra_data into a local variable before calling e.execute(), and use that local in the send_sync() call so the per-prompt id is preserved through the executor's finally cleanup. Adds an AST-based regression test asserting that prompt_worker extracts workflow_id locally and that the terminal 'executing' send_sync never reads server.last_workflow_id. Verified the test fails on the buggy revision and passes after the fix.

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@tests-unit/execution_test/test_workflow_id_in_ws_messages.py`:
- Around line 138-157: The tests read source files with
Path(file_path).read_text(), which relies on the current working directory and
makes the tests flaky; update the reader to resolve paths relative to the test
module by deriving a base_dir = Path(__file__).resolve().parent (or an
appropriate ancestor) and then use (base_dir / file_path).read_text(); apply
this change in _assert_workflow_id_in_every_prompt_id_dict and the other similar
helper calls (the other occurrences around the second helper/test block) so all
file reads are deterministic regardless of CWD.
- Around line 265-287: The test currently only asserts that the payload dict for
send_sync("executing", ...) does not reference server.last_workflow_id but
silently passes if no matching send call is found; modify the test loop that
walks the AST (ast.walk(worker)) and inspects ast.Call nodes invoking .send_sync
with first arg "executing" to set a boolean flag (e.g., found_executing_send)
when a match is encountered, keep the existing payload check for workflow_id vs
last_workflow_id using ast.unparse(value), and after the loop assert that
found_executing_send is True (fail the test if no matching
send_sync("executing", ...) call was found) so the regression is caught when the
send is refactored to use a local variable.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: bc33d301-89e6-4cda-8069-de3e57d6c5a0

📥 Commits

Reviewing files that changed from the base of the PR and between 1f0b137 and 7ca4457.

📒 Files selected for processing (2)

main.py
tests-unit/execution_test/test_workflow_id_in_ws_messages.py

🚧 Files skipped from review as they are similar to previous changes (1)

main.py

Two robustness fixes to the workflow_id regression tests: 1. Resolve source-file paths relative to __file__ (repo_root = Path(__file__).resolve().parents[2]) instead of bare Path("main.py"). The bare form depends on the process CWD being the repo root, which made the tests flaky when run from tests-unit/, an IDE runner, or any other directory. Verified passing from three different CWDs (repo root, tests-unit/, /tmp). 2. Track matched_terminal_executing_send and assert it after the AST walk in TestTerminalExecutingResetInMainPy. Previously, if someone refactored the send_sync("executing", {...}) call to pass a local variable instead of an inline dict literal, the loop would simply find no match and the test would pass vacuously, losing its regression-protection value. Verified by temporarily refactoring main.py to use a variable payload: the test now fails with a clear message asking to update the regression test, and passes again once the inline dict is restored.

Kosinkadink · 2026-05-14T02:49:43Z

lgtm, just needs @guill 's blessing now

glary-bot · 2026-05-14T02:50:20Z

Thanks for the review, @Kosinkadink! Will leave the draft state until @guill takes a look.

guill

LGTM

glary-bot · 2026-05-14T20:07:13Z

Thanks @guill! With both code-owner approvals in place, this is ready to land whenever a maintainer wants to merge.

guill · 2026-05-14T20:08:10Z

        self.status_messages = []
-        self.add_message("execution_start", { "prompt_id": prompt_id}, broadcast=False)
+        self.workflow_id = extract_workflow_id(extra_data)
+        self.server.last_workflow_id = self.workflow_id


I slightly question whether the server is the right place to put this data, but I don't have any other ideas. This is probably fine as-is -- we can always move it later.

Agreed — server.last_workflow_id is mirroring the existing server.last_node_id / server.last_prompt_id pattern that hijack_progress in main.py reads from, mostly because the progress callback fires from inside the executor and needs some well-known place to pick up the current workflow id. A cleaner home would be a per-execution context object (something like a RunContext passed through the progress handler chain), but that's a larger refactor of how hijack_progress and the WebUI handler share state.

Happy to do that as a follow-up if you want it scoped — keeping it on the server here mainly to avoid expanding the PR's blast radius beyond the workflow_id plumbing itself.

Adds an `extra_data.metadata` envelope on POST /prompt body. The backend (Comfy-Org/ComfyUI#13979) treats the dict as opaque and merges every key/value at the top level of dict-typed WebSocket frames emitted during the prompt's execution. For now the envelope only carries `workflow_id` (sourced from the ComfyWorkflowJSON the user is queueing). The server has no awareness of what `workflow_id` means — it's just one key in a generic tag dict, and adding new tags later requires no backend change. This restores the wire shape from the reverted Comfy-Org/ComfyUI#13684 (top-level `workflow_id` on WS frames) without coupling the core execution layer to workflow concepts. Fixes FE-745

…omfy-Org#13684)

Glary-Bot added 2 commits May 2, 2026 02:29

christian-byrne assigned guill May 3, 2026

christian-byrne mentioned this pull request May 3, 2026

fix: filter WS messages by workflow tab to prevent cross-tab state leaking Comfy-Org/ComfyUI_frontend#10689

Closed

christian-byrne marked this pull request as ready for review May 12, 2026 08:18

christian-byrne requested review from Kosinkadink, alexisrolland, comfyanonymous, guill, kijai and rattus128 as code owners May 12, 2026 08:18

coderabbitai Bot reviewed May 12, 2026

View reviewed changes

Comment thread main.py Outdated

alexisrolland changed the title ~~Include workflow_id in all execution WebSocket messages~~ May 12, 2026

coderabbitai Bot reviewed May 12, 2026

View reviewed changes

Comment thread tests-unit/execution_test/test_workflow_id_in_ws_messages.py

Comment thread tests-unit/execution_test/test_workflow_id_in_ws_messages.py

guill approved these changes May 14, 2026

View reviewed changes

guill reviewed May 14, 2026

View reviewed changes

Merge branch 'master' into glary/ws-workflow-id

56c0ca3

Kosinkadink approved these changes May 14, 2026

View reviewed changes

Kosinkadink merged commit 4f60189 into master May 14, 2026
21 checks passed

comfyanonymous deleted the glary/ws-workflow-id branch May 14, 2026 22:30

comfyanonymous mentioned this pull request May 14, 2026

Revert "Include workflow_id in all execution WebSocket messages (CORE-198)" #13901

Merged

deepme987 mentioned this pull request May 15, 2026

feat(server): per-prompt metadata envelope on WebSocket events #13905

Closed

dante01yoon mentioned this pull request May 19, 2026

Propagate workflow_id via per-prompt metadata registry (FE-745) #13979

Closed

5 tasks

dante01yoon mentioned this pull request May 20, 2026

feat(api): send workflow_id via opaque metadata dict on queuePrompt (FE-745) Comfy-Org/ComfyUI_frontend#12374

Closed

5 tasks

simonri pushed a commit to simonri/ComfyUI-flash-attention-3 that referenced this pull request May 26, 2026

Include workflow_id in all execution WebSocket messages (CORE-198) (C…

101d779

…omfy-Org#13684)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Include workflow_id in all execution WebSocket messages (CORE-198)#13684

Include workflow_id in all execution WebSocket messages (CORE-198)#13684
Kosinkadink merged 5 commits into
masterfrom
glary/ws-workflow-id

christian-byrne commented May 3, 2026

coderabbitai Bot commented May 12, 2026 •

edited

Loading

Walkthrough

❌ Failed checks (1 warning)

coderabbitai Bot left a comment

Uh oh!

glary-bot Bot commented May 12, 2026

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Kosinkadink commented May 14, 2026

glary-bot Bot commented May 14, 2026

guill left a comment

glary-bot Bot commented May 14, 2026

guill May 14, 2026

glary-bot Bot May 14, 2026

Uh oh!

Labels

4 participants

Uh oh!

Conversation

christian-byrne commented May 3, 2026

Why

What changes

Single source of truth for workflow id

Plumbing

WebSocket events that now include workflow_id

Hygiene

Backward compatibility

Tests

Manual verification

coderabbitai Bot commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

❌ Failed checks (1 warning)

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

glary-bot Bot commented May 12, 2026

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Kosinkadink commented May 14, 2026

glary-bot Bot commented May 14, 2026

guill left a comment

Choose a reason for hiding this comment

glary-bot Bot commented May 14, 2026

guill May 14, 2026

Choose a reason for hiding this comment

glary-bot Bot May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Labels

4 participants

WebSocket events that now include `workflow_id`

coderabbitai Bot commented May 12, 2026 •

edited

Loading