fix: pad vace_input_frames to min spatial size to avoid 3×3 kernel underflow by livepeer-tessa · Pull Request #696 · daydreamlive/scope

livepeer-tessa · 2026-03-15T18:19:20Z

Summary

Fixes #557

Guards VaceEncodingBlock._encode_with_conditioning against inputs whose spatial dimensions are below the WAN VAE's 3×3 convolution minimum — the spatial analogue of the temporal guard in PR #674 / issue #673.

Root Cause

The WAN VAE encoder has a 3×3 spatial convolution kernel in its first layer. When vace_input_frames has height or width < 3 pixels, PyTorch raises:

RuntimeError: Calculated padded input size per channel: (2 x 513).
Kernel size: (3 x 3). Kernel size can't be greater than actual input size

Observed in Prod (2026-03-15)

Pipeline: krea-realtime-video
App: github_f1lhgmk5v76a0ev1w0u378by-scope-app--prod
Job: 5193400c-da0f-4eef-8bdd-dd0fdd26c1db
Window: 10:48–10:59 UTC (11 minutes)
Volume: ~2,372 errors at ~4 errors/second
Input shape: height=2, width=513 (extremely short frame — cf. VaceEncodingBlock fails on extremely narrow input images (kernel size > input size) #557's narrow case of height=513, width=2)

Fix

In _encode_with_conditioning, after extracting (batch, channels, frames, height, width) from vace_input_frames:

If height < 3 or width < 3, pad to the minimum safe size using F.pad
vace_input_masks (if provided) is padded to match
block_state.height/width are updated so the downstream resolution assertion still passes
A WARNING is emitted so the unusual input remains visible in logs

Testing

The fix is a pure defensive guard — normal inputs (height ≥ 3, width ≥ 3) are completely unaffected. Abnormal inputs (< 3 on either axis) will now warn instead of crash.

Related: PR #674 (temporal kernel guard, same block)

…derflow The WAN VAE encoder contains a 3×3 spatial convolution kernel. When the input chunk has spatial dimensions < 3 on either axis the forward pass raises: RuntimeError: Calculated padded input size per channel: (2 x 513). Kernel size: (3 x 3). Kernel size can't be greater than actual input size Observed in prod logs (2026-03-15, 10:48–10:59 UTC) on krea-realtime-video pipeline, fal.ai job 5193400c-da0f-4eef-8bdd-dd0fdd26c1db: 2 372 errors over 11 minutes (~4 errors/second) from an input with height=2 pixels. Fix: in _encode_with_conditioning, detect when height or width < 3 and pad to the minimum safe size using F.pad. The corresponding masks tensor is also padded to keep shapes consistent. block_state.height/width are updated so the downstream resolution check still passes. A WARNING is emitted so the unusual input remains visible in logs without a crash. This is the spatial analogue of the 3×1×1 temporal kernel guard (issue #673, PR #674). Fixes #557 Signed-off-by: livepeer-robot <robot@livepeer.org>

coderabbitai · 2026-03-15T18:19:30Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 8588b9f9-cc03-445d-b85a-4a084dd85548

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/vace-spatial-kernel-underflow-557

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-03-15T18:25:41Z

🚀 fal.ai Preview Deployment


App ID	`daydream/scope-pr-696--preview`
WebSocket	`wss://fal.run/daydream/scope-pr-696--preview/ws`
Commit	`0d2e60a`

Testing

Connect to this preview deployment by running this on your branch:

uv run build && SCOPE_CLOUD_APP_ID="daydream/scope-pr-696--preview/ws" uv run daydream-scope

🧪 E2E tests will run automatically against this deployment.

github-actions · 2026-03-15T18:28:53Z

✅ E2E Tests passed


Status	passed
fal App	`daydream/scope-pr-696--preview`
Run	View logs

Test Artifacts

Check the workflow run for screenshots.

livepeer-tessa requested review from emranemran and mjh1 March 15, 2026 18:19

livepeer-tessa mentioned this pull request Mar 15, 2026

VaceEncodingBlock fails on extremely narrow input images (kernel size > input size) #557

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pad vace_input_frames to min spatial size to avoid 3×3 kernel underflow#696

fix: pad vace_input_frames to min spatial size to avoid 3×3 kernel underflow#696
livepeer-tessa wants to merge 1 commit intomainfrom
fix/vace-spatial-kernel-underflow-557

livepeer-tessa commented Mar 15, 2026

Uh oh!

coderabbitai bot commented Mar 15, 2026

Review skipped

Uh oh!

github-actions bot commented Mar 15, 2026

Uh oh!

github-actions bot commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

livepeer-tessa commented Mar 15, 2026

Summary

Root Cause

Observed in Prod (2026-03-15)

Fix

Testing

Uh oh!

coderabbitai bot commented Mar 15, 2026

Review skipped

Uh oh!

github-actions bot commented Mar 15, 2026

🚀 fal.ai Preview Deployment

Testing

Uh oh!

github-actions bot commented Mar 15, 2026

✅ E2E Tests passed

Test Artifacts

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant