Allow `VideoProcessor` to Accept Single Image Inputs by dg845 · Pull Request #13084 · huggingface/diffusers

dg845 · 2026-02-05T02:50:33Z

What does this PR do?

This PR adds support for single-image inputs to VideoProcessor.preprocess_video. As VideoProcessor.preprocess_video uses VaeImageProcessor.preprocess under the hood, the PR also changes preprocess_video to forward keyword arguments to preprocess. This allows it to support arguments that preprocess supports, such as resize_mode.

Changelist

Adds support for single image videos for VideoProcessor.preprocess_video.
VideoProcessor.preprocess_video and VideoProcessor.postprocess_video now forward keyword arguments to VaeImageProcessor.preprocess and VaeImageProcessor.postprocess, respectively.
Improve docstrings in VideoProcessor.

Inspired by discussion in #13058.

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@sayakpaul
@yiyixuxu

…ocess

HuggingFaceDocBuilderDev · 2026-02-05T02:59:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2026-02-05T03:59:29Z

src/diffusers/video_processor.py


    def postprocess_video(
-        self, video: torch.Tensor, output_type: str = "np"
+        self, video: torch.Tensor, output_type: str = "np", **kwargs


What would kwargs facilitate?

Currently, this would facilitate passing the do_denormalize flag to VaeImageProcessor.postprocess. But it's intended more as a forward-looking change which allows postprocess_video to support any arguments that postprocess might want.

sayakpaul

Just one comment. I guess we can create an LTX-2 specific video processor subclassing from the current one and implement center cropping logic?

dg845 · 2026-02-05T05:04:42Z

For LTX-2, the idea is that after the changes in this PR, we can call preprocess_video(..., resize_mode="crop") to preprocess the conditions with center cropping like in the original code. I tried this on the bird image used in the FLF2V example and the result looks very close to the original code:

sayakpaul · 2026-02-05T05:15:57Z

@yiyixuxu WDYT?

dg845 added 2 commits February 5, 2026 02:52

Forward kwargs from preprocess/postprocess_video to preprocess/postpr…

8a91357

…ocess

Allow VideoProcessor.preprocess_video to accept single-image inputs

4ebcdb6

dg845 requested a review from sayakpaul February 5, 2026 02:50

dg845 mentioned this pull request Feb 5, 2026

Add LTX2 Condition Pipeline #13058

Open

sayakpaul reviewed Feb 5, 2026

View reviewed changes

sayakpaul approved these changes Feb 5, 2026

View reviewed changes

Merge branch 'main' into video-processor-accept-imagelike-inputs

c30acad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow `VideoProcessor` to Accept Single Image Inputs#13084

Allow `VideoProcessor` to Accept Single Image Inputs#13084
dg845 wants to merge 3 commits intomainfrom
video-processor-accept-imagelike-inputs

dg845 commented Feb 5, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2026

Uh oh!

sayakpaul Feb 5, 2026

Uh oh!

dg845 Feb 5, 2026

Uh oh!

sayakpaul left a comment

Uh oh!

dg845 commented Feb 5, 2026

Uh oh!

sayakpaul commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dg845 commented Feb 5, 2026

What does this PR do?

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2026

Uh oh!

sayakpaul Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

dg845 Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

dg845 commented Feb 5, 2026

Uh oh!

sayakpaul commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants