LTX2.3 Image-to-Video Audio-Visual Sync HD Edition

Description:

This workflow is based on the LTX2.3 Distilled v1.1 large model and incorporates the VBVR Spatial Reasoning LoRA to generate videos from images, with automatic audio generation included.

The workflow execution is divided into two sampling stages. In the second stage, the workflow uses the official upscaling model to perform a secondary video enhancement based on the original input first frame, producing high-definition video results. During this second sampling stage, the workflow also integrates the IC Detail LoRA and Refocus LoRA to further improve video details.

Model Requirements