Voltar
Wan 2.2 14B Fun Camera Control

This ComfyUI workflow turns a single still image into a smooth, camera-driven video using Wan 2.2 14B Fun Camera Control. It loads the Wan 2.2 model with UNETLoader and ModelSamplingSD3, encodes your text prompt with CLIPLoader + CLIPTextEncode, and builds a camera path with WanCameraEmbedding. The WanCameraImageToVideo node fuses your prompt, start image, and camera embeddings to generate temporally consistent frames that follow pan, zoom, and rotation cues. Frames are decoded with VAELoader + VAEDecode, assembled with CreateVideo, and written out via SaveVideo.

Under the hood, the workflow runs a two-pass denoise with KSamplerAdvanced: a High noise phase to establish motion, structure, and parallax, followed by a Low noise phase to refine details and reduce flicker. This separation helps keep subjects stable while letting the virtual camera move dynamically. The template includes an optional LightX2V 4 Steps LoRA (via LoraLoaderModelOnly) to accelerate generation at the cost of motion richness and fine detail; use the Default (Without Lightning LoRA) path for the best dynamics. The provided fp8_scaled Wan 2.2 checkpoint is optimized for 640×640 and, on an RTX 4090D 24GB, typically uses ~84% VRAM and takes about 536s for the first run and ~513s for subsequent runs at that resolution.