رجوع
Capybara: Image Edit - After
Capybara: Image Edit - Before

The Capybara: Image Edit workflow is designed to facilitate image and video editing through user-friendly text instructions. Utilizing the Capybara model, this workflow allows users to perform various editing tasks such as style changes, object replacement, and time-of-day adjustments. The workflow incorporates nodes like LoadImage and SaveImage to manage input and output, while the core processing is handled by a specialized node identified by its unique ID (4f4ddf39-1508-4d34-a35c-ff10e6ce995b). This setup ensures a seamless integration of image loading, processing, and saving, making it highly efficient for both simple and complex edits.

Technically, the workflow leverages the Capybara model, which is known for its robust capabilities in interpreting text instructions to generate desired edits. The model is supported by a text encoder, qwen_2.5_vl_7b.safetensors, which accurately processes user inputs. The workflow's flexibility in resolution settings, as outlined in the Capybara Resolution Configuration, allows users to tailor their outputs based on specific aspect ratios and pixel requirements. This adaptability, combined with the model's advanced inference capabilities, makes the workflow particularly useful for content creators and digital artists seeking precise and creative control over their media.