Ideogram v4: Text to Image - ComfyUI Workflow

This ComfyUI tutorial workflow demonstrates how to generate images with Ideogram 4.0 using either a simple text prompt or a structured JSON caption for precise visual control. The Ideogram4 Caption Prompt Template subgraph guides you to describe a scene summary, style block, background, and optional per-object elements with bounding boxes and hex color palettes. A ResolutionSelector node standardizes width and height, while a MarkdownNote explains the JSON schema so you can author layout-aware prompts without guessing. PreviewAny provides live feedback, and SaveImage writes the final output to disk.

Under the hood, the workflow routes your prompt into a custom Ideogram v4 inference node backed by the ideogram4_fp8_scaled diffusion model and a compatible VAE (flux2-vae). If you provide plain text, the template wraps it for sensible defaults; if you provide structured JSON, it passes through so the model can honor layout, color, and style constraints. This setup mirrors the official caption schema Ideogram 4.0 was trained on, giving you reproducible composition and color control locally. For users who prefer a visual editor, you can optionally add KJNodes and the Ideogram 4 Prompt Builder KJ node to create bounding boxes and palettes interactively.