ThinkDiffusion

Qwen Image Edit 2509: Combine Multiple Images Into One Scene for Fashion, Products, Poses & more

Matt Shih — Tue, 21 Oct 2025 12:04:08 GMT

Prompt: A woman in image 1 is riding a vintage in image 2. A green trash bin in image 3 is visible beside her.

Ever wanted to merge the perfect outfit with the ideal pose, or showcase different products all in a single, seamless scene? Whether you’re a designer looking to preview fashion combinations, a marketer building composite product showcases, or an artist bringing creative concepts to life, this workflow gives you the power to experiment freely and see instant, consistent results.

What is Qwen Image Edit 2509? Why is it better?

Source: Qwen Image

Qwen Image Edit 2509 lets you combine multiple images into a single scene using text prompts. Reference different images by number in your prompt-"image 1," "image 2," "image 3"-and the model places them together.

Released in September 2025 (the "2509" refers to the version and release date), this update handles complex multi-image edits better than the original Qwen. Faster processing, better alignment between elements, and more realistic results when combining different inputs.

Useful for product mockups, fashion previews, or any project where you need to merge different elements into one cohesive image.

Prompt: A orange cat in image 1 and white dog in image 2 met each and greet other at the grassy place in image 3. They greet each other.

Why settle for one-at-a-time edits, when your next masterpiece could start with everything you need, all at once?

Download Workflow

Installation guide

Download the workflow file
Open ComfyUI (local or ThinkDiffusion)
Drag the workflow file into the ComfyUI window
If you see red nodes, install missing components:

ComfyUI Manager > Install Missing Custom Nodes

ThinkDiffusion-StableDiffusion-ComfyUI-Image2Image-Qwen-Image-2509

DOWNLOAD THE FILE HERE

ThinkDiffusion-StableDiffusion-ComfyUI-Image2Image-Qwen-Image-2509.json

34 KB

Verified to work on ThinkDiffusion Build: September 29, 2025

ComfyUI v0.3.60 with the use qwen_image_edit_2509_fp8_e4m3fn.safetensors
model support

Minimum Machine Size: Ultra

Use the specified machine size or higher to ensure it meets the VRAM and performance requirements of the workflow

💡

Download the workflow and drag & drop it into your ComfyUI window, whether locally or on ThinkDiffusion. If you're using ThinkDiffusion, minimum requirement is the Turbo 24gb machine, but we do recommend the Ultra 48gb machine.

Custom Nodes

If there are red nodes in the workflow, it means that the workflow lacks the certain required nodes. Install the custom nodes in order for the workflow to work.

Go to the ComfyUI Manager > Click Install Missing Custom Nodes

Check the list below if there's a list of custom nodes that needs to be installed and click the install.

Required Models

For this guide you'll need to download these 4 recommended models.

1. qwen_image_edit_2509_fp8_e4m3fn.safetensors
2. Qwen-Image-Lightning-4steps-v1.0.safetensors
3. qwen_2.5_vl_7b_fp8_scaled.safetensors
4. qwen_image_vae.safetensors

Go to ComfyUI Manager > Click Model Manager

Search for the models above and when you find the exact model that you're looking for, click install, and make sure to press refresh when you are finished.

If Model Manager doesn't have them: Use direct download links (included with workflow) and upload through ThinkDiffusion MyFiles > Upload URL. Refer our docs for more guidance on this.

You could also use the model path source instead: by pasting the model's link address into ThinkDiffusion MyFiles using upload URL.

Model Name	Model Link Address	ThinkDiffusion Upload Directory
qwen_image_edit_2509_fp8_e4m3fn.safetensors	📋 Copy Path	.../comfyui/models/diffusion_models/
Qwen-Image-Lightning-4steps-v1.0.safetensors	📋 Copy Path	.../comfyui/models/lora/
qwen_2.5_vl_7b_fp8_scaled.safetensors	📋 Copy Path	.../comfyui/models/text_encoders/
qwen_image_vae.safetensors	📋 Copy Path	.../comfyui/models/vae/

Step-by-step Workflow Guide

This workflow was pretty easy to set up and runs well from the default settings. Here are a few steps where you might want to take extra note.

Steps	Recommended Nodes
1. Set the Models Set the required models as seen in the image.
2. Load the Input Image Upload your input image. You can you either 1 up to 3 images only. It works even 1 input image only and just bypassed the other node.
3. Write the Prompt Write a detailed prompt. The should designate an image < number> in the prompt. You describe whatever you want and whatever action will be.
4. Check the Sampling Settings Check the sampling settings. If you want higher quality of output you can the full model but it needs a higher machine than Ultra. You can use the fp8 model instead.
5. Check the Output Image

Insights

💡

I can't generate new images from scratch, the model only lets me edit or modify images that I provide as input. If I try to make complex compositional changes like switching backgrounds, inserting several new objects, or making extensive additions, I often find that the model struggles with these tasks.

💡

If my edits involve faces, personal data, explicit imagery, or copyrighted material, my requests might get denied or the renders might be incomplete. Sometimes, I run into limitations with supported file formats or maximum image dimensions.

💡

I'm aware that it can sometimes generate images with a "plasticky" or artificial AI look, especially in areas like skin, faces, or complex edits. This characteristic is a known limitation and has been discussed in the user community as an effect where the result lacks realism, often showing overly smooth textures, unnatural glossiness, or uniform surfaces that do not appear lifelike.

Examples

Prompt: A woman in image 1 is holding the white in image 2. She sitting at the living room in image 3./

Prompt: A toy robot in image 1, a toy crane in mage 2 and a shoe in image 3 are visible in the kitchen.

Prompt: A man facing front in image 1 wears the dress in image 2. He is holding a basket of colorful eggs in image 3. The background is a city street.

Troubleshooting

Red Nodes: Install missing custom nodes through ComfyUI Manager
Out of Memory: Use smaller expansion factors or switch to Ultra machine
Poor Quality: Check input image resolution and adjust kontext strength
Visible Seams: Lower strength and ensure good prompt description

If you’re having issues with installation or slow hardware, you can try any of these workflows on a more powerful GPU in your browser with ThinkDiffusion.

TRY THINKDIFFUSION

Join the ThinkDiffusion Discord Server!

ThinkDiffusion is your Stable Diffusion workspace in the cloud with unrestricted, bleeding edge opensource AI art tools. | 5510 members

Discord

Flux Krea Dev: Photorealistic Portraits Without the AI Look - Workflow + Guide

Matt Shih — Mon, 06 Oct 2025 11:48:46 GMT

Prompt: A realistic person wearing a bright yellow raincoat stands smiling near the right edge of a wide, landscape-oriented frame. The background is a vibrant urban skatepark, with colorful ramps and graffiti, set after a rain—puddles reflecting bold street art and cloudy sky. The person has authentic skin texture and natural facial features, fully clothed (no topless or naked appearance), and holds a skateboard with one hand. The words "Flux Krea" are clearly visible as part on the clothing. Lighting is soft and natural, with realistic shadows and reflections. No plasticky skin, no overly smooth surfaces, no AI artifacts, no uncanny valley. The composition is dynamic and wide, blending urban energy with lifelike detail. Highly detailed, organic, and human appearance

Flux Krea Dev is built to create photorealistic images without the usual AI giveaways-no plastic skin, no overly smooth textures, no uncanny valley weirdness.

This is a collaboration between Black Forest Labs and Krea AI. It's a 12-billion parameter model that handles realistic skin tones, natural lighting, and follows prompts accurately. Works especially well for portraits and character shots.

Faster than standard Flux Dev and produces more varied, lifelike results.

What is Flux Krea Dev

Source: Flux Krea Dev

Flux Krea Dev is designed to generate photorealistic images while avoiding common AI artifacts like plastic-looking skin or overprocessed textures. The 12-billion parameter model maintains realistic detail, handles nuanced lighting well, and produces natural skin tones.

It supports fine-tuning, works with existing Flux workflows, and generates images quickly. Good for portraits, character work, and any project where you need genuinely realistic human features.

Comparison of Flux Krea Dev with Standard model

Prompt: A female assassin dances fluidly atop a city rooftop at night, her sleek, dark attire blending modern tactical gear with elegant, flowing elements. Neon lights from the city skyline reflect off her outfit as she moves with precision and grace, her silhouette striking against the urban backdrop. Seed - 521404981828559, Euler, Simple, Steps 25, CFG 1

Flux Krea Dev is a major upgrade from the original Flux Dev, designed to create genuinely lifelike images. Where Flux Dev often produces flat, repetitive, or generic results, Flux Krea Dev excels in capturing photorealistic detail, prompt accuracy, and expressive variety—making it the best open-source choice for anyone seeking high-quality, realistic AI-generated art and portraits.

Get ready for a hands-on experience designed to make your art leap off the screen and make people ask, “Is that really AI?”

Download Workflow

Installation guide

Download the workflow file
Open ComfyUI (local or ThinkDiffusion)
Drag the workflow file into the ComfyUI window
If you see red nodes, install missing components:

ComfyUI Manager > Install Missing Custom Nodes

ThinkDiffusion-StableDiffusion-ComfyUI-txt2img-flux1_krea_dev

DOWNLOAD THE FILE HERE

ThinkDiffusion-StableDiffusion-ComfyUI-txt2img-flux1_krea_dev.json

12 KB

Verified to work on ThinkDiffusion Build: June 27, 2025

ComfyUI v0.3.47 with the use flux1-krea-dev.safetensors

Note: We specify the build date because ComfyUI and custom node versions updated after this date may change the behavior or outputs of the workflow.

Minimum Machine Size: Ultra

Use the specified machine size or higher to ensure it meets the VRAM and performance requirements of the workflow

💡

Custom Nodes

If there are red nodes in the workflow, it means that the workflow lacks the certain required nodes. Install the custom nodes in order for the workflow to work.

Go to the ComfyUI Manager > Click Install Missing Custom Nodes

Check the list below if there's a list of custom nodes that needs to be installed and click the install.

Required Models

For this guide you'll need to download these 4 recommended models.

1. flux1-krea-dev.safetensors
2. t5xxl_fp32.safetensors
3. clip_l.safetensors
4. ae.safetensors

Go to ComfyUI Manager > Click Model Manager

Search for the models above and when you find the exact model that you're looking for, click install, and make sure to press refresh when you are finished.

Model Name	Model Link Address	ThinkDiffusion Upload Directory
flux1-krea-dev.safetensors	📋 Copy Path	.../comfyui/models/diffusion_models/
t5xxl_fp32.safetensors	📋 Copy Path	.../comfyui/models/clip/
clip_l.safetensors	📋 Copy Path	.../comfyui/models/clip/
ae.safetensors	📋 Copy Path	.../comfyui/models/vae/

💡

If the flux1-krea-dev.safetensors is already a pre-loaded model, you don't need to upload the model anymore.

Step-by-step Workflow Guide

This workflow was pretty easy to set up and runs well from the default settings. Here are a few steps where you might want to take extra note.

Steps	Recommended Nodes
1. Set the Models Set the models as seen on the image.
2. Write a Prompt Write a prompt that describes a realistic description. It is better to depict a character or portrait inthe prompt.
3. Check the Sampling Set the settings as seen on the image. Don't change the scheduler or sampler as it may result to bad quality.
4. Check the Generated Image

Examples

Prompt: A vivid portrait of a modern street artist, spray paint stains on their hands and a colorful bandana covering part of their face. The character stands against a bright graffiti wall, with energetic splashes of neon paint in the background. Their eyes are lively and creative, capturing urban spirit and bold individuality. Studio-quality lighting, crisp detail, portrait format

Prompt: A close-up portrait of a mysterious elf scholar, pointed ears visible beneath tousled silver hair, intricate crystal earrings, and a deep blue cloak embroidered with ancient runes. The character is posed against a backdrop of weathered library shelves filled with glowing magical tomes. Soft, moody lighting highlights delicate facial features and thoughtful eyes. Highly detailed, fantasy avatar style, portrait orientation

Prompt: A close-up shot of a young woman with curly light brown hair. The woman's hair is blowing in the wind and cascades over her shoulders. She is wearing a white tank top with a white strap on the back. Her eyes are open and her lips are pursed. The background is blurred out, but it appears to be a sunny day. The sky is a light blue, and the sun is shining down on the left side of the image.

Prompt: A realistic person shown from the waist up in a landscape-oriented image, positioned randomly within the frame (off-center, left, right, or anywhere unexpected). The person is fully clothed in casual, modern attire—no topless or naked appearance. The background is visually interesting and randomly chosen, such as a lively city street, tranquil beach, cozy café, lush garden, misty park, or an abstract blur. The person has authentic skin texture, natural facial features, and expressive emotion, with realistic lighting and true color tones. No plasticky skin, no overly smooth surfaces, no AI artifacts, no uncanny valley. Composition is wide and balanced, with distinct background elements contributing to the atmosphere. Highly detailed, organic, human appearance

Prompt: An indoor shot of a man smoking a cigarette. The smoke is coming from behind his head, obscuring his face in the upper right corner of the frame. His left hand is resting on his chin, while his right hand is holding the cigarette in his left hand. His mouth is slightly open, as if he is about to smoke the cigarette. His right ear is visible, and his upper left ear is showing. His eyes are dark, and he has a small amount of light shining on his face. The lighting is subdued, as evidenced by the shadow of the man's face on the right side of the image.

Prompt: A medium-sized woman stands in front of a colorful graffiti wall. The woman's hair is long and cascades down to her shoulders. She is wearing a black short-sleeved t-shirt and black pants. The word "FLOYO" is written in large, bold letters in a vibrant shade of pink, orange, and black. The letters are outlined in a darker shade of black. The wall behind the woman is a vibrant combination of blue, green, yellow, and orange. The ground beneath her is a dark gray asphalt.

Prompt: A highly realistic portrait of a person with authentic skin texture and natural facial features, expressive eyes, and genuine emotion. The background is random and visually interesting—such as an urban street, cozy café, lush garden, or abstract blurred scene—adding character to the composition. Lighting is soft and flattering, with true-to-life colors and gentle shadows. No plasticky skin, no AI artifacts, no overly smooth surfaces, no uncanny valley. Rich detail, organic appearance

Troubleshooting

If you’re having issues with installation or slow hardware, you can try any of these workflows on a more powerful GPU in your browser with ThinkDiffusion.

TRY THINKDIFFUSION

If you're having issues with workflow and visit us here at Discord #Help Desk or you may opt to email us at [email protected]

Join the ThinkDiffusion Discord Server!

ThinkDiffusion is your Stable Diffusion workspace in the cloud with unrestricted, bleeding edge opensource AI art tools. | 5449 members

Discord

Top 5 ComfyUI Flux Workflows

Phu Ngo — Sun, 05 Oct 2025 11:44:41 GMT

Flux is one of the most popular AI image models in ComfyUI right now. It handles text-to-image generation well, follows prompts accurately, and works across different art styles.

Here are the 5 Flux workflows people are using the most. Each one does something different-from filling in parts of images to training custom models to keeping characters looking consistent across scenes.

Pick whichever matches what you're trying to do.

Category	Description	Link
Inpainting with Reference using Flux Fill and Flux Redux workflow	Inpainting with Reference using Flux Fill and Flux Redux lets you naturally fill or edit image areas to match a reference image’s style.	View Now
Train Flux Models using Flux workflow	Train Flux Models using Flux lets you quickly fine-tune AI models with your own images in ComfyUI.	View Now
Image2Image using Flux Controlnet workflow	Image2Image using Flux ControlNet lets you transform or enhance images with precise structure and style control by combining your input image, prompts, and advanced ControlNet guidance.	View Now
Consistent Character Creating using Flux workflow	Consistent Character Creating using Flux lets you generate multiple images of the same character with matching features, style, and identity across different poses and scenes.	View Now
Intro to Flux workflow	Flux is a state-of-the-art AI model for generating high-quality, detailed images from text prompts, known for its versatility, prompt accuracy, and support for diverse artistic styles	View Now

Inpainting with Reference using Flux Fill and Flux Redux

ThinkDiffusion-Image2Image_Inpainting_Flux_Fill_Redux_ReferenceImage

Download the File Here

ThinkDiffusion-Image2Image_Inpainting_Flux_Fill_Redux_ReferenceImage.json

54 KB

What it's great for:

Seamlessly inpaints, outpaints, and fills missing or extended areas with natural blending and high visual consistency.
Accurately follows user text prompts for content replacement, restyling, and creative modifications
Combines styles or images to generate unique variations while preserving important details.
Supports adjustments in aspect ratio, guidance scale, resolution, and blending intensity for tailored results.

Inpainting using reference images is ideal for digital artists, photographers, designers, and content creators who want to restore, enhance, or modify images seamlessly. It benefits anyone needing to remove objects, repair photos, or add new elements that match the original style-making it useful for both professionals and hobbyists.

The Inpainting Revolution: How Reference Images with Flux Fill and Flux Redux Are Changing the Game

Ever needed to add something to an image that wasn’t there before? That’s where Flux Fill and Flux Redux come in – they’re changing the game for image editing by making inpainting (filling in parts of images) look natural and professional. By using models such as Flux Fill and Flux Redux,

ThinkDiffusionPhu Ngo

Train Flux Models using Flux

ThinkDiffusion-Flux-Lora-Train

Download the File Here

ThinkDiffusion-Flux-Lora-Train.json

32 KB

What it's great for:

Train Flux models efficiently on machine with low VRAM.
Node-based interface to manage the entire training process within the same environment as your image generation workflows.
Easily train on small datasets (10–30 images), mix multiple datasets, and apply data augmentation or custom captions for diverse results.
Fine-tune training with adjustable parameters like batch size, learning rate, optimizer type.
Produce detailed and consistent models suitable for portraits, objects, styles, and more.

Training Flux models with ComfyUI is great for digital artists, designers, developers, and content creators who want to create custom AI image styles or assets. It’s accessible even for beginners and hobbyists, thanks to its user-friendly interface and low hardware requirements, making advanced AI image generation possible for anyone interested in creative or professional projects.

Building Better Models: Flux LoRAs in ComfyUI

What if you could make every image you generate, conform to a certain style or person? This is exactly what using a LoRA model with Flux AI in ComfyUI does. In this guide, we’ll explore how Flux can help you build stronger, more efficient models with ease. Whether you’re new

ThinkDiffusionPhu Ngo

Image2Image using Flux Controlnet

ThinkDiffusion_Flux_ControlNet

Download the File Here

ThinkDiffusion_Flux_ControlNet.json

15 KB

0:00

/0:25

What it's great for:

Integrates edge detection (Canny, HED) and depth maps as information to guide image transformation.
All models are trained and optimized for 1024x1024 resolution, enabling the generation of detailed, high-quality images suitable for professional and creative use.

Image2Image Flux with ControlNet is ideal for digital artists, designers, animators, and content creators who want precise control over image transformations. It benefits anyone looking to guide AI-generated images with edge, pose, or depth maps-making it useful for creative projects, marketing visuals, game assets, or personal artwork.

Precision in Flux Art: Harnessing the Power of ControlNet

Flux lets you create impressive images from text prompts. ControlNet is a significant tool for controlling the precise composition of AI images.

ThinkDiffusionPhu Ngo

Consistent Character Creating using Flux

ThinkDiffusion_Character_Consistency_Flux

Download the File Here

ThinkDiffusion_Character_Consistency_Flux.json

48 KB

ThinkDiffusion-the-pose-sheet

Download the File Here

ThinkDiffusion-the-pose-sheet.png

954 KB

What it's great for:

Generates multiple character poses and expressions from a single reference image.
Maintains strong identity consistency across all outputs (face, clothing, colors).
Supports multi-angle and multi-scene character generation for comics, games, and animation.
Allows easy style, background, and attribute customization with prompts.

Artists, animators, game developers, marketers, and hobbyists can all benefit from Flux’s consistent character workflow. It’s perfect for anyone who needs uniform character designs across multiple scenes, poses, or styles-making it useful for comics, animation, games, branding, and personal creative projects.

Utilizing Flux in ComfyUI for Consistent Character Creation

Creating characters that look the same every time is crucial. This guide will show you, to maintain consistency by using the workflow in ComfyUI.

ThinkDiffusionPhu Ngo

Intro to Flux

ThinkDiffusion-Intro-to-Flux-

Download the File Here

ThinkDiffusion-Intro-to-Flux-.json

53 KB

What it's great for:

Ensures consistent character appearance across multiple images and poses.
Works with both reference images and text prompts for flexible input.
Offers modular, user-friendly workflows for generation, upscaling, and detailing.
Supports various art styles and creative applications like comics, games, and animation.
Provides advanced controls for fine-tuning and precise customization.

Flux in ComfyUI is great for digital artists, designers, marketers, content creators, and hobbyists. Anyone who needs quick, high-quality, and consistent AI-generated images for creative projects, branding, or personal use can benefit from using it.

Introduction to Flux - Quick Guide

Flux has bursted on the scene as the defacto AI Art model. It’s here and easy to use on ThinkDiffusion, let’s dive in and show you how it works!

ThinkDiffusionPhu Ngo

If your computer's struggling or installation is giving you headaches, try these workflows in your browser with ThinkDiffusion. We provide the GPU power so you can focus on creating.

Enjoy experimenting with these workflows! And remember - every pro started as a beginner once.

TRY THINKDIFFUSION

If you enjoy ComfyUI and you want to test out HyperSD in ComfyUI and Blender in real-time, then feel free to check out this Real-Time Creativity: Leveraging Hyper SD and Blender with ComfyUI. And have fun out there with your videos!

Wan2.2 Workflow + Guide: Turn Text Into Cinematic Video

Matt Shih — Sat, 04 Oct 2025 11:41:21 GMT

0:00

/0:05

Prompt: A shy apprentice mage, cloaked in tattered robes, stands beside a glowing portal deep within a misty, enchanted forest at twilight. Strange fireflies flicker around ancient twisted trees, and distant magical runes pulse gently on mossy stones. The camera spirals in from above, capturing the mage’s hesitant gestures as arcane sparks dance between their fingers. The sound of whispering leaves and a faint, mystical melody fills the air—immersing viewers in a fantastical, atmospheric scene with lifelike lighting, rich magical effects, and cinematic visual storytelling.

Type a scene description, get a video. That's Wan2.2.

This is an open-source text-to-video model that uses a Mixture-of-Experts system to create realistic motion and accurate visuals. It handles 720p videos with smoother animation and fewer artifacts than version 2.1, and it runs on standard GPUs without needing a server farm.

Describe a fantasy world, a dramatic scene, or something everyday—Wan2.2 turns it into video. Works well for storyboarding, concept testing, or just seeing your ideas move.

What we'll cover

What Wan2.2 is and how it's better than 2.1
Getting the workflow running on ThinkDiffusion
Installing the 6 models you need
Walking through the workflow settings
Real video examples across different styles
Common issues and fixes

The Wan2.2 Release

0:00

/0:05

Wan2.2 is a next-generation, open-source text-to-video model featuring a Mixture-of-Experts (MoE) system for dramatically more realistic motion, prompt accuracy, and cinema-quality visuals compared to Wan2.1. With much larger training data and smart MoE design, it delivers fluid, artifact-free 720p videos quickly and efficiently, even on standard GPUs. Artists, animators, filmmakers, and creators at any level will find Wan2.2 superior for its greater detail, smoother animation, and enhanced creative control—making it the go-to tool for high-quality, prompt-driven video generation.

0:00

/0:05

Prompt: A teenage boy in a faded hoodie bicycles down a rain-slicked suburban street under a brooding twilight sky. The houses’ windows glow warmly as he pedals past, his breath visible in the cold air. The camera tracks alongside at wheel level, water spraying from the tires and reflecting streetlights. Wind ruffles fallen leaves, dogs bark in the distance, and the sound of passing cars merges with distant thunder—evoking a moody, authentic suburban scene with nuanced lighting and a strong sense of realism.

Whether you’re experimenting with fantasy worlds, dramatic scenes, or lifelike moments that pulse with real atmosphere, prepare to be amazed—because with Wan2.2, the magic of cinematic video is just a sentence away waiting to be brought to life by your imagination alone.

Download Workflow

Installation guide

Download the workflow file
Open ComfyUI (local or ThinkDiffusion)
Drag the workflow file into the ComfyUI window
If you see red nodes, install missing components:

ComfyUI Manager > Install Missing Custom Nodes

ThinkDiffusion-StableDiffusion-ComfyUI-Wan2_2_T2V

DOWNLOAD THE FILE HERE

ThinkDiffusion-StableDiffusion-ComfyUI-Wan2_2_T2V.json

36 KB

Verified to work on ThinkDiffusion Build: July 9, 2025

ComfyUI v0.3.47 with the use wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors and wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors, and with these LoRAs high_noise_model.safetensors and low_noise_model.safetensors

Note: We specify the build date because ComfyUI and custom node versions updated after this date may change the behavior or outputs of the workflow.

Minimum Machine Size: Ultra

Use the specified machine size or higher to ensure it meets the VRAM and performance requirements of the workflow

💡

Custom Nodes

If there are red nodes in the workflow, it means that the workflow lacks the certain required nodes. Install the custom nodes in order for the workflow to work.

Go to the ComfyUI Manager > Click Install Missing Custom Nodes

Check the list below if there's a list of custom nodes that needs to be installed and click the install.

Required Models

For this guide you'll need to download these 6 recommended models.

1. wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors
2. wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors
3. umt5-xxl-enc-bf16.safetensors
4. wan_2.1_VAE_bf16.safetensors
5. high_noise_model.safetensors
6. low_noise_model.safetensors

Go to ComfyUI Manager > Click Model Manager

Search for the models above and when you find the exact model that you're looking for, click install, and make sure to press refresh when you are finished.

Model Name	Model Link Address	ThinkDiffusion Upload Directory
wan2.2_t2v_high_noise_14B_fp8_scaled.safetensors	📋 Copy Path	.../comfyui/models/diffusion_models/
wan2.2_t2v_low_noise_14B_fp8_scaled.safetensors	📋 Copy Path	.../comfyui/models/diffusion_models/
umt5-xxl-enc-bf16.safetensors	📋 Copy Path	.../comfyui/models/text_encoders/
wan_2.1_VAE_bf16.safetensors	📋 Copy Path	.../comfyui/models/vae/
high_noise_model.safetensors	📋 Copy Path	.../comfyui/models/lora/
low_noise_model.safetensors	📋 Copy Path	.../comfyui/models/lora/

💡

You need to rename lightning x2v LorA to your desired name, you can do it before you upload or rename it right into your My Files of ThinkDiffusion. It needs to be renamed because it has the same of LoRA for T2V and I2V of LightningX2V.

Step-by-step Workflow Guide

This workflow was pretty easy to set up and runs well from the default settings. Here are a few steps where you might want to take extra note.

Steps	Recommended Nodes
1. Set the Models Set the models as seen on the image. Enabled the low mem load if you experienced the out of memory.
2. Write a Prompt Write a detailed prompt. Wan2.2 model is good in adherence of prompt. Set the size to 480p only. Wan2.2 is compatible with 720 and 1080 resolution and higher frames but you need a higher machine for that.
3. Check Sampling Set the sampling setting as seen on the image. Since it uses a lightning x2v lora, the inference steps should be at 4 only. Otherwise, it will result to an error.
4. Check the Video

💡

I was only able to run the workflow at 480p resolution and process up to 81 frames due to the limitations of my current hardware. However, if you have a more powerful ComfyUI setup, I recommend utilizing the higher-capacity Wan2.2 model for improved performance and output quality. Robust hardware resources will allow you to take full advantage of advanced models and process higher resolutions or longer sequences more efficiently.

💡

From my experience, using the Lightning x2v LoRA in the workflow is crucial for optimizing generation speed. Whenever I include this LoRA, I’ve noticed a significant reduction in processing time, allowing me to complete tasks far more efficiently. Without it, the generation process can be extremely sluggish—sometimes taking up to 30 minutes for a single output. Leveraging the Lightning x2v LoRA has become an essential part of my workflow to ensure fast and reliable results.