Wan 2.6 Video Generation

Wan 2.6 Prompting Guide

Wan 2.6 works best when you prompt it like a film shot instead of a still-image caption. It responds especially well to clear scene setup, explicit motion, deliberate camera direction, timed action beats, and strong continuity instructions.

Best Overall Formula

Subject + environment + action + camera + lighting + style + timing + constraints

Why Wan 2.6 Feels Different

Wan 2.6 is strong at cinematic short-form video, multi-shot structure, image-to-video continuity, and prompt-driven motion, so it tends to reward prompts that read like direction for a film crew.

Single Biggest Rule

Prompt the sequence like a shot, not like a poster.

Best Order to Write

Who + where + what happens + camera + light + style + timing + negatives

Prompt Anatomy

1) Subject

Define the main subject clearly with only the details that affect identity, wardrobe, mood, or framing.

2) Environment

Anchor the shot in a specific place with enough lighting, texture, and background detail to prevent scene drift.

3) Action

Describe visible motion in a simple sequence with verbs and small timed beats instead of static description.

4) Camera

Specify framing, movement, and perspective so the shot feels intentional and cinematic.

Text-to-Video

For text-to-video, define the whole visual situation from scratch and choreograph the shot in a clear sequence.

A middle-aged jazz pianist plays alone in a smoky underground club, seated at a black grand piano. The camera begins in a wide shot from the back of the room and slowly dollies toward him as he plays. Candlelit tables, warm amber practical lights, drifting cigarette smoke, elegant moody noir atmosphere, subtle reflections on the piano lid. In the middle of the shot he closes his eyes and leans into the performance, and in the last moment he opens them and glances toward the audience. Realistic human motion, natural hand movement, stable facial features, detailed fingers, no flicker, no distortion.

Image-to-Video

For image-to-video, use the source image as the anchor and describe what should stay fixed, what should animate, and how the camera should behave.

Preserve the woman’s pose, outfit, face, and overall framing from the source image. She stands on a rooftop at sunset while wind moves her hair and jacket naturally. The camera makes a slow cinematic push-in. Soft golden-hour light, realistic skin texture, subtle background city motion, elegant premium fashion-film aesthetic. Stable face, natural blink, no warping, no body shape changes, no extra limbs.

Motion Guidance

Low Motion

subtle breathing
blinking
small head turn
fabric movement
hair moving in wind

Medium Motion

walking slowly
turning toward camera
raising a hand
looking back
stepping into frame

High Motion

running
fight choreography
multi-subject movement
fast environmental motion
heavy camera movement

Camera Movement

Wan 2.6 usually responds best when you give it one clear framing choice and one deliberate camera move.

close-up
medium shot
wide shot
over-the-shoulder
slow push-in
dolly out
tracking shot
handheld follow
locked-off shot
gentle orbit

Timing and Shot Beats

First, she looks down at the letter. Then she slowly lifts her gaze toward camera. In the final moment, a faint smile appears as the wind catches her hair.

Wan 2.6 tends to perform better when the action unfolds in simple beats instead of everything happening at once.

Lighting and Style

cinematic realism, luxury commercial, documentary realism, moody noir, dreamlike fantasy, prestige TV drama, glossy music video, high-end fashion film, naturalistic daylight realism, shallow depth of field, anamorphic feel, soft bokeh

Useful Negatives

no flicker
no jitter
no warped face
no asymmetrical eyes
no distorted mouth
no bad hands
no extra fingers
no extra limbs
no duplicate subject
no deformed anatomy
no unstable background

Strong Master Template

[Main subject] in [specific environment], performing [clear action].
The camera [framing] and [camera movement].
[Visual style / realism level], [lighting], [color palette], [lens or depth-of-field feel].
During the shot, [timed action beat 1], then [timed action beat 2], ending with [final beat].
Natural realistic motion, stable anatomy, consistent facial features, coherent background, no flicker, no distortion, no extra limbs.

Bottom Line

Give Wan 2.6 a clear subject, a real setting, a visible action, a deliberate camera move, and simple timed beats.