Turn One Product Photo Into a Branded Vertical Video With Gemini
Gemini's built-in Veo 3.1 image-to-video makes a social-ready 9:16 clip from a single photo in under five minutes, no editor and no freelancer.
What matters today
Gemini's built-in Veo 3.1 image-to-video makes a social-ready 9:16 clip from a single photo in under five minutes, no editor and no freelancer.
Key points
- What image-to-video does
- Step 1: Open the video tool in Gemini
- Step 2: Upload your product image
- Step 3: Write a prompt that controls motion and audio
- Step 4: Generate, review, and regenerate
What you'll learn in this article:
- How Gemini's image-to-video works and what it can produce
- The exact steps to turn a product photo into a 9:16 vertical clip
- A reusable prompt structure for motion, camera moves, and voiceover
- How to keep a product looking consistent across multiple clips
- Realistic limits, so you know when to use it and when not to
A short product video used to mean a brief, a freelancer, a few hundred dollars, and a week of waiting. For a single clip to test on Reels or Shorts, that math never works. So most small teams skip video entirely and post another static image, then wonder why the feed feels flat.
Gemini changed the math. Its built-in Veo 3.1 image-to-video turns one photo into an 8-second clip with native audio, and it now generates native vertical 9:16 video with 1080p and 4K upscaling, the exact format social platforms reward. It lives inside the Gemini app, so there is no separate video tool to learn and no export dance between apps.
The result is that a marketing lead can produce a branded vertical clip from a single product photo in under five minutes, for the cost of a Gemini subscription instead of a $500 to $2,000 edit. This article walks through the exact workflow, the prompt structure that produces usable output instead of weird artifacts, and the limits worth knowing before you build a campaign around it.
What image-to-video does
Veo 3.1 takes a still image and animates it into a short clip with camera movement, motion, and natively generated audio. The version in Gemini produces 8-second clips and supports 720p, 1080p, and 4K, with native 9:16 vertical output for social. You can also supply up to three reference images to keep a product or character consistent across multiple shots, which is what makes it usable for a brand rather than a one-off novelty.
It has been available in the Gemini app since late 2025 and continues to get capability upgrades, including the vertical reference-image workflow. For a business user, the headline is simple: a photo goes in, a short branded video comes out, inside one app.
Step 1: Open the video tool in Gemini
Open the Gemini app and tap into the prompt box. Find the Video option. On some devices it shows directly; on others it sits under a More menu or the "..." control. Select it, then choose From photo. This tells Gemini you want image-to-video rather than text-to-video.
Step 2: Upload your product image
Choose a clean, well-lit photo of the product on a simple background. The clearer the subject and the less visual clutter, the better the animation. A crisp studio-style shot animates far more cleanly than a busy lifestyle photo with competing elements.
Step 3: Write a prompt that controls motion and audio
The difference between a usable clip and a strange one is the prompt. Describe the camera move, the motion, and any voiceover line explicitly. Use this structure:
GEMINI IMAGE-TO-VIDEO PROMPT
Animate this product photo as an 8-second vertical 9:16 clip. Camera: slow push-in toward the product, then a gentle orbit to the right. Motion: subtle, premium, no fast cuts. Lighting: keep the existing studio lighting, add a soft highlight sweep across the surface. Audio: a calm voiceover saying "Built for the work that matters." End on a clean hold of the product centered in frame.
Naming the camera move (push-in, orbit, pan), the pace (subtle, no fast cuts), and the exact voiceover line gives Veo the constraints it needs to produce something on-brand instead of generic.
Step 4: Generate, review, and regenerate
Generate the clip and watch it at full size. Image-to-video is probabilistic, so the first result may not nail the motion. Regenerate with a sharper instruction: if the orbit was too fast, write "slow the orbit by half"; if the product warped, write "keep the product shape rigid and stable throughout." Two or three iterations usually land it.
Step 5: Keep products consistent across a set
For a campaign, consistency matters. When you make a second or third clip of the same product, supply the same reference image (and up to three references total) so the product reads identically across the set. This is how you build a small library of clips that look like they came from the same shoot.
A reusable workflow for a content batch
The real leverage is batching. Pick five product photos on a Monday morning, run each through the same prompt structure with the product-specific details swapped, and you have five vertical clips before lunch. Schedule them across the week. The whole batch costs the time it used to take to brief a single freelance edit.
Realistic limits
Image-to-video is strong for short, simple product and brand clips. It is not the tool for a 90-second explainer with synced dialogue, complex scene changes, or precise text overlays, those still need a real editor. Eight seconds is the canvas. Treat each clip as a single strong moment, a hook, a reveal, a product hero shot, rather than a full story, and the output holds up.
Action Steps Summary
- Open the video tool: In the Gemini app, tap the prompt box, find the Video option, and choose From photo.
- Use a clean image: Upload a crisp, well-lit product photo on a simple background.
- Control the clip with the prompt: Name the camera move, the pace, and the exact voiceover line.
- Iterate fast: Regenerate with sharper instructions if the motion or shape is off.
- Batch for a campaign: Run multiple photos through the same prompt structure and keep one reference image for consistency.
Three deep dives. Four useful moves. One email worth opening.
PromptHacker turns the AI firehose into practical next steps for work, health, family, and everything time keeps trying to steal.