Two starting points: text or image
Start from a written idea when you only have a concept, or start from a reference image when you want the prompt to inherit the visual style of an existing frame.
Convert a text idea or reference image into a cinematic prompt covering subject, motion, scene, camera, lighting, and atmosphere — formatted for the model you actually use, whether that's Sora, Runway, or Kling.
Write a short idea or upload a reference image to generate a production-ready video prompt.
Video prompts work best when they describe motion, camera, and atmosphere — not just static details.
Start from a written idea when you only have a concept, or start from a reference image when you want the prompt to inherit the visual style of an existing frame.
Video prompts need movement, camera language, and atmosphere — not just subject and style. Each output is structured around motion logic so the resulting clip looks intentional, not random.
Sora prompts, Runway prompts, and Kling prompts all read differently. Switch the target model and the prompt rewrites itself in the right format.
When the prompt looks right, push it straight into image-to-video or text-to-video generation without leaving the workbench.
Type your idea or upload a frame. Both produce a complete cinematic prompt.
Sora, Runway, Kling, or general — the prompt format adapts to each.
Use the next-step actions to launch image-to-video or text-to-video without losing your prompt.
A video prompt generator turns a text idea or reference image into a structured prompt that includes motion, camera, scene, and atmosphere — ready to use with text-to-video or image-to-video models like Sora, Runway, and Kling.
Image prompts focus on a single frame. Video prompts must describe motion over time, camera behaviour, and how the scene evolves — this generator structures every output around those dimensions.
Sora, Runway, Kling, and a general format that works across most other text-to-video tools.
Yes. Switch to image-input mode, upload a frame, and the generator will lift composition and style from the image while structuring the motion logic for video.