sora

Generate videos using OpenAI's Sora API. Use when the user asks to generate, create, or make videos from text prompts or reference images. Supports image-to-video generation with automatic resizing.

Why use this skill?

Use the OpenClaw Sora skill to generate professional AI videos from text and images. Fast, automated, and supports multiple resolutions and model types.

What This Skill Does

The Sora video generation skill empowers the OpenClaw AI agent to create high-quality, cinematic video content directly from text-based instructions or existing reference images. By leveraging the advanced OpenAI Sora API, this tool allows for the seamless creation of short-form video content, ranging from 4 to 12 seconds in duration, with support for various aspect ratios and professional-grade resolutions. Whether you are building brand assets, prototyping visual concepts, or generating social media content, this skill automates the entire rendering pipeline, from prompt processing to final file delivery.

Installation

To integrate this functionality into your environment, run the following command in your terminal: clawhub install openclaw/skills/skills/pauldelavallaz/sora-video-gen Ensure you have your OPENAI_API_KEY set as an environment variable, or have it ready to pass via the --api-key flag during execution to authenticate with the OpenAI platform.

Use Cases

This skill is perfect for marketers looking to create quick lifestyle commercial snippets, designers needing to visualize motion in a scene, and content creators aiming to generate b-roll for longer projects. You can use it to turn static product photography into engaging "cinematic" product reveals, or to generate abstract atmospheric footage by simply describing the lighting, mood, and camera movement you desire.

Example Prompts

"Create a 12-second video of a bustling Tokyo street at night, focusing on neon lights reflecting on wet pavement, using the sora-2-pro model."
"Take this image (reference.png) and turn it into an 8-second video where the clouds move slowly in the background and the lighting changes to sunset."
"Generate a short video clip showing a slow dolly shot of a luxury watch, highlighting the metallic reflections and premium craftsmanship."

Tips & Limitations

To get the best results, structure your prompts to include specific camera movements (e.g., pan, zoom, tracking shot), lighting conditions, and atmospheric details. Remember that the system will automatically resize your input images to match the target resolution, so expect some cropping if aspect ratios differ significantly. Be mindful that video generation is a resource-intensive process that typically takes between 1 to 3 minutes. Finally, please download your generated files promptly, as the output storage is ephemeral and videos will expire from the cache after approximately one hour.

sora

Why use this skill?

Install via CLI (Recommended)

What This Skill Does

Installation

Use Cases

Example Prompts

Tips & Limitations

Metadata

Tags(AI)

Related Skills

morpheus-fashion-design

veed-ugc

ugc-manual

ad-ready

sora