Official Verified

doubao-image-video

豆包图片与视频生成原生技能。适用于用户提到豆包、文生图、图生图、文生视频、图生视频、查询视频生成任务、等待任务完成或下载最终视频时，直接调用火山引擎 Ark 接口，不依赖外部 MCP 服务。

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/156554395/doubao-image-video

Download Source Code (.zip)

Doubao Native Media Skill

This is a native OpenClaw skill. Do not spin up the upstream MCP server unless the user explicitly asks for MCP compatibility.

Use this skill for

Doubao / 豆包 text-to-image
image-to-image or multi-reference image generation
Doubao text-to-video or image-to-video
querying an async Doubao video task by task_id
troubleshooting Volcengine Ark endpoint/model issues

Commands

Generate an image

python3 {baseDir}/scripts/doubao_media.py image \
  --prompt "A cinematic cyberpunk alley in rain" \
  --size 2560x1440

Generate a video

python3 {baseDir}/scripts/doubao_media.py video \
  --prompt "A panda astronaut waves on the moon" \
  --video-duration 5 \
  --fps 24 \
  --resolution 1080p

Query a video task

python3 {baseDir}/scripts/doubao_media.py task --task-id your-task-id

Wait for a video task and optionally download the result

python3 {baseDir}/scripts/doubao_media.py wait \
  --task-id your-task-id \
  --timeout 600 \
  --interval 5 \
  --download-to ./doubao-result.mp4

Input rules

Always prefer --endpoint-id when the user has a provisioned Volcengine Ark endpoint.
Fall back to model names only when endpoint ids are unavailable.
For video generation, this skill mirrors the upstream behavior and appends --dur, --fps, --rs, and --ratio to the prompt when they are not already present.
If the user supplies image URLs, pass them through exactly; do not download or re-host unless asked.

Troubleshooting

If neither --endpoint-id nor a default endpoint env var exists, the script falls back to the default model env var.
If the API returns InvalidEndpointOrModel.NotFound, ask the user to verify the Volcengine Ark endpoint authorization first.
Video generation is async. If generation succeeds, capture task_id and query it later with the task subcommand, or use wait for automatic polling.

References

Read references/api-notes.md when you need request shapes, defaults, or caveats.

Read Full Documentation on GitHub

Metadata

Author@156554395

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-156554395-doubao-image-video": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

amoeba-management-analysis

阿米巴经营分析技能。基于稻盛和夫阿米巴经营理念，提供单位时间核算、经营会计报表分析、阿米巴组织划分评估、业绩改善诊断等能力。当用户需要做阿米巴经营分析、单位时间核算、经营会计、阿米巴组织划分、利润中心分析、内部交易定价、业绩评价时触发。触发词：阿米巴、阿米巴经营、单位时间核算、经营会计、利润中心、内部交易、阿米巴划分、巴长、稻盛和夫、京瓷会计学

156554395 4473

tx-cos

面向 OpenClaw 的腾讯云 COS 原生操作技能。适用于上传本地文件、批量同步目录、大文件分片上传、生成临时签名链接、浏览对象与文件夹视图、复制/移动/重命名/删除对象、统计目录体量，以及排查 COS 凭证和 Bucket 配置。

156554395 4473

zhipu-free-image-video

智谱免费图片与视频生成技能。适用于用户想用智谱生成图片、批量出图、生成短视频、查询视频任务结果、等待视频完成，或优先使用免费/低成本模型快速产出创意内容时。

156554395 4473

bigmodel-image-video

使用 BigModel (CogView/CogVideoX) API 生成高质量图片和视频。当用户需要"生成图片"、"制作视频"、"AI 绘画"、"创建封面"、"设计海报"、"视觉内容生成"、或任何需要创建图像/视频内容的场景时使用此技能。即使没有明确提到"生成"，只要用户需要创建、设计或制作视觉内容（如小说封面、产品图片、宣传图、短视频等），都应该主动使用此技能。

156554395 4473

novel-writer

小说写作助手。触发词：写小说/创作小说/生成小说/帮我写个小说/小说大纲/网文创作。工作流：需求收集（类型/风格/字数）→ AI生成大纲（世界观/人物/剧情/章节规划）→ 用户确认 → 自动逐章生成 → 一章一个文件保存。支持玄幻、仙侠、都市、科幻、历史、武侠、网游、灵异、军事、竞技等主流网文类型，面向新手作者。

156554395 4473