ClawKit Logo
ClawKitReliability Toolkit
Back to Registry
Official Verified

doubao-image-video

豆包图片与视频生成原生技能。适用于用户提到豆包、文生图、图生图、文生视频、图生视频、查询视频生成任务、等待任务完成或下载最终视频时,直接调用火山引擎 Ark 接口,不依赖外部 MCP 服务。

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/156554395/doubao-image-video
Or

Doubao Native Media Skill

This is a native OpenClaw skill. Do not spin up the upstream MCP server unless the user explicitly asks for MCP compatibility.

Use this skill for

  • Doubao / 豆包 text-to-image
  • image-to-image or multi-reference image generation
  • Doubao text-to-video or image-to-video
  • querying an async Doubao video task by task_id
  • troubleshooting Volcengine Ark endpoint/model issues

Commands

Generate an image

python3 {baseDir}/scripts/doubao_media.py image \
  --prompt "A cinematic cyberpunk alley in rain" \
  --size 2560x1440

Generate a video

python3 {baseDir}/scripts/doubao_media.py video \
  --prompt "A panda astronaut waves on the moon" \
  --video-duration 5 \
  --fps 24 \
  --resolution 1080p

Query a video task

python3 {baseDir}/scripts/doubao_media.py task --task-id your-task-id

Wait for a video task and optionally download the result

python3 {baseDir}/scripts/doubao_media.py wait \
  --task-id your-task-id \
  --timeout 600 \
  --interval 5 \
  --download-to ./doubao-result.mp4

Input rules

  • Always prefer --endpoint-id when the user has a provisioned Volcengine Ark endpoint.
  • Fall back to model names only when endpoint ids are unavailable.
  • For video generation, this skill mirrors the upstream behavior and appends --dur, --fps, --rs, and --ratio to the prompt when they are not already present.
  • If the user supplies image URLs, pass them through exactly; do not download or re-host unless asked.

Troubleshooting

  • If neither --endpoint-id nor a default endpoint env var exists, the script falls back to the default model env var.
  • If the API returns InvalidEndpointOrModel.NotFound, ask the user to verify the Volcengine Ark endpoint authorization first.
  • Video generation is async. If generation succeeds, capture task_id and query it later with the task subcommand, or use wait for automatic polling.

References

  • Read references/api-notes.md when you need request shapes, defaults, or caveats.

Metadata

Author@156554395
Stars4473
Views1
Updated2026-05-01
View Author Profile
AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill
Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-156554395-doubao-image-video": {
      "enabled": true,
      "auto_update": true
    }
  }
}
Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

amoeba-management-analysis

阿米巴经营分析技能。基于稻盛和夫阿米巴经营理念,提供单位时间核算、经营会计报表分析、阿米巴组织划分评估、业绩改善诊断等能力。 当用户需要做阿米巴经营分析、单位时间核算、经营会计、阿米巴组织划分、利润中心分析、内部交易定价、业绩评价时触发。 触发词:阿米巴、阿米巴经营、单位时间核算、经营会计、利润中心、内部交易、阿米巴划分、巴长、稻盛和夫、京瓷会计学

156554395 4473

tx-cos

面向 OpenClaw 的腾讯云 COS 原生操作技能。适用于上传本地文件、批量同步目录、大文件分片上传、生成临时签名链接、浏览对象与文件夹视图、复制/移动/重命名/删除对象、统计目录体量,以及排查 COS 凭证和 Bucket 配置。

156554395 4473

zhipu-free-image-video

智谱免费图片与视频生成技能。适用于用户想用智谱生成图片、批量出图、生成短视频、查询视频任务结果、等待视频完成,或优先使用免费/低成本模型快速产出创意内容时。

156554395 4473

bigmodel-image-video

使用 BigModel (CogView/CogVideoX) API 生成高质量图片和视频。当用户需要"生成图片"、"制作视频"、"AI 绘画"、"创建封面"、"设计海报"、"视觉内容生成"、或任何需要创建图像/视频内容的场景时使用此技能。即使没有明确提到"生成",只要用户需要创建、设计或制作视觉内容(如小说封面、产品图片、宣传图、短视频等),都应该主动使用此技能。

156554395 4473

novel-writer

小说写作助手。触发词:写小说/创作小说/生成小说/帮我写个小说/小说大纲/网文创作。工作流:需求收集(类型/风格/字数)→ AI生成大纲(世界观/人物/剧情/章节规划)→ 用户确认 → 自动逐章生成 → 一章一个文件保存。支持玄幻、仙侠、都市、科幻、历史、武侠、网游、灵异、军事、竞技等主流网文类型,面向新手作者。

156554395 4473