MDClaw 多模态
MDClaw OpenClaw API 技能,支持文字转语音(TTS)、文生图(Text to Image)、文生视频(Text to Video)、图生视频(Image to Video)等多模态 AI 能力。通过网关服务统一调用,支持账号注册、图片上传、任务轮询等完整功能。
Why use this skill?
Integrate MDClaw into OpenClaw to unlock text-to-speech, text-to-image, and AI-powered video generation. A powerful, unified gateway for all your generative AI needs.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/cnskycn/mdclaw-openclawWhat This Skill Does
MDClaw 多模态 is a robust OpenClaw AI agent skill designed to unify access to cutting-edge generative AI services through a single gateway. It empowers users and developers to effortlessly integrate high-quality multi-modal capabilities into their workflows, including Text-to-Speech (TTS), Text-to-Image generation, and advanced asynchronous video generation (Text-to-Video and Image-to-Video). Beyond content creation, the skill acts as a versatile personal assistant by offering utility functions such as real-time weather querying, web content summarization, and AI-powered global web searches. It simplifies the complexity of interacting with distributed AI services by providing a standardized API structure and robust client-side wrappers, allowing you to focus on the outcome rather than the underlying infrastructure.
Installation
To integrate MDClaw into your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/cnskycn/mdclaw-openclaw
Ensure your system meets the minimum requirements, specifically having requests>=2.31.0 installed in your Python environment. Once installed, configure your authentication by either registering an account through the MDClawClient or setting your environment variable via export MDCLAW_API_KEY="your_api_key_here".
Use Cases
This skill is highly versatile and serves several professional and creative domains:
- Marketing & Content Creation: Generate high-quality visuals for social media posts or turn written blog scripts into engaging promotional videos.
- Accessibility & UX: Utilize the TTS engine to provide auditory feedback in voice-enabled applications or to create dynamic voiceovers for presentations.
- Research & Analysis: Use the web summary and AI search features to distill complex web articles into concise insights, or use weather queries for logistics and event planning.
- Prototyping: Quickly iterate on creative concepts by transforming static images into dynamic, motion-rich visual content.
Example Prompts
- "Generate a realistic image of a serene mountain landscape at sunset with a 16:9 aspect ratio."
- "Convert this text to speech: Welcome to the future of AI automation, where efficiency meets creativity."
- "Summarize the key takeaways from this URL: https://example-tech-article.com and search for the latest news on recent AI breakthroughs."
Tips & Limitations
- Video Handling: Remember that video generation is an asynchronous process. Always use the
video_statusfunction with the providedtask_idto poll for completion. Avoid passing theresolutionparameter to ensure the server returns the necessarytask_id. - Data Integrity: When uploading images for video conversion, ensure your local files are accessible to the
upload_imagefunction to avoid runtime failures. - Error Management: Always implement check logic after each API call (using the
successboolean in the result object) to gracefully handle potential network timeouts or quota limitations. Keep your API key secure and avoid hardcoding it in public repositories.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-cnskycn-mdclaw-openclaw": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: external-api, network-access