What This Skill Does

MDClaw 多模态 is a robust OpenClaw AI agent skill designed to unify access to cutting-edge generative AI services through a single gateway. It empowers users and developers to effortlessly integrate high-quality multi-modal capabilities into their workflows, including Text-to-Speech (TTS), Text-to-Image generation, and advanced asynchronous video generation (Text-to-Video and Image-to-Video). Beyond content creation, the skill acts as a versatile personal assistant by offering utility functions such as real-time weather querying, web content summarization, and AI-powered global web searches. It simplifies the complexity of interacting with distributed AI services by providing a standardized API structure and robust client-side wrappers, allowing you to focus on the outcome rather than the underlying infrastructure.

Installation

To integrate MDClaw into your OpenClaw environment, execute the following command in your terminal: clawhub install openclaw/skills/skills/cnskycn/mdclaw-openclaw Ensure your system meets the minimum requirements, specifically having requests>=2.31.0 installed in your Python environment. Once installed, configure your authentication by either registering an account through the MDClawClient or setting your environment variable via export MDCLAW_API_KEY="your_api_key_here".

Use Cases

This skill is highly versatile and serves several professional and creative domains:

Marketing & Content Creation: Generate high-quality visuals for social media posts or turn written blog scripts into engaging promotional videos.
Accessibility & UX: Utilize the TTS engine to provide auditory feedback in voice-enabled applications or to create dynamic voiceovers for presentations.
Research & Analysis: Use the web summary and AI search features to distill complex web articles into concise insights, or use weather queries for logistics and event planning.
Prototyping: Quickly iterate on creative concepts by transforming static images into dynamic, motion-rich visual content.

Example Prompts

"Generate a realistic image of a serene mountain landscape at sunset with a 16:9 aspect ratio."
"Convert this text to speech: Welcome to the future of AI automation, where efficiency meets creativity."
"Summarize the key takeaways from this URL: https://example-tech-article.com and search for the latest news on recent AI breakthroughs."

Tips & Limitations

Video Handling: Remember that video generation is an asynchronous process. Always use the video_status function with the provided task_id to poll for completion. Avoid passing the resolution parameter to ensure the server returns the necessary task_id.
Data Integrity: When uploading images for video conversion, ensure your local files are accessible to the upload_image function to avoid runtime failures.
Error Management: Always implement check logic after each API call (using the success boolean in the result object) to gracefully handle potential network timeouts or quota limitations. Keep your API key secure and avoid hardcoding it in public repositories.

MDClaw 多模态

Why use this skill?

Install via CLI (Recommended)

What This Skill Does

Installation

Use Cases

Example Prompts

Tips & Limitations

Metadata

Tags(AI)