alicloud-ai-audio-asr
Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/cinience/alicloud-ai-audio-asrWhat This Skill Does
The alicloud-ai-audio-asr skill provides a powerful, professional-grade bridge to Alibaba Cloud's Model Studio Qwen ASR (Automatic Speech Recognition) services. It enables the OpenClaw AI agent to convert spoken audio into precise, punctuated text. This skill supports both real-time, synchronous transcription for short audio clips and sophisticated asynchronous workflows for long-form recordings such as interviews, lectures, or lengthy meetings. By leveraging models like qwen3-asr-flash and qwen3-asr-flash-filetrans, this tool ensures high-accuracy transcription performance integrated directly into your agentic workflow.
Installation
To integrate this skill, use the command provided by your agent manager: clawhub install openclaw/skills/skills/cinience/alicloud-ai-audio-asr. Ensure that you have set your DASHSCOPE_API_KEY in your environment or added it to your ~/.alibabacloud/credentials file. The skill uses standard Python libraries, so no additional heavy dependencies are required beyond the core OpenClaw environment. Run the validation script provided in the documentation to verify your setup: mkdir -p output/alicloud-ai-audio-asr && python -m py_compile skills/ai/audio/alicloud-ai-audio-asr/scripts/transcribe_audio.py.
Use Cases
- Meeting Intelligence: Automatically transcribe hour-long meeting recordings to generate searchable text logs for your team.
- Content Creation: Convert voice memos or podcast raw files into draft articles or blog posts.
- Accessibility: Generate transcripts for audio files to assist hearing-impaired users.
- Data Analysis: Extract keywords and topics from customer service audio records to identify support trends.
- Historical Archiving: Process large libraries of recorded interviews into standardized text formats.
Example Prompts
- "Transcribe the interview audio located at ./audios/interview_01.mp3 and summarize the key technical requirements discussed."
- "Please process the meeting recording https://example.com/daily_scrum.wav using the flash transcription model and provide a transcript with sentence-level timestamps."
- "Transcribe the attached voice note using the async long-file worker and save the raw API JSON response to the output directory."
Tips & Limitations
- Choosing the Right Model: Use
qwen3-asr-flashfor fast responses on short snippets. For anything exceeding several minutes, always opt forqwen3-asr-flash-filetransto handle the asynchronous job queue effectively. - Language Support: While the model is highly capable in multiple languages, always provide the
language_hintsparameter when you know the input language to ensure maximum accuracy. - Environment Security: Keep your
DASHSCOPE_API_KEYprivate and never hardcode it into scripts shared in public repositories. - Output Management: Ensure your storage path is correctly configured in
output/alicloud-ai-audio-asr/to keep your workspace organized and prevent overwriting critical transcripts.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-cinience-alicloud-ai-audio-asr": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, file-write, file-read, external-api
Related Skills
volcengine-compute-ecs
Manage Volcengine ECS instances and related resources. Use when users need instance inventory, lifecycle operations, troubleshooting, or automation templates for ECS.
alicloud-ai-search-opensearch
Use OpenSearch vector search edition via the Python SDK (ha3engine) to push documents and run HA/SQL searches. Ideal for RAG and vector retrieval pipelines in Claude Code/Codex.
alicloud-storage-oss-ossutil
Alibaba Cloud OSS CLI (ossutil 2.0) skill. Install, configure, and operate OSS from the command line based on the official ossutil overview.
alicloud-platform-openapi-product-api-discovery
Discover and reconcile Alibaba Cloud product catalogs from Ticket System, Support & Service, and BSS OpenAPI; fetch OpenAPI product/version/API metadata; and summarize API coverage to plan new skills. Use when you need a complete product list, product-to-API mapping, or coverage/gap reports for skill generation.
alicloud-ai-image-qwen-image
Generate images with Model Studio DashScope SDK using Qwen Image generation models (qwen-image, qwen-image-plus, qwen-image-max and snapshots). Use when implementing or documenting image.generate requests/responses, mapping prompt/negative_prompt/size/seed/reference_image, or integrating image generation into the video-agent pipeline.