acestep
Use ACE-Step API to generate music, edit songs, and remix music. Supports text-to-music, lyrics generation, audio continuation, and audio repainting. Use this skill when users mention generating music, creating songs, music production, remix, or audio continuation.
Why use this skill?
Generate professional music, edit songs, and create vocal tracks using the ACE-Step API. Features text-to-music, lyrics generation, and remixing tools.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/dumoedss/acestepWhat This Skill Does
The ACE-Step Music Generation skill serves as your comprehensive production assistant within the OpenClaw environment. Powered by the ACE-Step V1.5 API, this tool enables the synthesis of professional-grade audio, including text-to-music generation, lyrical composition, audio continuation, and complex audio repainting. By abstracting the API layer behind the scripts/acestep.sh command, the skill ensures that all musical assets—from lo-fi beats to complex electronic pop—are generated with high fidelity while maintaining a standardized output structure in the acestep_output directory.
Installation
To integrate this skill into your OpenClaw agent, execute the following command in your terminal:
clawhub install openclaw/skills/skills/dumoedss/acestep
Once installed, verify the connection to the underlying model service by navigating to the skill directory and running ./scripts/acestep.sh health to ensure the environment variables and API endpoints are correctly configured.
Use Cases
This skill is ideal for creators, developers, and producers who need to generate original background music, sound effects, or full-length vocal tracks. It is specifically designed to handle:
- Music Production: Generating custom tracks with specific BPM, key, and instrumentation.
- Audio Continuation: Expanding existing audio clips or loops into longer compositions.
- Remixing & Repainting: Modifying existing audio samples by applying new stylistic parameters.
- Vocals: Integrating structured lyrics with melodic generation for high-quality singing synthesis.
Example Prompts
- "Generate a high-energy electronic pop song at 128 BPM with a female vocal track that has an uplifting tone. Here are the lyrics: [Verse 1] The sun rises over the city... [Chorus] We are reaching for the stars..."
- "I need a 60-second jazz instrumental track that features a prominent saxophone solo for my video project."
- "Create a cheerful, simple song about spring using random parameters to get some creative inspiration."
Tips & Limitations
To maximize the quality of your output, always prefer Caption mode over Random mode. When working with vocals, the best results are achieved by using the acestep-songwriting skill to structure your lyrics before passing them to the generator. You must pass the complete, unabridged lyric block to the -l parameter; truncation will result in fragmented audio. Note that for advanced productions, such as music videos with synced lyrics, you will need to chain this skill with acestep-lyrics-transcription and acestep-simplemv. Ensure you manage your storage space in the project root, as generated audio files can occupy significant disk space over time.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-dumoedss-acestep": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read, external-api, code-execution
Related Skills
acestep-songwriting
Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/duration, and structuring songs. Use this skill when users want to create, write, or plan a song before generating it with ACE-Step.
acestep-lyrics-transcription
Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API. Outputs LRC, SRT, or JSON with word-level timestamps. Use when users want to transcribe songs, generate LRC files, or extract lyrics with timestamps from audio.
acestep-simplemv
Render music videos from audio files and lyrics using Remotion. Accepts audio + LRC/JSON lyrics + title to produce MP4 videos with waveform visualization and synced lyrics display. Use when users mention MV generation, music video rendering, creating video from audio/lyrics, or visualizing songs.