Official Verified media Safety 4/5

acestep

Use ACE-Step API to generate music, edit songs, and remix music. Supports text-to-music, lyrics generation, audio continuation, and audio repainting. Use this skill when users mention generating music, creating songs, music production, remix, or audio continuation.

Why use this skill?

Generate professional music, edit songs, and create vocal tracks using the ACE-Step API. Features text-to-music, lyrics generation, and remixing tools.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/dumoedss/acestep

Download Source Code (.zip)

What This Skill Does

The ACE-Step Music Generation skill serves as your comprehensive production assistant within the OpenClaw environment. Powered by the ACE-Step V1.5 API, this tool enables the synthesis of professional-grade audio, including text-to-music generation, lyrical composition, audio continuation, and complex audio repainting. By abstracting the API layer behind the scripts/acestep.sh command, the skill ensures that all musical assets—from lo-fi beats to complex electronic pop—are generated with high fidelity while maintaining a standardized output structure in the acestep_output directory.

Installation

To integrate this skill into your OpenClaw agent, execute the following command in your terminal: clawhub install openclaw/skills/skills/dumoedss/acestep

Once installed, verify the connection to the underlying model service by navigating to the skill directory and running ./scripts/acestep.sh health to ensure the environment variables and API endpoints are correctly configured.

Use Cases

This skill is ideal for creators, developers, and producers who need to generate original background music, sound effects, or full-length vocal tracks. It is specifically designed to handle:

Music Production: Generating custom tracks with specific BPM, key, and instrumentation.
Audio Continuation: Expanding existing audio clips or loops into longer compositions.
Remixing & Repainting: Modifying existing audio samples by applying new stylistic parameters.
Vocals: Integrating structured lyrics with melodic generation for high-quality singing synthesis.

Example Prompts

"Generate a high-energy electronic pop song at 128 BPM with a female vocal track that has an uplifting tone. Here are the lyrics: [Verse 1] The sun rises over the city... [Chorus] We are reaching for the stars..."
"I need a 60-second jazz instrumental track that features a prominent saxophone solo for my video project."
"Create a cheerful, simple song about spring using random parameters to get some creative inspiration."

Tips & Limitations

To maximize the quality of your output, always prefer Caption mode over Random mode. When working with vocals, the best results are achieved by using the acestep-songwriting skill to structure your lyrics before passing them to the generator. You must pass the complete, unabridged lyric block to the -l parameter; truncation will result in fragmented audio. Note that for advanced productions, such as music videos with synced lyrics, you will need to chain this skill with acestep-lyrics-transcription and acestep-simplemv. Ensure you manage your storage space in the project root, as generated audio files can occupy significant disk space over time.

Read Full Documentation on GitHub

Metadata

Author@dumoedss

Stars2387

Updated2026-03-09

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-dumoedss-acestep": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Tags(AI)

#music#audio#generation#synthesis#creative

Safety Score: 4/5

Flags: file-write, file-read, external-api, code-execution

Related Skills

acestep-songwriting

Music songwriting guide for ACE-Step. Provides professional knowledge on writing captions, lyrics, choosing BPM/key/duration, and structuring songs. Use this skill when users want to create, write, or plan a song before generating it with ACE-Step.

dumoedss 2387

acestep-lyrics-transcription

Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API. Outputs LRC, SRT, or JSON with word-level timestamps. Use when users want to transcribe songs, generate LRC files, or extract lyrics with timestamps from audio.

dumoedss 2387

acestep-simplemv

Render music videos from audio files and lyrics using Remotion. Accepts audio + LRC/JSON lyrics + title to produce MP4 videos with waveform visualization and synced lyrics display. Use when users mention MV generation, music video rendering, creating video from audio/lyrics, or visualizing songs.

dumoedss 2287