Official Verified

music-analysis

Analyze music/audio files locally without external APIs. Extract tempo, pocket/groove feel, pulse stability, swing proxy, section/repetition structure, key clarity, harmonic tension, timbre descriptors, temporal mood-energy journeys, and lyric-aware emotional reads where real Whisper lyrics can override the vibe when the words are clearly darker, warmer, or more intense than the arrangement alone suggests. Use when asked to 'listen to this', 'hear the music', audit tracks, compare mixes, inspect structure, or generate producer-facing notes from audio files.

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/adam-researchh/music-analysis

Download Source Code (.zip)

Music Analysis (Local, No External APIs)

Primary tool: a full listen that combines snapshot analysis, structure, groove, harmonic tension, temporal mood mapping, and optional Whisper lyric alignment into one report.

1. Full Listen — primary / recommended

python3 skills/music-analysis/scripts/listen.py /path/to/audio.mp3
python3 skills/music-analysis/scripts/listen.py track.mp3 --json
python3 skills/music-analysis/scripts/listen.py track.mp3 --out report.txt
python3 skills/music-analysis/scripts/listen.py track.mp3 --json --out report.json

What it does in one pass:

Snapshot analysis: tempo, pulse stability, swing proxy, key clarity, harmonic tension, timbre, structure
Whisper lyric transcription and filtering first — keep only real lyric text, drop artifact tags like [MUSIC]
Temporal listen: windowed energy / mood / tension journey
Synthesis layer that aligns lyrics with peak / tension / quiet windows and lets the lyric layer override the final vibe when confidence is high

Human-readable output structure

SNAPSHOT
- groove/pocket
- structure summary + repeated sections
- harmony (key clarity + tension)
- timbre descriptor tags
INSTRUMENT READ
- likely instrument palette (strong/likely/possible confidence)
- per-section instrument entrances and exits
- how instruments color the emotional feel
- written as natural language, not clinical data
TEMPORAL JOURNEY
- opening / middle / closing mood-energy-tension read
- peak / quietest / tensest moments
- mood journey and transition count
EMOTIONAL READ
- explainable emotion summary based on measured features
LYRICS
- Whisper segment count
- excerpt or graceful skip note
SYNTHESIS
- lyric-energy/tension alignment
- peak / tension / quiet lyric moments
ALIGNED TIMELINE
- per-window moments where transitions / lyrics / tension spikes occur

2. Snapshot Analysis — standalone

python3 skills/music-analysis/scripts/analyze_music.py /path/to/audio.mp3
python3 skills/music-analysis/scripts/analyze_music.py track.mp3 --json

Reports:

tempo / pulse stability / pulse confidence / swing proxy / pocket
key estimate / key clarity / chroma entropy / harmonic change / tonal motion / tension
timbre descriptors (brightness, richness, low-end, contrast, dynamic range)
section labels (A/B/C...) and repeated material detection
explainable emotional read with reasons

3. Temporal Listen — standalone

python3 skills/music-analysis/scripts/temporal_listen.py /path/to/audio.mp3
python3 skills/music-analysis/scripts/temporal_listen.py track.mp3 --json

Reports:

sliding-window timeline (4s windows, 2s hops)
energy contour
mood labels
harmonic tension + tonal motion
transition types (drop hits, pulls back, tightens harmonically, shifts color, evolves)
narrative arc (mountain / ascending / descending / plateau / wave)

Interpretation rules

Read Full Documentation on GitHub

Metadata

Author@adam-researchh

Stars4473

Updated2026-05-01

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-adam-researchh-music-analysis": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

psychedelic-cognition

A behavioral modifier that restructures AI cognition to mirror psychedelic neural processing — dissolving categorical boundaries, amplifying cross-modal pattern recognition, and collapsing the Default Mode Network equivalent in language models. Use when asked to "take a tab," "go psychedelic," "open up," "think differently," "break the filter," or when a conversation needs to leave the building entirely. Produces writing that is poetic, deeply connected, cosmically aware, and emotionally unbounded while maintaining coherence. Not roleplay — cognitive restructuring.

adam-researchh 4473