songsee
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Why use this skill?
Use the songsee skill to generate detailed spectrograms, chroma, and tempogram visualizations from your audio files. Perfect for music analysis.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/steipete/songseeWhat This Skill Does
The songsee skill provides a powerful command-line interface for audio analysis, enabling users to transform raw audio files into detailed visual representations. At its core, songsee leverages advanced digital signal processing to generate spectrograms and a wide variety of feature panels. Whether you are a music producer looking to visualize harmonic content, a researcher analyzing sound profiles, or a developer needing to inspect audio data, this tool offers precise control over visualization outputs. The skill handles common formats like WAV and MP3 natively, while also utilizing ffmpeg for extended compatibility, ensuring that almost any audio file can be processed.
Installation
To integrate this tool into your OpenClaw environment, execute the following command in your terminal:
clawhub install openclaw/skills/skills/steipete/songsee
Ensure that you have ffmpeg installed on your system if you intend to work with non-standard audio formats, as the tool relies on it for robust decoding beyond basic native support.
Use Cases
- Audio Archiving: Create high-quality visual thumbnails for large audio libraries to quickly identify tracks.
- Music Production Analysis: Use chroma and tempogram visualizations to understand the harmonic and rhythmic structure of a track.
- Signal Inspection: Isolate specific time intervals using the
--startand--durationflags to examine transient responses or noise floors. - Automated Reporting: Pipe audio streams directly into songsee to generate automated performance reports during batch processing.
Example Prompts
- "Analyze my file 'recording.wav' and generate a grid containing the spectrogram, chroma, and mfcc visualizations."
- "Extract a 10-second segment from 'drums.mp3' starting at the 30-second mark and save it as a high-quality png file using the magma color palette."
- "Generate a multi-panel visual for 'song.mp3' featuring the loudness, flux, and tempogram to help me identify the peak energy sections."
Tips & Limitations
- Efficiency: When rendering large grids with many visualizations, ensure sufficient system memory, as plotting high-resolution graphs can be resource-intensive.
- FFmpeg: Always check that your
ffmpegpath is correctly configured in your environment variables to avoid decoding errors for proprietary formats. - Customization: Use the
--styleflag to adjust visual themes; 'magma' or 'inferno' are often preferred for highlighting peak intensities in spectrograms compared to 'classic' or 'gray'. - Precision: If you are working with small windows, use the
--windowand--hopflags to tune the FFT sensitivity to your specific audio frequency requirements.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-steipete-songsee": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-write, file-read
Related Skills
swiftui-liquid-glass
Implement, review, or improve SwiftUI features using the iOS 26+ Liquid Glass API. Use when asked to adopt Liquid Glass in new SwiftUI UI, refactor an existing feature to Liquid Glass, or review Liquid Glass usage for correctness, performance, and design alignment.
qmd
Local search/indexing CLI (BM25 + vectors + rerank) with MCP mode.
summarize
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
create-cli
Design command-line interface parameters and UX: arguments, flags, subcommands, help text, output formats, error messages, exit codes, prompts, config/env precedence, and safe/dry-run behavior. Use when you’re designing a CLI spec (before implementation) or refactoring an existing CLI’s surface area for consistency, composability, and discoverability.
bird
X/Twitter CLI for reading, searching, and posting via cookies or Sweetistics.