nightingale-karaoke
ML-powered Karaoke app in Rust using Bevy, WhisperX, and Demucs for stem separation, lyrics transcription, and pitch scoring.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/adisinghstudent/nightingale-karaokeNightingale Karaoke Skill
Skill by ara.so — Daily 2026 Skills collection.
Nightingale is a self-contained, ML-powered karaoke application written in Rust (Bevy engine). It scans a local music folder, separates vocals from instrumentals (UVR Karaoke model or Demucs), transcribes lyrics with word-level timestamps (WhisperX), and plays back with synchronized highlighting, real-time pitch scoring, player profiles, and GPU shader / video backgrounds. Everything — ffmpeg, Python, PyTorch, ML models — is bootstrapped automatically on first launch.
Installation
Pre-built Binary (Recommended)
Download the latest release from the Releases page for your platform and run it.
macOS only — remove quarantine after extracting:
xattr -cr Nightingale.app
Build from Source
Prerequisites:
- Rust 1.85+ (edition 2024)
- Linux additionally needs:
libasound2-dev libudev-dev libwayland-dev libxkbcommon-dev
git clone https://github.com/rzru/nightingale
cd nightingale
# Development build
cargo build --release
# Run directly
./target/release/nightingale
Release Packaging
# Linux / macOS
scripts/make-release.sh
# Windows (PowerShell)
powershell -ExecutionPolicy Bypass -File scripts/make-release.ps1
Outputs a .tar.gz (Linux/macOS) or .zip (Windows) ready for distribution.
First Launch / Bootstrap
On first run, Nightingale downloads and configures:
ffmpegbinaryuv(Python package manager)- Python 3.10 via uv
- PyTorch + WhisperX + audio-separator in a virtual environment
- UVR Karaoke ONNX model and WhisperX
large-v3model
This takes 2–10 minutes depending on network speed. A progress screen is shown in-app.
To force re-bootstrap at any time:
./nightingale --setup
Bootstrap completion is marked by ~/.nightingale/vendor/.ready.
CLI Flags
| Flag | Description |
|---|---|
--setup | Force re-run of the first-launch bootstrap (re-downloads vendor deps) |
Keyboard & Gamepad Controls
Navigation
| Action | Keyboard | Gamepad |
|---|---|---|
| Move | Arrow keys | D-pad / Left stick |
| Confirm | Enter | A (South) |
| Back | Escape | B (East) / Start |
| Switch panel | Tab | — |
| Search | Type to filter | — |
Playback
| Action | Keyboard | Gamepad |
|---|---|---|
| Pause / Resume | Space | Start |
| Exit to menu | Escape | B (East) |
| Toggle guide vocals | G | — |
| Guide volume up/down | + / - | — |
| Cycle background | T | — |
| Cycle video flavor | F | — |
| Toggle microphone | M | — |
| Next microphone | N | — |
| Toggle fullscreen | F11 | — |
Configuration
Main Config
Located at ~/.nightingale/config.json. Edit directly or via in-app settings.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-adisinghstudent-nightingale-karaoke": {
"enabled": true,
"auto_update": true
}
}
}Related Skills
Oh My Openagent Omo
Skill by adisinghstudent
Planning With Files Manus Workflow
Skill by adisinghstudent
mirofish-offline-simulation
Fully local multi-agent swarm intelligence simulation engine using Neo4j + Ollama for public opinion, market sentiment, and social dynamics prediction.
ghostling-libghostty-terminal
Build minimal terminal emulators using the libghostty-vt C API with Raylib for windowing and rendering
Obra Superpowers Agentic Workflow
Skill by adisinghstudent