Official Verified

Percept Speaker Id

Skill by jarvis563

skill-install — Terminal

Install via CLI (Recommended)

clawhub install openclaw/skills/skills/jarvis563/percept-speaker-id

Download Source Code (.zip)

percept-speaker-id

Speaker identification and management for multi-person conversations.

What it does

Tracks who said what in conversations. Maps anonymous speaker labels (SPEAKER_0, SPEAKER_1) to real names, maintains speaker profiles, and gates voice command authorization.

When to use

User asks "who said that?" or wants speaker-attributed transcripts
User wants to configure which people can trigger voice commands
Agent needs to know who is speaking in a multi-person conversation

Requirements

percept-listen skill installed and running
Omi pendant (provides is_user flag for primary speaker)

How it works

Omi sends transcript segments with speaker labels (SPEAKER_0, SPEAKER_1, etc.)
Percept resolves labels to names using the speakers registry
is_user flag from Omi identifies the pendant wearer as the primary speaker
Speaker profiles track first/last seen timestamps and authorization status

Speaker registry

Located at percept/data/speakers.json:

{
  "SPEAKER_00": {
    "name": "David",
    "is_owner": true,
    "approved": true
  },
  "SPEAKER_01": {
    "name": "Rob",
    "is_owner": false,
    "approved": true
  }
}

Manage via Percept dashboard (port 8960) → Settings → Speakers.

Authorization levels

Owner (is_owner: true): Full command access, always authorized
Approved (approved: true): Can trigger wake word commands
Unknown: Logged only, commands not executed

Future: Voice embeddings

Planned: pyannote speaker diarization with 192-dim voice embeddings for automatic speaker recognition via cosine similarity. Currently speaker mapping is manual.

Links

GitHub: https://github.com/GetPercept/percept

Read Full Documentation on GitHub

Metadata

Author@jarvis563

Stars1947

Updated2026-03-04

View Author Profile

AI Skill Finder

Not sure this is the right skill?

Describe what you want to build — we'll match you to the best skill from 16,000+ options.

Find the right skill

Add to Configuration

Paste this into your clawhub.json to enable this plugin.

{
  "plugins": {
    "official-jarvis563-percept-speaker-id": {
      "enabled": true,
      "auto_update": true
    }
  }
}

Safety NoteClawKit audits metadata but not runtime behavior. Use with caution.

Related Skills

Percept Ambient

Skill by jarvis563

jarvis563 1947

Percept Summarize

Skill by jarvis563

jarvis563 1947

Percept Voice Cmd

Skill by jarvis563

jarvis563 1947

browser-audio-capture

Capture audio from any browser tab — meetings, YouTube, podcasts, courses, webinars — and stream to any AI agent. Zero API keys, works with any framework.

jarvis563 1947

stable-browser

Set up reliable browser automation using Chrome DevTools Protocol (CDP) instead of the flaky browser extension relay. Use when browser relay keeps disconnecting, throwing WebSocket 403 errors, or when you need stable headless/headed browser control for web scraping, form filling, social media posting, or any browser automation task. Replaces profile="chrome" with a rock-solid CDP connection.

jarvis563 1947