Langfuse Observability
Skill by aiwithabidi
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/aiwithabidi/langfuse-observabilityWhat This Skill Does
The Langfuse Observability skill acts as the central nervous system for OpenClaw agents, providing a comprehensive toolkit for monitoring, tracing, and analyzing agent behavior in real-time. By integrating the Langfuse v3 platform, this skill allows users to gain full visibility into every step of an agent's logic. It automatically captures LLM inputs and outputs, API requests, tool execution payloads, and custom events. With built-in cost tracking, you can monitor token usage per model to prevent budget overruns, while session grouping allows you to correlate multiple interactions into a unified narrative. Beyond simple logging, the tool enables evaluation scoring and provides health monitoring dashboards to ensure your agents are performing reliably in production environments. It is the essential diagnostic layer for any agent developer looking to move from prototype to production.
Installation
To integrate this skill into your existing agent environment, ensure you have the OpenClaw CLI configured, then execute the following command in your terminal: clawhub install openclaw/skills/skills/aiwithabidi/langfuse-observability. Once the installation is complete, the module becomes available in your workspace. You can import the necessary tracing decorators and functions directly into your scripts using: from langfuse_hub import traced, trace_llm, trace_api, trace_tool, trace_event, flush. This enables granular control over which parts of your agent's execution pipeline are sent to your Langfuse instance.
Use Cases
- Debugging Complex Chains: Identify exactly where an agent's reasoning failed or where a specific tool call returned malformed data.
- Cost Governance: Track the financial impact of your agents by monitoring token consumption per model call, helping you optimize prompt engineering for cheaper execution.
- Performance Auditing: Maintain an audit trail of every decision made by the agent for compliance and security reviews.
- Quality Assurance: Use evaluation scores to automatically grade agent responses against ground-truth data, allowing for iterative improvement of your agent's logic.
- Operational Monitoring: Receive periodic reports via Telegram using the
langfuse_cron.pyscript to stay updated on system health without constantly checking the dashboard.
Example Prompts
- "OpenClaw, run a health report for my agent and show me the total token cost for the last 24 hours using the Langfuse integration."
- "Trace the last failed execution in my session and identify which tool call caused the bottleneck."
- "Summarize the top three most expensive prompt chains executed by the agent this week and suggest optimizations for cost reduction."
Tips & Limitations
For optimal performance, always ensure you call the flush() function at the end of your agent's execution sequence to ensure all buffer data is transmitted to the Langfuse server. Be aware that excessive logging of high-frequency tool calls can increase network overhead, so filter sensitive data before transmission. The internal dashboard is available at http://langfuse-web:3000, which must be accessible from your agent's network environment. While this tool provides powerful visibility, it is recommended to manage your API keys via secure environment variables rather than hardcoding them into your scripts.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-aiwithabidi-langfuse-observability": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, external-api
Related Skills
freshsales
Freshsales CRM integration — manage contacts, leads, deals, accounts, tasks, and sales sequences via the Freshsales API. Track deal pipelines, automate lead assignments, log activities, and generate sales reports. Built for AI agents — Python stdlib only, no dependencies. Use for sales CRM, contact management, deal tracking, pipeline reporting, and sales automation.
gemini-video-analyzer
Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe speech, identify objects and actions. Use when: (1) User sends a video file and wants it analyzed, (2) Video summarization or description needed, (3) Extracting text, UI elements, or information from screen recordings, (4) Answering questions about video content, (5) Comparing multiple videos, (6) Analyzing tutorials, demos, or walkthroughs.
agent-memory
Full AI agent memory stack — Mem0 unified memory engine with vector search (Qdrant) and knowledge graph (Neo4j), plus SQLite for structured data. Complete setup script and tools. Give your OpenClaw agent a real brain with semantic recall, entity relationships, and structured storage.
neon
Neon serverless Postgres — manage projects, branches, databases, roles, endpoints, and compute via the Neon API. Create database branches for development, manage connection endpoints, scale compute, and monitor usage. Built for AI agents — Python stdlib only, zero dependencies. Use for serverless Postgres, database branching, database management, development workflows, and cloud database automation.
onepassword
1Password Connect — vaults, items, secrets management for server-side applications.