llmwhisperer
Extract text and layout from images and PDFs using LLMWhisperer API. Good for handwriting and complex forms.
Why use this skill?
Extract accurate text and preserve layouts from handwritten notes, complex forms, and PDFs using the LLMWhisperer API within your OpenClaw agent workflow.
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/gumadeiras/llmwhispererWhat This Skill Does
The llmwhisperer skill is a specialized OCR (Optical Character Recognition) and layout extraction tool designed to interface with the LLMWhisperer API by Unstract. Unlike standard text extractors, this skill is purpose-built to handle complex document structures, tables, and handwritten notes. It excels at preserving the semantic layout of a document, making it an essential utility for digitizing physical paperwork, converting legacy forms into machine-readable data, and interpreting challenging visual media.
Installation
To begin using llmwhisperer, first ensure you have the OpenClaw environment configured. Install the skill by running: clawhub install openclaw/skills/skills/gumadeiras/llmwhisperer. Once installed, you must register for a free API key at the Unstract website. Save this key into your configuration file located at ~/.clawdbot/.env by appending LLMWHISPERER_API_KEY=your_key_here. The skill is designed to automatically detect this key from your environment, ensuring a seamless authentication process every time you invoke the tool.
Use Cases
This skill is highly effective for professionals dealing with high-density information. Common use cases include: digitizing handwritten medical or research notes, extracting financial data from non-searchable PDF invoices, reading text from complex marketing flyers with overlapping elements, and processing scanned government or legal forms where standard OCR tools fail to maintain column integrity.
Example Prompts
- "OpenClaw, please run the llmwhisperer skill on the scanned_invoice.pdf file in my downloads folder and output the text directly to the console so I can review it."
- "Process the image of my handwritten meeting notes using llmwhisperer and save the resulting text content into a new file called meeting_summary.txt."
- "Use the llmwhisperer skill to extract all data from this image of a complex table; I need the layout preserved so I can parse it into a spreadsheet later."
Tips & Limitations
To maximize the performance of this skill, ensure that your input files (JPG or PDF) are clear and high-resolution. While LLMWhisperer is exceptionally robust, extremely low-light or blurry photos may impact accuracy. Note that the API has a usage limit of 100 pages per day on the free tier. When using the command, remember that the tool performs a direct binary upload; if your local environment has restricted internet access, the tool will fail as it requires a connection to the Unstract API endpoints. Always verify your .env file permissions if you encounter authentication errors.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-gumadeiras-llmwhisperer": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: file-read, external-api, network-access
Related Skills
gotrain
MTA system train departures (NYC Subway, LIRR, Metro-North). Use when the user wants train times, schedules, or service alerts for MTA transit. Covers MTA Subway, LIRR, and Metro-North across the greater New York area.
apple-mail-search
Apple Mail search on macOS with fast metadata and full body lookup. Use for finding messages in Mail.app by subject/sender/recipient/date, opening messages, and reading full body text.
lastfm
Access Last.fm listening history, music stats, and discovery. Query recent tracks, top artists/albums/tracks, loved tracks, similar artists, and global charts.
little-snitch
Control Little Snitch firewall on macOS. View logs, manage profiles and rule groups, monitor network traffic. Use when the user wants to check firewall activity, enable/disable profiles or blocklists, or troubleshoot network connections.
calcurse
A text-based calendar and scheduling application. Use strictly for CLI-based calendar management.