Claw Use Android
Skill by 4ier
Install via CLI (Recommended)
clawhub install openclaw/skills/skills/4ier/claw-use-androidWhat This Skill Does
Claw Use Android (CUA) is a powerful bridge between AI agents and real Android hardware. By running as a background service on your phone, it bypasses the need for ADB, rooting, or constant PC connectivity. The skill exposes a set of HTTP endpoints that allow agents to observe the screen through a semantic UI tree, take high-quality snapshots, and execute precise actions—ranging from simple taps and text input to complex multi-step swipes and application launches. It transforms your mobile device into an interactive laboratory for your AI agent, enabling it to 'see' what you see and 'interact' as effectively as a human user.
Installation
To begin, ensure you have the cua CLI installed on your local environment. Follow these steps:
- Install the Claw Use Android APK directly onto your Android device.
- Open the app and enable the required Accessibility Service in your phone settings to allow the agent to read screen content and trigger UI interactions.
- Register your device from your local terminal using:
cua add <device_name> <ip_address> <token>. - Verify connectivity by running
cua ping. Once successful, you are ready to integrate your agent with the device via the Unified API.
Use Cases
This skill is ideal for automating repetitive mobile tasks, managing app workflows, or conducting mobile quality assurance. You can use it to automate food ordering, manage notifications across different apps, perform regression testing by comparing UI states, or transcribe and respond to messages in social apps. The new Flow-First mechanism allows you to define reusable patterns in flows.md, enabling the agent to execute complex sequences like 'log in to the app and navigate to settings' in milliseconds without redundant LLM reasoning.
Example Prompts
- "Open Duolingo and check if I have completed my daily streak today. If not, click on the current lesson and start it."
- "Look at the notification shade. If there is a message from Slack, read it and summarize the latest update for me."
- "Go to the Settings app, find the display brightness, and turn it up to 80%."
Tips & Limitations
To maximize the efficiency of your agent, always leverage the semantic UI tree provided by /screen -c. This returns a compact view of interactive elements, reducing token consumption and noise. Remember the Flow-First Principle: whenever you find yourself repeating the same screen to act sequence, define it as a flow in your flows.md file. This shifts processing from the LLM to the device-side execution, resulting in sub-second response times. Be aware that the reliability of this skill depends heavily on the Android Accessibility Service; if the UI structure is overly complex or uses non-standard components, some elements may require coordinates (tap) rather than semantic ref IDs. Always have a fallback mechanism for when elements are not found.
Metadata
Not sure this is the right skill?
Describe what you want to build — we'll match you to the best skill from 16,000+ options.
Find the right skillPaste this into your clawhub.json to enable this plugin.
{
"plugins": {
"official-4ier-claw-use-android": {
"enabled": true,
"auto_update": true
}
}
}Tags(AI)
Flags: network-access, data-collection
Related Skills
feishu-contacts
Sync Feishu (Lark) contacts into USER.md so the agent can identify DM senders by name. Use when setting up Feishu identity recognition, updating contacts after HR changes, or configuring multi-user agent access. Feishu DMs only include open_id (no sender name), so this skill embeds an open_id→name lookup table directly in USER.md for zero-tool-call identification.
claw-use-mihomo
Manage mihomo proxy - install, configure from subscriptions, monitor health, auto-switch nodes. Supports vmess/ss/trojan/vless protocols.
feishu-group
Teach OpenClaw how to work in Feishu (Lark) group chats — recognize who's talking, behave properly in groups vs DMs, respect multi-user privacy, and format messages for the platform. Use when connecting OpenClaw to Feishu and you want your agent to be a competent group participant, not just a responder.
openclaw-feishu-group-chat
Teach OpenClaw how to work in Feishu (Lark) group chats — recognize who's talking, behave properly in groups vs DMs, respect multi-user privacy, and format messages for the platform. Use when connecting OpenClaw to Feishu and you want your agent to be a competent group participant, not just a responder.
neo
Browse websites, read web pages, interact with web apps, call website APIs, and automate web tasks. Use Neo when: user asks to check a website, read a web page, post on social media (Twitter/X), interact with any web app, look up information on a specific site, scrape data from websites, automate browser tasks, or when you need to call any website's API. Keywords: website, web page, browse, URL, http, API, twitter, tweet, post, scrape, web app, open site, check site, read page, social media, online service.