MacLayer exposes Apple's native frameworks — ScreenCaptureKit, Vision, SFSpeechRecognizer, AXUIElement, Simulator — as REST + MCP endpoints any AI agent can call.
MacLayer wraps Apple's restricted native frameworks as clean JSON APIs — no Swift required on your side.
Extract structured text from any window, region, or display. Uses ScreenCaptureKit + Vision — fully on-device, sub-100ms, 99%+ accuracy on Apple Silicon.
Dump any macOS app's full UI tree as JSON — buttons, fields, labels, hierarchy. Agents can navigate any app without screen scraping or hardcoded coordinates.
Apple's on-device SFSpeechRecognizer. Near-Whisper accuracy, zero API cost, fully private. File or live mic. Returns timestamped segments.
Boot simulators, install apps, tap, swipe, screenshot, inspect UI trees. The missing piece for AI-assisted iOS development in Cursor or Claude Code.
Real-time CPU, memory, disk, battery, thermal, network, and top processes snapshot. Build monitoring agents and resource-aware automation.
Spotlight search with Quick Look previews and rich metadata. Returns file thumbnails, content snippets, and optional change-event webhooks.
Send, list, dismiss and stream macOS notifications. Agents can surface alerts, monitor for responses, and close the automation loop with the user.
Read and write the full macOS clipboard including text, images, RTF, HTML, and file references. Full rich type support via NSPasteboard.
Capture microphone or system audio output. Pipe directly into transcription. macOS 14.2+ system audio tap via CATapDescription — no extra apps needed.
Semantic search across Apple Photos by date, face, location, or natural language query. Returns thumbnails and metadata via PhotoKit + Vision.
Calendar events, reminders, secure credential store, and Focus mode control are on the roadmap. Vote on GitHub Discussions.
A Swift daemon holds your macOS entitlements. You talk to it via REST or MCP. No Swift, no Xcode, no entitlement headaches.
maclayerd as a signed LaunchAgent. It holds Screen Recording, Accessibility, and Microphone entitlements for you.maclayer setup walks you through macOS System Settings, granting each permission with a clear explanation of why it's needed.screen_ocr, get_accessibility_tree, or any other tool. Credits are deducted, response is instant.Buy credits once, use them forever. The more you use, the cheaper each call gets. Self-host is always free.
| Endpoint | Credits | Notes |
|---|---|---|
| POST /v1/screen/ocr | 2 | Full screen or window |
| POST /v1/screen/ocr | 1 | Region < 400×400px |
| POST /v1/accessibility/tree | 3 | Full UI tree dump |
| POST /v1/accessibility/element/click | 1 | Per action |
| POST /v1/speech/transcribe | 5/min | Per minute of audio |
| POST /v1/simulator/screenshot | 1 | |
| GET /v1/simulator/ui-tree | 3 | |
| GET /v1/system/snapshot | 1 | All metrics at once |
| POST /v1/files/search | 2 | With Quick Look preview |
| POST /v1/notifications/send | 1 | |
| GET /v1/clipboard/read | 1 | |
| POST /v1/photos/search | 3 |
From zero to your first OCR call in under 5 minutes. Requires macOS 14 Sonoma or later.
MacLayer is listed across every major agent tool directory. Discover it wherever you build.
Join the early access list. 1,000 free credits on signup. No credit card required.