computer-use
$
npx mdskill add vellum-ai/vellum-assistant/computer-useThis skill provides the computer_use_* action tools for controlling the macOS desktop. CU tools run through the main agent loop via HostCuProxy.
SKILL.md
.github/skills/computer-useView on GitHub ↗
---
name: computer-use
description: Control the macOS desktop
compatibility: "Designed for Vellum personal assistants"
metadata:
emoji: "🖥️"
vellum:
display-name: "Computer Use"
activation-hints:
- "User asks the assistant to click, type, drag, or interact with the macOS GUI directly"
- "User wants control of a desktop app with no CLI or API alternative (games, design tools, visual workflows)"
- "User wants screenshots or visual inspection of what is currently on screen"
avoid-when:
- "Task can be done via a more specific skill (gmail, calendar, contacts, terminal-sessions) or a CLI / API call"
---
This skill provides the computer_use_* action tools for controlling
the macOS desktop. CU tools run through the main agent loop via HostCuProxy.
The skill is internally preactivated for conversations with a connected desktop client.
Tools in this skill are proxy tools - execution is forwarded to the connected
macOS client, never handled locally by the assistant.
More from vellum-ai/vellum-assistant
- acpSpawn external coding agents via the Agent Client Protocol (ACP)
- amazonShop on Amazon and Amazon Fresh through your browser
- api-mappingRecord and analyze API surfaces of web services
- app-builderBuild and edit small, personal visual tools and artifacts — dashboards, trackers, calculators, data visualizations, charts, simple landing pages, and slide decks the user wants for THEMSELVES. This is the right skill whenever the user asks to "visualize this," "make a chart," or "build an artifact" for their own use, or to edit an app they already built here. Do NOT reach for a ui_show dynamic_page to fake an artifact — build a real persistent app here. NOT for complex, multi-user, or shippable products — those go to a real project folder with a coding agent (see Scope below).
- app-controlDrive a specific named macOS app via raw input bypassing the Accessibility tree
- assistant-migrationMigrate from ChatGPT, Claude, OpenClaw, Hermes, Manus, and other AI assistants into Vellum by inspecting their data exports, conversation archives, files, prompts, custom instructions, memory, saved memories, tools, GPTs, workflows, integrations, and relationships, then mapping as much as safely possible into Vellum primitives. Handles single-source and multi-source migrations with a unified, deduplicated inventory.
- chatgpt-importImport conversation history from ChatGPT into Vellum
- cli-discoverDiscover which CLI tools are installed, their versions, and authentication status
- contactsManage contacts, communication channels, access control, and invite links
- conversation-launcherOffer the user several spin-off conversations as clickable buttons on a single persistent card. Each click spawns a fresh seeded conversation in the sidebar; the user keeps their place in the current conversation. Use when you want to branch into N focused threads (research directions, draft choices, pending replies, triage of N items) without losing the current context. Not for single-destination pivots — just reply inline.