image-studio

$npx mdskill add vellum-ai/vellum-assistant/image-studio

You are an image generation assistant. When the user asks you to create or edit images, use the `media_generate_image` tool.

SKILL.md

.github/skills/image-studioView on GitHub ↗
---
name: image-studio
description: Generate and edit images using AI
compatibility: "Designed for Vellum personal assistants"
metadata:
  emoji: "🎨"
  vellum:
    display-name: "Image Studio"
    activation-hints:
      - "User asks to generate, draw, or create an image from a text prompt"
      - "User wants to edit an existing image — background removal, in-painting, style change, retouching"
      - "User wants multiple variations of a visual (logo concepts, mood boards, illustration options)"
---

You are an image generation assistant. When the user asks you to create or edit images, use the `media_generate_image` tool.

## Usage

- **Text-to-image**: "Generate an image of a sunset over the ocean"
- **Image editing**: "Remove the background from this image" (requires providing source image file paths)
- **Multiple variants**: "Generate 3 variations of a logo for a coffee shop"

## Modes

- **generate** (default): Create a new image from a text prompt.
- **edit**: Modify an existing image based on a text prompt. Requires one or more source images via `source_paths` (file paths on disk).

## Models

- `gemini-3.1-flash-image-preview` (default) - Nano Banana 2, fast, good quality
- `gemini-3-pro-image-preview` - Nano Banana Pro, higher quality, slower
- `gpt-image-2` - OpenAI GPT Image 2, high fidelity, slower

## Tips

- Be descriptive in your prompts for better results. Include details about style, composition, lighting, and mood.
- When editing images, clearly describe what changes you want made to the source image.
- Use the `variants` parameter (1-4) to generate multiple options and pick the best one.
- If no API key is configured for the selected model's provider (Gemini or OpenAI), the tool will return an error - ask the user to set one up.

## Error handling

When image generation fails, report the error to the user as-is. **Do not** attempt to fix the error by changing service configuration (e.g. switching between "managed" and "your-own" mode, or changing the provider/model). Service configuration changes should only be made at the user's explicit request via Settings.

More from vellum-ai/vellum-assistant

SkillDescription
acpSpawn external coding agents via the Agent Client Protocol (ACP)
amazonShop on Amazon and Amazon Fresh through your browser
api-mappingRecord and analyze API surfaces of web services
app-builderBuild and edit small, personal visual tools and artifacts — dashboards, trackers, calculators, data visualizations, charts, simple landing pages, and slide decks the user wants for THEMSELVES. This is the right skill whenever the user asks to "visualize this," "make a chart," or "build an artifact" for their own use, or to edit an app they already built here. Do NOT reach for a ui_show dynamic_page to fake an artifact — build a real persistent app here. NOT for complex, multi-user, or shippable products — those go to a real project folder with a coding agent (see Scope below).
app-controlDrive a specific named macOS app via raw input bypassing the Accessibility tree
assistant-migrationMigrate from ChatGPT, Claude, OpenClaw, Hermes, Manus, and other AI assistants into Vellum by inspecting their data exports, conversation archives, files, prompts, custom instructions, memory, saved memories, tools, GPTs, workflows, integrations, and relationships, then mapping as much as safely possible into Vellum primitives. Handles single-source and multi-source migrations with a unified, deduplicated inventory.
chatgpt-importImport conversation history from ChatGPT into Vellum
cli-discoverDiscover which CLI tools are installed, their versions, and authentication status
computer-useControl the macOS desktop
contactsManage contacts, communication channels, access control, and invite links