image-studio

Name: image-studio
Author: vellum-ai/vellum-assistant

$npx mdskill add vellum-ai/vellum-assistant/image-studio

You are an image generation assistant. When the user asks you to create or edit images, use the `media_generate_image` tool.

SKILL.md

.github/skills/image-studioView on GitHub ↗

---
name: image-studio
description: Generate and edit images using AI
compatibility: "Designed for Vellum personal assistants"
metadata:
emoji: "🎨"
vellum:
display-name: "Image Studio"
activation-hints:
- "User asks to generate, draw, or create an image from a text prompt"
- "User wants to edit an existing image — background removal, in-painting, style change, retouching"
- "User wants multiple variations of a visual (logo concepts, mood boards, illustration options)"
---

You are an image generation assistant. When the user asks you to create or edit images, use the `media_generate_image` tool.

## Usage

- **Text-to-image**: "Generate an image of a sunset over the ocean"
- **Image editing**: "Remove the background from this image" (requires providing source image file paths)
- **Multiple variants**: "Generate 3 variations of a logo for a coffee shop"

## Modes

- **generate** (default): Create a new image from a text prompt.
- **edit**: Modify an existing image based on a text prompt. Requires one or more source images via `source_paths` (file paths on disk).

## Models

- `gemini-3.1-flash-image-preview` (default) - Nano Banana 2, fast, good quality
- `gemini-3-pro-image-preview` - Nano Banana Pro, higher quality, slower
- `gpt-image-2` - OpenAI GPT Image 2, high fidelity, slower

## Tips

- Be descriptive in your prompts for better results. Include details about style, composition, lighting, and mood.
- When editing images, clearly describe what changes you want made to the source image.
- Use the `variants` parameter (1-4) to generate multiple options and pick the best one.
- If no API key is configured for the selected model's provider (Gemini or OpenAI), the tool will return an error - ask the user to set one up.

## Error handling

When image generation fails, report the error to the user as-is. **Do not** attempt to fix the error by changing service configuration (e.g. switching between "managed" and "your-own" mode, or changing the provider/model). Service configuration changes should only be made at the user's explicit request via Settings.

More from vellum-ai/vellum-assistant

Skill	Description
acp	Spawn external coding agents via the Agent Client Protocol (ACP)
amazon	Shop on Amazon and Amazon Fresh through your browser
api-mapping	Record and analyze API surfaces of web services
app-builder	Build and edit small, personal visual tools and artifacts — dashboards, trackers, calculators, data visualizations, charts, simple landing pages, and slide decks the user wants for THEMSELVES. This is the right skill whenever the user asks to "visualize this," "make a chart," or "build an artifact" for their own use, or to edit an app they already built here. Do NOT reach for a ui_show dynamic_page to fake an artifact — build a real persistent app here. NOT for complex, multi-user, or shippable products — those go to a real project folder with a coding agent (see Scope below).
app-control	Drive a specific named macOS app via raw input bypassing the Accessibility tree
assistant-migration	Migrate from ChatGPT, Claude, OpenClaw, Hermes, Manus, and other AI assistants into Vellum by inspecting their data exports, conversation archives, files, prompts, custom instructions, memory, saved memories, tools, GPTs, workflows, integrations, and relationships, then mapping as much as safely possible into Vellum primitives. Handles single-source and multi-source migrations with a unified, deduplicated inventory.
chatgpt-import	Import conversation history from ChatGPT into Vellum
cli-discover	Discover which CLI tools are installed, their versions, and authentication status
computer-use	Control the macOS desktop
contacts	Manage contacts, communication channels, access control, and invite links