test-triage
$
npx mdskill add openai/plugins/test-triageRun focused macOS tests to isolate failures and classify errors.
- Helps narrow test scope and distinguish build from execution failures.
- Integrates with xcodebuild and swift test commands.
- Decides execution by selecting smallest failing targets first.
- Delivers clear summaries of failure types and next proof steps.
SKILL.md
.github/skills/test-triageView on GitHub ↗
--- name: test-triage description: Triage macOS tests across Xcode and SwiftPM. Use when narrowing failures, explaining assertions or crashes, or separating setup from regressions. --- # Test Triage ## Quick Start Use this skill to run the smallest meaningful test scope first, classify failures precisely, and avoid treating every test failure like a product bug. ## Workflow 1. Detect the test harness. - Use `xcodebuild test` for Xcode-based projects. - Use `swift test` for SwiftPM packages. 2. Narrow the scope. - If the user gave a target, product, or test filter, use it. - If not, prefer the smallest likely failing target before a full suite. 3. Classify the result. - Build failure - Assertion failure - Crash or signal - Async timing or flake - Environment or fixture setup issue - Missing entitlement or host app issue 4. Rerun intelligently. - Use focused reruns when a specific case fails. - Avoid burning time on full-suite reruns without new information. 5. Summarize clearly. - What command ran - Which tests failed - What kind of failure it was - The best next proof step or fix path ## Guardrails - Distinguish compilation failures from test execution failures. - Call out when a test appears to assume iOS-only or simulator-only behavior. - Mark likely flakes as such instead of overstating confidence. ## Output Expectations Provide: - the command used - the smallest failing scope - the top failure category - a concise explanation of the likely cause - the next rerun or fix step
More from openai/plugins
- accessibility-and-inclusive-visualizationMake data visualizations accessible and inclusive. Use when the user needs chart or diagram accessibility guidance, text alternatives for complex visuals, color and contrast review, keyboard support, reduced-motion behavior for animation or parallax, or an accessibility QA workflow for exported figures, UML-like diagrams, and dashboards.
- agent-browserBrowser automation CLI for AI agents. Use when the user needs to interact with websites, verify dev server output, test web apps, navigate pages, fill forms, click buttons, take screenshots, extract data, or automate any browser task. Also triggers when a dev server starts so you can verify it visually.
- agent-browser-verifyAutomated browser verification for dev servers. Triggers when a dev server starts to run a visual gut-check with agent-browser — verifies the page loads, checks for console errors, validates key UI elements, and reports pass/fail before continuing.
- agents-sdkBuild AI agents on Cloudflare Workers using the Agents SDK. Load when creating stateful agents, durable workflows, real-time WebSocket apps, scheduled tasks, MCP servers, or chat applications. Covers Agent class, state management, callable RPC, Workflows integration, and React hooks. Biases towards retrieval from Cloudflare docs over pre-trained knowledge.
- ai-elementsAI Elements component library guidance — pre-built React components for AI interfaces built on shadcn/ui. Use when building chat UIs, message displays, tool call rendering, streaming responses, reasoning panels, or any AI-native interface with the AI SDK.
- ai-gatewayVercel AI Gateway expert guidance. Use when configuring model routing, provider failover, cost tracking, or managing multiple AI providers through a unified API.
- ai-generation-persistenceAI generation persistence patterns — unique IDs, addressable URLs, database storage, and cost tracking for every LLM generation
- ai-sdkVercel AI SDK expert guidance. Use when building AI-powered features — chat interfaces, text generation, structured output, tool calling, agents, MCP integration, streaming, embeddings, reranking, image generation, or working with any LLM provider.
- aiq-deploy|
- aiq-research|