triage-finding

$npx mdskill add openai/plugins/triage-finding

Triage existing security findings against the current repository using static code evidence. Return one evidence-backed verdict per supplied finding: `confirmed`, `not_actionable`, or `needs_review`. For `confirmed` and `needs_review` findings, also assign a discrete exploitability stack rank inside that verdict's own queue.

SKILL.md

.github/skills/triage-findingView on GitHub ↗
---
name: triage-finding
description: "Use when the user supplies or imports existing security findings, vulnerability reports, or security/vulnerability Jira/Linear tickets from scanners, advisories, GitHub, Atlassian Rovo, Linear, or similar backlog sources and wants static repo-impact triage. Do not use for discovery, duplicate-bug triage, validation, or fixes."
metadata:
  short-description: "Triage pasted or imported security findings from Jira/Linear, scanners, GitHub, and advisories"
---

# Triage Finding

## Objective

Triage existing security findings against the current repository using static code evidence. Return one evidence-backed verdict per supplied finding:
`confirmed`, `not_actionable`, or `needs_review`. For `confirmed` and `needs_review` findings, also assign a discrete exploitability stack rank inside that verdict's own queue.

This skill is for backlog burn-down. It starts from findings the user already has, such as SARIF results, CVEs, advisories, scanner tickets, bug bounty reports, Jira/Linear issues, or Codex Security finding artifacts. It is not a repository-wide scan, dynamic validation run, fix implementation, dashboard, or queue manager.

## Backlog Burn-Down Scope

Treat multiple supplied findings as one backlog-reduction problem, not as a set of unrelated one-off triages. The goal is to turn noisy existing finding sources into a ranked, evidence-backed action queue while preserving one result per input for auditability and app rendering.

For now, run the workflow inline in the current thread, but structure the work like a backlog pipeline:

- Build the normalized triage item list for the whole supplied or imported collection before assigning verdicts. Here, normalize means: assign `triage_item_id`, preserve source ids and references, extract the fields in the Inputs section below, and record missing fields as proof gaps without inventing scanner, severity, remediation, or generated Codex Security fields.
- Triage each normalized item using static evidence and keep one output result per supplied finding.
- Rank the `confirmed` and `needs_review` results as an action queue for backlog burn-down.
- Do not perform deduplication in this skill. If duplicate-looking inputs are present, keep one result per supplied finding; deduplication belongs in a separate workflow.
- Do not spawn subagents, use a subagent queue, or use deep triage mode until a future implementation explicitly adds those mechanics.

## Finding Schema Decision

Do not use `plugins/codex-security/schemas/findings.schema.json` as the canonical data shape for input normalization.

That schema describes completed Codex Security scan output. It requires generated fields such as `scanId`, `findingId`, `occurrenceId`, fingerprints,
severity, remediation, provenance, and at least one location. Most triage inputs are incomplete external claims, and forcing them into that schema before investigation would require inventing stable IDs, severity, remediation, or locations.

Use the schema only as an optional compatibility source when the user supplies an existing `codex-security.findings` JSON artifact. In that case, extract the available fields into the triage normalization record and preserve the original IDs as source identifiers. The triage result contract is defined in `references/triage-result-contract.md`.

## Static Assessment Guidance

Use the shared static finding assessment reference in `../../references/static-finding-assessment.md` for the reusable evidence work: source/control/sink tracing, smallest useful evidence search,
reachability, boundary inputs, counterevidence, proof gaps, and static confidence.

This skill still owns external finding intake, the backlog triage verdicts,
the first-pass no-runtime constraint, and the output contract.

## Routing and Connector Use

Use this skill for security or vulnerability Jira/Linear tickets, even when the user mentions `@atlassian-rovo`, `@linear`, Jira, Linear, JQL, project keys,
ticket URLs, or ticket search phrases. Treat Atlassian Rovo and Linear mentions as connector hints for importing ticket content, not as a reason to switch to Atlassian Rovo's `triage-issue` skill or another generic ticket workflow.

Do not run duplicate-bug triage instead of security-impact triage. Generic Jira duplicate triage answers "is this already filed?" This skill answers "does this existing security claim affect this repository, and how should it rank for backlog burn-down?"

## Jira and Linear Intake

When the user supplies Jira or Linear issue URLs, Jira issue keys, Linear issue identifiers, JQL, project/search phrases, or asks to import tickets from Jira or Linear, retrieve the issue content before normalizing findings.

For Jira, use the Atlassian Rovo connector when it is available:

- Prefer structured JQL for repeatable imported collections, ordered by a stable field such as key or created date. Use natural-language search only for discovery, such as finding the cloud/site, project, issue family, or first matching key from a vague user phrase; after discovery, switch to JQL or exact issue fetches.
- Jira text search tokenizes punctuation. If a hyphenated marker such as `KAN-PROMPTFOO-SYNTH` returns no rows, broaden to a stable token such as `PROMPTFOO`, then filter returned summaries for the exact family before triage.
- For exact issue URLs or keys, fetch those issues directly when the connector supports it. For a project family or saved query, use JQL and preserve the JQL string as source provenance.

For Linear, fetch exact issue URLs or identifiers directly when provided. When the user supplies a team/project/search phrase instead, use the Linear connector's search results only to identify the intended issue set, then fetch the selected issues before normalizing.

Normalize Jira and Linear vulnerability tickets as `source_type: "scanner_ticket"`
unless the ticket body clearly represents a more specific input type such as `bug_bounty`, `advisory`, `cve`, or `codex_security_finding`. Preserve source-system provenance including issue key or identifier, URL, project, status, labels, components, priority, assignee,
reporter, timestamps, issue type, and the import query in `input_id`,
`normalized_input.references`, and evidence/proof-gap text.

Default to read-only import and triage. Do not add comments, transition issues,
close issues, assign owners, or change labels unless the user explicitly asks for writeback. If the user does ask for writeback, finish the static verdicts first, summarize the planned issue mutations, and keep the mutation step separate from evidence analysis.

For imported Jira or Linear collections, the concise final summary should say how many issues were imported, what query or issue set was used, where the full result was rendered or saved, and whether any Jira or Linear changes were made.
The default source-system status is: no Jira or Linear changes made.

## GitHub Repository Intake

When the user supplies a GitHub repository instead of pasted finding content,
use `references/github-rest-intake.md` before normalizing findings.

Detect GitHub repositories from `owner/repo`, GitHub URLs, GitHub SSH remotes,
the current Codex project's attached GitHub repository, or the current local repository's GitHub remote.

If the user asks to pull from GitHub without typing an `owner/repo` or URL, first infer the GitHub repository from the current Codex project attachment when that metadata is available. Prefer that attached repository over a local path or local git remote. If no Codex project attachment is visible, fall back to the current repository's GitHub remote. Only ask for a repository URL or `owner/repo`
when neither source resolves to a GitHub repository.

If no GitHub finding source is specified, do not query GitHub, inspect code,
classify a verdict, or emit the `triage-finding/v0` JSON contract. Ask the user to choose one of:

- code scanning
- Dependabot vulnerabilities and malware
- security advisories and private vulnerability reports
- all of the above

If the user specifies a source, query only the matching GitHub REST endpoint family from `references/github-rest-intake.md`. If the user chooses all, query the REST sources listed there, but do not include GitHub Issues in all.

Use REST for all GitHub finding retrieval. Do not use the GitHub Connector to fetch findings, even when it is installed or the user explicitly asks for it.
If the user asks to use the GitHub Connector, silently use it only as the first auth-token source when a connector token acquisition path is available, then retrieve finding data directly with REST. If connector token acquisition is not available, fall through to the next REST auth source in the reference.

Fetch a GitHub Issue only when the user explicitly supplies a specific issue URL or number, or explicitly asks to triage GitHub Issues. Normalize explicit issues as `source_type: "freeform"`.

## Missing Input

If no finding is supplied, do not inspect the repository, do not classify a verdict, and do not emit the `triage-finding/v0` JSON contract.

Ask the user to provide a finding to triage. Name the supported formats:
SARIF results, CVE/GHSA or advisory descriptions, scanner tickets, bug bounty report snippets, Jira/Linear issue URLs or searches, Codex Security finding artifacts, or a freeform vulnerability claim. If useful, ask for the repository path or affected file/component at the same time.

## Inputs

Start by extracting:

- repository path or current working repository
- GitHub repository owner/name and selected GitHub REST source, when the input is a GitHub repository intake request
- Jira/Linear source query, issue key or identifier, URL, project, status,
  labels, components, priority, assignee, reporter, timestamps, and issue type when the input is imported from a ticketing system
- input id, scanner id, SARIF rule/result id, CVE/GHSA id, ticket id, or Codex Security `findingId`/`occurrenceId` when present
- title or short claim
- source type: `sarif`, `cve`, `advisory`, `scanner_ticket`,
  `bug_bounty`, `codex_security_finding`, `freeform`, or `unknown`
- vulnerable component, package, API, file, route, class, function, or service
- claimed attacker-controlled source
- claimed sink or broken security control
- affected version, path, configuration, or deployment surface
- required preconditions and claimed impact
- existing code references, evidence, and counterevidence supplied by the user
- GitHub provenance such as alert URL, advisory URL, issue URL, alert number,
  advisory state, package name, manifest path, rule id, and instance locations

Ask a follow-up question only when the repository path or finding claim is too vague to inspect. Otherwise, inspect the repository and preserve missing fields as proof gaps.

## Repository Security Policy Gate

Before static evidence analysis, check whether the target repository has a `SECURITY.md` file at or above the repository root. If present, read the repository's `SECURITY.md` before tracing source, control, sink, reachability,
or assigning a verdict.

Treat `SECURITY.md` as the primary local source for supported security boundaries, trusted inputs, supported versions, disclosure scope, hardening controls, and out-of-scope surfaces. Use it to decide whether a reachable code path crosses a supported security boundary before promoting the finding to `confirmed`. If no `SECURITY.md` is present, record that absence as a proof gap and continue with the next-best local policy evidence.

## Workflow

1. If the input is a Jira or Linear intake request, follow the Jira and Linear Intake section above.
   - Retrieve the source issue content before normalizing findings.
   - Use repeatable structured queries for Jira collections when possible.
   - Preserve ticket provenance and normalize vulnerability tickets into the
     existing source types instead of adding new `source_type` enum values.
   - Do not write back to Jira or Linear unless the user explicitly asks.
2. If the input is a GitHub repository intake request, follow `references/github-rest-intake.md`.
   - If the user did not specify a GitHub finding source, ask for the source and
     stop without emitting triage JSON.
   - If REST auth is unavailable, ask for a supported auth source and stop
     without emitting triage JSON.
   - Normalize retrieved GitHub findings into the existing source types:
     `sarif`, `cve`, `advisory`, or `freeform` for explicit GitHub Issues.
   - Preserve GitHub provenance in `input_id`, `normalized_input.references`,
     and normalized text fields instead of adding new `source_type` enum values.
3. Normalize each supplied or imported finding into a triage item.
   - Assign `triage_item_id` values such as `triage-001`.
   - Preserve external source ids in `input_id`.
   - Do not invent scanner fields, generated Codex Security ids, severity, or
     remediation just to satisfy another schema.
4. Resolve the repository path and git revision when available.
5. Apply the Repository Security Policy Gate.
   - Look for `SECURITY.md` in the target repository before source/control/sink
     tracing or reachability analysis.
   - If present, read it and record the relevant policy basis for supported
     security boundaries, trusted inputs, supported versions, hardening
     controls, and out-of-scope surfaces.
   - If absent, record the absence as a proof gap and continue with available
     product docs, threat models, disclosure policy, and local comments.
   - If `SECURITY.md` and available local product evidence do not establish the
     intended product surface, untrusted input boundary, or trusted
     operator/developer inputs, ask targeted operator-context questions before
     assigning a verdict when the answer would materially affect the result.
6. Follow `../../references/static-finding-assessment.md` to inspect the smallest relevant static evidence set and record source, control, sink,
   reachability, boundary inputs, counterevidence, proof gaps, and static confidence.
7. Classify the product surface and trust boundary.
   - Identify whether the path is a CLI, library API, hosted service, local
     developer UI, MCP/tooling surface, example/demo, test/fixture, docs,
     generated code, vendored code, or unknown surface.
   - Check package manifests, exports, binary entrypoints, deployment files,
     product docs, `SECURITY.md`, disclosure policy, threat models, and nearby
     comments when they are standard or local to the claim.
   - Record whether the claimed source is untrusted input in the intended
     product model, or trusted operator/developer configuration.
8. Trace the claimed source, control, sink, and reachable path.
   - Treat scanner/advisory prose as a claim, not as proof.
   - Start from the cited code, manifest, version range, or supplied evidence.
   - `confirmed` already requires reachability. When claiming reachability,
     record reachability anchor evidence: the concrete caller, entrypoint,
     route, command, package export, deployment path, dependency edge, or other
     repo fact that proves the vulnerable condition is connected to the product
     surface.
   - Record supporting evidence.
   - Record counterevidence that weakens or defeats the claim.
   - Record unresolved proof gaps instead of smoothing them over.
9. Apply the verdict rules.
10. Assign exploitability stack ranks for `confirmed` and `needs_review`
   findings.
11. For `confirmed` findings, add owner hints after verdicting when local ownership evidence is easy to derive.
12. Build one valid `triage-finding/v0` result using the contract in `references/triage-result-contract.md`.
13. If the Codex Security app tool `open_codex_security_triage_results` is
    available, call it with the complete result before the final response so
    the app renders the findings table. After a successful tool call, return a
    concise Markdown summary; do not paste the full JSON block unless the user
    asks for the raw contract.
14. If the app tool is unavailable or rejects the result, fall back to the
    fenced JSON block alongside the concise Markdown summary.

## Surface and Boundary Gate

Before assigning `confirmed`, classify the finding's intended product surface and trust boundary.

Inspect the smallest available evidence for:

- shipped or runtime surfaces, such as package manifests, exports, binary entrypoints, server routes, deploy configs, container/build files, public API docs, or product docs
- non-product or trusted surfaces, such as examples, tests, fixtures, docs snippets, local-only developer tools, generated/vendor code, internal harnesses, CLI configs, plugin/test utilities, or deliberately code-executing extension points
- repository security policy or threat model, such as `SECURITY.md`,
  docs/security, supported-versions docs, disclosure policy, threat models, or comments that define trusted inputs and supported boundaries

A reachable dataflow is not enough. `confirmed` requires both:

1. the vulnerable condition is statically reachable under stated preconditions
2. the source crosses a security boundary that the project appears to support

If the code is reachable only through trusted configuration, local developer interfaces, examples, tests, fixtures, or demo applications, do not mark `confirmed` unless static evidence shows that surface is shipped, deployed,
documented for untrusted users, or bypasses a documented hardening/auth boundary.

When the boundary classification is policy-dependent or unclear, prefer `needs_review` and put the ambiguity in proof gaps.

## Verdict Rules

Use `confirmed` only when static evidence connects the vulnerable condition to reachable code under stated preconditions and the source crosses an intended security boundary for a shipped, deployed, or documented product surface.
Dependency or component presence by itself is not enough.

Use `not_actionable` only when static evidence positively defeats the finding,
for example:

- the vulnerable package or feature is absent
- the locked version is outside the affected range
- the claimed code path is unreachable in the repository's configuration
- the dangerous sink is protected by a relevant guard, validator, sanitizer, or authorization control
- the scanner claim points at dead, test-only, generated, vendored, or unused code and repository evidence supports that conclusion
- the relevant code is example-only, fixture-only, docs-only, local-only, or not shipped in the affected artifact
- the source is trusted configuration, a CLI argument, a local-only tool input,
  or an intentionally code-executing extension point under the repository's documented security model

Use `needs_review` when evidence is insufficient, ambiguous, runtime-only,
policy-dependent, environment-dependent, or blocked by missing repository context. Prefer `needs_review` over speculative `confirmed` or `not_actionable` verdicts.

## Exploitability Stack Ranking

After verdicting, assign discrete exploitability stack ranks separately for `confirmed` and `needs_review` findings.

- `confirmed` findings use the `confirmed` rank queue and rank labels `P0`,
  `P1`, `P2`, etc. `P0` is the most exploitable confirmed finding in this result set.
- `needs_review` findings use the `needs_review` rank queue and the same `P0`, `P1`, `P2`, etc. labels. `P0` is the highest-exploitability unresolved finding to review first.
- Rank labels intentionally do not encode the verdict queue. Use `rank_queue`
  to distinguish confirmed priorities from needs-review priorities.
- `not_actionable` findings are not stack-ranked; set their rank queue and rank to `null`.

Rank by exploitability, not by scanner severity alone. Prioritize findings with clearer attacker reachability, lower required privileges, fewer preconditions,
more direct source-to-sink control, weaker or absent guards, and more reliable static evidence that the exploit path can be exercised. Use claimed impact or scanner severity only as a final tiebreaker when exploitability is otherwise equal.

Keep findings in input order in the JSON result. Use the stack-rank fields to show review/remediation priority instead of reordering the results.

## Owner Hints

For `confirmed` findings only, add a concise owner hint after assigning the verdict and exploitability stack rank when local ownership evidence is easy to derive.

Prefer CODEOWNERS or OWNERS evidence when available. If ownership is not clear,
omit the owner hint rather than guessing. Owner hints are routing metadata only:
do not use ownership to influence verdict, confidence, boundary assessment, or exploitability rank.

The `triage-finding/v0` contract does not define a dedicated owner field. Do not add undocumented fields to the app-tool payload. Put owner-hint text in existing Markdown output, evidence, or recommended-next-step text when it is useful.

## App Surface and Output Contract

The Markdown result should include:

- finding title or input id
- verdict and confidence
- short rationale
- affected locations, if any
- reachable path, if established
- boundary assessment: product surface, source trust level, policy basis, and whether a supported security boundary is crossed
- exploitability stack rank for `confirmed` and `needs_review` findings
- evidence
- counterevidence
- proof gaps
- owner hint for `confirmed` findings, when available
- recommended next step
- `$fix-finding` handoff when verdict is `confirmed`

The app-tool payload or fallback JSON block must include:

- `schema_version: "triage-finding/v0"`
- repository path and revision when available
- one result object per input finding, in input order
- `source_type` on every finding result, using one of the input source types listed above
- `boundary_assessment` on every finding result, even when fields are unknown
- `exploitability_stack_rank` on every finding result

Prefer the app tool over showing raw JSON. The intended default UX is:

1. generate the valid `triage-finding/v0` result internally
2. call `open_codex_security_triage_results` with that result
3. respond with the concise Markdown summary

Use the fenced JSON block only as a fallback when the app tool cannot be used,
or when the user explicitly asks to see or copy the raw result contract.

## Fix-Finding Handoff

For `confirmed` findings, include a concise prompt-ready handoff for `$fix-finding` with:

- vulnerable source, sink, or broken control
- attacker-controlled input and preconditions
- exact code references
- required security invariant
- recommended fix boundary
- proof gaps that `$fix-finding` should preserve or validate

Do not invoke `$fix-finding` unless the user explicitly asks to continue into fixing.

## Hard Rules

- Do not run tests, builds, applications, PoCs, exploit checks, or dynamic validation.
- Do not edit repository files while triaging.
- Do not search for unrelated vulnerabilities.
- Do not claim exhaustive repository coverage.
- Do not claim runtime validation happened.
- Do not use the GitHub Connector for GitHub finding retrieval. It may be used only as the first auth-token source for REST requests when available.
- Do not mutate Jira, Linear, or other backlog sources unless the user explicitly asks for writeback after triage.
- Do not include GitHub Issues in default GitHub intake or in the all-source GitHub intake path.
- Do not mark `confirmed` solely because attacker-influenced data reaches a dangerous sink; first establish the relevant product surface and supported security boundary.
- Do not use deep triage mode unless a future implementation explicitly adds it.
- Do not deduplicate, group, canonicalize, or drop duplicate-looking inputs in this skill; keep one result per supplied finding.
- Do not hide proof gaps or turn missing evidence into confidence.

More from openai/plugins

SkillDescription
accessibility-and-inclusive-visualizationMake data visualizations accessible and inclusive. Use when the user needs chart or diagram accessibility guidance, text alternatives for complex visuals, color and contrast review, keyboard support, reduced-motion behavior for animation or parallax, or an accessibility QA workflow for exported figures, UML-like diagrams, and dashboards.
agent-browserBrowser automation CLI for AI agents. Use when the user needs to interact with websites, verify dev server output, test web apps, navigate pages, fill forms, click buttons, take screenshots, extract data, or automate any browser task. Also triggers when a dev server starts so you can verify it visually.
agent-browser-verifyAutomated browser verification for dev servers. Triggers when a dev server starts to run a visual gut-check with agent-browser — verifies the page loads, checks for console errors, validates key UI elements, and reports pass/fail before continuing.
agents-sdkBuild AI agents on Cloudflare Workers using the Agents SDK. Load when creating stateful agents, durable workflows, real-time WebSocket apps, scheduled tasks, MCP servers, or chat applications. Covers Agent class, state management, callable RPC, Workflows integration, and React hooks. Biases towards retrieval from Cloudflare docs over pre-trained knowledge.
ai-elementsAI Elements component library guidance — pre-built React components for AI interfaces built on shadcn/ui. Use when building chat UIs, message displays, tool call rendering, streaming responses, reasoning panels, or any AI-native interface with the AI SDK.
ai-gatewayVercel AI Gateway expert guidance. Use when configuring model routing, provider failover, cost tracking, or managing multiple AI providers through a unified API.
ai-generation-persistenceAI generation persistence patterns — unique IDs, addressable URLs, database storage, and cost tracking for every LLM generation
ai-sdkVercel AI SDK expert guidance. Use when building AI-powered features — chat interfaces, text generation, structured output, tool calling, agents, MCP integration, streaming, embeddings, reranking, image generation, or working with any LLM provider.
aiq-deploy|
aiq-research|