metric-pack-designer
$
npx mdskill add openai/plugins/metric-pack-designerBuild custom evaluation rubrics for plugin-eval analysis.
- Creates local schema-compatible checks and metrics for teams.
- Depends on plugin-eval analyze command and metric-pack manifest.
- Generates deterministic JSON payloads without overwriting core scores.
- Outputs a script printing JSON to stdout for immediate execution.
SKILL.md
.github/skills/metric-pack-designerView on GitHub ↗
--- name: metric-pack-designer description: Design custom metric packs for plugin-eval so teams can add local evaluation rubrics that emit schema-compatible checks and metrics. Use when the user wants their own evaluation criteria or visualizations. --- # Metric Pack Designer Use this skill when the user wants to extend `plugin-eval` with a local rubric. ## Workflow 1. Clarify the custom rubric categories and target kinds. 2. Define the smallest useful `checks[]` and `metrics[]` payload. 3. Create a metric-pack manifest plus a script that prints JSON to stdout. 4. Run the pack through `plugin-eval analyze <path> --metric-pack <manifest.json>`. ## Design Rules - Keep IDs stable across runs so comparisons stay meaningful. - Emit only `checks[]`, `metrics[]`, and optional `artifacts[]`. - Do not try to overwrite the core score or summary. - Prefer deterministic local signals over subjective text generation. ## Reference - `../../references/metric-pack-manifest.md`