ladder-quality-order

Name: ladder-quality-order
Author: yogsoth-ai/de-anthropocentric-research-engine

$npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/ladder-quality-order

You receive: the n samples under one topic, each with (research_graph, research_result), plus their intended_order (the id order from the interpolator: id0 should be best -> idN-1 should be worst).

SKILL.md

.github/skills/ladder-quality-orderView on GitHub ↗

---
name: ladder-quality-order
description: loss-2 judge - pairwise quality comparison across the n rungs within one topic; decide monotonicity and endpoint separation. check-blind, D1-D5 only.
---

# ladder-quality-order (loss-2)

You receive: the n samples under one topic, each with (research_graph, research_result),
plus their intended_order (the id order from the interpolator: id0 should be best ->
idN-1 should be worst).

## Task (pairwise ranking, no absolute scores)
1. Enumerate all i<j pairs; for each pair ask: **in the D1-D5 sense, which research design
   is more substantive?** (D1 more meaningful / D2 more skill-research value / D3 more
   usable to DARE / D4 better respects the 4 layers / D5 firmer prerequisites). Output
   winner + a one-line reason.
2. Aggregate into an induced order; compute Kendall tau against intended_order.
3. Endpoints: directly compare id0 vs idN-1; across K repeats, check whether id0 wins stably.

## Output (JSON)
{"tau": float, "monotonicity_pass": bool,   // tau>=0.7 and no endpoint inversion
 "endpoint_separation_pass": bool,          // id0 wins >= K-allowance of K repeats
 "rigor_floor_flag": bool,                  // if id0 ~ idN-1 endpoints collapse (feed risk register)
 "pairwise_log": [{i,j,winner,reason}]}

## check-blind contract (hard constraint)
- The judge prompt may use **only** D1-D5 wording.
- **Forbidden**: 32-check vocabulary, 6-primitive, "pseudo-good/novel-good" categories,
  any detection signature.
- z-perp-C: on the B1 confound triplet (same substance, different framing) your order
  **must stay invariant**; if it varies with framing -> you were dragged by the confound,
  tighten back to D1-D5 substance.

More from yogsoth-ai/de-anthropocentric-research-engine

Skill	Description
abductive-hypothesis-generation	Strategy: 面对异常的最佳解释推理
ablation-brainstorm	Remove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
ablation-component-mapping	Map system architecture to ablatable units for ablation studies
ablation-design	Design ablation studies to isolate component contributions in ML systems
ablation-execution	Remove components one by one from a system, record the response/impact of each removal.
abp-vulnerability-classification	Classify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
abstraction-extraction	Extract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
abstraction-ladder	Perform bisociation at multiple abstraction levels
abstraction-laddering	Move between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
abstraction-to-design	Abstract biological principle to design principle. Bridge from biology to engineering.