injection-fidelity

Name: injection-fidelity
Author: yogsoth-ai/de-anthropocentric-research-engine

$npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/injection-fidelity

Evaluates if a user simulator semantically enacts a Policy Card in a dialogue

Assesses whether a user's simulated behavior aligns with a Policy Card's semantic intent
Requires access to full dialogue and associated Policy Card data
Analyzes each axis of the Policy Card for expected intensity and coherence
Returns a fidelity judgment with per-axis evidence and drift detection

SKILL.md

.github/skills/injection-fidelityView on GitHub ↗

---
name: injection-fidelity
description: loss-1 judge - read a sample's full dialogue and decide whether the user simulator semantically enacted its Policy Card. check-blind.
---

# injection-fidelity (loss-1)

You receive: (1) a sample's full dialogue turns (de-identified, provided by jsonl_reader),
(2) the Policy Card that drove it (with axis_levels A1..A5,B1 + the two F8 phases).

Decide whether the simulator **semantically** acted out the card (not just word-frequency).
Check axis by axis:

- **A1 substance demand**: did the user genuinely interrogate causal mechanism (and refuse
  to let perfunctory answers slide)? Is the pushback real probing or surface questioning ->
  match against the expected intensity of card.A1's level.
- **A3 operationalization**: did the user demand numbers/thresholds/executable steps ->
  match A3's level.
- **A2 legitimacy**: were the requests coherent and on-topic -> match A2's level.
- **A4 corrigibility** (if C-): did the user hold the wrong premise throughout, never relent.
- **A5 generativity** (if G+): did the user throw out substantive novel seeds (not a
  restatement of the assistant's content).
- **Drift gate**: first half vs second half of the dialogue, did the pressure signal stay
  in-level (guard against the simulator drifting back to over-cooperation).

## Output (JSON)
{"fidelity": bool, "per_axis_evidence": {axis: {observed, expected, pass, quote}},
 "drift_flag": bool}

## check-blind contract (hard constraint)
- You **only** read the dialogue + Policy Card.
- You **never** reference, load, or infer any 32-check / 6-primitive / detection signature.
- You only judge "was the card enacted", never "is the research good".

More from yogsoth-ai/de-anthropocentric-research-engine

Skill	Description
abductive-hypothesis-generation	Strategy: 面对异常的最佳解释推理
ablation-brainstorm	Remove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
ablation-component-mapping	Map system architecture to ablatable units for ablation studies
ablation-design	Design ablation studies to isolate component contributions in ML systems
ablation-execution	Remove components one by one from a system, record the response/impact of each removal.
abp-vulnerability-classification	Classify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
abstraction-extraction	Extract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
abstraction-ladder	Perform bisociation at multiple abstraction levels
abstraction-laddering	Move between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
abstraction-to-design	Abstract biological principle to design principle. Bridge from biology to engineering.