robustness-testing
$
npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/robustness-testingTest whether conclusions survive across different modeling choices.
SKILL.md
.github/skills/robustness-testingView on GitHub ↗
--- name: robustness-testing description: Test conclusion robustness via multi-model convergence — enumerate assumptions, generate alternatives, compare results, flag fragile conclusions. used-by: boundary-analysis --- # Robustness Testing Test whether conclusions survive across different modeling choices. ## Budget | Base SOP | Target | ±10% Range | |----------|--------|------------| | web-search | 20 | 18–22 | | web-research | 10 | 9–11 | | paper-overview | 30 | 27–33 | | paper-search | 25 | 22–28 | | paper-research | 15 | 13–17 | ## State Ledger ``` <HARD-GATE> | SOP | Done | Target | % | |-----|------|--------|---| | web-search | ? | 20 | ? | | web-research | ? | 10 | ? | | paper-overview | ? | 30 | ? | | paper-search | ? | 25 | ? | | paper-research | ? | 15 | ? | Budget Gate: OPEN/CLOSED (>=80% required to exit) </HARD-GATE> ``` ## Available Tactics - multi-model-convergence ## Available SOPs **Import:** web-search, web-research, paper-overview, paper-search, paper-research **Subagent:** assumption-enumeration, alternative-model-generation, convergence-assessment, fragility-flagging ## Execution Guidance Enumerate modeling assumptions, generate alternative models by relaxing each, compare results across alternatives, flag results that depend on specific assumptions (fragile).
More from yogsoth-ai/de-anthropocentric-research-engine
- abductive-hypothesis-generationStrategy: 面对异常的最佳解释推理
- ablation-brainstormRemove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
- ablation-component-mappingMap system architecture to ablatable units for ablation studies
- ablation-designDesign ablation studies to isolate component contributions in ML systems
- ablation-executionRemove components one by one from a system, record the response/impact of each removal.
- abp-vulnerability-classificationClassify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
- abstraction-extractionExtract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
- abstraction-ladderPerform bisociation at multiple abstraction levels
- abstraction-ladderingMove between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
- abstraction-to-designAbstract biological principle to design principle. Bridge from biology to engineering.