statistical-method-selection
$
npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/statistical-method-selectionSelects appropriate statistical methods for analyzing experiments
- Solves the problem of choosing the right statistical test for experimental data
- Leverages metric specification and sample size estimation tools
- Uses data characteristics and decision criteria to determine the best method
- Returns a pre-registered analysis plan with recommended statistical methods
SKILL.md
.github/skills/statistical-method-selectionView on GitHub ↗
--- name: statistical-method-selection description: "Select appropriate statistical methods for experiment analysis" version: 1.0.0 category: experiment-execution type: tactic used-by: experiment-design orchestrates: - metric-specification - sample-size-estimation --- # Tactic: Statistical Method Selection ## Orchestration Pattern 1. **Assess Data Characteristics** → Determine distribution type, sample size, pairing structure 2. **metric-specification** → Ensure metrics are well-defined and measurable 3. **Select Test Family** → Choose between parametric, non-parametric, or Bayesian 4. **sample-size-estimation** → Power analysis for the selected test 5. **Define Analysis Pipeline** → Pre-register the complete analysis plan ## Decision Criteria | Condition | Recommended Method | |-----------|-------------------| | Normal data, 2 groups, paired | Paired t-test | | Normal data, 2 groups, unpaired | Welch's t-test | | Normal data, 3+ groups | ANOVA + post-hoc (Tukey HSD) | | Non-normal, 2 groups | Wilcoxon signed-rank / Mann-Whitney U | | Non-normal, 3+ groups | Kruskal-Wallis + Dunn's test | | Multiple datasets, multiple methods | Friedman + Nemenyi / critical difference | | Want probability of superiority | Bayesian comparison (Benavoli 2017) | | Small sample, no distributional assumptions | Permutation test | | Variance estimation needed | Bootstrap confidence intervals | | Multiple comparisons | Apply Holm-Bonferroni correction | ## Quality Checks - Is the test appropriate for the data type (continuous, ordinal, categorical)? - Are independence assumptions met? (If not, use paired/repeated-measures variants) - Is the sample size sufficient for the chosen test's power requirements? - Are multiple comparison corrections applied when testing multiple hypotheses? - Is the effect size reported alongside p-values? - Is the significance threshold pre-registered (not chosen post-hoc)?
More from yogsoth-ai/de-anthropocentric-research-engine
- abductive-hypothesis-generationStrategy: 面对异常的最佳解释推理
- ablation-brainstormRemove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
- ablation-component-mappingMap system architecture to ablatable units for ablation studies
- ablation-designDesign ablation studies to isolate component contributions in ML systems
- ablation-executionRemove components one by one from a system, record the response/impact of each removal.
- abp-vulnerability-classificationClassify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
- abstraction-extractionExtract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
- abstraction-ladderPerform bisociation at multiple abstraction levels
- abstraction-ladderingMove between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
- abstraction-to-designAbstract biological principle to design principle. Bridge from biology to engineering.