statistical-method-selection

Name: statistical-method-selection
Author: yogsoth-ai/de-anthropocentric-research-engine

$npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/statistical-method-selection

Selects appropriate statistical methods for analyzing experiments

Solves the problem of choosing the right statistical test for experimental data
Leverages metric specification and sample size estimation tools
Uses data characteristics and decision criteria to determine the best method
Returns a pre-registered analysis plan with recommended statistical methods

SKILL.md

.github/skills/statistical-method-selectionView on GitHub ↗

---
name: statistical-method-selection
description: "Select appropriate statistical methods for experiment analysis"
version: 1.0.0
category: experiment-execution
type: tactic
used-by: experiment-design
orchestrates:
  - metric-specification
  - sample-size-estimation
---

# Tactic: Statistical Method Selection

## Orchestration Pattern

1. **Assess Data Characteristics** → Determine distribution type, sample size, pairing structure
2. **metric-specification** → Ensure metrics are well-defined and measurable
3. **Select Test Family** → Choose between parametric, non-parametric, or Bayesian
4. **sample-size-estimation** → Power analysis for the selected test
5. **Define Analysis Pipeline** → Pre-register the complete analysis plan

## Decision Criteria

| Condition | Recommended Method |
|-----------|-------------------|
| Normal data, 2 groups, paired | Paired t-test |
| Normal data, 2 groups, unpaired | Welch's t-test |
| Normal data, 3+ groups | ANOVA + post-hoc (Tukey HSD) |
| Non-normal, 2 groups | Wilcoxon signed-rank / Mann-Whitney U |
| Non-normal, 3+ groups | Kruskal-Wallis + Dunn's test |
| Multiple datasets, multiple methods | Friedman + Nemenyi / critical difference |
| Want probability of superiority | Bayesian comparison (Benavoli 2017) |
| Small sample, no distributional assumptions | Permutation test |
| Variance estimation needed | Bootstrap confidence intervals |
| Multiple comparisons | Apply Holm-Bonferroni correction |

## Quality Checks

- Is the test appropriate for the data type (continuous, ordinal, categorical)?
- Are independence assumptions met? (If not, use paired/repeated-measures variants)
- Is the sample size sufficient for the chosen test's power requirements?
- Are multiple comparison corrections applied when testing multiple hypotheses?
- Is the effect size reported alongside p-values?
- Is the significance threshold pre-registered (not chosen post-hoc)?

More from yogsoth-ai/de-anthropocentric-research-engine