headroom-estimation

Name: headroom-estimation
Author: yogsoth-ai/de-anthropocentric-research-engine

$npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/headroom-estimation

Estimate performance headroom by comparing current SOTA to theoretical and practical ceilings

Quantifies remaining improvement potential for a given task
Uses task-specific metrics, human performance, and theoretical bounds
Analyzes gaps between state-of-the-art and performance limits
Returns structured JSON with confidence levels and assumptions

SKILL.md

.github/skills/headroom-estimationView on GitHub ↗

---
name: headroom-estimation
description: Estimate theoretical/practical ceiling vs current SOTA gap
execution: subagent
prompt: ./prompt.md
input: task_name, current_sota, human_performance, theoretical_bounds
used-by: baseline-establishment
---

# Headroom Estimation


## Purpose

Quantify the remaining improvement potential for a task by estimating various performance ceilings and computing the gap between current SOTA and those ceilings. Distinguishes between theoretical limits (information-theoretic), practical limits (current paradigm), and human performance baselines.

## Input Schema

| Field | Type | Description |
|-------|------|-------------|
| task_name | string | The target task |
| current_sota | object | {method, score, metric, dataset, date} |
| human_performance | object | {score, conditions, source} or null |
| theoretical_bounds | object | {bound_type, value, derivation} or null |

## Output Schema

```json
{
  "task": "string",
  "dataset": "string",
  "metric": "string",
  "ceilings": {
    "theoretical": {
      "value": null,
      "type": "information_theoretic|bayes_optimal|combinatorial",
      "derivation": "string",
      "confidence": "high|medium|low|speculative"
    },
    "human": {
      "value": null,
      "conditions": "string",
      "source": "string",
      "is_expert": true,
      "confidence": "high|medium|low"
    },
    "practical": {
      "value": null,
      "assumptions": "string",
      "based_on": "string",
      "confidence": "medium|low|speculative"
    }
  },
  "headroom": {
    "vs_theoretical": null,
    "vs_human": null,
    "vs_practical": null,
    "interpretation": "string"
  },
  "saturation_assessment": {
    "status": "saturating|active_progress|early_stage|unknown",
    "evidence": "string",
    "years_to_human_parity": null
  }
}
```

More from yogsoth-ai/de-anthropocentric-research-engine

Skill	Description
abductive-hypothesis-generation	Strategy: 面对异常的最佳解释推理
ablation-brainstorm	Remove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
ablation-component-mapping	Map system architecture to ablatable units for ablation studies
ablation-design	Design ablation studies to isolate component contributions in ML systems
ablation-execution	Remove components one by one from a system, record the response/impact of each removal.
abp-vulnerability-classification	Classify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
abstraction-extraction	Extract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
abstraction-ladder	Perform bisociation at multiple abstraction levels
abstraction-laddering	Move between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
abstraction-to-design	Abstract biological principle to design principle. Bridge from biology to engineering.