benchmark-sweep
$
npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/benchmark-sweepSystematically scan all known solutions in a domain, catalog their properties, and identify gaps where no solution exists.
SKILL.md
.github/skills/benchmark-sweepView on GitHub ↗
--- name: benchmark-sweep description: Systematically scan all known solutions, identify gaps in coverage and unexplored regions of the solution space. execution: strategy used-by: systematic-enumeration --- # Benchmark Sweep Systematically scan all known solutions in a domain, catalog their properties, and identify gaps where no solution exists. ## State Ledger | Resource | Target | Current | % | |----------|--------|---------|---| | web-search | 30 | 0 | 0% | | web-research | 10 | 0 | 0% | | paper-overview | 30 | 0 | 0% | | paper-search | 20 | 0 | 0% | | paper-research | 8 | 0 | 0% | ## HARD-GATE Cannot exit strategy until ≥80% of each budget line is consumed OR yield targets are met with justification for remaining budget. ## Available Tactics | Tactic | Role | |--------|------| | coverage-analysis | Inventory → crossing → intersection evaluation pipeline | | evaluation-filtering | Score and filter generated gap-filling ideas | ## Available SOPs | SOP | Role | |-----|------| | benchmark-inventory | Catalog all known solutions with performance/applicability/limitations | | method-problem-crossing | Build cross-reference matrix from inventory | | intersection-evaluation | Annotate matrix cells as explored/partial/unexplored | | enumeration-synthesis | Synthesize sweep findings into structured report | ## Execution Guidance 1. **Inventory**: Run benchmark-inventory to catalog all known methods 2. **Structure**: Use method-problem-crossing to organize into matrix form 3. **Evaluate**: Run intersection-evaluation to find gaps 4. **Generate**: For each gap, brainstorm potential solutions 5. **Filter**: Apply evaluation-filtering to rank gap-filling ideas 6. **Synthesize**: Produce final report via enumeration-synthesis
More from yogsoth-ai/de-anthropocentric-research-engine
- abductive-hypothesis-generationStrategy: 面对异常的最佳解释推理
- ablation-brainstormRemove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
- ablation-component-mappingMap system architecture to ablatable units for ablation studies
- ablation-designDesign ablation studies to isolate component contributions in ML systems
- ablation-executionRemove components one by one from a system, record the response/impact of each removal.
- abp-vulnerability-classificationClassify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
- abstraction-extractionExtract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
- abstraction-ladderPerform bisociation at multiple abstraction levels
- abstraction-ladderingMove between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
- abstraction-to-designAbstract biological principle to design principle. Bridge from biology to engineering.