evidence-tournament

Name: evidence-tournament
Author: yogsoth-ai/de-anthropocentric-research-engine

$npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/evidence-tournament

Gathers, challenges, and scores evidence for structured debate

Solves the problem of verifying claims with reliable and relevant evidence
Uses subagents like evidence-scout, debate-critic, and debate-judge
Evaluates evidence based on relevance, reliability, recency, and specificity
Produces a ranked bracket of evidence quality for decision-making

SKILL.md

.github/skills/evidence-tournamentView on GitHub ↗

---
name: evidence-tournament
description: "Tactic: Evidence gathering, cross-examination, and quality judgment. External evidence is collected, presented, challenged, and scored for relevance and reliability."
type: tactic
used-by: [multiagent-debate]
strategies: [courtroom-structured, critic-defender-judge]
---

# Evidence Tournament Tactic

Structured evidence competition — gather, present, cross-examine, and judge evidence quality.

## Orchestration

1. **evidence-scout** searches for evidence supporting the artifact's claims
2. **evidence-scout** searches for evidence opposing the artifact's claims
3. **debate-critic** presents opposing evidence as structured arguments
4. **debate-defender** presents supporting evidence as counter-arguments
5. **cross-examination** probes both evidence sets:
   - Source reliability assessment
   - Relevance to specific claims
   - Recency and applicability
   - Potential confounds or alternative interpretations
6. **debate-judge** scores each piece of evidence and produces tournament bracket result

## Evidence Quality Criteria

- **Relevance**: Direct bearing on the claim (0.0–1.0)
- **Reliability**: Source credibility and methodology (0.0–1.0)
- **Recency**: Temporal applicability (0.0–1.0)
- **Specificity**: How precisely it addresses the claim (0.0–1.0)

## Subagents Dispatched

- evidence-scout × 2 (pro and con evidence gathering)
- debate-critic (opposing evidence presentation)
- debate-defender (supporting evidence presentation)
- cross-examination (evidence probing)
- debate-judge (evidence scoring and verdict)

## Termination Conditions

- All claims have been evidenced and cross-examined
- Evidence search budget exhausted (2/5/10 searches)
- No further relevant evidence discoverable
- Judge has scored all evidence pairs

More from yogsoth-ai/de-anthropocentric-research-engine

Skill	Description
abductive-hypothesis-generation	Strategy: 面对异常的最佳解释推理
ablation-brainstorm	Remove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
ablation-component-mapping	Map system architecture to ablatable units for ablation studies
ablation-design	Design ablation studies to isolate component contributions in ML systems
ablation-execution	Remove components one by one from a system, record the response/impact of each removal.
abp-vulnerability-classification	Classify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
abstraction-extraction	Extract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
abstraction-ladder	Perform bisociation at multiple abstraction levels
abstraction-laddering	Move between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
abstraction-to-design	Abstract biological principle to design principle. Bridge from biology to engineering.