scaling-design
$
npx mdskill add yogsoth-ai/de-anthropocentric-research-engine/scaling-design**Question**: How does performance scale with resources?
SKILL.md
.github/skills/scaling-designView on GitHub ↗
--- name: scaling-design description: "Design scaling experiments to characterize performance-resource relationships" version: 1.0.0 category: experiment-execution type: strategy used-by: experiment-design sops: - factor-identification - level-specification - metric-specification - sample-size-estimation - design-matrix-construction tactics: - statistical-method-selection - budget-constrained-design --- # Strategy: Scaling Design **Question**: How does performance scale with resources? ## Methodology - **Neural Scaling Laws** (Kaplan 2020, Hoffmann 2022): Power-law relationships between compute/data/parameters and loss. - **Compute-Optimal Scaling** (Chinchilla): Find optimal allocation between model size and data. - **Data Scaling**: Characterize learning curves as function of dataset size. - **Model Scaling**: Performance vs. parameter count at fixed data. - **Inference Scaling**: Throughput/latency vs. batch size, sequence length, model size. ## Execution Flow 1. **factor-identification** → Identify scaling axes (data, compute, parameters, time) 2. **level-specification** → Define scale points (geometric progression, typically 4-8 points) 3. **metric-specification** → Define metrics at each scale (loss, downstream task, efficiency) 4. **design-matrix-construction** → Build scaling experiment grid 5. **sample-size-estimation** → Determine replicates needed for reliable curve fitting 6. **budget-constrained-design** (tactic) → Optimize which scale points to run given budget ## Budget Gate | Scaling Type | Scale Points | Replicates | Min Runs | Typical Cost | |-------------|-------------|------------|----------|--------------| | Data scaling | 4-6 | 3 | 12-18 | Low (same model, subset data) | | Model scaling | 4-8 | 2-3 | 8-24 | High (different model sizes) | | Compute-optimal | 6-10 per iso-FLOP | 1-2 | 12-20 | Very high | | Inference scaling | 5-10 | 5 | 25-50 | Low (inference only) |
More from yogsoth-ai/de-anthropocentric-research-engine
- abductive-hypothesis-generationStrategy: 面对异常的最佳解释推理
- ablation-brainstormRemove components one by one, observe system changes to reveal hidden dependencies and generate ideas from structural gaps.
- ablation-component-mappingMap system architecture to ablatable units for ablation studies
- ablation-designDesign ablation studies to isolate component contributions in ML systems
- ablation-executionRemove components one by one from a system, record the response/impact of each removal.
- abp-vulnerability-classificationClassify assumptions on 2 axes — load-bearing (how much conclusion depends on it) × vulnerable (how likely to be false). Focuses attention on High-Load × High-Vulnerable quadrant.
- abstraction-extractionExtract abstract principles from concrete domain cases. Strips domain-specific details to reveal transferable mechanisms.
- abstraction-ladderPerform bisociation at multiple abstraction levels
- abstraction-ladderingMove between concrete and abstract framings — 3 levels up (Why?) and 3 levels down (How?) to find the most productive research level.
- abstraction-to-designAbstract biological principle to design principle. Bridge from biology to engineering.