datadog-metrics
$
npx mdskill add automateyournetwork/netclaw/datadog-metricsQuery Datadog metrics and explore dashboards for monitoring and analysis
- Solve tasks like monitoring infrastructure health and network performance
- Uses Datadog API to access metrics, metadata, and dashboards
- Executes user-specified queries and filters based on patterns or IDs
- Returns structured data including metric results and dashboard definitions
SKILL.md
.github/skills/datadog-metricsView on GitHub ↗
---
name: datadog-metrics
description: "Query metrics and explore dashboards in Datadog."
version: 1.0.0
license: Apache-2.0
author: netclaw
tags: []
---
# Datadog Metrics Skill
Query metrics and explore dashboards in Datadog.
## Tools
| Tool | Description |
|------|-------------|
| `query_metrics` | Execute metric queries with aggregations |
| `list_metrics` | List available metrics matching a pattern |
| `get_metric_metadata` | Get metadata for a specific metric |
| `list_dashboards` | List all available dashboards |
| `get_dashboard` | Get dashboard definition and widgets |
## Example Queries
```
Query CPU utilization for network devices
→ query_metrics(query="avg:system.cpu.user{role:network_device} by {host}", time_range="1h")
Get interface traffic metrics
→ query_metrics(query="sum:network.bytes_rcvd{*} by {interface}", time_range="4h")
List all network-related metrics
→ list_metrics(pattern="network.*")
Get a specific dashboard
→ get_dashboard(dashboard_id="abc-def-123")
```
## Workflows
### Network Performance Monitoring
1. Query interface utilization: `query_metrics(query="avg:network.bytes_sent{*} by {host,interface}")`
2. Check error rates: `query_metrics(query="sum:network.errors{*} by {host}")`
3. Monitor latency percentiles: `query_metrics(query="p99:network.latency{*}")`
4. Compare against SLA thresholds
### Infrastructure Health Assessment
1. List all infrastructure metrics: `list_metrics(pattern="system.*")`
2. Query CPU/memory across fleet: `query_metrics(query="avg:system.cpu.user{*} by {availability_zone}")`
3. Identify outliers and degradation
4. Check dashboard for historical trends
### Capacity Planning
1. Query peak utilization: `query_metrics(query="max:system.disk.used{*} by {host}", time_range="30d")`
2. Calculate growth rate
3. Forecast exhaustion dates
4. Generate capacity report
## Prerequisites
- `DD_API_KEY` Datadog API key
- `DD_APP_KEY` Datadog application key
- `DD_SITE` Datadog site (optional, defaults to datadoghq.com)
## Server
This skill uses the `datadog-mcp` server via remote MCP transport.
More from automateyournetwork/netclaw
- aap-automationRed Hat Ansible Automation Platform — inventory management, job template execution, project SCM sync, ad-hoc commands, host management, Galaxy content discovery. Use when automating infrastructure with Ansible, running playbooks, managing inventories, or searching for Ansible collections and roles.
- aap-edaEvent-Driven Ansible (EDA) — activation lifecycle, rulebook management, decision environments, event stream monitoring. Use when managing event-driven automation triggers, enabling/disabling activations, or reviewing EDA rulebooks.
- aap-lintansible-lint playbook and role validation — syntax checking, best practice enforcement, project-wide analysis, rule filtering. Use when validating Ansible playbooks, checking code quality, or enforcing automation best practices before deployment.
- aci-change-deploySafe ACI policy change deployment - ServiceNow CR lifecycle, pre/post-change fault baselines, APIC policy application, automatic rollback on fault delta, and GAIT audit trail. Use when deploying ACI policy changes, creating tenants or EPGs, pushing config to APIC, or running a change window with rollback protection.
- aci-fabric-auditComprehensive Cisco ACI fabric health audit - node status, tenant/VRF/BD/EPG policy review, contract analysis, fault triage, and endpoint learning verification. Use when auditing ACI fabric health, checking for faults, reviewing tenant policies, or running pre/post-change baselines on APIC.
- arista-cvpArista CloudVision Portal (CVP) automation via REST API — device inventory, events, connectivity monitoring, tag management (4 tools). Use when managing Arista devices, checking CloudVision events, monitoring network connectivity probes, or tagging devices in CVP.
- aruba-cx-configView and manage Aruba CX switch configurations, perform ISSU upgrades, and firmware operations
- aruba-cx-interfacesMonitor Aruba CX switch interface status, LLDP neighbors, and optical transceiver health
- aruba-cx-switchingView and manage Aruba CX switch VLANs and MAC address tables for Layer 2 operations
- aruba-cx-systemDiscover Aruba CX switch system information, firmware versions, and VSF topology