defuddle

$npx mdskill add kepano/obsidian-skills/defuddle

Extract clean markdown from web pages to save tokens.

  • Removes navigation, ads, and clutter from standard web pages.
  • Depends on the Defuddle CLI tool for parsing operations.
  • Selects output format based on command-line flags provided.
  • Delivers formatted text, JSON, or HTML directly to the agent.
SKILL.md
.github/skills/defuddleView on GitHub ↗
---
name: defuddle
description: Extract clean markdown content from web pages using Defuddle CLI, removing clutter and navigation to save tokens. Use instead of WebFetch when the user provides a URL to read or analyze, for online documentation, articles, blog posts, or any standard web page. Do NOT use for URLs ending in .md — those are already markdown, use WebFetch directly.
---

# Defuddle

Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.

If not installed: `npm install -g defuddle`

## Usage

Always use `--md` for markdown output:

```bash
defuddle parse <url> --md
```

Save to file:

```bash
defuddle parse <url> --md -o content.md
```

Extract specific metadata:

```bash
defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain
```

## Output formats

| Flag | Format |
|------|--------|
| `--md` | Markdown (default choice) |
| `--json` | JSON with both HTML and markdown |
| (none) | HTML |
| `-p <name>` | Specific metadata property |
More from kepano/obsidian-skills