pdf-merge-split
$
npx mdskill add TerminalSkills/skills/pdf-merge-splitMerge or split PDF files by page ranges or custom order
- Combine multiple PDFs into one or split a PDF into individual pages or sections
- Uses Python libraries like PyPDF2 or pikepdf for document manipulation
- Determines operation based on user request and specified page ranges or order
- Delivers merged or split PDFs as output files or streams to the user
SKILL.md
.github/skills/pdf-merge-splitView on GitHub ↗
---
name: pdf-merge-split
description: >-
Combine multiple PDFs into one or split a PDF into separate files. Use when a
user asks to merge PDFs, combine PDF files, join documents together, split a
PDF into pages, extract pages from a PDF, or separate a PDF into parts.
Supports page range selection and custom ordering.
license: Apache-2.0
compatibility: "Requires Python 3.9+ with PyPDF2 or pikepdf installed"
metadata:
author: terminal-skills
version: "1.0.0"
category: documents
tags: ["pdf", "merge", "split", "combine", "pages"]
use-cases:
- "Merge multiple report PDFs into a single document for distribution"
- "Split a large PDF into individual chapters or page ranges"
- "Extract specific pages from a PDF to create a new document"
agents: [claude-code, openai-codex, gemini-cli, cursor]
---
# PDF Merge & Split
## Overview
Combine multiple PDF files into a single document or split a PDF into separate files by page ranges. This skill handles merging in a specified order, splitting by page numbers, extracting specific pages, and preserving bookmarks and metadata where possible.
## Instructions
When a user asks to merge or split PDF files, follow these steps:
### Step 1: Determine the operation
Ask or infer what the user needs:
- **Merge**: Combine multiple PDFs into one output file
- **Split by pages**: Break a single PDF into multiple files by page ranges
- **Extract pages**: Pull specific pages out into a new PDF
- **Split by size**: Divide a PDF into chunks of N pages each
### Step 2: Validate the input files
Check that all input files exist and are valid PDFs:
```python
import os
from PyPDF2 import PdfReader
def validate_pdfs(file_paths):
results = []
for path in file_paths:
if not os.path.exists(path):
results.append({"file": path, "status": "not found"})
continue
try:
reader = PdfReader(path)
results.append({
"file": path,
"status": "valid",
"pages": len(reader.pages)
})
except Exception as e:
results.append({"file": path, "status": f"invalid: {e}"})
return results
```
### Step 3: Perform the operation
**For merging:**
```python
from PyPDF2 import PdfMerger
def merge_pdfs(input_paths, output_path):
merger = PdfMerger()
for path in input_paths:
merger.append(path)
merger.write(output_path)
merger.close()
return output_path
```
**For splitting by page ranges:**
```python
from PyPDF2 import PdfReader, PdfWriter
def split_pdf(input_path, ranges, output_dir="."):
reader = PdfReader(input_path)
output_files = []
for i, (start, end) in enumerate(ranges):
writer = PdfWriter()
for page_num in range(start - 1, min(end, len(reader.pages))):
writer.add_page(reader.pages[page_num])
output_path = os.path.join(output_dir, f"split_{i+1}_pages_{start}-{end}.pdf")
with open(output_path, "wb") as f:
writer.write(f)
output_files.append(output_path)
return output_files
```
**For extracting specific pages:**
```python
def extract_pages(input_path, page_numbers, output_path):
reader = PdfReader(input_path)
writer = PdfWriter()
for page_num in page_numbers:
if 1 <= page_num <= len(reader.pages):
writer.add_page(reader.pages[page_num - 1])
with open(output_path, "wb") as f:
writer.write(f)
return output_path
```
### Step 4: Verify and report results
After the operation, verify the output:
1. Confirm the output file exists and is a valid PDF
2. Report the page count of each output file
3. Show file sizes for the user
## Examples
### Example 1: Merge three reports into one
**User request:** "Combine report-q1.pdf, report-q2.pdf, and report-q3.pdf into annual-report.pdf"
**Actions taken:**
1. Validate all three input files
2. Merge in the specified order
3. Write to annual-report.pdf
**Output:**
```
Merged 3 PDF files into annual-report.pdf
Input files:
1. report-q1.pdf (12 pages)
2. report-q2.pdf (15 pages)
3. report-q3.pdf (11 pages)
Output: annual-report.pdf (38 pages, 2.4 MB)
```
### Example 2: Split a PDF into chapters
**User request:** "Split textbook.pdf into separate files: pages 1-30, 31-55, 56-80"
**Actions taken:**
1. Validate textbook.pdf (80 pages)
2. Split into three page ranges
3. Save each range as a separate file
**Output:**
```
Split textbook.pdf into 3 files:
1. split_1_pages_1-30.pdf (30 pages, 1.1 MB)
2. split_2_pages_31-55.pdf (25 pages, 0.9 MB)
3. split_3_pages_56-80.pdf (25 pages, 0.8 MB)
All files saved to current directory.
```
### Example 3: Extract specific pages
**User request:** "Pull out pages 5, 12, and 18-22 from presentation.pdf"
**Actions taken:**
1. Parse the page specification: [5, 12, 18, 19, 20, 21, 22]
2. Extract those pages from presentation.pdf
3. Save as extracted_pages.pdf
**Output:**
```
Extracted 7 pages from presentation.pdf
Pages extracted: 5, 12, 18, 19, 20, 21, 22
Output: extracted_pages.pdf (7 pages, 540 KB)
```
## Guidelines
- Always validate input files before processing. Report clear errors for missing or corrupt files.
- Preserve the original files. Never modify input PDFs in place.
- When merging, respect the order specified by the user. If no order is given, use alphabetical.
- Use 1-based page numbering in all user-facing output to match what users see in PDF viewers.
- For encrypted PDFs, inform the user that a password is needed before processing.
- When splitting, create descriptive filenames that include page ranges.
- Report file sizes alongside page counts so the user knows the output scale.
- Install PyPDF2 with `pip install PyPDF2` if not available. For advanced features like preserving form fields, use pikepdf instead.