pyopenms-skill
$
npx mdskill add aipoch/medical-research-skills/pyopenms-skillProcess mass spectrometry data with PyOpenMS algorithms.
- Converts mzML, mzXML, and MGF formats for pipeline integration.
- Applies smoothing, baseline correction, and peak picking operations.
- Detects isotope patterns and links features in proteomics workflows.
- Delivers processed feature tables ready for downstream analysis.
SKILL.md
.github/skills/pyopenms-skillView on GitHub ↗
---
name: pyopenms-skill
description: Comprehensive tool for computational mass spectrometry using PyOpenMS; use when you need to read/write MS formats (mzML/mzXML/MGF), run signal processing (smoothing/peak picking), detect isotope features, or perform peptide identification in proteomics/metabolomics workflows.
license: MIT
author: aipoch
---
> **Source**: [https://github.com/aipoch/medical-research-skills](https://github.com/aipoch/medical-research-skills)
## When to Use
- Converting, validating, or batch-processing mass spectrometry files (e.g., mzML, mzXML, MGF) as part of a pipeline.
- Cleaning raw spectra before downstream analysis (smoothing, baseline correction, denoising, peak picking).
- Detecting and linking isotope patterns / features for proteomics or metabolomics feature tables.
- Running identification-oriented steps where peptide/protein identification integration is required.
- Building custom computational MS workflows in Python while leveraging OpenMS algorithms.
## Key Features
- **MS File I/O**: Read/write common MS formats (mzML, mzXML, MGF).
- **Signal Processing**: Smoothing, baseline correction, filtering, and peak picking.
- **Feature Detection**: Isotope pattern detection and feature linking utilities.
- **Identification Support**: Hooks for peptide identification workflows via OpenMS-compatible components.
- **Scripted Workflows**: A ready-to-use “Load → Process → Analyze” workflow entry point.
## Dependencies
Install the following Python packages:
- `pyopenms` (version: compatible with your OpenMS/PyOpenMS distribution)
- `pandas` (version: latest recommended)
- `numpy` (version: latest recommended)
Installation:
```bash
uv pip install pyopenms pandas numpy
```
## Example Usage
A complete runnable example using the provided workflow script (`scripts/process_ms.py`):
```python
# run_example.py
from scripts.process_ms import run_workflow
def main():
# Load -> Process -> Analyze
# The script is expected to read the input mzML and apply optional filtering.
result = run_workflow("data.mzML", apply_filter=True)
# The returned object depends on the implementation of run_workflow.
# Common patterns include a processed experiment, a feature map, or a summary dict.
print("Workflow finished.")
print(result)
if __name__ == "__main__":
main()
```
Run:
```bash
python run_example.py
```
For manual/custom workflows, see:
- File operations: `references/file_io.md`
- Signal processing algorithms: `references/signal_processing.md`
## Implementation Details
- **Binding Layer**: This skill uses PyOpenMS, the Python bindings for the OpenMS C++ library, to expose core computational MS algorithms.
- **Workflow Pattern**: The default script follows a standard pipeline structure:
1. **Load** an MS run from disk (e.g., mzML).
2. **Process** spectra (optional filtering/smoothing/baseline correction).
3. **Analyze** results (e.g., peak picking, feature detection, or downstream summaries).
- **Configurable Processing**: The `apply_filter` flag in `run_workflow(...)` is intended to toggle one or more preprocessing steps; exact filters and parameters should be documented in `scripts/process_ms.py` and the referenced guides.
- **Algorithm Reference**: Detailed descriptions of available filters and peak pickers, including parameterization, are maintained in `references/signal_processing.md`.More from aipoch/medical-research-skills
- 3d-molecule-ray-tracerGenerate photorealistic rendering scripts for PyMOL and UCSF ChimeraX.
- abstract-summarizerTransform lengthy academic papers into concise, structured 250-word abstracts.
- abstract-trimmerPrecision editing tool that reduces abstract word count through intelligent compression techniques, maintaining scientific rigor while meeting strict journal and conference requirements.
- academic-abstract-refinerRefines long medical academic texts into SCI-style unstructured Chinese and English abstracts; use when you need to condense drafts/reports/summaries into bilingual abstracts and generate Summary_Report.md.
- academic-cv-generatorGenerate structured academic CVs from free-form Chinese/English text and export to Word (.docx). Use this skill when you are asked to organize, generate, or optimize an academic CV (e.g., publications/projects/awards) into a consistent, formatted document with uniform-colored section headers and optional bilingual output.
- academic-highlight-generatorGenerates submission-ready Elsevier/SCI Highlights from manuscript text or extracted PDF/DOCX/TXT content. Use when a user needs 3-5 concise, evidence-grounded highlight bullets for a research paper, review, meta-analysis, case report, or bioinformatics manuscript.
- academic-norm-reviewDetects content similarity, verifies standardized citations and abbreviations, and flags potential academic integrity risks; use it before submission, during academic writing QA, or for compliance reviews.
- academic-poster-generatorComplete workflow for generating academic research posters from PDF literature; use when you need to extract paper content from PDFs and produce a LaTeX-based poster (beamerposter/tikzposter/baposter) with mandatory figure generation and a final rendered HTML deliverable.
- acronym-unpackerIntelligent medical abbreviation disambiguation tool that resolves ambiguous acronyms using clinical context, specialty-specific knowledge, and document-level semantic analysis.
- active-comparator-single-soc-faers-safety-comparisonGenerates complete FAERS pharmacovigilance study designs for multi-drug or class-level safety comparison inside one predefined SOC or AE family using active comparators, disproportionality analysis, subgroup characterization, and reviewer-facing evidence control.