bio-single-cell-metabolite-communication
$
npx mdskill add GPTomics/bioSkills/bio-single-cell-metabolite-communicationPredict metabolic signaling between cell types from scRNA-seq data.
- Infers secretion and sensing patterns from enzyme and transporter expression.
- Depends on MeboCost tool for metabolic signaling inference.
- Computes communication scores based on predicted metabolite interactions.
- Outputs metabolite-mediated communication maps between cell populations.
SKILL.md
.github/skills/bio-single-cell-metabolite-communicationView on GitHub ↗
---
name: bio-single-cell-metabolite-communication
description: Analyze metabolite-mediated cell-cell communication using MeboCost for metabolic signaling inference between cell types. Predict metabolite secretion and sensing patterns from scRNA-seq data. Use when studying metabolic crosstalk between cell populations or metabolite-receptor interactions.
tool_type: python
primary_tool: MeboCost
---
## Version Compatibility
Reference examples tested with: matplotlib 3.8+, scanpy 1.10+
Before using code patterns, verify installed versions match. If versions differ:
- Python: `pip show <package>` then `help(module.function)` to check signatures
If code throws ImportError, AttributeError, or TypeError, introspect the installed
package and adapt the example to match the actual API rather than retrying.
# Metabolite-Mediated Cell Communication
**"Analyze metabolic crosstalk between cell types"** → Predict metabolite secretion-sensing interactions between cell populations based on enzyme and transporter expression patterns.
- Python: `mebocost.MeboCost(adata, groupby='cell_type')` → `run_mebocost()`
## MeboCost Overview
MeboCost infers metabolite-mediated communication by:
1. Predicting metabolite secretion from enzyme expression
2. Identifying metabolite-sensing receptors
3. Computing communication scores between cell types
## Basic Workflow
**Goal:** Infer metabolite-mediated cell-cell communication from scRNA-seq data by predicting which cell types secrete and sense specific metabolites.
**Approach:** Initialize a MeboCost object from an AnnData with cell type annotations, run permutation-based communication inference to score metabolite secretion-sensing interactions, then filter for statistically significant pairs.
```python
import mebocost as mbc
import scanpy as sc
# Load scRNA-seq data
adata = sc.read_h5ad('adata.h5ad')
# Initialize MeboCost
mebo = mbc.create_obj(
adata=adata,
group_col='cell_type', # Cell type annotation column
species='human' # 'human' or 'mouse'
)
# Infer metabolite communication
mebo.infer_commu(
n_permutations=1000, # Permutations for significance testing
seed=42
)
# Get significant interactions
sig_interactions = mebo.commu_res[mebo.commu_res['pval'] < 0.05]
```
## Prepare Data
```python
import scanpy as sc
def prepare_for_mebocost(adata, cell_type_col='cell_type', min_cells=50):
'''Prepare AnnData for MeboCost analysis
Requirements:
- Log-normalized expression (sc.pp.normalize_total, sc.pp.log1p)
- Cell type annotations
- Gene symbols (not Ensembl IDs)
'''
# Check normalization
if adata.X.max() > 50:
print('Warning: Data may not be log-normalized')
# Filter rare cell types
cell_counts = adata.obs[cell_type_col].value_counts()
valid_types = cell_counts[cell_counts >= min_cells].index
adata = adata[adata.obs[cell_type_col].isin(valid_types)].copy()
print(f'Cell types: {len(valid_types)}')
print(f'Cells: {adata.n_obs}')
return adata
```
## Run Communication Analysis
```python
def run_mebocost(adata, cell_type_col='cell_type', species='human'):
'''Run MeboCost metabolite communication analysis
Args:
adata: AnnData with log-normalized expression
cell_type_col: Column with cell type annotations
species: 'human' or 'mouse'
Returns:
MeboCost object with communication results
'''
import mebocost as mbc
# Create MeboCost object
mebo = mbc.create_obj(
adata=adata,
group_col=cell_type_col,
species=species
)
# Infer enzyme-metabolite-receptor communication
# n_permutations: Higher = more accurate p-values but slower
# 1000 is standard; use 100 for quick exploration
mebo.infer_commu(n_permutations=1000, seed=42)
return mebo
```
## Analyze Results
```python
def analyze_metabolite_communication(mebo, pval_threshold=0.05):
'''Extract and summarize significant communications
Communication flow:
Sender cell -> Enzyme -> Metabolite -> Receptor -> Receiver cell
'''
results = mebo.commu_res.copy()
# Filter significant interactions
sig = results[results['pval'] < pval_threshold]
# Summary statistics
summary = {
'total_interactions': len(results),
'significant_interactions': len(sig),
'unique_metabolites': sig['metabolite'].nunique(),
'unique_sender_types': sig['sender'].nunique(),
'unique_receiver_types': sig['receiver'].nunique()
}
# Top metabolites by frequency
top_metabolites = sig['metabolite'].value_counts().head(10)
# Top sender-receiver pairs
sig['pair'] = sig['sender'] + ' -> ' + sig['receiver']
top_pairs = sig['pair'].value_counts().head(10)
return {
'summary': summary,
'top_metabolites': top_metabolites,
'top_pairs': top_pairs,
'significant_interactions': sig
}
```
## Visualization
```python
def plot_communication_network(mebo, pval_threshold=0.05):
'''Plot metabolite communication network'''
import matplotlib.pyplot as plt
# Filter significant
sig = mebo.commu_res[mebo.commu_res['pval'] < pval_threshold]
# Aggregate by cell type pair
pair_counts = sig.groupby(['sender', 'receiver']).size().reset_index(name='count')
# Create chord diagram or heatmap
pivot = pair_counts.pivot(index='sender', columns='receiver', values='count')
pivot = pivot.fillna(0)
plt.figure(figsize=(10, 8))
plt.imshow(pivot.values, cmap='Reds')
plt.xticks(range(len(pivot.columns)), pivot.columns, rotation=45, ha='right')
plt.yticks(range(len(pivot.index)), pivot.index)
plt.colorbar(label='Number of interactions')
plt.xlabel('Receiver')
plt.ylabel('Sender')
plt.title('Metabolite Communication Network')
plt.tight_layout()
return plt.gcf()
def plot_metabolite_flow(mebo, metabolite, pval_threshold=0.05):
'''Visualize communication flow for specific metabolite'''
sig = mebo.commu_res[
(mebo.commu_res['metabolite'] == metabolite) &
(mebo.commu_res['pval'] < pval_threshold)
]
print(f'\n{metabolite} communication:')
for _, row in sig.iterrows():
print(f" {row['sender']} ({row['enzyme']}) -> "
f"{row['receiver']} ({row['receptor']})")
print(f" Score: {row['commu_score']:.3f}, p-value: {row['pval']:.4f}")
```
## Compare Conditions
```python
def compare_conditions(adata, condition_col, cell_type_col, species='human'):
'''Compare metabolite communication between conditions
Useful for:
- Tumor vs normal
- Treatment vs control
- Disease vs healthy
'''
import mebocost as mbc
conditions = adata.obs[condition_col].unique()
results = {}
for condition in conditions:
adata_subset = adata[adata.obs[condition_col] == condition].copy()
mebo = mbc.create_obj(
adata=adata_subset,
group_col=cell_type_col,
species=species
)
mebo.infer_commu(n_permutations=1000, seed=42)
results[condition] = mebo.commu_res
# Find differential communications
# Interactions significant in one condition but not another
return results
```
## Metabolite Categories
```python
# MeboCost includes curated metabolite-receptor pairs
# Major categories:
METABOLITE_CATEGORIES = {
'amino_acids': ['Glutamine', 'Glutamate', 'Tryptophan', 'Arginine'],
'lipids': ['Prostaglandin E2', 'Leukotriene B4', 'Sphingosine-1-phosphate'],
'nucleotides': ['ATP', 'Adenosine', 'UDP'],
'vitamins': ['Retinoic acid', 'Vitamin D'],
'other': ['Lactate', 'Succinate', 'Itaconate']
}
def filter_by_category(results, category):
'''Filter results to specific metabolite category'''
metabolites = METABOLITE_CATEGORIES.get(category, [])
return results[results['metabolite'].isin(metabolites)]
```
## Related Skills
- single-cell/cell-communication - Ligand-receptor communication analysis
- metabolomics/pathway-mapping - Metabolic pathway context
- systems-biology/flux-balance-analysis - Metabolic flux predictions
More from GPTomics/bioSkills
- bio-admet-predictionPredicts ADMET properties using ADMETlab 3.0 API or DeepChem models. Estimates bioavailability, CYP inhibition, hERG liability, and 119 toxicity endpoints with uncertainty quantification. Filters for PAINS and other structural alerts. Use when filtering compounds for drug-likeness or prioritizing leads by predicted safety.
- bio-alignment-amplicon-clippingTrim PCR primers from aligned reads in amplicon-panel BAMs using samtools ampliconclip. Use when processing SARS-CoV-2 ARTIC, hereditary cancer panels, ctDNA hot-spot panels, or any amplicon assay where primer-derived bases would falsely confirm reference at primer footprints.
- bio-alignment-filteringFilter alignments by flags, mapping quality, and regions using samtools view and pysam. Use when extracting specific reads, removing low-quality alignments, or subsetting to target regions.
- bio-alignment-indexingCreate and use BAI/CSI indices for BAM/CRAM files using samtools and pysam. Use when enabling random access to alignment files or fetching specific genomic regions.
- bio-alignment-ioRead, write, and convert multiple sequence alignment files using Biopython Bio.AlignIO. Supports Clustal, PHYLIP, Stockholm, FASTA, Nexus, and other alignment formats for phylogenetics and conservation analysis. Use when reading, writing, or converting alignment file formats.
- bio-alignment-msa-parsingParse and analyze multiple sequence alignments using Biopython. Extract sequences, identify conserved regions, analyze gaps, work with annotations, and manipulate alignment data for downstream analysis. Use when parsing or manipulating multiple sequence alignments.
- bio-alignment-msa-statisticsCalculate alignment statistics including sequence identity, conservation scores, substitution matrices, and similarity metrics. Use when comparing alignment quality, measuring sequence divergence, and analyzing evolutionary patterns.
- bio-alignment-multiplePerform multiple sequence alignment using MAFFT, MUSCLE5, ClustalOmega, or T-Coffee. Guides tool and algorithm selection based on dataset size, sequence divergence, and downstream application. Use when aligning three or more homologous sequences for phylogenetics, conservation analysis, or evolutionary studies.
- bio-alignment-pairwisePerform pairwise sequence alignment using Biopython Bio.Align.PairwiseAligner. Use when comparing two sequences, finding optimal alignments, scoring similarity, and identifying local or global matches between DNA, RNA, or protein sequences.
- bio-alignment-sortingSort alignment files by coordinate or read name using samtools and pysam. Use when preparing BAM files for indexing, variant calling, or paired-end analysis.