bio-systems-biology-flux-balance-analysis
$
npx mdskill add GPTomics/bioSkills/bio-systems-biology-flux-balance-analysisPredict metabolic fluxes and growth rates from genome-scale models
- Optimizes resource utilization and predicts metabolic phenotypes
- Integrates with COBRApy for linear programming and flux analysis
- Executes FBA and FVA algorithms on SBML or JSON model files
- Outputs optimal flux distributions and growth rate predictions
SKILL.md
.github/skills/bio-systems-biology-flux-balance-analysisView on GitHub ↗
---
name: bio-systems-biology-flux-balance-analysis
description: Perform flux balance analysis (FBA) and flux variability analysis (FVA) on genome-scale metabolic models using COBRApy. Predict growth rates, metabolic fluxes, and optimal resource utilization. Use when predicting metabolic phenotypes or optimizing flux distributions.
tool_type: python
primary_tool: cobrapy
---
## Version Compatibility
Reference examples tested with: COBRApy 0.29+
Before using code patterns, verify installed versions match. If versions differ:
- Python: `pip show <package>` then `help(module.function)` to check signatures
If code throws ImportError, AttributeError, or TypeError, introspect the installed
package and adapt the example to match the actual API rather than retrying.
# Flux Balance Analysis
**"Predict growth rate and metabolic fluxes for my organism"** → Solve a linear program over a genome-scale metabolic model to find the optimal flux distribution that maximizes biomass (or a custom objective), and assess flux ranges with FVA.
- Python: `model.optimize()`, `cobra.flux_analysis.flux_variability_analysis()` (COBRApy)
## Load Models
```python
import cobra
# Load built-in test models
model = cobra.io.load_model('textbook') # E. coli core (95 reactions)
model = cobra.io.load_model('iJO1366') # Full E. coli (2583 reactions)
# Load from file
model = cobra.io.read_sbml_model('model.xml')
model = cobra.io.load_json_model('model.json')
# BiGG models available at: http://bigg.ucsd.edu/models
```
## Basic FBA
**Goal:** Predict optimal metabolic flux distributions and growth rates for an organism under defined conditions.
**Approach:** Load a genome-scale metabolic model, solve the linear program to maximize the biomass objective function, then inspect flux values and solver status to interpret metabolic phenotype.
```python
import cobra
model = cobra.io.load_model('textbook')
# Run FBA (maximizes objective function, usually biomass)
solution = model.optimize()
# Growth rate interpretation:
# >0.8 h^-1: Fast growth (rich media)
# 0.3-0.8 h^-1: Moderate growth
# <0.3 h^-1: Slow growth or stress
# 0: No growth (lethal condition or missing nutrients)
print(f'Growth rate: {solution.objective_value:.4f} h^-1')
print(f'Status: {solution.status}')
# Access flux values
for rxn in model.reactions[:5]:
print(f'{rxn.id}: {solution.fluxes[rxn.id]:.4f}')
```
## Set Media Conditions
```python
def set_minimal_media(model, carbon_source='EX_glc__D_e', carbon_uptake=10):
'''Configure minimal media conditions
Args:
carbon_source: Exchange reaction ID for carbon source
carbon_uptake: Maximum uptake rate (mmol/gDW/h)
Typical glucose uptake: 10-20 mmol/gDW/h
'''
# Close all exchange reactions
for rxn in model.exchanges:
rxn.lower_bound = 0 # No uptake
# Open essential exchanges
essential = ['EX_o2_e', 'EX_h2o_e', 'EX_h_e', 'EX_nh4_e',
'EX_pi_e', 'EX_so4_e', 'EX_k_e', 'EX_mg2_e']
for ex_id in essential:
if ex_id in model.reactions:
model.reactions.get_by_id(ex_id).lower_bound = -1000
# Set carbon source
if carbon_source in model.reactions:
model.reactions.get_by_id(carbon_source).lower_bound = -carbon_uptake
return model
# Example: Compare growth on different carbon sources
carbon_sources = ['EX_glc__D_e', 'EX_ac_e', 'EX_succ_e']
for cs in carbon_sources:
with model:
set_minimal_media(model, carbon_source=cs)
sol = model.optimize()
print(f'{cs}: Growth = {sol.objective_value:.4f}')
```
## Flux Variability Analysis (FVA)
```python
from cobra.flux_analysis import flux_variability_analysis
# FVA finds the range of flux values for each reaction
# while maintaining optimal (or near-optimal) growth
# Standard FVA (at 100% optimum)
fva = flux_variability_analysis(model)
# FVA at 90% of optimal growth
# fraction_of_optimum=0.9: allows 10% suboptimal solutions
# This reveals alternative optimal flux distributions
fva = flux_variability_analysis(model, fraction_of_optimum=0.9)
# FVA for specific reactions
rxns_of_interest = ['PFK', 'PGI', 'GAPD']
fva = flux_variability_analysis(model, reaction_list=rxns_of_interest)
# Identify essential vs flexible reactions
fva['essential'] = (fva['minimum'] > 0) | (fva['maximum'] < 0)
fva['flexible'] = fva['maximum'] - fva['minimum'] > 0.01
print(fva[['minimum', 'maximum', 'essential', 'flexible']])
```
## Production Envelope
```python
from cobra.flux_analysis import production_envelope
# Analyze tradeoff between growth and product secretion
# Useful for metabolic engineering to find optimal conditions
prod_env = production_envelope(
model,
reactions=['EX_ac_e'], # Product (acetate)
objective='Biomass_Ecoli_core',
points=20
)
# prod_env is a DataFrame with:
# - EX_ac_e: acetate production rate
# - Biomass_Ecoli_core: growth rate at that production level
print(prod_env)
```
## Phenotype Phase Plane
```python
from cobra.flux_analysis import phenotype_phase_plane
# Analyze growth across two varying conditions
# Typically oxygen and carbon uptake
ppp = phenotype_phase_plane(
model,
variables=['EX_glc__D_e', 'EX_o2_e'], # X and Y axes
points=10
)
# Returns growth rate as function of both uptake rates
# Useful for identifying metabolic modes (aerobic vs anaerobic)
```
## Parsimonious FBA (pFBA)
```python
from cobra.flux_analysis import pfba
# pFBA minimizes total flux while achieving optimal growth
# Produces more biologically realistic flux distributions
pfba_solution = pfba(model)
# Compare total flux
fba_total = sum(abs(model.optimize().fluxes))
pfba_total = sum(abs(pfba_solution.fluxes))
print(f'FBA total flux: {fba_total:.1f}')
print(f'pFBA total flux: {pfba_total:.1f}')
```
## Loopless FBA
```python
from cobra.flux_analysis import loopless_solution
# Remove thermodynamically infeasible loops
# Important for realistic flux predictions
solution = loopless_solution(model)
```
## Related Skills
- systems-biology/gene-essentiality - In silico gene knockouts
- systems-biology/context-specific-models - Tissue-specific FBA
- metabolomics/pathway-mapping - Integrate metabolomics data
More from GPTomics/bioSkills
- bio-admet-predictionPredicts ADMET properties using ADMETlab 3.0 API or DeepChem models. Estimates bioavailability, CYP inhibition, hERG liability, and 119 toxicity endpoints with uncertainty quantification. Filters for PAINS and other structural alerts. Use when filtering compounds for drug-likeness or prioritizing leads by predicted safety.
- bio-alignment-amplicon-clippingTrim PCR primers from aligned reads in amplicon-panel BAMs using samtools ampliconclip. Use when processing SARS-CoV-2 ARTIC, hereditary cancer panels, ctDNA hot-spot panels, or any amplicon assay where primer-derived bases would falsely confirm reference at primer footprints.
- bio-alignment-filteringFilter alignments by flags, mapping quality, and regions using samtools view and pysam. Use when extracting specific reads, removing low-quality alignments, or subsetting to target regions.
- bio-alignment-indexingCreate and use BAI/CSI indices for BAM/CRAM files using samtools and pysam. Use when enabling random access to alignment files or fetching specific genomic regions.
- bio-alignment-ioRead, write, and convert multiple sequence alignment files using Biopython Bio.AlignIO. Supports Clustal, PHYLIP, Stockholm, FASTA, Nexus, and other alignment formats for phylogenetics and conservation analysis. Use when reading, writing, or converting alignment file formats.
- bio-alignment-msa-parsingParse and analyze multiple sequence alignments using Biopython. Extract sequences, identify conserved regions, analyze gaps, work with annotations, and manipulate alignment data for downstream analysis. Use when parsing or manipulating multiple sequence alignments.
- bio-alignment-msa-statisticsCalculate alignment statistics including sequence identity, conservation scores, substitution matrices, and similarity metrics. Use when comparing alignment quality, measuring sequence divergence, and analyzing evolutionary patterns.
- bio-alignment-multiplePerform multiple sequence alignment using MAFFT, MUSCLE5, ClustalOmega, or T-Coffee. Guides tool and algorithm selection based on dataset size, sequence divergence, and downstream application. Use when aligning three or more homologous sequences for phylogenetics, conservation analysis, or evolutionary studies.
- bio-alignment-pairwisePerform pairwise sequence alignment using Biopython Bio.Align.PairwiseAligner. Use when comparing two sequences, finding optimal alignments, scoring similarity, and identifying local or global matches between DNA, RNA, or protein sequences.
- bio-alignment-sortingSort alignment files by coordinate or read name using samtools and pysam. Use when preparing BAM files for indexing, variant calling, or paired-end analysis.