Skip to content

Add agentic-created pyopenms scripts for proteomics and metabolomics#1

Closed
Copilot wants to merge 3 commits intomainfrom
copilot/add-agentic-scripts-for-proteomics
Closed

Add agentic-created pyopenms scripts for proteomics and metabolomics#1
Copilot wants to merge 3 commits intomainfrom
copilot/add-agentic-scripts-for-proteomics

Conversation

Copy link
Contributor

Copilot AI commented Mar 24, 2026

Bootstraps the repository with a practical collection of pyopenms-based scripts covering the core analytical tasks proteomics and metabolomics users need day-to-day. Each script lives in its own subdirectory following the per-script directory structure established in main.

Proteomics (scripts/proteomics/)

  • peptide_mass_calculator/ – monoisotopic/average masses and b/y fragment ion series; supports modified sequences (e.g. PEPTM[147]IDEK) and arbitrary charge states
  • protein_digest/ – in-silico enzymatic digestion against pyopenms ProteaseDB (30+ enzymes); configurable missed cleavages and length filters
  • spectrum_file_info/ – mzML summary: MS-level counts, RT/m/z ranges, TIC statistics
  • feature_detection_proteomics/FeatureFinderCentroided wrapper; outputs featureXML

Metabolomics (scripts/metabolomics/)

  • metabolite_feature_detection/ – full FeatureFinderMetabo pipeline (mass tracing → elution peak detection → feature finding)
  • mass_accuracy_calculator/ – ppm error between theoretical m/z (from sequence or molecular formula) and one or more observed values
  • isotope_pattern_matcher/ – theoretical isotope distributions via CoarseIsotopePatternGenerator; cosine similarity scoring against observed peaks

Example

# Peptide masses + fragment ions
python scripts/proteomics/peptide_mass_calculator/peptide_mass_calculator.py --sequence PEPTIDEK --charge 2 --fragments

# ppm error for glucose
python scripts/metabolomics/mass_accuracy_calculator/mass_accuracy_calculator.py --formula C6H12O6 --observed 181.0709 181.0715

# Isotope pattern with similarity score
python scripts/metabolomics/isotope_pattern_matcher/isotope_pattern_matcher.py --formula C12H22O11 \
    --peaks 341.1056,100.0 342.1089,13.1

Also included

  • Per-script requirements.txt and isolated test suites following the repository convention
  • Per-script and top-level READMEs
  • GitHub Actions CI workflow integration (via main merge)
Original prompt

A repository of agentic-created scripts using pyopenms interesting for proteomics and metabolomics users.


⚡ Quickly spin up Copilot coding agent tasks from anywhere on your macOS or Windows machine with Raycast.

Copilot AI changed the title [WIP] Add agentic-created scripts for proteomics and metabolomics Add agentic-created pyopenms scripts for proteomics and metabolomics Mar 24, 2026
Copilot AI requested a review from ypriverol March 24, 2026 17:52
@ypriverol
Copy link
Collaborator

@copilot Can you update your branch from main.

Copy link
Contributor Author

Copilot AI commented Mar 24, 2026

@copilot Can you update your branch from main.

Done — merged origin/main into this branch in commit 4b04a21. Resolved the single README conflict by taking main's version, and removed the old flat script/test layout in favour of the per-script directory structure that landed on main.

@ypriverol ypriverol marked this pull request as ready for review March 24, 2026 20:43
@ypriverol
Copy link
Collaborator

@copilot Can you make sure you follow the standards of the main repository? Also make sure your script is not similar to any script in the repo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants