Skip to content

[GSoC 2025] Generate MS1 and MS2 Peptide Feature Masks for Targeted Deconvolution #20

@singjc

Description

@singjc

Background

Our diffusion model for DIA-MS data deconvolution can benefit from conditioning signals to guide the deconvolution process. Similar to how segmentation masks guide image super-resolution in computer vision, peptide feature masks can provide valuable prior information about where signals are expected in the MS data. This task involves generating these masks from established DIA-MS analysis tools (OpenSwath, DIA-NN, or Spectronaut, DIA-UMPIRE) and integrating them as conditioning signals in our diffusion model.

Task Objectives

  • Extract peptide feature masks from one or more DIA analysis tools:
    • OpenSwath
    • DIA-NN
    • Spectronaut
    • DIA-UMPIRE
  • Process and format these masks to be compatible with our model's conditioning input
  • Implement the conditioning mechanism in our diffusion model architecture
  • Evaluate the impact of different mask sources on deconvolution performance

Technical Details

  • Feature masks should capture the expected m/z and RT positions of target peptides
  • Masks may need to be converted into compatible format for model input
  • Consider both binary masks and probabilistic/scored masks based on tool confidence scores

Deliverables

  • Scripts to extract and process peptide feature masks from at least one of the target tools
  • Integration of the masks as conditioning signals in the model architecture
  • Documentation explaining the mask generation process and format specifications
  • Evaluation of deconvolution performance with and without mask conditioning
  • Comparative analysis if multiple mask sources are implemented

Resources

Difficulty

Intermediate - Requires domain knowledge of proteomics tools and understanding of conditioning mechanisms in deep learning models.

Metadata

Metadata

Assignees

Labels

GSoC 2025Tasks specific for GSoC2025enhancementNew feature or requesthelp wantedExtra attention is needed

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions