Skip to content

[OpenSwathWorkflow] Add direct in-silico library searching #21

@singjc

Description

@singjc

It would be useful to support FASTA-driven in-silico assay generation directly inside OpenSwathWorkflow, similar in spirit to DIA-NN’s “library-free” mode and Spectronaut’s directDIA, where the user can provide a FASTA and run a single command end-to-end.

Currently, OpenSwathWorkflow requires a pre-built predicted assay library (e.g., generated with Prosit/PeptDeep/MS2PIP/DeepLC/DIA-NN) which is then passed into OpenSwathWorkflow.

We could integrate easypqp-rs (Rust in-silico library generation) in two ways:

  • Add a C ABI (FFI) wrapper for easypqp-rs and call it from OpenMS.

  • Call the easypqp-rs executable from OpenSwathWorkflow as an internal thirdparty preprocessing step (subprocess, probably the easiest approach?)

Considerations:

  • Need to ensure some sort of reproducible library ID mapping, so that different MS run analyses have the same library IDs (precursor, peptide, protein, gene and transition IDs)
  • We would need to perform direct corresponding decoy generation within OpenSwathWorkflow (currently done outside via OpenSwathDecoyGenerator)

Pros:

  • Depending on how it's implemented, we could aid in reducing memory usage (from the large input transition list), by chunking how many in-silico proteins are digested at a time.
  • Reduces the amount of extra work the user would need to do on their end
  • It would be possible to fine-tune the prediction models based on the iRT calibration step, which would calibrate the predictions for the specific data file being analyzed.

@timosachsenberg, @jcharkow what are your thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions