Skip to content

Conversation

@jarbesfeld
Copy link
Collaborator

Two new statistics were added to the paper, and this PR contains two new cells in the mapping_analysis.ipynb notebook to compute these statistics:

  1. The number of score sets without any discordant variants. The total number of concordant variant pairs across these score sets was then counted.
  2. The number of score sets where the MAVE target sequence and human reference sequence were equivalent. For experiment 97, we took the mapped protein variants that were provided by the submitter, so these score sets were added by default. Given score sets where the MAVE target sequence and human reference sequence are equivalent, we would expect the VRS IDs to be equivalent as the same variations are being described on the same sequence.

@jarbesfeld jarbesfeld added the enhancement New feature or request label Mar 13, 2025
@jarbesfeld jarbesfeld requested a review from jsstevenson March 13, 2025 13:36
@jarbesfeld jarbesfeld self-assigned this Mar 13, 2025
@jarbesfeld jarbesfeld merged commit b6a9cbb into main Mar 14, 2025
8 checks passed
@jarbesfeld jarbesfeld deleted the discordant-ss-data branch March 14, 2025 14:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants