Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions regarding variants #28

Open
msierk opened this issue Oct 30, 2023 · 0 comments
Open

Questions regarding variants #28

msierk opened this issue Oct 30, 2023 · 0 comments

Comments

@msierk
Copy link
Collaborator

msierk commented Oct 30, 2023

I have some questions about how we are going to do variants.

  1. We need to decide which fields from the CDA mutation endpoint we are keeping and which not. Right now cda_mutation_factory.py has 26 out of the 143 fields - is that all we're getting?

  2. I'm not sure how we want to capture things in CDA that aren't directly in the VariantDescriptor schema. For example the reference and tumor alleles, and the normal matched sample alleles. Should these go under Extension?

  3. I don't totally understand the to_ga4gh method in op_mutation.py. The Phenopacket schema has a VariantDescriptor and a VariantInterpretation. First, in the pyphetools hgvs_variant.py there is a note to change to an unambiguous name (to_ga4gh_variant_interpretation), which seems like should be done here. But more fundamentally it doesn't make sense to me to transform a variant into an interpretation - the interpretation is a field that is attached to the variant (which is how the CDA has it). It's not clear from the Phenopackets documentation how the VariantDescriptor and VariantInterpretation are linked together. It seems to me that the interpretation ought to be part of the descriptor, but even if there are two different data structures it is confusing to refer to transforming a Variant object into a VariantInterpretation.

  4. It is confusing that the VariationDescriptor has an "Expression" class which has nothing to do with gene expression, but that just refers to the syntax for "expressing" the variant. "Representation" would be less confusing. (I realize this is a Phenopackets issue, but I just wanted to point it out.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant