-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Genome accessibility/callability #1219
Comments
Sounds like adding a |
This is something I'm also interested in and I have mentioned it to you @jeromekelleher in the context of our analyses on spruce. @mufernando have you started looking into this or should I have a go? |
+1 for this. @percyfal and I have discussed this, and I would be happy to contribute here to help get this going. |
I am not working on this right now! |
It is important to consider genome accessibility when computing rates from genomic data.
scikit-allel has options to include an "accessibility mask", a boolean array indicating whether a base is accessible or not, and can be used to properly normalize quantities.
I found mentions of implementing this in #341
I am happy to help make this happen, but since I am new to the codebase I'd need some hand-helding... Ideally we would need a way of reading BED files which can be attached to the genotype dataset. Then, when computing per base statistics, we would need to intersect the accessible intervals with the windows intervals to get the right denominator.
The text was updated successfully, but these errors were encountered: