Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CHM13, high-coverage imputation #11

Open
gevro opened this issue Dec 26, 2021 · 1 comment
Open

CHM13, high-coverage imputation #11

gevro opened this issue Dec 26, 2021 · 1 comment

Comments

@gevro
Copy link

gevro commented Dec 26, 2021

Hi,
Are there reference files available to perform imputation on CHM13-v1 aligned data?

Also, would QUILT work on high-cvg (~35x) whole genome sequencing, to help find the very small number of missed variant calls?

Thanks.

@rwdavies
Copy link
Owner

Hi,

No, sorry there are not reference files that I'm aware of.

QUILT can work on high coverage WGS, in the sense that it can run, but I don't think it would be helpful in your situation. Most reference data (e.g. HRC, or 1000 Genomes) uses the same tech (I'm assuming you've got short insert paired end Illumina or similar), so you likely won't gain SNPs in uncallable regions (as the reference panel would miss the same sites), and the imputation would contain errors at least because of the different history since divergence of any long matching reference haplotype versus your current sample (gene conversion and mutations that arose on one haplotype and not the other since they last coalesced).

If you have high coverage, other approaches like mapping to population graphs and calling variants on them might be more useful (they were a big thing a few years ago, not sure how easy to use / how commonly used)

Best
Robbie

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants