-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue querying db's made with latest cyvcf2 #238
Comments
Hi Matthias, can you share a VCF with these 2 rows? |
Sure, here's an excerpt with the 10 first variants from that file. |
with this script:
run on that file, I see:
which seems to match what I expect. Can you verify that you see the same? Perhaps it's since we updated how cyvcf2 is built? |
Hi Brent I'm getting the following results. py2.7
and
py3
It seems to me the latest version encodes the strings in unicode on python2, whereas the older version did not. I can imagine thats the cause of the issues downstream. I know it's kind of stupid to still be using python2, but that's the way it currently configured in the bcbio environment. Ping @naumenko-sa. Matthias |
Hi @brentp,
I've come across a weird issue.
I've updated one of our bcbio installations, resulting in an environment with the latest vcf2db + cyvcf2=0.30.14.
When I try to query one of the db's generated with this setup using the gemini python API, I get a funky result when fetching genotypes.
when I try to print the gts field from a table row (gemini.GeminiQuery.GeminiRow) I get the following numpy array
["T" "" "" "T" "" "" "C" "" "" "/" "" "" "T" "" "" "T" "" "" "C" "" "" "T"
"" "" "/" "" "" "T" "" "" "T" "" "" "C" "" "" "" "" "" "" "" "" "" "" "T"
"" "" "/" "" "" "T" "" "" "T" "" "" "C" "" "" "" "" "" "" "" "" "" "" ""]
which should show
["TTC/TTC","T/TTC","T/TTC"]
e.g. the genotypes for three individuals.
This is the case for our older installs running cyvcf2=0.20.9.
I suppose this error may have something to do with #227.
When downgrading cyvcf2 to the older version and regenerating the db, everything seems to work again.
Any thoughts?
M
PS, it seems others have also run into similar issues: https://github.com/chapmanb/cloudbiolinux/blob/master/contrib/flavor/ngs_pipeline_minimal/packages-conda.yaml#L354
xpost from quinlan-lab/vcf2db#69
The text was updated successfully, but these errors were encountered: