Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High Memory Consumption with Large Dataset in PC-Relate #102

Open
ecyeh opened this issue Jul 24, 2023 · 1 comment
Open

High Memory Consumption with Large Dataset in PC-Relate #102

ecyeh opened this issue Jul 24, 2023 · 1 comment

Comments

@ecyeh
Copy link

ecyeh commented Jul 24, 2023

Hello,

I'm currently using GENESIS for a large-scale genetic analysis involving approximately 500,000 samples. However, I've found that running PC-Relate with only 160,000 samples already exceeds my system's 1 TB memory capacity.

Given these memory demands, I'm wondering if there are any strategies or planned updates to reduce the memory footprint of PC-Relate. I'm also interested in any recommended approaches for handling such large datasets with GENESIS. Any advice or guidance would be greatly appreciated.
Thank you for your time and assistance.

Best Regards,
Erh-Chan

@smgogarten
Copy link
Collaborator

You can run the various steps of pcrelate separately on smaller batches of samples and then combine the results. See #38 for a description of how to do this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants