-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to handle when two contigs from the same assembly sligtly overlap #329
Comments
Hi @Isoris,
In PGGB, there is no such method to merge overlapping contigs. In |
Hi @subwaystation Thank you for your assistance, In my bacterial dataset, comprising a minimal example of three bacteria, I observed the following results: Without merging by prefixes, the visualization is as depicted here: However, after merging the paths by prefixes, the graph alters to the representation shown below: My inquiry is: Is it feasible to extract the complete paths post-merge? Specifically, I am interested in obtaining the nodes and edges present on the left-hand side of the visual. My aim is to extract these "scaffolds" or paths, enabling me to subsequently remap my short reads onto them. Ultimately, my goal is to produce a unified genome graph, as opposed to a fragmented genome graph. As evident from the left side of the visualization, the merged paths of blue and violet do share an overlap. This suggests that we possess the requisite positional or genomic context information. Using this, I hope to reconstruct a cohesive graph, wherein all nodes within this particular interval are interconnected, deriving from all the individual contigs. I believe it is possible because I have a set of 80 samples of the same species from short read data and de-novo contigs and also 2 reference genomes. I would be grateful for any suggestions. |
For instance here for another subset of the bacteria of the same species. We obtain this: Without merging by prefixes, the visualization is as depicted here: However, after merging the paths by prefixes, the graph alters to the representation shown below: We can clearly see that it is theoretically possible to integrate the floating "subgraphs" to the main graph at least to merge some of them in a larger graph. |
Dear @Isoris , thanks, |
odgi viz -M To merge prefixes. If I remember correctly.. i will send you my code later in the afternoon. Basically the prefixes have to be the same before the first separator. |
I did do it. it worked. 22-prefix is a text file with sample names, one sample per line.
|
Wow I never knew about that, thank you so much for the tips. |
Hello,
Thank you for providing this tool.
I'm currently building a species pangenome using Illumina assemblies. However, I've noticed that some contigs appear to overlap or map to the same region without collapsing when I use odgi extract followed by viz.
Is there a method to collapse overlapping alignments into a single alignment for a sample?
Thank you for your assistance.
Thank you for your answer.
Quentin
The text was updated successfully, but these errors were encountered: