-
Hello, I have a dataset with samples across an environmental gradient and I ran Atlas for the first time (with both the genomes and genecatalog workflow). I have analyzed the MAGs and their taxonomic patterns, but I would like to dive into the functional patterns now. I followed the online tutorial to analyze the output of Atlas, and I made the heatmaps that use the relative abundance of the KEGG modules (based on the matrix multiplication of the relative abundance of the MAGs and the presence matrix of the KEGG modules with step coverage>0.8). I also see a lot of papers online however that use KO abundances and I can't find these in the Atlas output folders. I am a bit confused about whether I should use the output in the ./genome folder or the ./Genecatalog folder for these functional analyses and how I can find or calculate the KO abundances. Should I use the Nmapped_reads (in ./Genecatalog/counts) and look up the KO identifier for each of the genes listed? My apologies if this is a very simple question. |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 11 replies
-
@LunavdL It is not a silly question. There are two ways of getting to KO abundances, and you mentioned both of them:
I suggest you to start with 1. The analysis would work the same as for Kegg modules with a matrix multiplication. You just need to take the KO's instead of the modules, I think it's the file For 2. You can use the eggnog annotations, and filter for the eggnogs that have a link to Kegg. and sum the abundance of these genes. I don't know which way is better. The genecatalog for sure is more comprehensive. But the eggNOG annotations are linked to somehwat outdated KOs' if I'm not wrong. |
Beta Was this translation helpful? Give feedback.
-
An other way for analysis would be to do a kind of enrichment analysis based on the genomes. E.g. from all the genomes that are changed above threshold are they enriched in a KO or in e kegg module. |
Beta Was this translation helpful? Give feedback.
-
For your information in the version 2.15.2 I added the option to annotate genes with dram that runs more precise and uptodate KO annotation than eggNOG. You need to add See also #640 |
Beta Was this translation helpful? Give feedback.
@LunavdL It is not a silly question.
Great you managed to calculate the abundance of modules.
There are two ways of getting to KO abundances, and you mentioned both of them:
I suggest you to start with 1. The analysis would work the same as for Kegg modules with a matrix multiplication. You just need to take the KO's instead of the modules, I think it's the file
genomes/annotations/dram/annotations.tsv
For 2. You can use the eggnog annotations, and filter for the eggnogs that have a link to Kegg. and sum the abundance of these genes.
I don't know which way is better. The genecatalog for sure is more comprehensive. But the eggNOG annotations …