-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CreateNetwork clarifications --- toatally confused #16
Comments
UPDATE: What I outline above does not work, since it wants only one input file. I cannot input multiple files or a collection of bed files.....can I? After some more juggling, I ran the CreateNetwork with a single "_bound.bed" file for one TF. This gives me a 1:1 relationship between the TF and other TFs. I got the following files
A. If I look at file 2, it is just a 1:1 between ANDR (androgen receptor) and another TF. Since this is a "bound" file to begin with, am I looking at TFs that ANDR regulates or whose gene has a ANDR binding site? SInce this is a 1:1 file there seems to be no reason to make a figure out of it. B. If I look at file 4, edges.txt, it has all the peak information etc., and the last column is the TFs listed in file 2. The columns have no labels though, it just says SItes1, Sites 2 etc. C. How do I create a deeper network? Can I integrate more than 1 TF bed file? Thanks! C. What program do you suggest I visualize all this with? |
Hey, you are right, to create proper networks, you will want to use all One thing you have to keep in mind is that your The File 4 is a global overview containing all information of all TFBS with an annotated target gene + their target gene. So instead of just showing which TF influences the expression of which gene, it also mentions the specific TFBS and other information contained in the input bed file. If you have more input files, it will merged all the information of every TF in this file, while the TF specific files only contain the paths there the specific TF is involved. The output files can be used as input for any software that create network visualization. There are a few, one I can recommend is Cytoscape, but you can use a tool of your choice. I hope this clears up your questions! |
Hi, Fantastic! Thanks for your clear explanation and your patience. I really appreciate it. I will have to try the multiple bed files and report back to you. Thanks! |
UPDATE: So if I point to the folder with the multiple beds and try to input multiple beds at once like this: I get the error:
Stumped here....It does not want to take multiple files?? The default structure Regards! |
UPDATE 2: So after a lot of commands I conclude it needs an explicit reference to a SINGLE input .bed file. Anything else does not work. |
I can assure you, you can and usually should use multiple input bed files. The problem with your attempt was the use of quotation marks |
You're really good! :-) Yup, removing the '....' and it reads the whole folder of beds. Of course, for all the HOCOMO TFs the output exceeds 200GB... phew!. |
I ran CreateNetwork on a batch of " _bound.bed" files and got adjancent.txt What are the column names? |
Hi,
The instructions of how to run CreateNetwork are truly murky because there is no folder called "annotated_tfbs" nor is there an example file. After a lot of head-scraching this is what I have come up with -- the "mapping_togene" file was the easiest part.
I ran the snakemake pipeline and I am attaching screenshots of my folders from the output folder. One is the whole folder, and the other is the subfolder "TFBS". And the subfolders of each TF....
$ TOBIAS CreateNetwork --TFBS test_data/annotated_tfbs/* --origin test_data/motif2gene_mapping.txt
So the annotated_tfbs/* implies the folder of bed files as in the test data. Is that right? But there is no such folder in my snakemake outputs.
So in the snakemake ouput this folder is not present, however, there is the TFBS folder (picture: "TOBIAS TFBS folder") that has individual TF files.
Within each of these folders is a separate bed folder. Within this bed folder of each TF, is a "_bound.bed" file for each of my conditions.
So this is the file I need to use. Can you please confirm that?
But then I have to copy each of these "_bound.bed" files to a separate folder and per condition.
Then I can run "CreateNetwork".
Thanks!
The text was updated successfully, but these errors were encountered: