Skip to content

Latest commit

 

History

History
38 lines (19 loc) · 1.22 KB

README.md

File metadata and controls

38 lines (19 loc) · 1.22 KB

ISEScan_stats

Kindly if you find this repo useful for your work, cite & star this repo

What is this script?

ISEScan is a great tool for insertion elements locator. However, if you used it for many genomes, and you want to summarize the many basic summary (.fna.sum) stats into an overview CSV sheet, then this simple Python3 script shall do this job.

What do you need?

This script is working on python3.9

You shall have the .sum files in one folder and these dependencies (pandas, openpyxl ,glob, warnings,and argparse)

Just type this command and you will get two CSV sheets in the same folder of your sum files.

"-i /--input_dir" is your path to the directory of sum files.

"-p /--prefix" is your preferred prefix for your run

python ise_stats_with_nis.py -i summ_folder/ -p Trial_two

Important: This analysis uses the copy number (nIS) for your frequency or your heatmap.

What do you get?

Currently, two CSV files.

1- "prefix"_frequency.csv (The general one like a summary of systems per the whole run).

Here you have the "frequency" of each family in your genomes

2- "prefix"_heatmap.csv (The more detailed one).Easy to feed to R heatmaps.