Decision function to assign job queue
+diff --git a/pkgdown.yml b/pkgdown.yml index 91009a3c..410e43d2 100644 --- a/pkgdown.yml +++ b/pkgdown.yml @@ -2,7 +2,7 @@ pandoc: 3.1.11 pkgdown: 2.1.1 pkgdown_sha: ~ articles: {} -last_built: 2024-10-12T01:01Z +last_built: 2024-10-22T21:09Z urls: reference: https://jravilab.github.io/MolEvolvR/reference article: https://jravilab.github.io/MolEvolvR/articles diff --git a/reference/GCA2Lineage.html b/reference/GCA2Lineage.html index a3fe8deb..bdc60267 100644 --- a/reference/GCA2Lineage.html +++ b/reference/GCA2Lineage.html @@ -62,7 +62,7 @@
String of the path to the lineage lookup file (taxid to lineage mapping). This file can be generated using the -"create_lineage_lookup()" function
Takes the resulting file of an efetch run on the ipg database and
+Takes the resulting file +of an efetch run on the ipg database and
Takes the resulting file of an efetch run on the ipg database and append lineage, and taxid columns
String of the path to the lineage lookup file (taxid to lineage mapping). This file can be generated using the -"create_lineage_lookup()" function
R/assign_job_queue.R
+ calculateEstimatedWallTimeFromOpts.Rd
Given MolEvolvR advanced options and number of inputs, +calculate the total estimated walltime for the job
+calculateEstimatedWallTimeFromOpts(
+ advanced_opts,
+ n_inputs = 1L,
+ n_hits = NULL,
+ verbose = FALSE
+)
Scrape MolEvolvR logs and calculate median processes
+list names: processes; values: median runtime (seconds)
+see molevol_scripts/R/metrics.R for info on functions called here
+examples:
dir_job_results <- "/data/scratch/janani/molevolvr_out" +list_proc_medians <- calculateProcessRuntime(dir_job_results)
from outside container environment +common_root <- "/data/molevolvr_transfer/molevolvr_dev" +dir_job_results <- "/data/molevolvr_transfer/molevolvr_dev/job_results" +list_proc_medians <- calculateProcessRuntime(dir_job_results)
R/combine_files.R
+ combineFiles.Rd
Download the combined assembly summaries of genbank and refseq
+combineFiles(
+ inpath = c("../molevol_data/project_data/phage_defense/"),
+ pattern = "*full_analysis.tsv",
+ delim = "\t",
+ skip = 0,
+ col_names = T
+)
String of 'master' path where the files reside (recursive=T)
Character vector containing search pattern for files
Takes logical T/F arguments OR column names vector;
+usage similar to col_names parameter in readr::read_delim
Combining full_analysis files
+Combining clean ipr files
+Create a look up table that goes from TaxID, to Lineage
+createLineageLookup(
+ lineage_file = here("data/rankedlineage.dmp"),
+ outfile,
+ taxonomic_rank = "phylum"
+)
Path to the rankedlineage.dmp file containing taxid's and their +corresponding taxonomic rank. rankedlineage.dmp can be downloaded at +https://ftp.ncbi.nlm.nih.gov/pub/taxonomy/new_taxdump/
File the resulting lineage lookup table should be written to
The upperbound of taxonomic rank that the lineage includes. The lineaege will +include superkingdom>...>taxonomic_rank. +Choices include: "supperkingdom", "phylum", "class","order", "family", +"genus", and "species"
Quickly get the runtime weights for MolEvolvR backend processes
+list names: processes; values: median runtime (seconds)
+example: writeProcessRuntime2YML()
+alignFasta()
- alignFasta()
+ assignJobQueue()
assign_job_queue()
+ calculateEstimatedWallTimeFromOpts()
calculateProcessRuntime()
+
+ cleanClusterFile()
+
+ cleanClusters()
@@ -175,25 +187,19 @@ combine_files()
+ combineFiles()
combine_full()
+ combineFullAnalysis()
combine_ipr()
+ combineIPR()
createRepresentativeAccNum()
domain_network()
getProcessRuntimeWeights()
- get_proc_medians()
+ getTopAccByLinDomArch()
get_proc_weights()
+ mapAcc2Name()
make_opts2procs()
+ mapAdvOption2Process()
mapAcc2Name()
+ mapOption2Process()
map_advanced_opts2procs()
+ msa_pdf()
msa_pdf()
+ plotEstimatedWallTimes()
plotIPR2Viz()
@@ -499,12 +505,6 @@ prepareColumnParams()
writeProcessRuntime2TSV()
- write_proc_medians_yml()
+ writeProcessRuntime2YML()
Use MolEvolvR advanced options to get associated processes
+Construct list where names (MolEvolvR advanced options) point to processes
+Plot the estimated runtimes for different advanced options and number +of inputs
+this function was just for fun; very, very messy code
+