Skip to content

minimal example implementations for bioinformatics workflow managers

License

Notifications You must be signed in to change notification settings

kyleoconnell-NIH/bioinformatics-workflows

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers

Workflow managers provide an easy and intuitive way to simplify pipeline development. Here we provide basic proof-of-concept implementations for selected workflow managers. The analysis workflow is based on a small portion of an RNA-seq pipeline, using fastqc for quality controls and salmon for transcript quantification. These implementations are designed for basic illustrations. Workflow managers provide many more powerful features than what we use here, please visit the official documentations to explore those in detail.

The RNA-Seq workflow

The RNA-Seq analysis workflow performs quality controls with fastqc and quantifies transcripts expression using Salmon. Here we will use local installation (see documentation for salmon and fastqc). For the local installations you can add a symbolic link to the executables to your $PATH:

sudo ln -s /absolute/path/salmon/bin/salmon /usr/local/bin/salmon

sudo ln -s /absolute/path/FastQC/fastqc /usr/local/bin/fastqc

You can test the installation using the help function of the two tools (i.e. salmon -h and fastqc -h).

Test Data

This repository contains a simulated test data set which can be used to run the example implementations. The test data contains RNA-Seq reads (reads_1.fq.gz and reads_2.fq.gz), a transcriptome reference file (transcriptome.fa) and the true counts from the simulation experiments (truth.tsv)

Basic proof-of-concept implementations

Each workflow manager folder in this repository has a README detailing how to run the proof-of-concept pipeline:

Online Documentation for Workflow managers

Workflow managers have many more features which are not used in these implementations, and there are many additional workflow managers. You can read more about each workflow manager in their official documentation:

Contact and Call for Contribution

This repository was created by Laura Wratten. We very much encourage contributions by users of these workflows. If you would like to add an implementation for any of these workflow managers you can follow the template. If you would like to suggest changes to any of the existing implementations, please raise an issue and submit a pull request.

Acknowledgements

We would like to thank the following people for their contribution to this repository:

About

minimal example implementations for bioinformatics workflow managers

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 54.0%
  • Jupyter Notebook 15.9%
  • Nextflow 15.5%
  • WDL 14.6%