backend hpc

Installing and configuring for HPC backends

Either miniwdl or Cromwell can be used to run workflows on the HPC.

Installing and configuring `miniwdl`

Requirements

miniwdl >= 1.9.0
miniwdl-slurm

Configuration

An example miniwdl.cfg file is provided here. This should be placed at ~/.config/miniwdl.cfg and edited to match your slurm configuration. This allows running workflows using a basic SLURM setup.

Important

In order to simplify workflow inputs, we make use of map files to specify the input data. This allows for a more concise input file, but requires changing a miniwdl configuration option to allow workflows to access files that are not expressly supplied with workflow inputs. To enable this, add the following line to your miniwdl.cfg file:

[file_io]
allow_any_input = true

This option is already included in the example miniwdl.cfg file described in this section.

Installing and configuring `Cromwell`

Cromwell supports a number of different HPC backends; see Cromwell's documentation for more information on configuring each of the backends. Cromwell can be used in a standalone "run" mode, or in "server" mode to allow for multiple users to submit workflows. In the example below, we provide example commands for running Cromwell in "run" mode.

Note

If running Cromwell on an HPC cluster using NFS for storage, you may encounter issues with NFS latency, which can cause Cromwell to fail to read files from the filesystem. You can work around this by using script-epilogue to add a delay & sync to the end of each job. This option is added to your backend provider config.

script-epilogue = "sleep 60 && sync"

Running the workflow

Filling out workflow inputs

Fill out any information missing in the inputs file. Once you have downloaded the reference data bundle, ensure that you have replaced the <local_path_prefix> in the input template file with the local path to the reference datasets on your HPC.

See the inputs section of the singleton README for more information on the structure of the inputs.json file.

Running via miniwdl

miniwdl run workflows/singleton.wdl --input <inputs_json_file>

Running via Cromwell

cromwell run workflows/singleton.wdl --input <inputs_json_file>

Reference data bundle

Reference data is hosted on Zenodo at 10.5281/zenodo.14027047. Download the reference data bundle and extract it to a location on your HPC, then update the input template file with the path to the reference data.

## download the reference data bundle
wget https://zenodo.org/record/14027047/files/hifi-wdl-resources-v2.0.0.tar

## extract the reference data bundle and rename as dataset
tar -xvf hifi-wdl-resources-v2.0.0.tar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backend hpc

Installing and configuring for HPC backends

Installing and configuring `miniwdl`

Requirements

Configuration

Installing and configuring `Cromwell`

Running the workflow

Filling out workflow inputs

Running via miniwdl

Running via Cromwell

Reference data bundle

PacBio WGS Variant Pipeline

Workflows

Subworkflows

Reference Inputs and Dependencies

Backends

Clone this wiki locally

backend hpc

Installing and configuring for HPC backends

Installing and configuring miniwdl

Requirements

Configuration

Installing and configuring Cromwell

Running the workflow

Filling out workflow inputs

Running via miniwdl

Running via Cromwell

Reference data bundle

PacBio WGS Variant Pipeline

Workflows

Subworkflows

Reference Inputs and Dependencies

Backends

Clone this wiki locally

Installing and configuring `miniwdl`

Installing and configuring `Cromwell`