Comprehensive benchmark of dimension reduction methods for single-cell data analysis

Software requirements

Nextflow
Docker

The list of benchmarking datasets

scATAC-seq:

How to prepare new benchmark datasets

Benchmark data is stored in anndata format. The anndata object should contain:

adata.X: cell by feature count matrix. For scATAC-seq, the format of feature names should be: chr:start-end.
adata.obs["cell_type"]: cluster/cell-type annotation of each barcode.
data.obs['batch']: batch annotation of each barcode (optional).
data.obs['compare']:differentially accessible region between two groups ref vs case
Groundtruth have at least two columns: ['cell_type'] and DARs named by ['index']

How to run benchmark

The pipeline uses docker containers to run the benchmark. Therefore, you need to install either docker on your machine. The pipeline needs to download large data files, so make sure you have enough disk space (>20G) and a fast internet connection.

Use ./bench.sh -profile singularity or ./bench.sh -profile docker to run benchmarks. The benchmark results will be stored in ./results folder.

See nextflow.config for additional configuration options.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Comprehensive benchmark of dimension reduction methods for single-cell data analysis

Software requirements

The list of benchmarking datasets

How to prepare new benchmark datasets

How to run benchmark

Files

README.md

Latest commit

History

README.md

File metadata and controls

Comprehensive benchmark of dimension reduction methods for single-cell data analysis

Software requirements

The list of benchmarking datasets

How to prepare new benchmark datasets

How to run benchmark