Skip to content
This repository has been archived by the owner on Apr 24, 2023. It is now read-only.

heptagonhust/pac2021final-2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ST_BarcodeMap

This program can map barcode of stereomics-seq to stereomics-chip

Compile

Platform & Environment

  • linux
  • gcc-8.2.0

Prerequisties

Package Version Description
boost >=1.73.0 serialize hashmap
zlib >=1.2.11 file compressing and decompressing
hdf5 >=1.10.7 hdf5 format mask file read

make the runnable program

cd ST_Barcode_Map directory
make

Run

Usage

usage: ST_BarcodeMap-0.0.1 --in=string --out=string [options] ... 
options:
  -i, --in                   mask file of stereomics chip or input barcode_list file (string)
  -I, --in1                  the second sequencing fastq file path of read1 (string [=])
      --in2                  the second sequencing fastq file path of read2 (string [=])
      --barcodeReadsCount    the mapped barcode list file with reads count per barcode. (string [=])
  -O, --out                  output file prefix or fastq output file of read1 (string)
      --out2                 fastq output file of read2 (string [=])
      --report               logging file path. (string [=])
      --PEout                if this option was given, PE reads with barcode tag will be writen
  -z, --compression          compression level for gzip output (1 ~ 9). 1 is fastest, 9 is smallest, default is 4. (int [=4])
      --unmappedOut          output file path for barcode unmapped reads of read1, if this path isn't given, discard the reads. (string [=])
      --unmappedOut2         output file path for barcode unmapped reads of read2, if this path isn't given, discard the reads. (string [=])
  -l, --barcodeLen           barcode length, default is 25 (unsigned int [=25])
      --barcodeStart         barcode start position (int [=0])
      --umiRead              read1 or read2 contains the umi sequence. (int [=1])
      --barcodeRead          read1 or read2 contains the barcode sequence. (int [=1])
      --umiStart             umi start position. if the start postion is negative number, no umi sequence will be found (int [=25])
      --umiLen               umi length. (int [=10])
      --fixedSequence        fixed sequence in read1 that will be filtered. (string [=])
      --fixedStart           fixed sequence start position can by specied. (int [=-1])
      --barcodeSegment       barcode segment for every position on the stereo-chip. (int [=1])
      --fixedSequenceFile    file contianing the fixed sequences and the start position, one sequence per line in the format: TGCCTCTCAG        -1. when position less than 0, means wouldn't specified (string [=])
      --mapSize              bucket size of the new unordered_map. (long [=0])
      --mismatch             max mismatch is allowed for barcode overlap find. (int [=0])
      --action               chose one action you want to run [map_barcode_to_slide = 1, merge_barcode_list = 2, mask_format_change = 3, mask_merge = 4]. (int [=1])
  -w, --thread               number of thread that will be used to run. (int [=2])
  -V, --verbose              output verbose log information (i.e. when every 1M reads are processed).
  -?, --help                 print this message

run example

ST_BarcodeMap-0.0.1 --in chipid.h5 --in1 read1.fq.gz --in2 read2.fq.gz --out combine_read.fq.gz --mismatch 1 --thread 2

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages