Updated Expression Prediction

Analyze gene expression based on coverage distribution around TSS from cfDNA

Basic Usage: expression_prediction.py [-h] -fq FASTQ_FILE -s NAME -g {m,f} [-o OUTDIR] [-k] [-t THREADS] -cna CNA_FILE [-tmp TEMP_DIR] [-step START_STEP]

Note: This calls every step of the below list. Copy number alterations (CNAs) need to be normalized against. This is done by specifying a list of copy-number states in the format . For plasma-seq analyses this is the *.segments file.

Attention: Large hg19 reference index files for BWA are not in this commit

Also needed for analysis: . java . R (Package e1071)

-) Step1 create directory and create MD5 file of input -) Step2 Trim fastq -) Step3 alignment and conversion to BAM -) Step4 remove PCR duplicates -) Step5 analyze TSS profile in housekeeping vs. unexpressed genes and plot in R -) Step6 extract coverage parameters for expression prediction -) Step7 expression prediction

New added notes:

As Rmdup is now obsolete, and we have paired end reads, or lab uses picard's markduplicates to remove them. We then start from step 5 after duplicates have been removed and fastq's converted to bam's in the way the lab prefers. These new scripts will be uploaded to the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Ref		Ref
Scripts		Scripts
Software		Software
LICENSE.txt		LICENSE.txt
README.md		README.md
Workflow		Workflow
expression_prediction.py		expression_prediction.py
script		script

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated Expression Prediction

New added notes:

About

Releases

Packages

Languages

License

DoryAbelman/ExpressionPrediction

Folders and files

Latest commit

History

Repository files navigation

Updated Expression Prediction

New added notes:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages