GitHub

HGCスパコン上でのEEM解析

インストール

git clone https://[email protected]/atusiniida/EEM2.0.git

または

wget https://github.com/atusiniida/EEM2.0/archive/master.zip  
unzip  master.zip    
rm master.zip  
mv EEM2.0-master EEM2.0

EEMを実行

発現データA.tab、遺伝子セットデータB.gmtに適用

perl EEM2.0/perl/eem.pl  A.tab B.gmt

A.tabは以下のような行列形式のフォーマット

[tab]sample1[tab]sample2[tab]sample3
gene1[tab]1.0[tab]2.0[tab]3.0
gene2[tab]4.0[tab]5.0[tab]6.0
gene3[tab]7.0[tab]8.0[tab]9.0

B.gmtのフォーマットは以下を参照
http://software.broadinstitute.org/cancer/software/genepattern/file-formats-guide#GMT

テストデータを使うと

 perl EEM2.0/perl/eem.pl EEM2.0/data/coadExp3000.tab  EEM2.0/data/hallmark.gmt

A＿B.eemができる

以下のように各行にmodule の情報がはいっている

HALLMARK_EPITHELIAL_MESENCHYMAL_TRANSITION[tab] 39.149765925967408[tab]3[tab]CoherenceBasedEEM absoluteRadius=0.3125 relativeRadius=0.050000000000000003 itrForPvalue2Calculation=300 coreGeneSize=10 seedGenes=[ABI3BP, ADAM12, ANPEP,...., VCAN] moduleGenes=[COL3A1, COL1A2, COL5A2,...., ELN] center=[-0.10535502585313571, -1.7660015061835561, -1.7153470069277423,...., 2.0250583168882716] Pvalue=39.626887180687071 Pvalue1=37.182678226670824 Pvalue2=39.626887180687071

重要なところ

1列目: gene set ID
2列目: -log10(p-value)
4列目のseedGenes: inputのgene set
4列目のmoduleGenes: seedGenesのうち共発現しているもの

post-processing

perl EEM2.0/perl/postEEMprocessing.pl  -p X -c Y  -m Z A＿B.eem A.tab

実行するにはR package "gplots"のインストール必要
-log10(p-value)がX以上のモジュールをcorrelation cutoff Yでクラスタリング、module個数Z未満のクラスターは捨てる (Default: X = 6 Y=0.7 Z =0)

例えばテストデータを使うと

perl EEM2.0/perl/postEEMprocessing.pl  -p 4 -c 0.75 coadExp3000_hallmark.eem EEM2.0/data/coadExp3000.tab

失敗する場合はR package "gplots"がインストール済みかチェック

A＿B.posteem (ディレクトリ）の中に以下のファイルができる

module.tab: module activity profileのmatrix
module.pdf: module.tabのヒートマップ colored side barがmodule clusterをしめす。
module.collapsed.tab: module activity profileのクラスターをまとめ、activity profileを平均化したたmeta module activity profile
module.collapsed.pdf: module.collapsed.tabのヒートマップ
module.gmt：module cluster中のmoduleの情報
[gene set ID].pdf: 各モジュールのseed genesのヒートマップ　colored top bar はactivity profile、red side barはmodule genesを示す。

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
R		R
data		data
eemParallel_1.0.1		eemParallel_1.0.1
perl		perl
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HGCスパコン上でのEEM解析

インストール

EEMを実行

post-processing

About

Releases

Packages

Languages

atusiniida/EEM2.0

Folders and files

Latest commit

History

Repository files navigation

HGCスパコン上でのEEM解析

インストール

EEMを実行

post-processing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages