This repository contains detailed simulation and analysis code needed to reproduce the results in this study:
Scaling tree-based automated machine learning to biomedical big data with a feature set selector
Trang T Le, Weixuan Fu, Jason H Moore
Bioinformatics (2019-06-04) https://doi.org/gf3tds
DOI: 10.1093/bioinformatics/btz470 · PMID: 31165141
Simulation analyses and visualizations are in simulation
, and real-world application to RNA-Seq data are in RNASeq
.
Each folder contains TPOT exported pipelines (with FSS in pipelines_ds
and without FSS in pipelines_reg
) as well as the cross validated accuracy of each pipeline (in accuracies_*
).
Questions/PRs are welcomed.