Skip to content

A Hierarchical Graph V-Net with Semi-supervised Pre-training for Breast Cancer Histology Image Classification" (IEEE TMI)

Notifications You must be signed in to change notification settings

lyhkevin/Graph-V-Net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Graph V-Net

We provide Pytorch implementations for our paper "A Hierarchical Graph V-Net with Semi-supervised Pre-training for Breast Cancer Histology Image Classification" (IEEE TMI).

1. Introduction

Graph V-Net is a hierarchical graph convolutional network for patch-based breast cancer diagnosis. The proposed Graph V-Net classifies each patch within the whole slide image into four categories: normal, benign, carcinoma in situ, and invasive carcinoma.


Figure 1. An overview of the proposed Graph V-Net.

Preview:

Our proposed framework consists of two main components:

  • Pre-train the patch-level feature extractor with semi-supervised learning.

  • Fine-tune the Graph V-Net in a supervised learning manner.

2. Graph V-Net Walkthrough

  • Dataset Preparation

    Download the BACH dataset, which includes the BACH training set (ICIAR2018_BACH_Challenge.zip) and testing set (ICIAR2018_BACH_Challenge_TestDataset.zip). Unzip them in the ./dataset/ folder. Note that the BACH dataset includes both microscopy images (ROI) and whole slide images, and we use whole slide images only. The folder structure should be like this:

    /dataset/
       ├── annotations
       ├── ICIAR2018_BACH_Challenge
       ├── ICIAR2018_BACH_Challenge_TestDataset
    

    We provide refined annotations (by pathologists from Yunnan Cancer Hospital) for the BACH dataset in ./dataset/annotations/.


    Figure 2. (a) The original annotation provided by BACH organizers. (b) A WSI relabeled by the pathologists from Yunnan Cancer Hospital.

  • Date Preprocessing

    (1) Unzip all the data, and create a conda environment for preprocessing. For linux:

    conda create --name openslide python=3.8
    conda activate openslide
    conda install pytorch==1.12.1 torchvision==0.13.1 cudatoolkit=11.3 -c pytorch
    conda install -c conda-forge openslide
    conda install -c conda-forge openslide-python
    pip install -r ./requirements.txt

    To install the OpenSlide package on Windows, please refer to this tutorial. (2) Run ./preprocessing/preprocess_pretrain.py first. This script will generate non-overlapping patches (from 31 labeled WSIs and 9 unlabeled WSIs) for pre-training in ./dataset/pretrain/. The file name of a specific patch indicates its spatial location and the class (0, 1, 2, or 3). For instance, ./labeled/01/18_58_1.png indicates that the patch locates at the 18th row and 58th column of the WSI, and its class is 1 (cancerous).
    (3) Then, run ./preprocessing/preprocess_finetune.py to generate overlapping regions (from 31 labeled WSIs) for fine-tuning and testing in ./dataset/finetune/. Each cropped region consists of 64 patches and the corresponding soft label is a numpy array with a size of (8,8,4), which indicates the average label of all pixels in those 8*8 patches within the region.

  • Pre-training

    (1) Create another conda environment (without Openslide package) for pre-training, fine-tuning, and testing. You can install all the dependencies by

    conda create --name vnet python=3.8
    conda activate vnet
    conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch
    pip install -r ./requirements.txt

    (2) To pre-train the patch encoder, run ./pretrain/pretrain.py. The weights of the patch encoder will be saved in ./weight/pretrain/. Run ./pretrain/visualize_attention.py to visualize the attention maps of the patch encoder. We have provided the pre-trained weight on BACH dataset in ./weight/checkpoint.txt using --train_samples (in ./pretrain/pretrain.py. ) and all the unlabeled WSIs.


    Figure 3. The attention maps of the last self-attention layer from the patch encoder.

  • Fine-tuning

    To fine-tune our framework, run ./train.py. When fine-tuning is completed, the weights will be saved in ./weight/finetune/.

  • Testing

    Run test.py, and the results (figure 3) will be saved in ./snapshot/test/.


    Figure 3. The prediction results of our framework.

3. Citation

    @ARTICLE{10255661,
      author={Li, Yonghao and Shen, Yiqing and Zhang, Jiadong and Song, Shujie and Li, Zhenhui and Ke, Jing and Shen, Dinggang},
      journal={IEEE Transactions on Medical Imaging}, 
      title={A Hierarchical Graph V-Net with Semi-supervised Pre-training for Histological Image based Breast Cancer Classification}, 
      year={2023},
      volume={},
      number={},
      pages={1-1},
      doi={10.1109/TMI.2023.3317132}}

4. References

About

A Hierarchical Graph V-Net with Semi-supervised Pre-training for Breast Cancer Histology Image Classification" (IEEE TMI)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages