WSI reader #1548

bhashemian · 2021-02-04T15:42:41Z

Description

Training models for pathology use cases requires loading patches from Whole Slide Imaging (WSI) scans. These images are enormous in size and usually much larger than user's available RAM (#1504). This PR provide a solution by implementing whole slide image readers for CuImage and OpenSlide.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh --codeformat --coverage.
Quick tests passed locally by running ./runtests.sh --quick.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

Signed-off-by: Behrooz <[email protected]>

…ology_dataset Signed-off-by: Behrooz <[email protected]>

Signed-off-by: Behrooz <[email protected]>

bhashemian · 2021-02-08T05:48:12Z

@wyli, @Nic-Ma
The mypy complains are for the existing code and I think they are due to the third-party libraries update. For instance, the first one is about np.eye, which I suspect it is because of the interface file ( __init__.pyi) that has been added after numpy==1.20 where the return type of np.eye is defined as Any. Have you encounter this in any other place in MONAI?

monai/data/image_reader.py:252:9: error: Returning Any from function declared to return "ndarray"  [no-any-return]
286
monai/data/image_reader.py:280:13: error: Returning Any from function declared to return "ndarray"  [no-any-return]
287
monai/data/image_reader.py:393:9: error: Returning Any from function declared to return "ndarray"  [no-any-return]

Signed-off-by: Behrooz <[email protected]>

bhashemian · 2021-02-11T17:55:35Z

@wyli, @Nic-Ma
The mypy complains are for the existing code and I think they are due to the third-party libraries update. For instance, the first one is about np.eye, which I suspect it is because of the interface file ( __init__.pyi) that has been added after numpy==1.20 where the return type of np.eye is defined as Any. Have you encounter this in any other place in MONAI?
monai/data/image_reader.py:252:9: error: Returning Any from function declared to return "ndarray"  [no-any-return]
286
monai/data/image_reader.py:280:13: error: Returning Any from function declared to return "ndarray"  [no-any-return]
287
monai/data/image_reader.py:393:9: error: Returning Any from function declared to return "ndarray"  [no-any-return]

@wyli @Nic-Ma, what do you suggest here?

monai/data/image_reader.py

Signed-off-by: Behrooz <[email protected]>

wyli

thanks, please see some initial comments inline, I'll try it out with an EA container soon...

monai/data/image_reader.py

tests/test_openslide_reader.py

Signed-off-by: Behrooz <[email protected]>

Nic-Ma

Thanks for your quick update.
I put several minor comments inline, others look good to me.
And please add this new object to the "init.py" of mode/data/ and docs/sources/data.

Thanks.

monai/data/image_reader.py

Nic-Ma · 2021-03-03T07:52:48Z

monai/data/image_reader.py

+        for name in filenames:
+            img = self.wsi_reader(name)
+            if self.wsi_reader_name == "openslide":
+                img.shape = (img.dimensions[1], img.dimensions[0], 3)


Can it support images with channel = 1 or no channel?

It only supports RGB data as the output and it explicitly convert it to RGB, so the channels always should be 3.

thanks, the doc suggests RGBA (https://openslide.org/api/python/#openslide.OpenSlide.associated_images), I think we should make it clear in the docstring if we only support RGB

monai/data/image_reader.py

Nic-Ma · 2021-03-03T08:04:19Z

monai/data/image_reader.py

+        self,
+        img_obj,
+        location: Tuple[int, int] = (0, 0),
+        size: Optional[Tuple[int, int]] = None,


Suggest to use spatial_size, otherwise, it may be confusing whether it contains channel dim.

I kind of agree here but since both OpenSlide and cuClaraImage use size to define this input argument, I found using another name confusing for people who has ML in pathology background and has worked with WSI in the past.

Signed-off-by: Behrooz <[email protected]>

…ology_dataset

Signed-off-by: Behrooz <[email protected]>

Nic-Ma

Looks good to me.
Thanks for your quick update!

bhashemian closed this Feb 4, 2021

bhashemian force-pushed the pathology_dataset branch from 580ef07 to f7565bf Compare February 4, 2021 19:17

Implement CuImageReader and OpenSlideReader

753deca

Signed-off-by: Behrooz <[email protected]>

bhashemian reopened this Feb 4, 2021

bhashemian added 14 commits February 4, 2021 14:21

Add unittests for CuImageReader

ddbd6ab

Signed-off-by: Behrooz <[email protected]>

Add unittests for OpenSlideReader

7e77449

Signed-off-by: Behrooz <[email protected]>

Merge branch 'master' into pathology_dataset

d3dbf7d

Sort imports

c40b019

Signed-off-by: Behrooz <[email protected]>

Add correct boundaries

f8b0962

Signed-off-by: Behrooz <[email protected]>

Merge branch 'pathology_dataset' of github.com:behxyz/MONAI into path…

e4dd37d

…ology_dataset Signed-off-by: Behrooz <[email protected]>

Add test cases for reading patches on a grid for CuImage

9a3e672

Signed-off-by: Behrooz <[email protected]>

Add patch whole slide imaging dataset for pathology

b463310

Signed-off-by: Behrooz <[email protected]>

Add test case for read patches for OpenSlide

4c735cb

Signed-off-by: Behrooz <[email protected]>

flake8 and few minor changes

378893c

Signed-off-by: Behrooz <[email protected]>

black

ec5261b

Signed-off-by: Behrooz <[email protected]>

flake8

ce01a9b

Signed-off-by: Behrooz <[email protected]>

Add kwargs to CuImageReader and OpenSlideReader's read method

51c1578

Signed-off-by: Behrooz <[email protected]>

Change the type hint from np.dtype to DTypeLike

714561a

Signed-off-by: Behrooz <[email protected]>

bhashemian requested a review from Nic-Ma February 8, 2021 05:19

bhashemian marked this pull request as ready for review February 8, 2021 05:48

bhashemian added 3 commits February 8, 2021 10:57

Merge branch 'master' into pathology_dataset

f6f5cf6

Merge branch 'master' into pathology_dataset

642ee9b

Fix a bug

e83573d

Signed-off-by: Behrooz <[email protected]>

bhashemian requested a review from wyli February 11, 2021 17:55

ericspod reviewed Feb 16, 2021

View reviewed changes

monai/data/image_reader.py Outdated Show resolved Hide resolved

bhashemian added 3 commits February 22, 2021 13:57

Merge branch 'master' into pathology_dataset

1adf4ee

Implement WSIReader and unittests

097eb19

Signed-off-by: Behrooz <[email protected]>

Minor updates

356e0d4

Signed-off-by: Behrooz <[email protected]>

Merge branch 'master' into pathology_dataset

40a6f23

wyli reviewed Feb 24, 2021

View reviewed changes

monai/data/image_reader.py Show resolved Hide resolved

tests/test_openslide_reader.py Outdated Show resolved Hide resolved

bhashemian and others added 9 commits February 26, 2021 10:26

Merge branch 'master' into pathology_dataset

00b7a55

Replace the test TIFF and some upgrades

b851859

Signed-off-by: Behrooz <[email protected]>

Update dependencies for OpenSlide

0a99658

Signed-off-by: Behrooz <[email protected]>

Merge branch 'master' into pathology_dataset

dede661

Update unittests for OpenSlide and CuImage

563a4fa

Signed-off-by: Behrooz <[email protected]>

Merge branch 'pathology_dataset' of pathology_dataset

9ee2200

Fix openslide dependency

3ac12c3

Signed-off-by: Behrooz <[email protected]>

Fix doc dependencies

15c147d

Signed-off-by: Behrooz <[email protected]>

Merge branch 'master' into pathology_dataset

d9059ec

Nic-Ma reviewed Mar 3, 2021

View reviewed changes

bhashemian added 10 commits March 3, 2021 10:41

Merge branch 'master' into pathology_dataset

c394ebe

Minor changes

8a279c3

Signed-off-by: Behrooz <[email protected]>

Merge branch 'pathology_dataset' into pathology_dataset

c6171d1

Merge branch 'master' into pathology_dataset

0082ac6

Merge branch 'master' into pathology_dataset

22846f8

Few variable name changes

c8750f0

Signed-off-by: Behrooz <[email protected]>

Add EnsureChannelFirst

a440caf

Signed-off-by: Behrooz <[email protected]>

Merge branch 'pathology_dataset' of github.com:behxyz/MONAI into path…

d4ff431

…ology_dataset

Add metadata to WSIReader

652f046

Signed-off-by: Behrooz <[email protected]>

Merge branch 'master' into pathology_dataset

2ffdf58

bhashemian enabled auto-merge (squash) March 4, 2021 18:01

bhashemian requested a review from Nic-Ma March 5, 2021 00:01

bhashemian added 2 commits March 4, 2021 19:02

Merge branch 'master' into pathology_dataset

1f32f71

Merge branch 'master' into pathology_dataset

2202f57

Nic-Ma approved these changes Mar 5, 2021

View reviewed changes

bhashemian merged commit 889c9f9 into Project-MONAI:master Mar 5, 2021

bhashemian deleted the pathology_dataset branch March 5, 2021 15:20

KumoLiu mentioned this pull request Dec 28, 2023

Update openslide-python version #7344

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WSI reader #1548

WSI reader #1548

bhashemian commented Feb 4, 2021 •

edited

Loading

bhashemian commented Feb 8, 2021

bhashemian commented Feb 11, 2021

wyli left a comment

Nic-Ma left a comment

Nic-Ma Mar 3, 2021

bhashemian Mar 3, 2021

wyli Mar 3, 2021

Nic-Ma Mar 3, 2021

bhashemian Mar 3, 2021

Nic-Ma left a comment

WSI reader #1548

WSI reader #1548

Conversation

bhashemian commented Feb 4, 2021 • edited Loading

Description

Types of changes

bhashemian commented Feb 8, 2021

bhashemian commented Feb 11, 2021

wyli left a comment

Choose a reason for hiding this comment

Nic-Ma left a comment

Choose a reason for hiding this comment

Nic-Ma Mar 3, 2021

Choose a reason for hiding this comment

bhashemian Mar 3, 2021

Choose a reason for hiding this comment

wyli Mar 3, 2021

Choose a reason for hiding this comment

Nic-Ma Mar 3, 2021

Choose a reason for hiding this comment

bhashemian Mar 3, 2021

Choose a reason for hiding this comment

Nic-Ma left a comment

Choose a reason for hiding this comment

bhashemian commented Feb 4, 2021 •

edited

Loading