Histo-fetch - On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

Brendon Lutnick; Leema Krishna Murali; Brandon Ginley; Avi Z. Rosenberg; Pinaki Sarder

doi:10.4103/jpi.jpi_59_20

Histo-fetch - On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

Brendon Lutnick, Leema Krishna Murali, Brandon Ginley, Avi Z. Rosenberg, Pinaki Sarder

School of Medicine

Research output: Contribution to journal › Article › peer-review

Abstract

Background: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. Methods: We created a custom pipeline (histo-fetch) to efficiently extract random patches and labels from pathology WSIs for input to a neural network on-the-fly. We prefetch these patches as needed during network training, avoiding the need for WSI preparation such as chopping/tiling. Results & Conclusions: We demonstrate the utility of this pipeline to perform artificial stain transfer and image generation using the popular networks CycleGAN and ProGAN, respectively. For a large WSI dataset, histo-fetch is 98.6% faster to start training and used 7535x less disk space.

Original language	English (US)
Pages (from-to)	7
Number of pages	1
Journal	Journal of Pathology Informatics
Volume	13
Issue number	1
DOIs	https://doi.org/10.4103/jpi.jpi_59_20
State	Published - Jan 1 2022

Keywords

Convolutional neural network
generative adversarial network
tensorflow
whole slide images

ASJC Scopus subject areas

Health Informatics
Pathology and Forensic Medicine
Computer Science Applications

Access to Document

10.4103/jpi.jpi_59_20

Cite this

@article{1a45c23d3abf498a8ec6ad81038c366f,

title = "Histo-fetch - On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training",

abstract = "Background: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. Methods: We created a custom pipeline (histo-fetch) to efficiently extract random patches and labels from pathology WSIs for input to a neural network on-the-fly. We prefetch these patches as needed during network training, avoiding the need for WSI preparation such as chopping/tiling. Results & Conclusions: We demonstrate the utility of this pipeline to perform artificial stain transfer and image generation using the popular networks CycleGAN and ProGAN, respectively. For a large WSI dataset, histo-fetch is 98.6% faster to start training and used 7535x less disk space.",

keywords = "Convolutional neural network, generative adversarial network, tensorflow, whole slide images",

author = "Brendon Lutnick and Murali, {Leema Krishna} and Brandon Ginley and Rosenberg, {Avi Z.} and Pinaki Sarder",

note = "Publisher Copyright: {\textcopyright} 2022 Journal of Pathology Informatics | Published by Wolters Kluwer - Medknow.",

year = "2022",

month = jan,

day = "1",

doi = "10.4103/jpi.jpi_59_20",

language = "English (US)",

volume = "13",

pages = "7",

journal = "Journal of Pathology Informatics",

issn = "2229-5089",

publisher = "Medknow Publications and Media Pvt. Ltd",

number = "1",

}

TY - JOUR

T1 - Histo-fetch - On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

AU - Lutnick, Brendon

AU - Murali, Leema Krishna

AU - Ginley, Brandon

AU - Rosenberg, Avi Z.

AU - Sarder, Pinaki

PY - 2022/1/1

Y1 - 2022/1/1

N2 - Background: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. Methods: We created a custom pipeline (histo-fetch) to efficiently extract random patches and labels from pathology WSIs for input to a neural network on-the-fly. We prefetch these patches as needed during network training, avoiding the need for WSI preparation such as chopping/tiling. Results & Conclusions: We demonstrate the utility of this pipeline to perform artificial stain transfer and image generation using the popular networks CycleGAN and ProGAN, respectively. For a large WSI dataset, histo-fetch is 98.6% faster to start training and used 7535x less disk space.

AB - Background: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. Methods: We created a custom pipeline (histo-fetch) to efficiently extract random patches and labels from pathology WSIs for input to a neural network on-the-fly. We prefetch these patches as needed during network training, avoiding the need for WSI preparation such as chopping/tiling. Results & Conclusions: We demonstrate the utility of this pipeline to perform artificial stain transfer and image generation using the popular networks CycleGAN and ProGAN, respectively. For a large WSI dataset, histo-fetch is 98.6% faster to start training and used 7535x less disk space.

KW - Convolutional neural network

KW - generative adversarial network

KW - tensorflow

KW - whole slide images

UR - http://www.scopus.com/inward/record.url?scp=85124653582&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85124653582&partnerID=8YFLogxK

U2 - 10.4103/jpi.jpi_59_20

DO - 10.4103/jpi.jpi_59_20

M3 - Article

C2 - 35136674

AN - SCOPUS:85124653582

SN - 2229-5089

VL - 13

SP - 7

JO - Journal of Pathology Informatics

JF - Journal of Pathology Informatics

IS - 1

ER -

Histo-fetch - On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this