Histo-fetch - On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

Brendon Lutnick, Leema Krishna Murali, Brandon Ginley, Avi Z. Rosenberg, Pinaki Sarder

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Training convolutional neural networks using pathology whole slide images (WSIs) is traditionally prefaced by the extraction of a training dataset of image patches. While effective, for large datasets of WSIs, this dataset preparation is inefficient. Methods: We created a custom pipeline (histo-fetch) to efficiently extract random patches and labels from pathology WSIs for input to a neural network on-the-fly. We prefetch these patches as needed during network training, avoiding the need for WSI preparation such as chopping/tiling. Results & Conclusions: We demonstrate the utility of this pipeline to perform artificial stain transfer and image generation using the popular networks CycleGAN and ProGAN, respectively. For a large WSI dataset, histo-fetch is 98.6% faster to start training and used 7535x less disk space.

Original languageEnglish (US)
Pages (from-to)7
Number of pages1
JournalJournal of Pathology Informatics
Volume13
Issue number1
DOIs
StatePublished - Jan 1 2022

Keywords

  • Convolutional neural network
  • generative adversarial network
  • tensorflow
  • whole slide images

ASJC Scopus subject areas

  • Health Informatics
  • Pathology and Forensic Medicine
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Histo-fetch - On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training'. Together they form a unique fingerprint.

Cite this