Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Jun Li; Junyu Chen; Yucheng Tang; Ce Wang; Bennett A. Landman; S. Kevin Zhou

doi:10.1016/j.media.2023.102762

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Jun Li, Junyu Chen, Yucheng Tang, Ce Wang, Bennett A. Landman, S. Kevin Zhou

School of Medicine

Research output: Contribution to journal › Review article › peer-review

Abstract

Transformer, one of the latest technological advances of deep learning, has gained prevalence in natural language processing or computer vision. Since medical imaging bear some resemblance to computer vision, it is natural to inquire about the status quo of Transformers in medical imaging and ask the question: can the Transformer models transform medical imaging? In this paper, we attempt to make a response to the inquiry. After a brief introduction of the fundamentals of Transformers, especially in comparison with convolutional neural networks (CNNs), and highlighting key defining properties that characterize the Transformers, we offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging and exhibit current research progresses made in the areas of medical image segmentation, recognition, detection, registration, reconstruction, enhancement, etc. In particular, what distinguishes our review lies in its organization based on the Transformer's key defining properties, which are mostly derived from comparing the Transformer and CNN, and its type of architecture, which specifies the manner in which the Transformer and CNN are combined, all helping the readers to best understand the rationale behind the reviewed approaches. We conclude with discussions of future perspectives.

Original language	English (US)
Article number	102762
Journal	Medical image analysis
Volume	85
DOIs	https://doi.org/10.1016/j.media.2023.102762
State	Published - Apr 2023

Keywords

Medical imaging
Survey
Transformer

ASJC Scopus subject areas

Radiological and Ultrasound Technology
Health Informatics
Radiology Nuclear Medicine and imaging
Computer Vision and Pattern Recognition
Computer Graphics and Computer-Aided Design

Access to Document

10.1016/j.media.2023.102762

Cite this

@article{6f732c571bd641a79d51001dbfccfcd8,

title = "Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives",

abstract = "Transformer, one of the latest technological advances of deep learning, has gained prevalence in natural language processing or computer vision. Since medical imaging bear some resemblance to computer vision, it is natural to inquire about the status quo of Transformers in medical imaging and ask the question: can the Transformer models transform medical imaging? In this paper, we attempt to make a response to the inquiry. After a brief introduction of the fundamentals of Transformers, especially in comparison with convolutional neural networks (CNNs), and highlighting key defining properties that characterize the Transformers, we offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging and exhibit current research progresses made in the areas of medical image segmentation, recognition, detection, registration, reconstruction, enhancement, etc. In particular, what distinguishes our review lies in its organization based on the Transformer's key defining properties, which are mostly derived from comparing the Transformer and CNN, and its type of architecture, which specifies the manner in which the Transformer and CNN are combined, all helping the readers to best understand the rationale behind the reviewed approaches. We conclude with discussions of future perspectives.",

keywords = "Medical imaging, Survey, Transformer",

author = "Jun Li and Junyu Chen and Yucheng Tang and Ce Wang and Landman, {Bennett A.} and Zhou, {S. Kevin}",

note = "Publisher Copyright: {\textcopyright} 2023",

year = "2023",

month = apr,

doi = "10.1016/j.media.2023.102762",

language = "English (US)",

volume = "85",

journal = "Medical image analysis",

issn = "1361-8415",

publisher = "Elsevier",

}

TY - JOUR

T1 - Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

AU - Li, Jun

AU - Chen, Junyu

AU - Tang, Yucheng

AU - Wang, Ce

AU - Landman, Bennett A.

AU - Zhou, S. Kevin

PY - 2023/4

Y1 - 2023/4

N2 - Transformer, one of the latest technological advances of deep learning, has gained prevalence in natural language processing or computer vision. Since medical imaging bear some resemblance to computer vision, it is natural to inquire about the status quo of Transformers in medical imaging and ask the question: can the Transformer models transform medical imaging? In this paper, we attempt to make a response to the inquiry. After a brief introduction of the fundamentals of Transformers, especially in comparison with convolutional neural networks (CNNs), and highlighting key defining properties that characterize the Transformers, we offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging and exhibit current research progresses made in the areas of medical image segmentation, recognition, detection, registration, reconstruction, enhancement, etc. In particular, what distinguishes our review lies in its organization based on the Transformer's key defining properties, which are mostly derived from comparing the Transformer and CNN, and its type of architecture, which specifies the manner in which the Transformer and CNN are combined, all helping the readers to best understand the rationale behind the reviewed approaches. We conclude with discussions of future perspectives.

AB - Transformer, one of the latest technological advances of deep learning, has gained prevalence in natural language processing or computer vision. Since medical imaging bear some resemblance to computer vision, it is natural to inquire about the status quo of Transformers in medical imaging and ask the question: can the Transformer models transform medical imaging? In this paper, we attempt to make a response to the inquiry. After a brief introduction of the fundamentals of Transformers, especially in comparison with convolutional neural networks (CNNs), and highlighting key defining properties that characterize the Transformers, we offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging and exhibit current research progresses made in the areas of medical image segmentation, recognition, detection, registration, reconstruction, enhancement, etc. In particular, what distinguishes our review lies in its organization based on the Transformer's key defining properties, which are mostly derived from comparing the Transformer and CNN, and its type of architecture, which specifies the manner in which the Transformer and CNN are combined, all helping the readers to best understand the rationale behind the reviewed approaches. We conclude with discussions of future perspectives.

KW - Medical imaging

KW - Survey

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85148964632&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85148964632&partnerID=8YFLogxK

U2 - 10.1016/j.media.2023.102762

DO - 10.1016/j.media.2023.102762

M3 - Review article

C2 - 36738650

AN - SCOPUS:85148964632

SN - 1361-8415

VL - 85

JO - Medical image analysis

JF - Medical image analysis

M1 - 102762

ER -

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this