Vision transformers for glioma classification using T1 magnetic resonance imaging
Automated image analysis and classification have increasingly advanced in recent decades owing to machine learning and computer vision. In particular, deep learning (DL) architectures have become popular in resource-limited and labor-restricted environments such as the health-care sector. Transformer architecture, a DL method with self-attention mechanism, excels in natural language processing; however, its application in image-based diagnosis in health-care sector remains limited. Herein, the feasibility, bottlenecks, and performance of transformers in magnetic resonance imaging (MRI)-based brain tumor classification were investigated. To this end, a vision transformer (ViT) model was trained and tested using the popular Brain Tumor Segmentation (BraTS) 2015 dataset for glioma classification. Owing to limited data availability, domain adaptation techniques were used to pretrain the ViT model and the BraTS 2015 dataset was used for its fine-tuning. With the model only trained for 100 epochs, the confusion matrix for the two-class problem of tumor and nontumor classification showed an overall classification accuracy of 81.8%. In conclusion, although convolutional neural networks are traditionally used for DL-based medical image classification owing to their attention mechanism and long-range dependency-capturing capability, ViTs can outperform them in MRI-based brain tumor classification.
- Pal A, Chaturvedi A, Garain U, Chandra A, Chatterjee R. Severity Grading of Psoriatic Plaques using Deep CNN Based Multi-task Learning. Mexico: ICPR; 2016. doi: 10.1109/ICPR.2016.7899846
- Wang G. A perspective on deep imaging. IEEE Access. 2016;4:8914-8924. doi: 10.1109/ACCESS.2016.2624938
- Ker J, Wang L, Rao J, Lim T. Deep learning applications in medical image analysis. IEEE Access. 2018;6:9375-9389. doi: 10.1109/ACCESS.2017.2788044
- Kabir Anaraki A, Ayati M, Kazemi F. Magnetic resonance imaging-based brain tumor grades classification and grading via convolutional neural networks and genetic algorithms. Biocybern. Biomed. Eng. 2019;39(1):63-74. doi: 10.1016/j.bbe.2018.10.004
- Kaldera HNTK, Gunasekara SR, Dissanayake MB. Brain Tumor Classification and Segmentation Using Faster R-CNN. In: Proceedings ASET. United States: IEEE; 2019. doi: 10.1109/ICASET.2019.8714263
- Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. In: NIPS’17: Proceedings of the 31st International Conference on Neural Information Processing Systems; 2017:6000-6010.
- Menze BH, Jakab A, Bauer S, et al. The multimodal brain tumor image segmentation benchmark (BRATS). IEEE Trans Med Imaging. 2015;34(10):1993-2024. doi: 10.1109/tmi.2014.2377694
- Alsaif H, Guesmi R, Alshammari BM, et al. A novel data augmentation-based brain tumor detection using convolutional neural network. Appl Sci. 2022;12(8):3773. doi: 10.3390/app12083773
- Pan X, Ge C, Lu R, Song S, Chen G, Huang Z, et al. On the Integration of Self-Attention and Convolution. In: 2022 IEEE/ CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2022:805-815. doi: 10.1109/cvpr52688.2022.00089
- Devlin J, Chang MW, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Published online 2018. doi: 10.48550/ARXIV.1810.04805
- Dosovitskiy A, Beyer L, Kolesnikov A, et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. Published online 2020.doi: 10.48550/ARXIV.2010.11929
- Parmar N, Vaswani A, Uszkoreit J, et al. Image Transformer. In: JMLR Workshop and Conference Proceedings; 2018:4055- 4064.
- Zheng S, Lu J, Zhao H, et al. Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers. Published online 2020. doi: 10.48550/ARXIV.2012.15840
- Child R, Gray S, Radford A, Sutskever I. Generating Long Sequences with Sparse Transformers. Published online 2019. doi: 10.48550/ARXIV.1904.10509
- Wu H, Xiao B, Codella N, et al. CvT: Introducing Convolutions to Vision Transformers. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE; 2021:22-31. doi: 10.1109/iccv48922.2021.00009
- Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S. End-to-End Object Detection with Transformers. In: Computer Vision – ECCV 2020 (Lecture Notes in Computer Science). Springer International Publishing; 2020:213-229. doi: 10.1007/978-3-030-58452-8_13
- Aloraini M, Khan A, Aladhadh S, Habib S, Alsharekh MF, Islam M. Ombining the transformer and convolution for effective brain tumor classification using MRI Images. Appl Sci. 2023;13:3680. doi: 10.3390/app13063680
- Mehta S, Lu X, Weaver D, Elmore JG, Hajishirzi H, Shapiro L. HATNet: An End-to-End Holistic Attention Network for Diagnosis of Breast Biopsy Images. arXiv. Preprint posted online 2020. doi: 10.48550/arXiv.2007.13007
- Lan Y, Zou S, Qin B, Zhu X. Potential roles of transformers in brain tumor diagnosis and treatment. Brain-X. 2023;1(2):ae23. doi: 10.1002/brx2.23
- Courant R, Edberg M, Dufour N, Kalogeiton V. Transformers and visual transformers. In: Colliot O, editors. Machine Learning for Brain Disorders. Neuromethods. vol. 197. United States: Humana; 2023. doi: 10.1007/978-1-0716-3195-9_6
- Zunair H, Ben Hamza A. Sharp U-Net: Depthwise convolutional network for biomedical image segmentation. Comput Biol Med. 2021;136:104699. doi: 10.1016/j.compbiomed.2021.104699
- Dasanayaka C, Dharmasena B, Bandara WR, Dissanayake MB, Jayasinghe R. Segmentation of Mental Foramen in Dental Panoramic Tomography Using Deep Learning. In: 2019 IEEE 14th Conference on Industrial and Information Systems (ICIIS). IEEE; 2019:81-84. doi: 10.1109/ICIIS47346.2019.9063312
- Wang P, Yang Q, He Z, Yuan Y. Vision transformers in multi-modal brain tumor MRI segmentation: A review. Meta Radiol. 2023;1:100004. doi: 10.1016/j.metrad.2023.100004
- Marathe A, Kadam V, Chaumal A, Kodilkar S, Joshi A, Sawant S. Performance analysis of memory-efficient vision transformers in brain tumor segmentation. In: Artificial Intelligence-Based Healthcare Systems. Cham: Springer Nature Switzerland; 2023:125-133. doi: 10.1007/978-3-031-41925-6_9
- Asiri AA, Shaf A, Ali T, et al. Exploring the power of deep learning: Fine-tuned vision transformer for accurate and efficient brain tumor detection in MRI Scans. Diagnostics. 2023;13(12):2094. doi: 10.3390/diagnostics13122094
- Salama K. Image Classification with Vision Transformer; 2022. Available: https://keras.io/examples/vision/image_ classification_with_vision_transformer [Last accessed on 2022 Oct 10].
- Mabu S, Atsumo A, Kido S, Kuremoto T, Hirano Y. Investigating the effects of transfer learning on ROI-based classification of chest CT images: A case study on diffuse lung diseases. J Signal Process Syst. 2020;92:307-313. doi: 10.1007/s11265-019-01499-w
- Kanesamoorthy K, Dissanayake MB. Prediction of treatment failure of tuberculosis using support vector machine with genetic algorithm. Int J Mycobacteriol. 2021;10(3):279-284. doi: 10.4103/ijmy.ijmy_130_21
- Sun L, Zhang S, Chen H, Luo L. Brain tumor segmentation and survival prediction using multimodal MRI scans with deep learning. Front Neurosci. 2019;13:810. doi: 10.3389/fnins.2019.00810
- Latif G. DeepTumor: Framework for brain MR image classification, segmentation and tumor detection. Diagnostics (Basel). 2022;12(11):2888. doi: 10.3390/diagnostics12112888
- El-Melegy MT, El-Magd KMA. A Multiple Classifiers System for Automatic Multimodal Brain Tumor Segmentation. In: Proceedings of the 2019 15th International Computer Engineering Conference (ICENCO), Giza, Egypt. 29-30 December 2019. New York, NY, USA: IEEE; 2019. doi: 10.1109/ICENCO48310.2019.9027389
- Xue Y, Yang Y, Farhat FG, et al. Brain tumor classification with tumor segmentations and a dual path residual convolutional neural network from MRI and pathology images. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. Germany: Springer; 2020. p. 360-367. doi: 10.1007/978-3-030-46643-5_36
- Amin J, Sharif M, Gul N, Yasmin M, Shad SA. Brain tumor classification based on DWT fusion of MRI sequences using convolutional neural network. Pattern Recognit Lett. 2020;129:115-122. doi: 10.1016/j.patrec.2019.11.016
- Maram B, Rana P. Brain Tumour Detection on BraTS 2020 using U-Net. In: 2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India; 2021. p. 1-5. doi: 10.1109/ICRITO51393.2021.9596530
- Ferdous GJ, Sathi KA, Hossain MA, Hoque MM, Dewan MAA. LCDEiT: A linear complexity data-efficient image transformer for MRI brain tumor classification. IEEE Access. 2023;11:20337-20350. doi: 10.1109/ACCESS.2023.3244228
