3D Medical Image Reconstruction through Transformer-Based Neural Networks: A Comparative Study
DOI:
https://doi.org/10.51903/jtie.v4i3.453Keywords:
Transformer-Based Neural Networks, 3D Medical Image Reconstruction, Convolutional Neural Network (CNN), Deep Learning in Biomedical Engineering, CT/MRI Imaging AccuracyAbstract
Three-dimensional reconstruction of CT and MRI images remains a persistent challenge in medical imaging, where clinicians require high‐fidelity volumes that preserve subtle anatomical details while remaining computationally efficient. This study evaluates a transformer-based neural network against a conventional convolutional neural network (CNN) baseline to determine which architecture delivers superior reconstruction accuracy for clinical use. A standard deep learning pipeline was constructed, which included data curation, intensity normalization, and augmentation, prior to training the models. The experimental comparison studied two representative architectures, a 3D U-Net that served as the CNN benchmark, and a 3D Swin Transform, that served as the attention approach. The quantitative analysis showed that the transformer produced a higher Peak Signal-to-Noise-Ratio (35.8 dB vs 33.1 dB), better Structural Similarity Index Measure (0.942 vs 0.911), and better Dice coefficient (0.91 vs 0.87) with little differences with respect to inference time per volume. The visual analysis showed sharper cortical folds and clearer lesion edges, which radiologists linked with higher diagnostic confidence. The transformer’s ability to model global spatial dependencies and reduce noise artifacts facilitates accurate and clinically pertinent reconstructions. This study shows that transformer models can be computationally efficient but more precise than CNN alternatives, which support their implementation in hospital Picture Archiving and Communication Systems (PACS) and within future real time patient diagnostics workflows. Taken together, these findings support the collective efforts of engineers and healthcare providers to leverage future algorithmic improvements that can enhance patient care and the safety of imaging.
References
Ahishakiye, E., Van Gijzen, M. B., Tumwiine, J., Wario, R., & Obungoloch, J. (2021). A survey on deep learning in medical image reconstruction. In Intelligent Medicine (Vol. 1, Issue 3, pp. 118–127). Elsevier B.V. https://doi.org/10.1016/j.imed.2021.03.003
Boulogeorgos, A.-A. A., Trevlakis, S. E., Tegos, S. A., Papanikolaou, V. K., & Karagiannidis, G. K. (2020). Machine Learning in Nano-Scale Biomedical Engineering. http://arxiv.org/abs/2008.02195
Chen, J., Zhang, Y., Pan, Y., Xu, P., & Guan, C. (2022). A Transformer-based deep neural network model for SSVEP classification. http://arxiv.org/abs/2210.04172
Chen, X., Diaz-Pinto, A., Ravikumar, N., & Frangi, A. F. (2021). Deep learning in medical image registration. In Progress in Biomedical Engineering (Vol. 3, Issue 1). IOP Publishing Ltd. https://doi.org/10.1088/2516-1091/abd37c
Chen, Z., Agarwal, D., Aggarwal, K., Safta, W., Balan, M. M., Brown, K., & Squibb, B. M. (2023). Masked Image Modeling Advances 3D Medical Image Analysis. https://www.synapse.org/#!Synapse:syn3193805/wiki/89480
Dzobo, K., Adotey, S., Thomford, N. E., & Dzobo, W. (2020). Integrating Artificial and Human Intelligence: A Partnership for Responsible Innovation in Biomedical Engineering and Medicine. In OMICS A Journal of Integrative Biology (Vol. 24, Issue 5, pp. 247–263). Mary Ann Liebert Inc. https://doi.org/10.1089/omi.2019.0038
Ghofur, M. J. U., & Riyanto, E. (2025). AI-Driven Adaptive Radar Systems for Real-Time Target Tracking in Urban Environments. Journal of Technology Informatics and Engineering, 4(1). https://doi.org/10.51903/jtie.v4i1.289
Harrisha, M., Monikasree, J., Swathi, J., & Karthika, D. (2025). Smart Healthcare: Harnessing AI for Early prediction of Neurodegenerative disease. Journal of Technology Informatics and Engineering, 4(2), 214–224. https://doi.org/10.51903/jtie.v4i2.269
Hu, J., Gao, J., Fang, X., Liu, Z., Wang, F., Huang, W., wu, H., & Zhao, G. (2022). DTSyn: a dual-transformer-based neural network to predict synergistic drug combinations. https://doi.org/10.1101/2022.03.29.486200
Huang, Z., Mo, X., & Lv, C. (2021). Multi-modal Motion Prediction with Transformer-based Neural Network for Autonomous Driving. http://arxiv.org/abs/2109.06446
Ibrahim, S. M., Go, E.-M., & Iranda, J. (2024). Scalable and Secure IoT-Driven Vibration Monitoring: Advancing Predictive Maintenance in Industrial Systems. Journal of Technology Informatics and Engineering, 3(3), 370–381. https://doi.org/10.51903/jtie.v3i3.210
Kattenborn, T., Leitloff, J., Schiefer, F., & Hinz, S. (2021). Review on Convolutional Neural Networks (CNN) in vegetation remote sensing. In ISPRS Journal of Photogrammetry and Remote Sensing (Vol. 173, pp. 24–49). Elsevier B.V. https://doi.org/10.1016/j.isprsjprs.2020.12.010
Kim, J. H., Choi, K. Y., Lee, S. H., Lee, D. J., Park, B. J., Yoon, D. Y., & Rho, Y. S. (2020). The value of CT, MRI, and PET-CT in detecting retropharyngeal lymph node metastasis of head and neck squamous cell carcinoma. BMC Medical Imaging, 20(1). https://doi.org/10.1186/s12880-020-00487-y
Krasnov, L., Khokhlov, I., Fedorov, M. V., & Sosnin, S. (2021). Transformer-based artificial neural networks for the conversion between chemical notations. Scientific Reports, 11(1). https://doi.org/10.1038/s41598-021-94082-y
Krichen, M. (2023). Convolutional Neural Networks: A Survey. Computers, 12(8). https://doi.org/10.3390/computers12080151
Lother, D., Robert, M., Elwood, E., Smith, S., Tunariu, N., Johnston, S. R. D., Parton, M., Bhaludin, B., Millard, T., Downey, K., & Sharma, B. (2023). Imaging in metastatic breast cancer, CT, PET/CT, MRI, WB-DWI, CCA: review and new perspectives. In Cancer Imaging (Vol. 23, Issue 1). BioMed Central Ltd. https://doi.org/10.1186/s40644-023-00557-8
Lu, J., Tan, L., & Jiang, H. (2021). Review on convolutional neural network (CNN) applied to plant leaf disease classification. In Agriculture (Switzerland) (Vol. 11, Issue 8). MDPI AG. https://doi.org/10.3390/agriculture11080707
Luo, K., Zheng, H., & Shi, Z. (2023). A simple feature extraction method for estimating the whole life cycle state of health of lithium-ion batteries using transformer-based neural network. Journal of Power Sources, 576. https://doi.org/10.1016/j.jpowsour.2023.233139
Marcello Scotti, F., Stuepp, R. T., Dutra-Horstmann, K. L., Modolo, F., & Gusmão Paraiso Cavalcanti, M. (2022). Accuracy of MRI, CT, and Ultrasound imaging on thickness and depth of oral primary carcinomas invasion: a systematic review. Dentomaxillofacial Radiology, 51(5). https://doi.org/10.1259/dmfr.20210291
Purwono, Ma’arif, A., Rahmaniar, W., Fathurrahman, H. I. K., Frisky, A. Z. K., & Haq, Q. M. U. (2022). Understanding of Convolutional Neural Network (CNN): A Review. International Journal of Robotics and Control Systems, 2(4), 739–748. https://doi.org/10.31763/ijrcs.v2i4.888
Sholekhah, D. Z., & Noviar, D. (2025). Integrative Deep Learning Architecture for High-Accuracy Medical Image Segmentation: Combining U-Net, ResNet, and Transformers. Journal of Technology Informatics and Engineering, 4(1), 115–134. https://doi.org/10.51903/jtie.v4i1.288
Sun, H., Jian, S., Peng, B., & Hou, J. (2022). Comparison of magnetic resonance imaging and computed tomography in the diagnosis of acute pancreatitis: a systematic review and meta-analysis of diagnostic test accuracy studies. Annals of Translational Medicine, 10(7), 410–410. https://doi.org/10.21037/atm-22-812
Susatyono, J. D., Suasana, I. S., & Rozikin, K. (2024). Integrating Big Data and Edge Computing for Enhancing AI Efficiency in Real-Time Applications. Journal of Technology Informatics and Engineering, 3(3), 337–349. https://doi.org/10.51903/jtie.v3i3.204
Wang, K., He, B., & Zhu, W.-P. (2021). TSTNN: TWO-STAGE TRANSFORMER BASED NEURAL NETWORK FOR SPEECH ENHANCEMENT IN THE TIME DOMAIN.
Webber, G., & Reader, A. J. (2024). Diffusion Models for Medical Image Reconstruction. BJR|Artificial Intelligence. https://doi.org/10.1093/bjrai/ubae013
Zhang, H., & Dong, B. (2022). A Review on Deep Learning in Medical Image Reconstruction. https://doi.org/10.1007/s40305-019-00287-4
Zhang, Q., Xiao, J., Tian, C., Chun-Wei Lin, J., & Zhang, S. (2023). A robust deformed convolutional neural network (CNN) for image denoising. CAAI Transactions on Intelligence Technology, 8(2), 331–342. https://doi.org/10.1049/cit2.12110
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Wei Ling Tan, Arjun Menon

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

