3D Medical Image Reconstruction through Transformer-Based Neural Networks: A Comparative Study

Authors

  • Wei Ling Tan James Cook University Singapore, Singapore
  • Arjun Menon James Cook University Singapore, Singapore

DOI:

https://doi.org/10.51903/jtie.v4i3.453

Keywords:

Transformer-Based Neural Networks, 3D Medical Image Reconstruction, Convolutional Neural Network (CNN), Deep Learning in Biomedical Engineering, CT/MRI Imaging Accuracy

Abstract

Three-dimensional reconstruction of CT and MRI images remains a persistent challenge in medical imaging, where clinicians require high‐fidelity volumes that preserve subtle anatomical details while remaining computationally efficient. This study evaluates a transformer-based neural network against a conventional convolutional neural network (CNN) baseline to determine which architecture delivers superior reconstruction accuracy for clinical use. A standard deep learning pipeline was constructed, which included data curation, intensity normalization, and augmentation, prior to training the models. The experimental comparison studied two representative architectures, a 3D U-Net that served as the CNN benchmark, and a 3D Swin Transform, that served as the attention approach. The quantitative analysis showed that the transformer produced a higher Peak Signal-to-Noise-Ratio (35.8 dB vs 33.1 dB), better Structural Similarity Index Measure (0.942 vs 0.911), and better Dice coefficient (0.91 vs 0.87) with little differences with respect to inference time per volume. The visual analysis showed sharper cortical folds and clearer lesion edges, which radiologists linked with higher diagnostic confidence. The transformer’s ability to model global spatial dependencies and reduce noise artifacts facilitates accurate and clinically pertinent reconstructions. This study shows that transformer models can be computationally efficient but more precise than CNN alternatives, which support their implementation in hospital Picture Archiving and Communication Systems (PACS) and within future real time patient diagnostics workflows. Taken together, these findings support the collective efforts of engineers and healthcare providers to leverage future algorithmic improvements that can enhance patient care and the safety of imaging.

References

Ahishakiye, E., Van Gijzen, M. B., Tumwiine, J., Wario, R., & Obungoloch, J. (2021). A survey on deep learning in medical image reconstruction. In Intelligent Medicine (Vol. 1, Issue 3, pp. 118–127). Elsevier B.V. https://doi.org/10.1016/j.imed.2021.03.003

Boulogeorgos, A.-A. A., Trevlakis, S. E., Tegos, S. A., Papanikolaou, V. K., & Karagiannidis, G. K. (2020). Machine Learning in Nano-Scale Biomedical Engineering. http://arxiv.org/abs/2008.02195

Chen, J., Zhang, Y., Pan, Y., Xu, P., & Guan, C. (2022). A Transformer-based deep neural network model for SSVEP classification. http://arxiv.org/abs/2210.04172

Chen, X., Diaz-Pinto, A., Ravikumar, N., & Frangi, A. F. (2021). Deep learning in medical image registration. In Progress in Biomedical Engineering (Vol. 3, Issue 1). IOP Publishing Ltd. https://doi.org/10.1088/2516-1091/abd37c

Chen, Z., Agarwal, D., Aggarwal, K., Safta, W., Balan, M. M., Brown, K., & Squibb, B. M. (2023). Masked Image Modeling Advances 3D Medical Image Analysis. https://www.synapse.org/#!Synapse:syn3193805/wiki/89480

Dzobo, K., Adotey, S., Thomford, N. E., & Dzobo, W. (2020). Integrating Artificial and Human Intelligence: A Partnership for Responsible Innovation in Biomedical Engineering and Medicine. In OMICS A Journal of Integrative Biology (Vol. 24, Issue 5, pp. 247–263). Mary Ann Liebert Inc. https://doi.org/10.1089/omi.2019.0038

Ghofur, M. J. U., & Riyanto, E. (2025). AI-Driven Adaptive Radar Systems for Real-Time Target Tracking in Urban Environments. Journal of Technology Informatics and Engineering, 4(1). https://doi.org/10.51903/jtie.v4i1.289

Harrisha, M., Monikasree, J., Swathi, J., & Karthika, D. (2025). Smart Healthcare: Harnessing AI for Early prediction of Neurodegenerative disease. Journal of Technology Informatics and Engineering, 4(2), 214–224. https://doi.org/10.51903/jtie.v4i2.269

Hu, J., Gao, J., Fang, X., Liu, Z., Wang, F., Huang, W., wu, H., & Zhao, G. (2022). DTSyn: a dual-transformer-based neural network to predict synergistic drug combinations. https://doi.org/10.1101/2022.03.29.486200

Huang, Z., Mo, X., & Lv, C. (2021). Multi-modal Motion Prediction with Transformer-based Neural Network for Autonomous Driving. http://arxiv.org/abs/2109.06446

Ibrahim, S. M., Go, E.-M., & Iranda, J. (2024). Scalable and Secure IoT-Driven Vibration Monitoring: Advancing Predictive Maintenance in Industrial Systems. Journal of Technology Informatics and Engineering, 3(3), 370–381. https://doi.org/10.51903/jtie.v3i3.210

Kattenborn, T., Leitloff, J., Schiefer, F., & Hinz, S. (2021). Review on Convolutional Neural Networks (CNN) in vegetation remote sensing. In ISPRS Journal of Photogrammetry and Remote Sensing (Vol. 173, pp. 24–49). Elsevier B.V. https://doi.org/10.1016/j.isprsjprs.2020.12.010

Kim, J. H., Choi, K. Y., Lee, S. H., Lee, D. J., Park, B. J., Yoon, D. Y., & Rho, Y. S. (2020). The value of CT, MRI, and PET-CT in detecting retropharyngeal lymph node metastasis of head and neck squamous cell carcinoma. BMC Medical Imaging, 20(1). https://doi.org/10.1186/s12880-020-00487-y

Krasnov, L., Khokhlov, I., Fedorov, M. V., & Sosnin, S. (2021). Transformer-based artificial neural networks for the conversion between chemical notations. Scientific Reports, 11(1). https://doi.org/10.1038/s41598-021-94082-y

Krichen, M. (2023). Convolutional Neural Networks: A Survey. Computers, 12(8). https://doi.org/10.3390/computers12080151

Lother, D., Robert, M., Elwood, E., Smith, S., Tunariu, N., Johnston, S. R. D., Parton, M., Bhaludin, B., Millard, T., Downey, K., & Sharma, B. (2023). Imaging in metastatic breast cancer, CT, PET/CT, MRI, WB-DWI, CCA: review and new perspectives. In Cancer Imaging (Vol. 23, Issue 1). BioMed Central Ltd. https://doi.org/10.1186/s40644-023-00557-8

Lu, J., Tan, L., & Jiang, H. (2021). Review on convolutional neural network (CNN) applied to plant leaf disease classification. In Agriculture (Switzerland) (Vol. 11, Issue 8). MDPI AG. https://doi.org/10.3390/agriculture11080707

Luo, K., Zheng, H., & Shi, Z. (2023). A simple feature extraction method for estimating the whole life cycle state of health of lithium-ion batteries using transformer-based neural network. Journal of Power Sources, 576. https://doi.org/10.1016/j.jpowsour.2023.233139

Marcello Scotti, F., Stuepp, R. T., Dutra-Horstmann, K. L., Modolo, F., & Gusmão Paraiso Cavalcanti, M. (2022). Accuracy of MRI, CT, and Ultrasound imaging on thickness and depth of oral primary carcinomas invasion: a systematic review. Dentomaxillofacial Radiology, 51(5). https://doi.org/10.1259/dmfr.20210291

Purwono, Ma’arif, A., Rahmaniar, W., Fathurrahman, H. I. K., Frisky, A. Z. K., & Haq, Q. M. U. (2022). Understanding of Convolutional Neural Network (CNN): A Review. International Journal of Robotics and Control Systems, 2(4), 739–748. https://doi.org/10.31763/ijrcs.v2i4.888

Sholekhah, D. Z., & Noviar, D. (2025). Integrative Deep Learning Architecture for High-Accuracy Medical Image Segmentation: Combining U-Net, ResNet, and Transformers. Journal of Technology Informatics and Engineering, 4(1), 115–134. https://doi.org/10.51903/jtie.v4i1.288

Sun, H., Jian, S., Peng, B., & Hou, J. (2022). Comparison of magnetic resonance imaging and computed tomography in the diagnosis of acute pancreatitis: a systematic review and meta-analysis of diagnostic test accuracy studies. Annals of Translational Medicine, 10(7), 410–410. https://doi.org/10.21037/atm-22-812

Susatyono, J. D., Suasana, I. S., & Rozikin, K. (2024). Integrating Big Data and Edge Computing for Enhancing AI Efficiency in Real-Time Applications. Journal of Technology Informatics and Engineering, 3(3), 337–349. https://doi.org/10.51903/jtie.v3i3.204

Wang, K., He, B., & Zhu, W.-P. (2021). TSTNN: TWO-STAGE TRANSFORMER BASED NEURAL NETWORK FOR SPEECH ENHANCEMENT IN THE TIME DOMAIN.

Webber, G., & Reader, A. J. (2024). Diffusion Models for Medical Image Reconstruction. BJR|Artificial Intelligence. https://doi.org/10.1093/bjrai/ubae013

Zhang, H., & Dong, B. (2022). A Review on Deep Learning in Medical Image Reconstruction. https://doi.org/10.1007/s40305-019-00287-4

Zhang, Q., Xiao, J., Tian, C., Chun-Wei Lin, J., & Zhang, S. (2023). A robust deformed convolutional neural network (CNN) for image denoising. CAAI Transactions on Intelligence Technology, 8(2), 331–342. https://doi.org/10.1049/cit2.12110

Downloads

Published

2025-12-20

How to Cite

3D Medical Image Reconstruction through Transformer-Based Neural Networks: A Comparative Study. (2025). Journal of Technology Informatics and Engineering, 4(3). https://doi.org/10.51903/jtie.v4i3.453