Hybrid Explainable AI (XAI) Framework for Detecting Adversarial Attacks in Cyber-Physical Systems

Authors

  • Mohammad Taufik Universitas Tadulako, Palu, Sulawesi Tengah, Indonesia
  • Mohammad Saddam Aziz Universitas Tadulako, Palu, Sulawesi Tengah, Indonesia
  • Aulia Fitriana Universitas Tadulako, Palu, Sulawesi Tengah, Indonesia

DOI:

https://doi.org/10.51903/jtie.v4i1.295

Keywords:

Cyber-Physical Systems, Adversarial Attack Detection, Explainable Artificial Intelligence

Abstract

Cyber-Physical Systems (CPS) are increasingly deployed in critical infrastructure yet remain vulnerable to adversarial attacks that manipulate sensor data to mislead AI-based decision-making. These threats demand not only high-accuracy detection but also transparency in model reasoning. This study proposes a Hybrid Explainable AI (XAI) Framework that integrates Convolutional Neural Networks (CNN), SHAP-based feature interpretation, and rule-based reasoning to detect adversarial inputs in CPS environments. The framework is tested on two simulation scenarios: industrial sensor networks and autonomous traffic sign recognition. Using datasets of 10,000 samples (50% adversarial via FGSM and PGD), the model achieved an accuracy of 97.25%, precision of 96.80%, recall of 95.90%, and F1-score of 96.35%. SHAP visualizations effectively distinguished between normal and adversarial inputs, and the added explainability module increased inference time by only 8.5% over the baseline CNN (from 18.5 ms to 20.1 ms), making it suitable for real-time CPS deployment. Compared to prior methods (e.g., CNN + Grad-CAM, Random Forest + LIME), the proposed hybrid framework demonstrates superior performance and interpretability. The novelty of this work lies in its tri-level integration of predictive accuracy, explainability, and rule-based logic within a single real-time detection system—an approach not previously applied in CPS adversarial defense. This research contributes toward trustworthy AI systems that are robust, explainable, and secure by design.

References

Al-Essa, M., Andresini, G., Appice, A., & Malerba, D. (2022). An XAI-based adversarial training approach for cyber-threat detection. Proceedings of the 2022 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2022. https://doi.org/10.1109/DASC/PiCom/CBDCom/Cy55231.2022.9927842

Awotunde, J. B., Oguns, Y. J., Amuda, K. A., Nigar, N., Adeleke, T. A., Olagunju, K. M., & Ajagbe, S. A. (2023). Cyber-Physical Systems Security: Analysis, Opportunities, Challenges, and Future Prospects. Advances in Information Security, 102, 21–46. https://doi.org/10.1007/978-3-031-25506-9_2

Bai, M., Liu, P., Lv, F., Fang, D., Lv, S., Zhang, W., & Sun, L. (2024). Adversarial Attack against Intrusion Detectors in Cyber-Physical Systems With Minimal Perturbations. Proceedings - 2024 IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2024, 816–825. https://doi.org/10.1109/ISPA63168.2024.00109

Bai, Y., Park, J., Tehranipoor, M., & Forte, D. (2022). Real-time instruction-level verification of remote IoT/CPS devices via side channels. Discover Internet of Things, 2(1). https://doi.org/10.1007/s43926-022-00021-2

Bajaj, A., & Vishwakarma, D. K. (2024). A state-of-the-art review on adversarial machine learning in image classification. Multimedia Tools and Applications, 83(3), 9351–9416. https://doi.org/10.1007/s11042-023-15883-z

Doghri, W., Saddoud, A., & Chaari Fourati, L. (2022). Cyber-physical systems for structural health monitoring: sensing technologies and intelligent computing. Journal of Supercomputing, 78(1), 766–809. https://doi.org/10.1007/s11227-021-03875-5

Duo, W., Zhou, M. C., & Abusorrah, A. (2022). A Survey of Cyber Attacks on Cyber Physical Systems: Recent Advances and Challenges. IEEE/CAA Journal of Automatica Sinica, 9(5), 784–800. https://doi.org/10.1109/JAS.2022.105548

El-Kady, A. H., Halim, S., El-Halwagi, M. M., & Khan, F. (2023). Analysis of safety and security challenges and opportunities related to cyber-physical systems. Process Safety and Environmental Protection, 173, 384–413. https://doi.org/10.1016/j.psep.2023.03.012

Elgarhy, I., Badr, M. M., Mahmoud, M., Alsabaan, M., Alshawi, T., & Alsaqhan, M. (2024). XAI-Based Accurate Anomaly Detector That Is Robust Against Black-Box Evasion Attacks for the Smart Grid. Applied Sciences (Switzerland), 14(21). https://doi.org/10.3390/app14219897

Ennab, M., & Mcheick, H. (2025). Advancing AI Interpretability in Medical Imaging: A Comparative Analysis of Pixel-Level Interpretability and Grad-CAM Models. Machine Learning and Knowledge Extraction, 7(1). https://doi.org/10.3390/make7010012

Gipiskis, R., Chiaro, D., Preziosi, M., Prezioso, E., & Piccialli, F. (2023). The Impact of Adversarial Attacks on Interpretable Semantic Segmentation in Cyber-Physical Systems. IEEE Systems Journal, 17(4), 5327–5334. https://doi.org/10.1109/JSYST.2023.3281079

Hamzah, M., Islam, M. M., Hassan, S., Akhtar, M. N., Ferdous, M. J., Jasser, M. B., & Mohamed, A. W. (2023). Distributed Control of Cyber Physical Systems on Various Domains: A Critical Review. Systems, 11(4). https://doi.org/10.3390/systems11040208

Hulsen, T. (2023). Explainable Artificial Intelligence (XAI): Concepts and Challenges in Healthcare. AI (Switzerland), 4(3), 652–666. https://doi.org/10.3390/ai4030034

Islam, M. M., Rifat, H. R., Shahid, M. S. Bin, Akhter, A., Uddin, M. A., & Uddin, K. M. M. (2024). Explainable Machine Learning for Efficient Diabetes Prediction Using Hyperparameter Tuning, SHAP Analysis, Partial Dependency, and LIME. Engineering Reports. https://doi.org/10.1002/eng2.13080

Kok, I., Okay, F. Y., Muyanli, O., & Ozdemir, S. (2023). Explainable Artificial Intelligence (XAI) for Internet of Things: A Survey. IEEE Internet of Things Journal, 10(16), 14764–14779. https://doi.org/10.1109/JIOT.2023.3287678

Machlev, R., Perl, M., Levy, K. Y., Belikov, J., Mannor, S., Levron, Y., & Heistrene, L. (2022). Explainable Artificial Intelligence (XAI) techniques for energy and power systems: review, challenges and opportunities. Energy and AI, 9. https://www.sciencedirect.com/science/article/pii/S2666546822000246

Maiti, R. R., Yoong, C. H., Palleti, V. R., Silva, A., & Poskitt, C. M. (2023). Mitigating Adversarial Attacks on Data-Driven Invariant Checkers for Cyber-Physical Systems. IEEE Transactions on Dependable and Secure Computing, 20(4), 3378–3391. https://doi.org/10.1109/TDSC.2022.3194089

Minh, D., Wang, H. X., Li, Y. F., & Nguyen, T. N. (2022). Explainable artificial intelligence: a comprehensive review. Artificial Intelligence Review, 55(5), 3503–3568. https://doi.org/10.1007/s10462-021-10088-y

Momtaz, A., Basnet, N., Abbas, H., & Bonakdarpour, B. (2023). Predicate monitoring in distributed cyber-physical systems. International Journal on Software Tools for Technology Transfer, 25(4), 541–556. https://doi.org/10.1007/s10009-023-00718-x

Munshi, R. M., Cascone, L., Alturki, N., Saidani, O., Alshardan, A., & Umer, M. (2024). A novel approach for breast cancer detection using optimized ensemble learning framework and XAI. Image and Vision Computing, 142. https://doi.org/10.1016/j.imavis.2024.104910

Mustafaev, B., Kim, S., & Kim, E. (2023). Enhancing Metal Surface Defect Recognition Through Image Patching and Synthetic Defect Generation. IEEE Access, 11, 113339–113359. https://doi.org/10.1109/ACCESS.2023.3322734

Naveenan, R. V., & Suresh, G. (2023). Cyber Risk and the Cost of Unpreparedness of Financial Institutions. Cyber Security and Business Intelligence: Innovations and Machine Learning for Cyber Risk Management, 15–36. https://doi.org/10.4324/9781003285854-2

Nwakanma, C. I., Ahakonye, L. A. C., Njoku, J. N., Odirichukwu, J. C., Okolie, S. A., Uzondu, C., Ndubuisi Nweke, C. C., & Kim, D. S. (2023). Explainable Artificial Intelligence (XAI) for Intrusion Detection and Mitigation in Intelligent Connected Vehicles: A Review. Applied Sciences (Switzerland), 13(3). https://doi.org/10.3390/app13031252

Prasad, S. S., Deo, R. C., Salcedo-Sanz, S., Downs, N. J., Casillas-Pérez, D., & Parisi, A. V. (2023). Enhanced joint hybrid deep neural network explainable artificial intelligence model for 1-hr ahead solar ultraviolet index prediction. Computer Methods and Programs in Biomedicine, 241. https://doi.org/10.1016/j.cmpb.2023.107737

Rani, S., Kataria, A., Chauhan, M., Rattan, P., Kumar, R., & Kumar Sivaraman, A. (2022). Security and Privacy Challenges in the Deployment of Cyber-Physical Systems in Smart City Applications: State-of-Art Work. Materials Today: Proceedings, 62, 4671–4676. https://doi.org/10.1016/j.matpr.2022.03.123

S Band, S., Yarahmadi, A., Hsu, C. C., Biyari, M., Sookhak, M., Ameri, R., Dehzangi, I., Chronopoulos, A. T., & Liang, H. W. (2023). Application of explainable artificial intelligence in medical health: A systematic review of interpretability methods. Informatics in Medicine Unlocked, 40. https://doi.org/10.1016/j.imu.2023.101286

Schreiber, A., & Schreiber, I. (2025). AI for cyber-security risk: harnessing AI for automatic generation of company-specific cybersecurity risk profiles. Information and Computer Security. https://doi.org/10.1108/ICS-08-2024-0177

Sheikh, Z. A., Singh, Y., Singh, P. K., & Gonçalves, P. J. S. (2023). Defending the Defender: Adversarial Learning Based Defending Strategy for Learning Based Security Methods in Cyber-Physical Systems (CPS). Sensors, 23(12). https://doi.org/10.3390/s23125459

Silva-Aravena, F., Núñez Delafuente, H., Gutiérrez-Bahamondes, J. H., & Morales, J. (2023). A Hybrid Algorithm of ML and XAI to Prevent Breast Cancer: A Strategy to Support Decision Making. Cancers, 15(9). https://doi.org/10.3390/cancers15092443

Sohail, A., Fahmy, M. A., & Khan, U. A. (2023). XAI hybrid multi-staged algorithm for routine & quantum boosted oncological medical imaging. Computational Particle Mechanics, 10(2), 209–219. https://doi.org/10.1007/s40571-022-00490-w

Taherdoost, H. (2023). Deep Learning and Neural Networks: Decision-Making Implications. Symmetry, 15(9). https://doi.org/10.3390/sym15091723

Zhao, Y. (2024). LogicAL: Towards logical anomaly synthesis for unsupervised anomaly localization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern RecognitionProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop.

Published

2025-04-22

How to Cite

Hybrid Explainable AI (XAI) Framework for Detecting Adversarial Attacks in Cyber-Physical Systems. (2025). Journal of Technology Informatics and Engineering, 4(1). https://doi.org/10.51903/jtie.v4i1.295