Hybrid Explainable AI (XAI) Framework for Detecting Adversarial Attacks in Cyber-Physical Systems
DOI:
https://doi.org/10.51903/jtie.v4i1.295Keywords:
Cyber-Physical Systems, Adversarial Attack Detection, Explainable Artificial IntelligenceAbstract
Cyber-Physical Systems (CPS) are increasingly deployed in critical infrastructure yet remain vulnerable to adversarial attacks that manipulate sensor data to mislead AI-based decision-making. These threats demand not only high-accuracy detection but also transparency in model reasoning. This study proposes a Hybrid Explainable AI (XAI) Framework that integrates Convolutional Neural Networks (CNN), SHAP-based feature interpretation, and rule-based reasoning to detect adversarial inputs in CPS environments. The framework is tested on two simulation scenarios: industrial sensor networks and autonomous traffic sign recognition. Using datasets of 10,000 samples (50% adversarial via FGSM and PGD), the model achieved an accuracy of 97.25%, precision of 96.80%, recall of 95.90%, and F1-score of 96.35%. SHAP visualizations effectively distinguished between normal and adversarial inputs, and the added explainability module increased inference time by only 8.5% over the baseline CNN (from 18.5 ms to 20.1 ms), making it suitable for real-time CPS deployment. Compared to prior methods (e.g., CNN + Grad-CAM, Random Forest + LIME), the proposed hybrid framework demonstrates superior performance and interpretability. The novelty of this work lies in its tri-level integration of predictive accuracy, explainability, and rule-based logic within a single real-time detection system—an approach not previously applied in CPS adversarial defense. This research contributes toward trustworthy AI systems that are robust, explainable, and secure by design.
References
Al-Essa, M., Andresini, G., Appice, A., & Malerba, D. (2022). An XAI-based adversarial training approach for cyber-threat detection. Proceedings of the 2022 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2022. https://doi.org/10.1109/DASC/PiCom/CBDCom/Cy55231.2022.9927842
Awotunde, J. B., Oguns, Y. J., Amuda, K. A., Nigar, N., Adeleke, T. A., Olagunju, K. M., & Ajagbe, S. A. (2023). Cyber-Physical Systems Security: Analysis, Opportunities, Challenges, and Future Prospects. Advances in Information Security, 102, 21–46. https://doi.org/10.1007/978-3-031-25506-9_2
Bai, M., Liu, P., Lv, F., Fang, D., Lv, S., Zhang, W., & Sun, L. (2024). Adversarial Attack against Intrusion Detectors in Cyber-Physical Systems With Minimal Perturbations. Proceedings - 2024 IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2024, 816–825. https://doi.org/10.1109/ISPA63168.2024.00109
Bai, Y., Park, J., Tehranipoor, M., & Forte, D. (2022). Real-time instruction-level verification of remote IoT/CPS devices via side channels. Discover Internet of Things, 2(1). https://doi.org/10.1007/s43926-022-00021-2
Bajaj, A., & Vishwakarma, D. K. (2024). A state-of-the-art review on adversarial machine learning in image classification. Multimedia Tools and Applications, 83(3), 9351–9416. https://doi.org/10.1007/s11042-023-15883-z
Doghri, W., Saddoud, A., & Chaari Fourati, L. (2022). Cyber-physical systems for structural health monitoring: sensing technologies and intelligent computing. Journal of Supercomputing, 78(1), 766–809. https://doi.org/10.1007/s11227-021-03875-5
Duo, W., Zhou, M. C., & Abusorrah, A. (2022). A Survey of Cyber Attacks on Cyber Physical Systems: Recent Advances and Challenges. IEEE/CAA Journal of Automatica Sinica, 9(5), 784–800. https://doi.org/10.1109/JAS.2022.105548
El-Kady, A. H., Halim, S., El-Halwagi, M. M., & Khan, F. (2023). Analysis of safety and security challenges and opportunities related to cyber-physical systems. Process Safety and Environmental Protection, 173, 384–413. https://doi.org/10.1016/j.psep.2023.03.012
Elgarhy, I., Badr, M. M., Mahmoud, M., Alsabaan, M., Alshawi, T., & Alsaqhan, M. (2024). XAI-Based Accurate Anomaly Detector That Is Robust Against Black-Box Evasion Attacks for the Smart Grid. Applied Sciences (Switzerland), 14(21). https://doi.org/10.3390/app14219897
Ennab, M., & Mcheick, H. (2025). Advancing AI Interpretability in Medical Imaging: A Comparative Analysis of Pixel-Level Interpretability and Grad-CAM Models. Machine Learning and Knowledge Extraction, 7(1). https://doi.org/10.3390/make7010012
Gipiskis, R., Chiaro, D., Preziosi, M., Prezioso, E., & Piccialli, F. (2023). The Impact of Adversarial Attacks on Interpretable Semantic Segmentation in Cyber-Physical Systems. IEEE Systems Journal, 17(4), 5327–5334. https://doi.org/10.1109/JSYST.2023.3281079
Hamzah, M., Islam, M. M., Hassan, S., Akhtar, M. N., Ferdous, M. J., Jasser, M. B., & Mohamed, A. W. (2023). Distributed Control of Cyber Physical Systems on Various Domains: A Critical Review. Systems, 11(4). https://doi.org/10.3390/systems11040208
Hulsen, T. (2023). Explainable Artificial Intelligence (XAI): Concepts and Challenges in Healthcare. AI (Switzerland), 4(3), 652–666. https://doi.org/10.3390/ai4030034
Islam, M. M., Rifat, H. R., Shahid, M. S. Bin, Akhter, A., Uddin, M. A., & Uddin, K. M. M. (2024). Explainable Machine Learning for Efficient Diabetes Prediction Using Hyperparameter Tuning, SHAP Analysis, Partial Dependency, and LIME. Engineering Reports. https://doi.org/10.1002/eng2.13080
Kok, I., Okay, F. Y., Muyanli, O., & Ozdemir, S. (2023). Explainable Artificial Intelligence (XAI) for Internet of Things: A Survey. IEEE Internet of Things Journal, 10(16), 14764–14779. https://doi.org/10.1109/JIOT.2023.3287678
Machlev, R., Perl, M., Levy, K. Y., Belikov, J., Mannor, S., Levron, Y., & Heistrene, L. (2022). Explainable Artificial Intelligence (XAI) techniques for energy and power systems: review, challenges and opportunities. Energy and AI, 9. https://www.sciencedirect.com/science/article/pii/S2666546822000246
Maiti, R. R., Yoong, C. H., Palleti, V. R., Silva, A., & Poskitt, C. M. (2023). Mitigating Adversarial Attacks on Data-Driven Invariant Checkers for Cyber-Physical Systems. IEEE Transactions on Dependable and Secure Computing, 20(4), 3378–3391. https://doi.org/10.1109/TDSC.2022.3194089
Minh, D., Wang, H. X., Li, Y. F., & Nguyen, T. N. (2022). Explainable artificial intelligence: a comprehensive review. Artificial Intelligence Review, 55(5), 3503–3568. https://doi.org/10.1007/s10462-021-10088-y
Momtaz, A., Basnet, N., Abbas, H., & Bonakdarpour, B. (2023). Predicate monitoring in distributed cyber-physical systems. International Journal on Software Tools for Technology Transfer, 25(4), 541–556. https://doi.org/10.1007/s10009-023-00718-x
Munshi, R. M., Cascone, L., Alturki, N., Saidani, O., Alshardan, A., & Umer, M. (2024). A novel approach for breast cancer detection using optimized ensemble learning framework and XAI. Image and Vision Computing, 142. https://doi.org/10.1016/j.imavis.2024.104910
Mustafaev, B., Kim, S., & Kim, E. (2023). Enhancing Metal Surface Defect Recognition Through Image Patching and Synthetic Defect Generation. IEEE Access, 11, 113339–113359. https://doi.org/10.1109/ACCESS.2023.3322734
Naveenan, R. V., & Suresh, G. (2023). Cyber Risk and the Cost of Unpreparedness of Financial Institutions. Cyber Security and Business Intelligence: Innovations and Machine Learning for Cyber Risk Management, 15–36. https://doi.org/10.4324/9781003285854-2
Nwakanma, C. I., Ahakonye, L. A. C., Njoku, J. N., Odirichukwu, J. C., Okolie, S. A., Uzondu, C., Ndubuisi Nweke, C. C., & Kim, D. S. (2023). Explainable Artificial Intelligence (XAI) for Intrusion Detection and Mitigation in Intelligent Connected Vehicles: A Review. Applied Sciences (Switzerland), 13(3). https://doi.org/10.3390/app13031252
Prasad, S. S., Deo, R. C., Salcedo-Sanz, S., Downs, N. J., Casillas-Pérez, D., & Parisi, A. V. (2023). Enhanced joint hybrid deep neural network explainable artificial intelligence model for 1-hr ahead solar ultraviolet index prediction. Computer Methods and Programs in Biomedicine, 241. https://doi.org/10.1016/j.cmpb.2023.107737
Rani, S., Kataria, A., Chauhan, M., Rattan, P., Kumar, R., & Kumar Sivaraman, A. (2022). Security and Privacy Challenges in the Deployment of Cyber-Physical Systems in Smart City Applications: State-of-Art Work. Materials Today: Proceedings, 62, 4671–4676. https://doi.org/10.1016/j.matpr.2022.03.123
S Band, S., Yarahmadi, A., Hsu, C. C., Biyari, M., Sookhak, M., Ameri, R., Dehzangi, I., Chronopoulos, A. T., & Liang, H. W. (2023). Application of explainable artificial intelligence in medical health: A systematic review of interpretability methods. Informatics in Medicine Unlocked, 40. https://doi.org/10.1016/j.imu.2023.101286
Schreiber, A., & Schreiber, I. (2025). AI for cyber-security risk: harnessing AI for automatic generation of company-specific cybersecurity risk profiles. Information and Computer Security. https://doi.org/10.1108/ICS-08-2024-0177
Sheikh, Z. A., Singh, Y., Singh, P. K., & Gonçalves, P. J. S. (2023). Defending the Defender: Adversarial Learning Based Defending Strategy for Learning Based Security Methods in Cyber-Physical Systems (CPS). Sensors, 23(12). https://doi.org/10.3390/s23125459
Silva-Aravena, F., Núñez Delafuente, H., Gutiérrez-Bahamondes, J. H., & Morales, J. (2023). A Hybrid Algorithm of ML and XAI to Prevent Breast Cancer: A Strategy to Support Decision Making. Cancers, 15(9). https://doi.org/10.3390/cancers15092443
Sohail, A., Fahmy, M. A., & Khan, U. A. (2023). XAI hybrid multi-staged algorithm for routine & quantum boosted oncological medical imaging. Computational Particle Mechanics, 10(2), 209–219. https://doi.org/10.1007/s40571-022-00490-w
Taherdoost, H. (2023). Deep Learning and Neural Networks: Decision-Making Implications. Symmetry, 15(9). https://doi.org/10.3390/sym15091723
Zhao, Y. (2024). LogicAL: Towards logical anomaly synthesis for unsupervised anomaly localization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern RecognitionProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Journal of Technology Informatics and Engineering

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0).
This license allows others to copy, distribute, display, and perform the work, and derivative works based upon it, for both commercial and non-commercial purposes, as long as they credit the original author(s) and license their new creations under identical terms.
Licensed under CC BY-SA 4.0: https://creativecommons.org/licenses/by-sa/4.0/