Semantic Role Labeling in Neural Machine Translation Addressing Polysemy and Ambiguity Challenges

Yan Qin

doi:10.51903/jtie.v4i1.274

Authors

Yan Qin Fuzhou Polytechnic China https://orcid.org/0009-0004-4092-546X

DOI:

https://doi.org/10.51903/jtie.v4i1.274

Keywords:

Neural Machine Translation (NMT), Semantic Role Labeling (SRL), Translation Accuracy

Abstract

The persistent challenges of polysemy and ambiguity continue to hinder the semantic accuracy of Neural Machine Translation (NMT), particularly in language pairs with distinct syntactic structures. While transformer-based models such as BERT and GPT have achieved notable progress in capturing contextual word meanings, they still fall short in understanding explicit semantic roles. This study aims to address this limitation by integrating Semantic Role Labeling (SRL) into a Transformer-based NMT framework to enhance semantic comprehension and reduce translation errors. Using a parallel corpus of 100,000 English-Indonesian and English-Japanese sentence pairs, the proposed SRL-enhanced NMT model was trained and evaluated against a baseline Transformer NMT. The integration of SRL enabled the model to annotate semantic roles, such as agent, patient, and instrument, which were fused with encoder representations through semantic-aware attention mechanisms. Experimental results demonstrate that the SRL-integrated model significantly outperformed the standard NMT model, improving BLEU scores by 6.2 points (from 32.5 to 38.7), METEOR scores by 6.3 points (from 58.5 to 64.8), and reducing the TER by 5.8 points (from 45.1 to 39.3). These results were statistically validated using a paired t-test (p < 0.05). Furthermore, qualitative analyses confirmed SRL's effectiveness in resolving lexical ambiguities and syntactic uncertainties. Although SRL integration increased inference time by 12%, the performance trade-off was deemed acceptable for applications requiring higher semantic fidelity. The novelty of this research lies in the architectural fusion of SRL with transformer-based attention layers in NMT, a domain seldom explored in prior studies. Moreover, the model demonstrates robust performance across linguistically divergent language pairs, suggesting its broader applicability. This work contributes to the advancement of semantically aware translation systems and paves the way for future research in unsupervised SRL integration and multilingual scalability.

References

Anand, A., Jairath, A., Lal, N., Bangar, S., Sikka, J., Verma, A., Shah, R. R., & Satoh, S. (2023). GEC-DCL: Grammatical Error Correction Model with Dynamic Context Learning for Paragraphs and Scholarly Papers. International Conference on Big Data Analytics, 14418, 95–110. https://doi.org/10.1007/978-3-031-49601-1_7

Ariyanto, A. D. P., Fatichah, C., & Purwitasari, D. (2023). Semantic Role Labeling for Information Extraction on Indonesian Texts: A Literature Review. International Seminar on Intelligent Technology and Its Applications, 119–124. https://doi.org/10.1109/isitia59021.2023.10221008

Chang, W. L., & Sun, J. C. Y. (2024). Evaluating AI’s Impact on Self-Regulated Language Learning: A Systematic Review. System, 126, 103484. https://doi.org/10.1016/j.system.2024.103484

Cheng, N., Yan, Z., Wang, Z., Li, Z., Yu, J., Zheng, Z., Tu, K., Xu, J., & Han, W. (2024). Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL. International Conference on Intelligent Computing, 14875, 50–61. https://doi.org/10.1007/978-981-97-5663-6_5

Cohen, F. S., Zhong, Z., & Li, C. (2022). Semantic Graph for Word Disambiguation in Machine Translation. Multimedia Tools and Applications, 81(30), 43485–43502. https://doi.org/10.1007/s11042-022-13242-y

Devi, M. I., & Purkayastha, B. S. (2023). An Exploratory Study of SMT Versus NMT for the Resource Constraint English to Manipuri Translation. Lecture Notes in Networks and Systems, 720, 329–338. https://doi.org/10.1007/978-981-99-3761-5_31

Do Campo Bayón, M., & Sánchez-Gijón, P. (2024). Evaluating NMT Using the Non-Inferiority Principle. Natural Language Processing, 1–20. https://doi.org/10.1017/nlp.2024.4

Habtamu, R., & Gizachew, B. (2024). State-of-the-Art Approaches to Word Sense Disambiguation: A Multilingual Investigation. Communications in Computer and Information Science, 2068, 176–202. https://doi.org/10.1007/978-3-031-57624-9_10

Harsha, N. S., Kumar, C. N., Sonthi, V. K., & Amarendra, K. (2022). Lexical Ambiguity in Natural Language Processing Applications. International Conference on Electronics and Renewable Systems, 1550–1555. https://doi.org/10.1109/icears53579.2022.9752297

Huang, Y., & Xin, Y. (2022). Deep Learning-Based English-Chinese Translation Research. Advances in Meteorology, 2022(1), 3208167. https://doi.org/10.1155/2022/3208167

Jooste, W., Haque, R., & Way, A. (2022). Knowledge Distillation: A Method for Making Neural Machine Translation More Efficient. Information, 13(2), 88. https://doi.org/10.3390/info13020088

Kijania-Placek, K. (2023). The Polysemy of Proper Names. Philosophical Studies, 180(10–11), 2897–2935. https://doi.org/10.1007/s11098-023-02008-3

Kusnanti, E. A., Sierra, E., Putra, G. G. S., Cahyadi, E. S., Haq, A., & Purwitasari, D. (2024). Indonesian Lexical Ambiguity in Machine Translation: A Literature Review. International Conference on Information Technology Research and Innovation, 59–64. https://doi.org/10.1109/icitri62858.2024.10698991

Lee, K. W., & Qian, M. (2022). Misinformation in Machine Translation: Error Categories and Levels of Recognition Difficulty. International Conference on Human-Computer Interaction, 13336, 533–545. https://doi.org/10.1007/978-3-031-05643-7_34

Lee, S., Lee, J., Moon, H., Park, C., Seo, J., Eo, S., Koo, S., & Lim, H. (2023). A Survey on Evaluation Metrics for Machine Translation. Mathematics, 11(4), 1006. https://doi.org/10.3390/math11041006

Li, J., & Armstrong, B. C. (2024). Probing the Representational Structure of Regular Polysemy via Sense Analogy Questions: Insights from Contextual Word Vectors. Cognitive Science, 48(3), 13416. https://doi.org/10.1111/cogs.13416

Man, Z., Huang, Z., Zhang, Y., Li, Y., Chen, Y., Chen, Y., & Xu, J. (2024). WDSRL: Multi-Domain Neural Machine Translation with Word-Level Domain-Sensitive Representation Learning. IEEE/ACM Transactions on Audio Speech and Language Processing, 32, 577–590. https://doi.org/10.1109/taslp.2023.3336515

Mohamed, Y. A., Khanan, A., Bashir, M., Mohamed, A. H. H. M., Adiel, M. A. E., & Elsadig, M. A. (2024). The Impact of Artificial Intelligence on Language Translation: A Review. IEEE Access, 12, 25553–25579. https://doi.org/10.1109/access.2024.3366802

Naveen, P., & Trojovský, P. (2024). Overview and Challenges of Machine Translation for Contextually Appropriate Translations. IScience, 27(10), 1–25. https://doi.org/10.1016/j.isci.2024.110878

Nemer, M. A., Azar, J., Makhoul, A., & Bourgeois, J. (2024). Leveraging AI for Enhanced Semantic Interoperability in IoT: Insights from NER Models. In International Wireless Communications and Mobile Computing (pp. 1351–1357). https://doi.org/10.1109/iwcmc61514.2024.10592578

Onan, A. (2023). SRL-ACO: A Text Augmentation Framework Based on Semantic Role Labeling and Ant Colony Optimization. Journal of King Saud University - Computer and Information Sciences, 35(7), 101611. https://doi.org/10.1016/j.jksuci.2023.101611

Oqaily, M., Kabir, M. E., Majumdar, S., Jarraya, Y., Zhang, M., Pourzandi, M., Wang, L., & Debbabi, M. (2024). ICAT+: An Interactive Customizable Anonymization Tool Using Automated Translation Through Deep Learning. IEEE Transactions on Dependable and Secure Computing, 21(4), 2799–2817. https://doi.org/10.1109/tdsc.2023.3317806

Ortiz-Garces, I., Govea, J., Andrade, R. O., & Villegas-Ch, W. (2024). Optimizing Chatbot Effectiveness through Advanced Syntactic Analysis: A Comprehensive Study in Natural Language Processing. Applied Sciences, 14(5), 1737. https://doi.org/10.3390/app14051737

Shahin, N., & Ismail, L. (2024). From Rule-Based Models to Deep Learning Transformers Architectures for Natural Language Processing and Sign Language Translation Systems: Survey, Taxonomy and Performance Evaluation. Artificial Intelligence Review, 57(10), 271. https://doi.org/10.1007/s10462-024-10895-z

Shekhar, S., Schwarzer, M., Dhariwal, A., & Petersen, F. C. (2025). Nasal Microbiota Transplantation: A Gateway to Novel Treatments. Trends in Microbiology, 33(3), 264–267. https://doi.org/10.1016/j.tim.2024.12.010

Sterner, B. (2022). Explaining Ambiguity in Scientific Language. Synthese, 200(5), 354. https://doi.org/10.1007/s11229-022-03792-x

Sudhi, V., Kutty, L., & Gröpler, R. (2023). Natural Language Processing for Requirements Formalization: How to Derive New Approaches? Studies in Computational Intelligence, 1091, 1–27. https://doi.org/10.1007/978-3-031-26651-5_1

Tan, F., & Wang, H. (2024). A Semantic Context-Aware Automatic Quality Scoring Method for Machine Translation Based on Pretraining Language Model. IEEE Access, 12, 72023–72033. https://doi.org/10.1109/access.2024.3402360

Viebahn, E. (2022). Copredication, Polysemy and Context-Sensitivity. Inquiry, 65(8), 1066–1082. https://doi.org/10.1080/0020174x.2020.1822911

Wang, Y. (2024). E-Learning Application in English Writing Classroom Based on Neural Machine Translation and Semantic Analysis Algorithms. Entertainment Computing, 51, 100730. https://doi.org/10.1016/j.entcom.2024.100730

Wu, H., Xu, K., & Song, L. (2024). Structure-Aware Dialogue Modeling Methods for Conversational Semantic Role Labeling. IEEE/ACM Transactions on Audio Speech and Language Processing, 32, 742–752. https://doi.org/10.1109/taslp.2023.3331576

Yang, C., & Zhang, Y. (2024). The Application of Chinese-English Machine Translation in Cultural Terminology Translation: A Case Study of Homographs and Polysemous Words. International Conference on Artificial Intelligence and Big Data, 226–231. https://doi.org/10.1109/icaibd62003.2024.10604629

Yuan, L. (2024). Semantic Role Labeling Based on Valence Structure and Deep Neural Network. IETE Journal of Research, 70(5), 5044–5052. https://doi.org/10.1080/03772063.2023.2220683

Semantic Role Labeling in Neural Machine Translation Addressing Polysemy and Ambiguity Challenges

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

full sidebar