A Comparative Study of the Lexical Ambiguity of Arabic, English, and French in Natural Language Processing

Bahia Zemni (1) , Mimouna Zitouni (2) , Farouk Bouhadiba (3) , Mashael Almutairi (4)
1. Department of Translation, College of Languages, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
2. Translation Department, Princess Nourah bint Abdulrahman University, Saudi Arabia
3. Translation Department, Princess Nourah bint Abdulrahman University, Saudi Arabia
4. Translation Department, Princess Nourah bint Abdulrahman University, Saudi Arabia

Abstract

Ambiguity in some syntactic structures of the same language has always posed problems to the human translator and to machine translation. These problems become more complex for the Machine Translation of genetically unrelated languages such as Arabic, English and French. Arabic Lexical ambiguity in Natural Language Processing (NLP) also poses problems when the semantic fields of Arabic words differ from those of English for instance. This often occurs when two or more words from Arabic equate to a single word in English. Semantic gaps between the two languages are also a source of ambiguity in Natural Language Processing. We shall deal with some cases of ambiguity in machine translation from Arabic to English and French and vice versa. The questions addressed in this paper relate to segmentation, determination / non-determination, coordination and the issue of the word as a meaningful and functional unit. Some aspects of the segmentation of constituents into grammatical categories and their comparison with structures of Arabic English and French are addressed in this paper.


 

Full text article

Generated from XML file

References

Al-Jefri, M., & Mahmoud, S. A. (2013). Context-Sensitive Arabic Spell Checker Using Context Words and N-Gram Language Models. In the Proceedings of Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences, Al-Madinah al-Munawwarah, Saudi Arabia, 1, 258-263. https://doi.org/10.1109/nooric.2013.59 Google Scholar | Crossref | WorldCat

Alqudsi, A., Omar, N., & Shaker, K. (2019). A hybrid rules and statistical method for Arabic to English machine translation. In 2019 2nd International Conference on Computer Applications & Information Security (ICCAIS) (pp. 1-7). IEEE. Google Scholar | WorldCat

Andrews, E. D. (2023). Introduction to the Text of the Old Testament: From the Authors and Scribes to the Modern Critical Text. Christian Publishing House. Google Scholar | WorldCat

Azmi, A. M., Almutery, M., & Aboalsamh, H. (2019). Real-Word Errors in Arabic Texts: A Better Algorithm for Detection and Correction. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(8), 1308–1320. https://doi.org/10.1109/taslp.2019.2918404 Google Scholar | Crossref | WorldCat

Ahmad, I. (2021). Emotional Regulation as a Remedy for Teacher Burnout in Special Schools: Evaluating School Climate, Teacher’s Work-Life Balance and Children Behaviour. Frontiers in Psychology, 12, 1–10. https://doi.org/10.3389/fpsyg.2021.655850 Google Scholar | Crossref | WorldCat

Bais, H., & Machkour, M. (2023). A rule-induction approach for building an Arabic language interface to databases. The International Arab Journal of Information Technology, 20(1), 49 – 56. https://doi.org/10.34028/iajit/20/1/6. Google Scholar | Crossref | WorldCat

Bekraoui, F., & Daoudim, M. (2015). Conception Et Réalisation D’un Vocaliseur Automatique De Texte Arabe “Tashkil.” El-Wahat Journal for Research and Studies, 8(2). https://doi.org/https://doi.org/10.54246/1548-008-002-036 Google Scholar | Crossref | WorldCat

Cavali-Sforza, V, Soudi, A & Terulo, M (2000). Arabic Morphology Generation Using a Concatenative Strategy. In Proceedings of 1st Meeting of NAACL, 86 -93, Seattle, USA. Google Scholar | WorldCat

Computation approaches to Semitic languages, Montreal, Canada. https://doi.org/10.3115/1621753.1621761 Google Scholar | Crossref | WorldCat

Debili, F., & Achour, H. (1998). Voyellation automatique de l’arabe. In the Proceedings of the Workshop on Google Scholar | WorldCat

Dickins, J., Hervey, S., & Higgins, I. (2016). Thinking Arabic Translation: A course in translation method: Arabic to English. London: Routledge. https://doi.org/10.4324/9781315012650 Google Scholar | Crossref | WorldCat

Gamal, M. Y. (2020). Context, field, and landscape of audiovisual translation in the Arab world. ESSACHESS-Journal for Communication Studies, 13(1(25)), 73-105. Google Scholar | WorldCat

Goldsmith, J.A. (1976). Auto-segmental Phonology. Doctoral Thesis, Massachusetts MIT, USA. Google Scholar | WorldCat

Habash, N. Y. (2010). Introduction to Arabic natural language processing. Synthesis lectures on human language technologies, 3(1), 1-187. https://doi.org/10.1007/978-3-031-02139-8 Google Scholar | Crossref | WorldCat

Kastner, I., Tucker, M. A., Alexiadou, A., Kramer, R., Marantz, A., & Massuet, I. O. (2019). Non-concatenative morphology. Ms., Humboldt-Universität zu Berlin and Oakland University. Google Scholar | WorldCat

Khalatia, M. M., & Al-Romanyb, T. A. H. (2020). Artificial intelligence development and challenges (Arabic language as a model). Artificial Intelligence, 13(5), 83-105. http://dx.doi.org/10.17977/um056v7i1p83-105 Google Scholar | Crossref | WorldCat

Leavitt, J. R. (1994). MORPHE: A morphological rule compiler. Technical Report, CMU-CMT-94-MEMO. Google Scholar | WorldCat

Leslau, W. (2021). An annotated bibliography of the Semitic languages of Ethiopia (Vol. 1). Walter de Gruyter GmbH & Co KG. Google Scholar | WorldCat

Levshina, N. (2022). Corpus-based typology: Applications, challenges and some solutions. Linguistic Typology, 26(1), 129-160. https://doi.org/10.1515/lingty-2020-0118 Google Scholar | Crossref | WorldCat

Marie-Sainte, S.L., Alalyani, N., Alotaibi, S., Ghouzali, S., & Abunadi, I. (2019). Arabic Natural Language Processing and Machine Learning-Based Systems. IEEE Access, 7, 7011–20. https://doi.org/10.1109/access.2018.2890076. Google Scholar | Crossref | WorldCat

McCarthy, J. J. (1981). A prosodic theory of nonconcatenative morphology. Linguistic Inquiry, 12(3), 373-418. https://scholarworks.umass.edu/linguist_faculty_pubs/26 Google Scholar | WorldCat

McCarthy, J. J., & Prince, A. (1996). Prosodic Morphology. In J. Goldsmith (Ed.), The Handbook of Phonological Theory, (pp. 318-366). Cambridge, MA: Blackwell. Google Scholar | WorldCat

McCarthy, J.J. (1979). Formal Problems in Semitic Phonology and Morphology. Doctoral Thesis, Massachusetts MIT, USA. Google Scholar | WorldCat

Mohamed, E., & Sadat, F. (2015). Hybrid Arabic–French machine translation using syntactic re-ordering and morphological pre-processing. Computer Speech & Language, 32(1), 135–144. https://doi.org/10.1016/j.csl.2014.10.007 Google Scholar | Crossref | WorldCat

Royster, P. D. (2020). The Geography of Racial Bias. In Decolonizing Arts-Based Methodologies, (pp. 110-149). Brill. Google Scholar | WorldCat

Shirko, O., Omar, N., Arshad, H., & Albared, M. (2010). Machine Translation of Noun Phrases from Arabic to English Using Transfer-Based Approach. Journal of Computer Science, 6(3), 350–356. https://doi.org/10.3844/jcssp.2010.350.356 Google Scholar | Crossref | WorldCat

Soudi, A., & Cavalli-Sforza, V. (2003). Interfacing an Arabic Morphology and Sentence Generation with an English-to-Arabic knowledge-based Machine Translation System. In the Proceedings of the workshop on Information Technology, Rabat, Morocco, March (pp. 17-19). Google Scholar | WorldCat

Trad, A. (2021). The Societal Transformation Framework: The Nation of Semites–The Phoenicians. In Management and Conservation of Mediterranean Environments, (pp. 227-259). IGI Global. https://doi.org/10.4018/978-1-7998-7327-3.ch015 Google Scholar | Crossref | WorldCat

Zakraoui, J., Saleh, M., Al-Maadeed, S., & Alja’am, J. M. (2020, April). Evaluation of Arabic to English machine translation systems. In 2020 11th International Conference on Information and Communication Systems (ICICS) (pp. 185-190). IEEE. Google Scholar | WorldCat

Zakraoui, J., Saleh, M., Al-Maadeed, S., & Alja’am, J. M. (2021). Arabic Machine Translation: A Survey With Challenges and Future Directions. IEEE Access, 9, 161445-161468. https://doi.org/10.1109/ACCESS.2021.3132488 Google Scholar | Crossref | WorldCat

Zaretsky, E., & Russak, S. (2023). Using oral narratives to examine the acquisition of English verb morphology among multilingual Arabic and monolingual Hebrew speakers: finding similarities with monolingual English-speaking SLIs. International Journal of Multilingualism, 1-22. https://doi.org/10.1080/14790718.2023.2232376 Google Scholar | Crossref | WorldCat

Zitouni, M., Alzahrani, A., Al Kous, N., Almutlaq, S., Abdul-Ghafour, A.-Q., & Zemni, B. (2022). The Translation of Selected Cultural Items in Al-Nawawi’s Forty Hadiths: A Descriptive and Analytical Study. Journal of Intercultural Communication, 22(3), 43-53. https://doi.org/10.36923/jicc.v22i3.74 Google Scholar | Crossref | WorldCat

Authors

Mimouna Zitouni
mimouna.zitouni.oran2@gmail.com (Primary Contact)
Author Biographies

Bahia Zemni

Bahia Zemni received her PhD in linguistics from Sorbonne-Nouvelle III University. Since 2012, she has been a professor at Princess Nourah bint Abdulrahman University, where she has headed the Languages Faculty Research Center. Currently, she leads a research unit in the Translation Department and contributes to the research project titled "Translation from Arabic to French and Vice Versa in contextual dictionaries: mechanisms and strategies." Additionally, she heads the project "Artificial Intelligence and Audiovisual Translation." Bahia has published several translations in collaboration with the Louvre Museum and publishing houses such as Skira in France and Alsaqui in Lebanon. She has participated in several national and international conferences and has published extensively in well-established journals on the subjects of Linguistics and Translations. She recently co-edited a Special Issue of Kervan - International Journal of Afro-Asiatic Studies entitled "Pratiques langagières en arabe: variables culturelles et défis de la traduction."

Mimouna Zitouni

Mimouna Zitouni is a professor of Translation Studies and Linguistics at the College of Languages, PNU. In 2017–2018, she served as a distinguished Fulbright 'SIR' visiting scholar at Coastal Carolina University (USA). She has participated in several international conferences and has published extensively on the subjects of language use and translation studies.

Farouk Bouhadiba

Farouk Bouhadiba is a professor of linguistics at the College of Foreign Languages, University of Mohamed Ben Ahmed, Oran-Algeria. He specializes in Maghrebi Linguistics and Dialectology from a Sociolinguistics perspective. He also conducts research in the field of Didactics, with a special focus on the LMD implementation in Algeria. Currently, he is engaged in Natural Language Processing and is involved in building a WordNet Arabic. Dr. Bouhadiba has participated in numerous international conferences and has published extensively on various subjects related to language use and translation.

Mashael Almutairi

Mashael Almutairi is an assistant professor at Princess Nourah bint Abdulrahman University. She obtained her PhD in Translation from Leicester University in 2018 and subsequently assumed the role of Head of the Translation Department at PNU. Almutairi served as the Dean of the College of Languages from 2019 until May 2021 when she was appointed as the Dean of the Development and Consulting Services Institute. Throughout her professional career, she has been actively involved in various academic associations. She currently serves as an associate editor and reviewer at Altralang Journal and has authored several papers and translated several books.

Zemni, B., Zitouni, M., Bouhadiba, F., & Almutairi, M. (2024). A Comparative Study of the Lexical Ambiguity of Arabic, English, and French in Natural Language Processing. Journal of Intercultural Communication, 24(1), 203-212. https://doi.org/10.36923/jicc.v24i1.171

Article Details

How to Cite

Zemni, B., Zitouni, M., Bouhadiba, F., & Almutairi, M. (2024). A Comparative Study of the Lexical Ambiguity of Arabic, English, and French in Natural Language Processing. Journal of Intercultural Communication, 24(1), 203-212. https://doi.org/10.36923/jicc.v24i1.171

Most read articles by the same author(s)