Enhanced Word Sense Disambiguation Algorithm for Afaan Oromoo

Full Text (PDF, 540KB), PP.41-50

Views: 0 Downloads: 0

Author(s)

Abdo Ababor Abafogi 1,*

1. Department of Information Technology, College of Computing and Informatics, Wolkite University, Ethiopia

* Corresponding author.

DOI: https://doi.org/10.5815/ijieeb.2023.01.04

Received: 10 Mar. 2022 / Revised: 18 May 2022 / Accepted: 14 Jul. 2022 / Published: 8 Feb. 2023

Index Terms

Afaan Oromoo, Word Sense Disambiguation, Unsupervised Approach, Adopted Algorithm for Sense Clustering

Abstract

In various circumstances, the same word can mean differently based on the usage of the word in a particular sentence. The aim of word sense disambiguation (WSD) is to precisely understand the meaning of a word in particular usage. WSD utilized in several applications of natural language to interpret an ambiguous word contextually. This paper enhances a statistical algorithm proposed by Abdo [36] that performs a task of WSD for Afaan Oromoo (one of under-resourced language spoken in East Africa by nearly 50% of Ethiopians). The paper evaluates appropriate methods that used to increase the performance of disambiguation for the language with and without morphology consideration. The algorithm evaluated by 249 sentences with four evaluation metrics: recall, precision, F1 and accuracy. The evaluation result has achieved state of the art for Afaan Oromoo. Finally, future direction is highlighted for further research of the task on the language.

Cite This Paper

Abdo Ababor Abafogi, "Enhanced Word Sense Disambiguation Algorithm for Afaan Oromoo", International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.15, No.1, pp. 41-50, 2023. DOI:10.5815/ijieeb.2023.01.04

Reference

[1]N. Bouhriz, F. Benabbou, and E. Habib, “Word Sense Disambiguation Approach for Arabic Text”, International Journal of Advanced Computer Science and Applications, Vol. 7, No. 4, 2016
[2]M. Bevilacqua, and R. Navigli, “Breaking through the 80% Glass Ceiling Raising the State of the Art in Word Sense Disambiguation by Incorporating Knowledge Graph Information”, Proc. of the 58th Annual Meeting of the Association for Computational Linguistics, July 5-10, 2020, pp. 2854–2864
[3]Workineh T., Debela T., and Teferi K., “Designing Rule Based Disambiguator for Afaan Oromo Words”, Am J Compt Sci Inform Technol, Vol. 5, No. 2, 2017 pp. 1-4
[4]W. Tesema, D. Tesfaye, and T. Kibebew, “Towards the sense disambiguation of Afan Oromoo words using hybrid approach (unsupervised machine learning and rule based)”, Ethiopian Journal of Education and Sciences Vol. 12, No.1, pp. 61-77, 2016
[5]Beekan E., “Oromoo Language (Afaan Oromoo)”,https://scholar.harvard.edu/erena/oromo-language-afaan-oromoo Accessed 20 Sept. 2021.
[6]T. Keneni, “Prospects and Challenges of Afan Oromo: A Commentary”, Theory and Practice in Language Studies, vol.11, No.6, June 2021, pp.606, Accessed 15 Sept. 2021.
[7]Guya, T. “CaasLuga Afaan Oromoo: Jildii-1”, Gumii Qormaata Afaan Oromootiin Komishinii Aadaa fi Turizimii Oromiyaa, Finfinnee, 2003.
[8]W. Tesema, and Duresa T., “Investigating Afan Oromoo Language Structure and Developing Effective File Editing Tool as Plug-in into Ms Word to Support Text Entry and Input Methods”, American Journal of Computer Science and Engineering Survey, 2021.
[9]Abdo Ababor Abafogi, “Boosting Afaan Oromo Named Entity Recognition with Multiple Methods”, International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.13, No.5, pp. 50-58, 2021. DOI: 10.5815/ijieeb.2021.05.05.
[10]K. Tune, V. Varma, P. Pingali, “Evaluation of Oromo-English Cross-language Information Retrieval”, ijcai workshop on clia, hyderabad, india, 2007
[11]B. Sankaran, K. Vijay-Shanker, “Influence of morphology in word sense disambiguation for Tamil”, Anna University and University of Delaware Proc. of International Conference on Natural Language Processing, 2003.
[12]Abdo Ababor Abafogi, “Normalized Statistical Algorithm for Afaan Oromo Word Sense Disambiguation”, International Journal of Intelligent Systems and Applications(IJISA), Vol.13, No.6, pp.40-50, 2021. DOI:10.5815/ijisa.2021.06.04.
[13]Y. Wang, M. Wang, H. Fujitas, “Word Sense Disambiguation: A comprehensive knowledge exploitation framework”, Knowledge-Based Systems, Vol. 190, 29 February 2020.
[14]D. Yarowsky, “Decision lists for lexical ambiguity resolution: Application to accent restoration in spanish and French”, In Proc. of the 32nd annual meeting on Association for Computational Linguistics, 1994, pp. 88–95.
[15]B. Scarlini, T. Pasini, and R. Navig, “SENSEMBERT:Context-Enhanced Sense Embeddings for Multilingual Word Sense Disambiguation”, The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) Association for the Advancement of Artificial Intelligence, 2020.
[16]C. Hadiwinoto, H. T. Ng, and W. C. Gan, “Improved Word Sense Disambiguation using pre-trained contextualized word representations”, In Proc. Of EMNLP, 2019, pp. 5297–5306
[17]M. Bevilacqua, and R. Navigli. “Quasi bidirectional encoder representations from Transformers for Word Sense Disambiguation”, In Proc. of RANLP, 2019, pp. 122–131.
[18]F. Scozzafava, M. Maru, F. Brignone, G. Torrisi, and R. Navigli,“Personalized PageRank with syntagmatic information for multilingual Word Sense Disambiguation”, In Proc. of ACL (demos), 2020.
[19]K. Sawan, J. Sharmistha, S. Karan, and T. Partha, “Zero-shot Word Sense Disambiguation using sense definition embeddings”, In Proc. of ACL, 2019.
[20]T. Blevins, and L. Zettlemoyer, “Moving down the long tail of Word Sense Disambiguation with gloss informed bi-encoders”, 2020.
[21]E. Barba, T. Pasini, and R. Navigli, “ESC: Redesigning WSD with extractive sense comprehension”, In Proc. of NAACL, 2021
[22]S. Conia, and R. Navigli, “Framing Word Sense Disambiguation as a multi-label problem for model-agnostic knowledge integration”, In Proc. of EACL, 2021.
[23]M. Bevilacqua, T. Pasini, A. Raganato, and R. Navigli, “Recent Trends in Word Sense Disambiguation:A Survey”, Proc. of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21).
[24]Vial L, Lecouteux B, and Schwab D, “Sense Vocabulary Compression through the Semantic Knowledge of WordNet for Neural Word Sense Disambiguation”, In Proc. of Global Wordnet Conference, 2019.
[25]A. Raganato, C. Bovi, and R. Navigli, “Neural sequence learning models for word sense disambiguation”, In Proc. of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark. Association for Computational Linguistics, 2017, pp. 1156–1167
[26]B. H. Manjunatha Kumar, “Survey on Word Sense Disambiguatio”, Sri Siddhartha Institute of Technology, 2018.Conference on Emerging Trends in Engineering, Science and Technology, 2015.
[27]I. Iacobacci, M. T. Pilehvar, and R. Navigli, “Embeddings for word sense disambiguation:An evaluation study”, In Proc. Of ACL, Vol. 1, 2016, pp. 897–907
[28]O. Melamud, J. Goldberger, and I. Dagan, “context2vec:Learning generic context embedding with bidirectional LSTM”, In Proc. of CoNLL, 2016, pp. 51–61
[29]S. Sankar, Reghu Raj, V. U. Jayan, “Unsupervised Approach to Word Sense Disambiguation in Malayalam”, International .
[30]Krishnanjan et al., “Survey and Gap Analysis of Word Sense Disambiguation Approaches on Unstructured Texts”, Proc. of the International Conference on Electronics and Sustainable Communication Systems 2020.
[31]M. Gunavathi, and S. Rajini, “The Various Approaches for Word Sense Disambiguation: A Survey”, Department of Computer Science and Engineering, Kumaraguru College of Technology, Coimbatore, India, IJIRT, Vol. 3, No. 10, March 2017.
[32]S. Olika, “word sense disambiguation for Afaan Oromoo using knowledge base”, St. University College, 2018.
[33]Yehuwalashet B., “Hybrid Word Sense Disambiguation Approach for Afan Oromoo Words”, Thesis, Department of Computer Science, Addis Ababa University, Addis Ababa, Ethiopia, 2016.
[34]W. Tesema, and D. Tesfaye, “Word Sense Disambiguation and Semantics for Afan Oromoo Words using Vector Space Model”, International Journal of Research Studies in Science, Engineering and Technology, Vol. 4, No. 6, 2017, pp. 10-15.
[35]K. Tesfa, “Word sense disambiguation for Afaan Oromoo Language”,Thesis, Department of Computer Science, Addis Ababa University, Addis Ababa, Ethiopia, 2013
[36]Getachew T., “Seerluga Afaan Oromoo”, Finfinnee Oromiyaa press, 2014.
[37]Daraje K, “Afaan Oromoo Automatic word stemmer”, Wollega University, 2018.