TY - CHAP
T1 - Building lexical resources for dialectical Arabic
AU - Al Ameri, Sumaya Sulaiman
AU - Shoufan, Abdulhadi
N1 - Publisher Copyright:
© 2021, IGI Global.
PY - 2020/7/31
Y1 - 2020/7/31
N2 - The natural language processing of Arabic dialects faces a major difficulty, which is the lack of lexical resources. This problem complicates the penetration and the business of related technologies such as machine translation, speech recognition, and sentiment analysis. Current solutions frequently use lexica, which are specific to the task at hand and limited to some language variety. Modern communication platforms including social media gather people from different nations and regions. This has increased the demand for general-purpose lexica towards effective natural language processing solutions. This chapter presents a collaborative web-based platform for building a cross-dialectical, general-purpose lexicon for Arabic dialects. This solution was tested by a team of two annotators, a reviewer, and a lexicographer. The lexicon expansion rate was measured and analyzed to estimate the overhead required to reach the desired size of the lexicon. The inter-annotator reliability was analyzed using Cohen's Kappa.
AB - The natural language processing of Arabic dialects faces a major difficulty, which is the lack of lexical resources. This problem complicates the penetration and the business of related technologies such as machine translation, speech recognition, and sentiment analysis. Current solutions frequently use lexica, which are specific to the task at hand and limited to some language variety. Modern communication platforms including social media gather people from different nations and regions. This has increased the demand for general-purpose lexica towards effective natural language processing solutions. This chapter presents a collaborative web-based platform for building a cross-dialectical, general-purpose lexicon for Arabic dialects. This solution was tested by a team of two annotators, a reviewer, and a lexicographer. The lexicon expansion rate was measured and analyzed to estimate the overhead required to reach the desired size of the lexicon. The inter-annotator reliability was analyzed using Cohen's Kappa.
UR - http://www.scopus.com/inward/record.url?scp=85112793867&partnerID=8YFLogxK
U2 - 10.4018/978-1-7998-4240-8.ch014
DO - 10.4018/978-1-7998-4240-8.ch014
M3 - Chapter
AN - SCOPUS:85112793867
SN - 9781799842408
SP - 332
EP - 364
BT - Natural Language Processing for Global and Local Business
ER -