To read this content please select one of the options below:

Evaluating transfer learning approach for detecting Arabic anti-refugee/migrant speech on social media

Djamila Mohdeb (University of Bordj Bou Arreridj, Bordj Bou Arreridj, Algeria)
Meriem Laifa (University of Bordj Bou Arreridj, Bordj Bou Arreridj, Algeria) (Laboratory of Informatics and its Applications of M'sila (LIAM), M'sila, Algeria)
Fayssal Zerargui (University of Bordj Bou Arreridj, Bordj Bou Arreridj, Algeria)
Omar Benzaoui (University of Bordj Bou Arreridj, Bordj Bou Arreridj, Algeria)

Aslib Journal of Information Management

ISSN: 2050-3806

Article publication date: 22 March 2022

Issue publication date: 29 September 2022

277

Abstract

Purpose

The present study was designed to investigate eight research questions that are related to the analysis and the detection of dialectal Arabic hate speech that targeted African refugees and illegal migrants on the YouTube Algerian space.

Design/methodology/approach

The transfer learning approach which recently presents the state-of-the-art approach in natural language processing tasks has been exploited to classify and detect hate speech in Algerian dialectal Arabic. Besides, a descriptive analysis has been conducted to answer the analytical research questions that aim at measuring and evaluating the presence of the anti-refugee/migrant discourse on the YouTube social platform.

Findings

Data analysis revealed that there has been a gradual modest increase in the number of anti-refugee/migrant hateful comments on YouTube since 2014, a sharp rise in 2017 and a sharp decline in later years until 2021. Furthermore, our findings stemming from classifying hate content using multilingual and monolingual pre-trained language transformers demonstrate a good performance of the AraBERT monolingual transformer in comparison with the monodialectal transformer DziriBERT and the cross-lingual transformers mBERT and XLM-R.

Originality/value

Automatic hate speech detection in languages other than English is quite a challenging task that the literature has tried to address by various approaches of machine learning. Although the recent approach of cross-lingual transfer learning offers a promising solution, tackling this problem in the context of the Arabic language, particularly dialectal Arabic makes it even more challenging. Our results cast a new light on the actual ability of the transfer learning approach to deal with low-resource languages that widely differ from high-resource languages as well as other Latin-based, low-resource languages.

Keywords

Citation

Mohdeb, D., Laifa, M., Zerargui, F. and Benzaoui, O. (2022), "Evaluating transfer learning approach for detecting Arabic anti-refugee/migrant speech on social media", Aslib Journal of Information Management, Vol. 74 No. 6, pp. 1070-1088. https://doi.org/10.1108/AJIM-10-2021-0293

Publisher

:

Emerald Publishing Limited

Copyright © 2022, Emerald Publishing Limited

Related articles