| publication name | Hassan, S. I., Elrefaei, L., & Andraws, M. (2023). Arabic Tweets Spam Detection Based on Various Supervised Machine Learning and Deep Learning Classifiers. MSA Engineering Journal, 2(2), 1099-1119. doi: 10.21608/msaeng.2023.291931 |
|---|---|
| Authors | Shimaa I Hassan, Mina Shoukrey Andraws, Lamiaa Elrefaei |
| year | 2023 |
| keywords | |
| journal | MSA Engineering Journal |
| volume | 2 |
| issue | 2 |
| pages | 1099-1119 |
| publisher | EKB |
| Local/International | Local |
| Paper Link | https://msaeng.journals.ekb.eg/article_291931.html |
| Full paper | download |
| Supplementary materials | Not Available |
Abstract
In this paper, different machine learning algorithms, ensemble algorithms,and deep learning algorithms are applied to Arabic tweets to detect whether ithuman-generated or not. The tweets are used twice as preprocessed and nonpreprocessed to measure the effectiveness of Arabic preprocessing in theclassification process. The data is also tokenized with various methods like unigram,trigram, and Term Frequency–Inverse Document Frequency. The experimentsshow that the support vector machine with the non-preprocessed tweets andunigram tokenization has the best performance of 83.11% and a precision of 0.9516while it predicts the spam or not in a relatively small time.