Leveraging Arabic sentiment classification using an enhanced CNN-LSTM approach and effective Arabic text preparation

Abdulaziz M. Alayba, Vasile Palade

Research output: Contribution to journalArticlepeer-review

20 Citations (Scopus)
130 Downloads (Pure)

Abstract

The high variety in the forms of the Arabic words creates significant complexity related challenges in Natural Language Processing (NLP) tasks for Arabic text. These challenges can be dealt with by using different techniques for semantic representation, such as word embedding methods. In addition, approaches for reducing the diversity in Arabic morphologies can also be employed, for example using appropriate word normalisation for Arabic texts. Deep learning has proven to be very popular in solving different NLP tasks in recent years as well. This paper proposes an approach that combines Convolutional Neural Networks (CNNs) with Long Short-Term Memory (LSTM) networks to improve sentiment classification, by excluding the max-pooling layer from the CNN. This layer reduces the length of generated feature vectors after convolving the filters on the input data. As such, the LSTM networks will receive well-captured vectors from the feature maps. In addition, the paper investigated different effective approaches for preparing and representing the text features in order to increase the accuracy of Arabic sentiment classification.

Original languageEnglish
Pages (from-to)9710-9722
Number of pages13
JournalJournal of King Saud University - Computer and Information Sciences
Volume34
Issue number10
Early online date27 Dec 2021
DOIs
Publication statusPublished - Nov 2022

Bibliographical note

This is an Open Access article distributed under the terms of the Creative
Commons Attribution Non-Commercial No Derivatives License (https://creativecommons.org/licenses/by-nc-nd/4.0/),
which permits unrestricted use, distribution, and reproduction in any medium,
provided the original work is properly cited..

Keywords

  • Arabic NLP
  • Arabic sentiment analysis
  • Arabic word normalisation
  • CNN
  • Deep learning
  • LSTM
  • Word embedding for Arabic

ASJC Scopus subject areas

  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Leveraging Arabic sentiment classification using an enhanced CNN-LSTM approach and effective Arabic text preparation'. Together they form a unique fingerprint.

Cite this