Abstract
The high variety in the forms of the Arabic words creates significant complexity related challenges in Natural Language Processing (NLP) tasks for Arabic text. These challenges can be dealt with by using different techniques for semantic representation, such as word embedding methods. In addition, approaches for reducing the diversity in Arabic morphologies can also be employed, for example using appropriate word normalisation for Arabic texts. Deep learning has proven to be very popular in solving different NLP tasks in recent years as well. This paper proposes an approach that combines Convolutional Neural Networks (CNNs) with Long Short-Term Memory (LSTM) networks to improve sentiment classification, by excluding the max-pooling layer from the CNN. This layer reduces the length of generated feature vectors after convolving the filters on the input data. As such, the LSTM networks will receive well-captured vectors from the feature maps. In addition, the paper investigated different effective approaches for preparing and representing the text features in order to increase the accuracy of Arabic sentiment classification.
Original language | English |
---|---|
Pages (from-to) | 9710-9722 |
Number of pages | 13 |
Journal | Journal of King Saud University - Computer and Information Sciences |
Volume | 34 |
Issue number | 10 |
Early online date | 27 Dec 2021 |
DOIs | |
Publication status | Published - Nov 2022 |
Bibliographical note
This is an Open Access article distributed under the terms of the CreativeCommons Attribution Non-Commercial No Derivatives License (https://creativecommons.org/licenses/by-nc-nd/4.0/),
which permits unrestricted use, distribution, and reproduction in any medium,
provided the original work is properly cited..
Keywords
- Arabic NLP
- Arabic sentiment analysis
- Arabic word normalisation
- CNN
- Deep learning
- LSTM
- Word embedding for Arabic
ASJC Scopus subject areas
- Computer Science(all)