Improving Sentiment Analysis in Arabic Using Word Representation

Abdulaziz Alayba, Vasile Palade, Matthew England, Rahat Iqbal

    Research output: Chapter in Book/Report/Conference proceedingConference proceedingpeer-review

    68 Citations (Scopus)
    205 Downloads (Pure)

    Abstract

    The complexities of Arabic language in morphology, orthography and dialects makes sentiment analysis for Arabic more challenging. Also, text feature extraction from short messages like tweets, in order to gauge the sentiment, makes this task even more difficult. In recent years, deep neural networks were often employed and showed very good results in sentiment classification and natural language processing applications. Word embedding, or word distributing approach, is a current and powerful tool to capture together the closest words from a contextual text. In this paper, we describe how we construct Word2Vec models from a large Arabic corpus obtained from ten newspapers in different Arab countries. By applying different machine learning algorithms and convolutional neural networks with different text feature selections, we report improved accuracy of sentiment classification (91%-95%) on our publicly available Arabic language health sentiment dataset [1]. Keywords - Arabic Sentiment Analysis, Machine Learning, Convolutional Neural Networks, Word Embedding, Word2Vec for Arabic, Lexicon.
    Original languageEnglish
    Title of host publicationProc. 2nd International Workshop on Arabic Script Analysis and Recognition (ASAR '18)
    PublisherIEEE Computer Society
    Pages13-18
    Number of pages6
    ISBN (Electronic)978-1-5386-1459-4
    ISBN (Print) 978-1-5386-1460-0
    DOIs
    Publication statusPublished - 4 Oct 2018
    EventIEEE International Workshop on Arabic and derived Script Analysis and Recognition - London, United Kingdom
    Duration: 12 Mar 201814 Mar 2018
    Conference number: 2
    http://asar.ieee.tn/

    Workshop

    WorkshopIEEE International Workshop on Arabic and derived Script Analysis and Recognition
    Abbreviated titleASAR
    Country/TerritoryUnited Kingdom
    CityLondon
    Period12/03/1814/03/18
    Internet address

    Bibliographical note

    © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

    Keywords

    • Arabic Sentiment Analysis
    • Machine Learning
    • Convolutional Neural Networks
    • Word Embedding
    • Word2Vec for Arabic
    • Lexicon

    Fingerprint

    Dive into the research topics of 'Improving Sentiment Analysis in Arabic Using Word Representation'. Together they form a unique fingerprint.

    Cite this