Improving generalization ability of Instance-transfer Based Imbalanced Sentiment Classification of Turn-Level Interactive Chinese Texts

Feng Tian, Fan Wu, Xiang Fei, Nazaraf Shah, Qinhua Zheng, Yuanyuan Wang

    Research output: Contribution to journalArticlepeer-review

    5 Citations (Scopus)
    93 Downloads (Pure)

    Abstract

    Generally, a classification model achieving better generalization ability means the model performs better on the future incoming data, otherwise the history dataset. Increasing the generalization ability of multi-domain and imbalanced multi-class emotion classification of turn-level interactive Chinese texts poses the challenges due to its high dimension and sparse feature values in its feature space. Moreover, the properties of different feature spaces or diverse data distributions in various domains of target dataset (T) and source dataset (S) make it difficult to employ multi-class and multi-domain instance transfer. To address these challenges, we propose a data-level sampling approach for multi-class and multi-domain instance transfer which is inspired by transfer learning. To verify the validity of our proposed method, an imbalanced dataset is taken as target dataset, while three datasets, one collected from Bulletin Board System of Xi'an Jiaotong University and other two datasets collected from China microblog platform Weibo, as source datasets. The experimental results show that the proposed approach outperforms classic algorithms by alleviating the imbalanced problem in interactive texts effectively. Moreover, a classification model that is trained on immigrated datasets produced by employing our proposed method achieves the best ability of generalization.
    Original languageEnglish
    Pages (from-to)155-167
    Number of pages13
    JournalService Oriented Computing and Applications
    Volume13
    Issue number2
    Early online date17 Jun 2019
    DOIs
    Publication statusPublished - Jun 2019

    Bibliographical note

    The final publication is available at Springer via http://dx.doi.org/10.1007/s11761-01900264-y

    Copyright © and Moral Rights are retained by the author(s) and/ or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This item cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder(s). The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders.

    Keywords

    • Generalization ability
    • Imbalanced sentiment classification
    • Instance immigration-based sampling
    • Interactive Chinese texts
    • Multi-class
    • Multi-domain

    ASJC Scopus subject areas

    • Software
    • Management Information Systems
    • Information Systems
    • Hardware and Architecture

    Fingerprint

    Dive into the research topics of 'Improving generalization ability of Instance-transfer Based Imbalanced Sentiment Classification of Turn-Level Interactive Chinese Texts'. Together they form a unique fingerprint.

    Cite this