Stacked deep convolutional auto-encoders for emotion recognition from facial expressions

Ariel Ruiz-Garcia, Mark Elshaw, Abdulrahman Altahhan, Vasile Palade

Research output: Chapter in Book/Report/Conference proceedingConference proceeding

16 Citations (Scopus)

Abstract

Emotion recognition is critical for everyday living and is essential for meaningful interaction. If we are to progress towards human and machine interaction that is engaging the human user, the machine should be able to recognize the emotional state of the user. Deep Convolutional Neural Networks (CNN) have proven to be efficient in emotion recognition problems. The good degree of performance achieved by these classifiers can be attributed to their ability to self-learn a down-sampled feature vector that retains spatial information through filter kernels in convolutional layers. Given the view that random initialization of weights can lead to convergence to non-optimal local minima, in this paper we explore the impact of training the initial weights in an unsupervised manner. We study the effect of pre-training a Deep CNN as a Stacked Convolutional Auto-Encoder (SCAE) in a greedy layer-wise unsupervised fashion for emotion recognition using facial expression images. When trained with randomly initialized weights, our CNN emotion recognition model achieves a performance rate of 91.16% on the Karolinska Directed Emotional Faces (KDEF) dataset. In contrast, when each layer of the model, including the hidden layer, is pre-trained as an Auto-Encoder, the performance increases to 92.52%. Pre-training our CNN as a SCAE also reduces training time marginally. The emotion recognition model developed in this work will form the basis of a real-time empathic robot system.

Original languageEnglish
Title of host publication2017 International Joint Conference on Neural Networks, IJCNN 2017 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1586-1593
Number of pages8
Volume2017-May
ISBN (Electronic)9781509061815, 9781509061822
ISBN (Print) 9781509061839
DOIs
Publication statusPublished - 3 Jul 2017
Event2017 International Joint Conference on Neural Networks - Anchorage, United States
Duration: 14 May 201719 May 2017

Conference

Conference2017 International Joint Conference on Neural Networks
Abbreviated titleIJCNN 2017
CountryUnited States
CityAnchorage
Period14/05/1719/05/17

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Stacked deep convolutional auto-encoders for emotion recognition from facial expressions'. Together they form a unique fingerprint.

  • Cite this

    Ruiz-Garcia, A., Elshaw, M., Altahhan, A., & Palade, V. (2017). Stacked deep convolutional auto-encoders for emotion recognition from facial expressions. In 2017 International Joint Conference on Neural Networks, IJCNN 2017 - Proceedings (Vol. 2017-May, pp. 1586-1593). [7966040] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN.2017.7966040