Abstract
Emotion recognition is critical for everyday living and is essential for meaningful interaction. If we are to progress towards human and machine interaction that is engaging the human user, the machine should be able to recognize the emotional state of the user. Deep Convolutional Neural Networks (CNN) have proven to be efficient in emotion recognition problems. The good degree of performance achieved by these classifiers can be attributed to their ability to self-learn a down-sampled feature vector that retains spatial information through filter kernels in convolutional layers. Given the view that random initialization of weights can lead to convergence to non-optimal local minima, in this paper we explore the impact of training the initial weights in an unsupervised manner. We study the effect of pre-training a Deep CNN as a Stacked Convolutional Auto-Encoder (SCAE) in a greedy layer-wise unsupervised fashion for emotion recognition using facial expression images. When trained with randomly initialized weights, our CNN emotion recognition model achieves a performance rate of 91.16% on the Karolinska Directed Emotional Faces (KDEF) dataset. In contrast, when each layer of the model, including the hidden layer, is pre-trained as an Auto-Encoder, the performance increases to 92.52%. Pre-training our CNN as a SCAE also reduces training time marginally. The emotion recognition model developed in this work will form the basis of a real-time empathic robot system.
| Original language | English |
|---|---|
| Title of host publication | 2017 International Joint Conference on Neural Networks, IJCNN 2017 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 1586-1593 |
| Number of pages | 8 |
| Volume | 2017-May |
| ISBN (Electronic) | 9781509061815, 9781509061822 |
| ISBN (Print) | 9781509061839 |
| DOIs | |
| Publication status | Published - 3 Jul 2017 |
| Event | 2017 International Joint Conference on Neural Networks - Anchorage, United States Duration: 14 May 2017 → 19 May 2017 |
Conference
| Conference | 2017 International Joint Conference on Neural Networks |
|---|---|
| Abbreviated title | IJCNN 2017 |
| Country/Territory | United States |
| City | Anchorage |
| Period | 14/05/17 → 19/05/17 |
Funding
The authors would like to thank the funders of the Barry Gidden Fund, which partially funded the development of this work. The authors would also like to thank Coventry University and acknowledge it as the main funding body of this work.
ASJC Scopus subject areas
- Software
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Stacked deep convolutional auto-encoders for emotion recognition from facial expressions'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS