Efficient DenseNet Model with Fusion of Channel and Spatial Attention for Facial Expression Recognition

Facial Expression Recognition (FER) is a fundamental component of human communication with numerous potential applications. Convolutional neural networks, particularly those employing advanced architectures like Densely connected Networks (DenseNets), have demonstrated remarkable success in FER. Additionally, attention mechanisms have been harnessed to enhance feature extraction by focusing on critical image regions. This can induce more efficient models for image classification. This study introduces an efficient DenseNet model that utilizes a fusion of channel and spatial attention for FER, which capitalizes on the respective strengths to enhance feature extraction while also reducing model complexity in terms of parameters. The model is evaluated across five popular datasets: JAFFE, CK+, OuluCASIA, KDEF, and RAF-DB. The results indicate an accuracy of at least 99.94% for four lab-controlled datasets, which surpasses the accuracy of all other compared methods. Furthermore, the model demonstrates an accuracy of 83.18% with training from scratch on the real-world RAF-DB dataset.

eISSN:: 1314-4081
Sprache:: Englisch

Zeitrahmen der Veröffentlichung:: 4 Hefte pro Jahr
Fachgebiete der Zeitschrift:: Informatik, Informationstechnik

Zeitschrift RSS Feed

Efficient DenseNet Model with Fusion of Channel and Spatial Attention for Facial Expression Recognition

Online veröffentlicht: 23. März 2024

Seitenbereich: 171 - 189

Eingereicht: 06. Nov. 2023

Akzeptiert: 22. Jan. 2024

DOI: https://doi.org/10.2478/cait-2024-0010

SchlüsselwörterConvolutional neural networks, Dense connected network architectures, Channel and spatial attention mechanisms, Facial expression recognition

© 2024 Duong Thang Long, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Schlüsselwörter
Convolutional neural networks, Dense connected network architectures, Channel and spatial attention mechanisms, Facial expression recognition