File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: SFFNet: Multi-Scale Sparse Focus Feature Cue-Aware for Efficient Facial Expression Recognition

TitleSFFNet: Multi-Scale Sparse Focus Feature Cue-Aware for Efficient Facial Expression Recognition
Authors
KeywordsFacial expression recognition
Sparse focus features
Swin Transformer
Issue Date25-Mar-2025
Abstract

Facial expression recognition is an important subtask in the field of computer vision. Accurate recognition of facial expressions plays a critical role in applications such as human-computer interaction, emotion analysis, and intelligent surveillance. In this work, we propose a novel approach for learning multi-scale sparse focus features by combining CNN and Swin Transformer architectures, which enables the effective capture of both local and global patterns in facial expression images. Additionally, we introduce two key modules: Adaptive Hierarchical Feature Construction (AHFC) and Dual Contextual Attention Synthesis Unit (DCASU). Specifically, the AHFC module leverages Swin Transformer to extract multi-scale facial expression features while mining the relationships among sparse focus features. The DCASU module, on the other hand, extracts local features and utilizes attention mechanisms to focus on crucial regions of the face. Extensive experiments demonstrate that our method achieves state-of-the-art performance on the RAF-DB (89.18%) and KDEF (95.81%) datasets, validating the effectiveness and superiority of our approach.


Persistent Identifierhttp://hdl.handle.net/10722/362492

 

DC FieldValueLanguage
dc.contributor.authorDeng, Liqian-
dc.contributor.authorLiu, Tingting-
dc.contributor.authorWang, Minhong-
dc.contributor.authorLiu, Hai-
dc.contributor.authorZhang, Zhaoli-
dc.contributor.authorLi, Youfu-
dc.date.accessioned2025-09-24T00:51:58Z-
dc.date.available2025-09-24T00:51:58Z-
dc.date.issued2025-03-25-
dc.identifier.urihttp://hdl.handle.net/10722/362492-
dc.description.abstract<p>Facial expression recognition is an important subtask in the field of computer vision. Accurate recognition of facial expressions plays a critical role in applications such as human-computer interaction, emotion analysis, and intelligent surveillance. In this work, we propose a novel approach for learning multi-scale sparse focus features by combining CNN and Swin Transformer architectures, which enables the effective capture of both local and global patterns in facial expression images. Additionally, we introduce two key modules: Adaptive Hierarchical Feature Construction (AHFC) and Dual Contextual Attention Synthesis Unit (DCASU). Specifically, the AHFC module leverages Swin Transformer to extract multi-scale facial expression features while mining the relationships among sparse focus features. The DCASU module, on the other hand, extracts local features and utilizes attention mechanisms to focus on crucial regions of the face. Extensive experiments demonstrate that our method achieves state-of-the-art performance on the RAF-DB (89.18%) and KDEF (95.81%) datasets, validating the effectiveness and superiority of our approach.<br></p>-
dc.languageeng-
dc.relation.ispartof2024 4th International Conference on Electronic Information Engineering and Computer Communication (EIECC) (27/12/2024-29/12/2024, Wuhan)-
dc.subjectFacial expression recognition-
dc.subjectSparse focus features-
dc.subjectSwin Transformer-
dc.titleSFFNet: Multi-Scale Sparse Focus Feature Cue-Aware for Efficient Facial Expression Recognition-
dc.typeConference_Paper-
dc.identifier.doi10.1109/EIECC64539.2024.10929596-
dc.identifier.scopuseid_2-s2.0-105002343433-
dc.identifier.spage582-
dc.identifier.epage585-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats