File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Vision Transformer for Adaptive Image Transmission over MIMO Channels

TitleVision Transformer for Adaptive Image Transmission over MIMO Channels
Authors
Keywordsimage transmission
Joint source channel coding
MIMO
semantic communications
vision transformer
Issue Date2023
Citation
IEEE International Conference on Communications, 2023, v. 2023-May, p. 3702-3707 How to Cite?
AbstractThis paper presents a vision transformer (ViT) based joint source and channel coding (JSCC) scheme for wireless image transmission over multiple-input multiple-output (MIMO) systems, called ViT-MIMO. The proposed ViT-MIMO architecture, in addition to outperforming separation-based benchmarks, can flexibly adapt to different channel conditions without requiring retraining. Specifically, exploiting the self-attention mechanism of the ViT enables the proposed ViT-MIMO model to adaptively learn the feature mapping and power allocation based on the source image and channel conditions. Numerical experiments show that ViT-MIMO can significantly improve the transmission quality across a large variety of scenarios, including varying channel conditions, making it an attractive solution for emerging semantic communication systems.
Persistent Identifierhttp://hdl.handle.net/10722/363540
ISSN

 

DC FieldValueLanguage
dc.contributor.authorWu, Haotian-
dc.contributor.authorShao, Yulin-
dc.contributor.authorBian, Chenghong-
dc.contributor.authorMikolajczyk, Krystian-
dc.contributor.authorGündüz, Deniz-
dc.date.accessioned2025-10-10T07:47:38Z-
dc.date.available2025-10-10T07:47:38Z-
dc.date.issued2023-
dc.identifier.citationIEEE International Conference on Communications, 2023, v. 2023-May, p. 3702-3707-
dc.identifier.issn1550-3607-
dc.identifier.urihttp://hdl.handle.net/10722/363540-
dc.description.abstractThis paper presents a vision transformer (ViT) based joint source and channel coding (JSCC) scheme for wireless image transmission over multiple-input multiple-output (MIMO) systems, called ViT-MIMO. The proposed ViT-MIMO architecture, in addition to outperforming separation-based benchmarks, can flexibly adapt to different channel conditions without requiring retraining. Specifically, exploiting the self-attention mechanism of the ViT enables the proposed ViT-MIMO model to adaptively learn the feature mapping and power allocation based on the source image and channel conditions. Numerical experiments show that ViT-MIMO can significantly improve the transmission quality across a large variety of scenarios, including varying channel conditions, making it an attractive solution for emerging semantic communication systems.-
dc.languageeng-
dc.relation.ispartofIEEE International Conference on Communications-
dc.subjectimage transmission-
dc.subjectJoint source channel coding-
dc.subjectMIMO-
dc.subjectsemantic communications-
dc.subjectvision transformer-
dc.titleVision Transformer for Adaptive Image Transmission over MIMO Channels-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/ICC45041.2023.10278812-
dc.identifier.scopuseid_2-s2.0-85160194002-
dc.identifier.volume2023-May-
dc.identifier.spage3702-
dc.identifier.epage3707-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats