Vision Transformer for Adaptive Image Transmission over MIMO Channels

Wu, Haotian; Shao, Yulin; Bian, Chenghong; Mikolajczyk, Krystian; Gündüz, Deniz

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/ICC45041.2023.10278812
Scopus: eid_2-s2.0-85160194002
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Electrical & Electronic Engineering: Conference papers

Conference Paper: Vision Transformer for Adaptive Image Transmission over MIMO Channels

Title	Vision Transformer for Adaptive Image Transmission over MIMO Channels
Authors	Wu, Haotian Shao, Yulin Bian, Chenghong Mikolajczyk, Krystian Gündüz, Deniz
Keywords	image transmission Joint source channel coding MIMO semantic communications vision transformer
Issue Date	2023
Citation	IEEE International Conference on Communications, 2023, v. 2023-May, p. 3702-3707 How to Cite? DOI: http://dx.doi.org/10.1109/ICC45041.2023.10278812
Abstract	This paper presents a vision transformer (ViT) based joint source and channel coding (JSCC) scheme for wireless image transmission over multiple-input multiple-output (MIMO) systems, called ViT-MIMO. The proposed ViT-MIMO architecture, in addition to outperforming separation-based benchmarks, can flexibly adapt to different channel conditions without requiring retraining. Specifically, exploiting the self-attention mechanism of the ViT enables the proposed ViT-MIMO model to adaptively learn the feature mapping and power allocation based on the source image and channel conditions. Numerical experiments show that ViT-MIMO can significantly improve the transmission quality across a large variety of scenarios, including varying channel conditions, making it an attractive solution for emerging semantic communication systems.
Persistent Identifier	http://hdl.handle.net/10722/363540
ISSN	1550-3607

DC Field	Value	Language
dc.contributor.author	Wu, Haotian	-
dc.contributor.author	Shao, Yulin	-
dc.contributor.author	Bian, Chenghong	-
dc.contributor.author	Mikolajczyk, Krystian	-
dc.contributor.author	Gündüz, Deniz	-
dc.date.accessioned	2025-10-10T07:47:38Z	-
dc.date.available	2025-10-10T07:47:38Z	-
dc.date.issued	2023	-
dc.identifier.citation	IEEE International Conference on Communications, 2023, v. 2023-May, p. 3702-3707	-
dc.identifier.issn	1550-3607	-
dc.identifier.uri	http://hdl.handle.net/10722/363540	-
dc.description.abstract	This paper presents a vision transformer (ViT) based joint source and channel coding (JSCC) scheme for wireless image transmission over multiple-input multiple-output (MIMO) systems, called ViT-MIMO. The proposed ViT-MIMO architecture, in addition to outperforming separation-based benchmarks, can flexibly adapt to different channel conditions without requiring retraining. Specifically, exploiting the self-attention mechanism of the ViT enables the proposed ViT-MIMO model to adaptively learn the feature mapping and power allocation based on the source image and channel conditions. Numerical experiments show that ViT-MIMO can significantly improve the transmission quality across a large variety of scenarios, including varying channel conditions, making it an attractive solution for emerging semantic communication systems.	-
dc.language	eng	-
dc.relation.ispartof	IEEE International Conference on Communications	-
dc.subject	image transmission	-
dc.subject	Joint source channel coding	-
dc.subject	MIMO	-
dc.subject	semantic communications	-
dc.subject	vision transformer	-
dc.title	Vision Transformer for Adaptive Image Transmission over MIMO Channels	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/ICC45041.2023.10278812	-
dc.identifier.scopus	eid_2-s2.0-85160194002	-
dc.identifier.volume	2023-May	-
dc.identifier.spage	3702	-
dc.identifier.epage	3707	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Vision Transformer for Adaptive Image Transmission over MIMO Channels

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats