File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICC45041.2023.10278812
- Scopus: eid_2-s2.0-85160194002
- Find via

Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Vision Transformer for Adaptive Image Transmission over MIMO Channels
| Title | Vision Transformer for Adaptive Image Transmission over MIMO Channels |
|---|---|
| Authors | |
| Keywords | image transmission Joint source channel coding MIMO semantic communications vision transformer |
| Issue Date | 2023 |
| Citation | IEEE International Conference on Communications, 2023, v. 2023-May, p. 3702-3707 How to Cite? |
| Abstract | This paper presents a vision transformer (ViT) based joint source and channel coding (JSCC) scheme for wireless image transmission over multiple-input multiple-output (MIMO) systems, called ViT-MIMO. The proposed ViT-MIMO architecture, in addition to outperforming separation-based benchmarks, can flexibly adapt to different channel conditions without requiring retraining. Specifically, exploiting the self-attention mechanism of the ViT enables the proposed ViT-MIMO model to adaptively learn the feature mapping and power allocation based on the source image and channel conditions. Numerical experiments show that ViT-MIMO can significantly improve the transmission quality across a large variety of scenarios, including varying channel conditions, making it an attractive solution for emerging semantic communication systems. |
| Persistent Identifier | http://hdl.handle.net/10722/363540 |
| ISSN |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Wu, Haotian | - |
| dc.contributor.author | Shao, Yulin | - |
| dc.contributor.author | Bian, Chenghong | - |
| dc.contributor.author | Mikolajczyk, Krystian | - |
| dc.contributor.author | Gündüz, Deniz | - |
| dc.date.accessioned | 2025-10-10T07:47:38Z | - |
| dc.date.available | 2025-10-10T07:47:38Z | - |
| dc.date.issued | 2023 | - |
| dc.identifier.citation | IEEE International Conference on Communications, 2023, v. 2023-May, p. 3702-3707 | - |
| dc.identifier.issn | 1550-3607 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/363540 | - |
| dc.description.abstract | This paper presents a vision transformer (ViT) based joint source and channel coding (JSCC) scheme for wireless image transmission over multiple-input multiple-output (MIMO) systems, called ViT-MIMO. The proposed ViT-MIMO architecture, in addition to outperforming separation-based benchmarks, can flexibly adapt to different channel conditions without requiring retraining. Specifically, exploiting the self-attention mechanism of the ViT enables the proposed ViT-MIMO model to adaptively learn the feature mapping and power allocation based on the source image and channel conditions. Numerical experiments show that ViT-MIMO can significantly improve the transmission quality across a large variety of scenarios, including varying channel conditions, making it an attractive solution for emerging semantic communication systems. | - |
| dc.language | eng | - |
| dc.relation.ispartof | IEEE International Conference on Communications | - |
| dc.subject | image transmission | - |
| dc.subject | Joint source channel coding | - |
| dc.subject | MIMO | - |
| dc.subject | semantic communications | - |
| dc.subject | vision transformer | - |
| dc.title | Vision Transformer for Adaptive Image Transmission over MIMO Channels | - |
| dc.type | Conference_Paper | - |
| dc.description.nature | link_to_subscribed_fulltext | - |
| dc.identifier.doi | 10.1109/ICC45041.2023.10278812 | - |
| dc.identifier.scopus | eid_2-s2.0-85160194002 | - |
| dc.identifier.volume | 2023-May | - |
| dc.identifier.spage | 3702 | - |
| dc.identifier.epage | 3707 | - |
