File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ULTSYM.2010.5935689
- Scopus: eid_2-s2.0-80054073904
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Real-time GPU-based software beamformer designed for advanced imagingmethods research
Title | Real-time GPU-based software beamformer designed for advanced imagingmethods research |
---|---|
Authors | |
Keywords | graphicsprocessing units parallel processing plane wave imaging software beamformer synthetic aperture imaging |
Issue Date | 2010 |
Publisher | IEEE. |
Citation | The 2010 IEEE International Ultrasonics Symposium, San Diego, CA., 11-14 October 2010. In Proceedings of IEEE IUS, 2010, p. 1920-1923 How to Cite? |
Abstract | High computational demand is known to be a technical hurdle for real-timeimplementation of advanced methods like synthetic aperture imaging (SAI) andplane wave imaging (PWI) that work with the pre-beamform data of each arrayelement. In this paper, we present the development of a software beamformer forSAI and PWI with real-time parallel processing capacity. Our beamformer designcomprises a pipelined group of graphics processing units (GPU) that are hostedwithin the same computer workstation. During operation, each available GPU isassigned to perform demodulation and beamforming for one frame of pre-beamformdata acquired from one transmit firing (e.g. point firing for SAI). Tofacilitate parallel computation, the GPUs have been programmed to treat thecalculation of depth pixels from the same image scanline as a block ofprocessing threads that can be executed concurrently, and it would repeat thisprocess for all scanlines to obtain the entire frame of image data i.e.low-resolution image (LRI). To reduce processing latency due to repeated accessof each GPU's global memory, we have made use of each thread block's fast-sharedmemory (to store an entire line of pre-beamform data during demodulation),created texture memory pointers, and utilized global memory caches (to streamrepeatedly used data samples during beamforming). Based on this beamformerarchitecture, a prototype platform has been implemented for SAI and PWI, and itsLRI processing throughput has been measured for test datasets with 40 MHzsampling rate, 32 receive channels, and imaging depths between 5-15 cm. Whenusing two Fermi-class GPUs (GTX-470), our beamformer can compute LRIs of512-by-255 pixels at over 3200 fps and 1300 fps respectively for imaging depthsof 5 cm and 15 cm. This processing throughput is roughly 3.2 times higher than aTesla-class GPU (GTX-275). © 2010 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/129641 |
ISBN | |
ISSN | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yiu, BYS | en_HK |
dc.contributor.author | Tsang, IKH | en_HK |
dc.contributor.author | Yu, ACH | en_HK |
dc.date.accessioned | 2010-12-23T08:40:38Z | - |
dc.date.available | 2010-12-23T08:40:38Z | - |
dc.date.issued | 2010 | en_HK |
dc.identifier.citation | The 2010 IEEE International Ultrasonics Symposium, San Diego, CA., 11-14 October 2010. In Proceedings of IEEE IUS, 2010, p. 1920-1923 | en_HK |
dc.identifier.isbn | 978-1-4577-0381-2 | - |
dc.identifier.issn | 1051-0117 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/129641 | - |
dc.description.abstract | High computational demand is known to be a technical hurdle for real-timeimplementation of advanced methods like synthetic aperture imaging (SAI) andplane wave imaging (PWI) that work with the pre-beamform data of each arrayelement. In this paper, we present the development of a software beamformer forSAI and PWI with real-time parallel processing capacity. Our beamformer designcomprises a pipelined group of graphics processing units (GPU) that are hostedwithin the same computer workstation. During operation, each available GPU isassigned to perform demodulation and beamforming for one frame of pre-beamformdata acquired from one transmit firing (e.g. point firing for SAI). Tofacilitate parallel computation, the GPUs have been programmed to treat thecalculation of depth pixels from the same image scanline as a block ofprocessing threads that can be executed concurrently, and it would repeat thisprocess for all scanlines to obtain the entire frame of image data i.e.low-resolution image (LRI). To reduce processing latency due to repeated accessof each GPU's global memory, we have made use of each thread block's fast-sharedmemory (to store an entire line of pre-beamform data during demodulation),created texture memory pointers, and utilized global memory caches (to streamrepeatedly used data samples during beamforming). Based on this beamformerarchitecture, a prototype platform has been implemented for SAI and PWI, and itsLRI processing throughput has been measured for test datasets with 40 MHzsampling rate, 32 receive channels, and imaging depths between 5-15 cm. Whenusing two Fermi-class GPUs (GTX-470), our beamformer can compute LRIs of512-by-255 pixels at over 3200 fps and 1300 fps respectively for imaging depthsof 5 cm and 15 cm. This processing throughput is roughly 3.2 times higher than aTesla-class GPU (GTX-275). © 2010 IEEE. | en_HK |
dc.language | eng | en_US |
dc.publisher | IEEE. | - |
dc.relation.ispartof | Proceedings of the IEEE International Ultrasonics Symposium, IEEE IUS 2010 | en_HK |
dc.rights | ©2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | graphicsprocessing units | en_HK |
dc.subject | parallel processing | en_HK |
dc.subject | plane wave imaging | en_HK |
dc.subject | software beamformer | en_HK |
dc.subject | synthetic aperture imaging | en_HK |
dc.title | Real-time GPU-based software beamformer designed for advanced imagingmethods research | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=978-1-4577-0381-2&volume=&spage=1920&epage=1923&date=2010&atitle=Real-time+GPU-based+software+beamformer+designed+for+advanced+imaging+methods+research | - |
dc.identifier.email | Yu, ACH:alfred.yu@hku.hk | en_HK |
dc.identifier.authority | Yu, ACH=rp00657 | en_HK |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1109/ULTSYM.2010.5935689 | en_HK |
dc.identifier.scopus | eid_2-s2.0-80054073904 | en_HK |
dc.identifier.hkuros | 176828 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-80054073904&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 1920 | en_HK |
dc.identifier.epage | 1923 | en_HK |
dc.publisher.place | United States | en_HK |
dc.description.other | The 2010 IEEE International Ultrasonics Symposium, San Diego, CA., 11-14 October 2010. In Proceedings of IEEE IUS, 2010, p. 1920-1923 | - |
dc.identifier.scopusauthorid | Yiu, BYS=26657783600 | en_HK |
dc.identifier.scopusauthorid | Tsang, IKH=26657657600 | en_HK |
dc.identifier.scopusauthorid | Yu, ACH=8699317700 | en_HK |
dc.identifier.issnl | 1051-0117 | - |