Real-time GPU-based software beamformer designed for advanced imagingmethods research

Yiu, BYS; Tsang, IKH; Yu, ACH

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/ULTSYM.2010.5935689
Scopus: eid_2-s2.0-80054073904
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Electrical & Electronic Engineering: Conference papers

Conference Paper: Real-time GPU-based software beamformer designed for advanced imagingmethods research

Title	Real-time GPU-based software beamformer designed for advanced imagingmethods research
Authors	Yiu, BYS Tsang, IKH Yu, ACH
Keywords	graphicsprocessing units parallel processing plane wave imaging software beamformer synthetic aperture imaging
Issue Date	2010
Publisher	IEEE.
Citation	The 2010 IEEE International Ultrasonics Symposium, San Diego, CA., 11-14 October 2010. In Proceedings of IEEE IUS, 2010, p. 1920-1923 How to Cite? DOI: http://dx.doi.org/10.1109/ULTSYM.2010.5935689
Abstract	High computational demand is known to be a technical hurdle for real-timeimplementation of advanced methods like synthetic aperture imaging (SAI) andplane wave imaging (PWI) that work with the pre-beamform data of each arrayelement. In this paper, we present the development of a software beamformer forSAI and PWI with real-time parallel processing capacity. Our beamformer designcomprises a pipelined group of graphics processing units (GPU) that are hostedwithin the same computer workstation. During operation, each available GPU isassigned to perform demodulation and beamforming for one frame of pre-beamformdata acquired from one transmit firing (e.g. point firing for SAI). Tofacilitate parallel computation, the GPUs have been programmed to treat thecalculation of depth pixels from the same image scanline as a block ofprocessing threads that can be executed concurrently, and it would repeat thisprocess for all scanlines to obtain the entire frame of image data i.e.low-resolution image (LRI). To reduce processing latency due to repeated accessof each GPU's global memory, we have made use of each thread block's fast-sharedmemory (to store an entire line of pre-beamform data during demodulation),created texture memory pointers, and utilized global memory caches (to streamrepeatedly used data samples during beamforming). Based on this beamformerarchitecture, a prototype platform has been implemented for SAI and PWI, and itsLRI processing throughput has been measured for test datasets with 40 MHzsampling rate, 32 receive channels, and imaging depths between 5-15 cm. Whenusing two Fermi-class GPUs (GTX-470), our beamformer can compute LRIs of512-by-255 pixels at over 3200 fps and 1300 fps respectively for imaging depthsof 5 cm and 15 cm. This processing throughput is roughly 3.2 times higher than aTesla-class GPU (GTX-275). © 2010 IEEE.
Persistent Identifier	http://hdl.handle.net/10722/129641
ISBN	978-1-4577-0381-2
ISSN	1051-0117
References	References in Scopus

DC Field	Value	Language
dc.contributor.author	Yiu, BYS	en_HK
dc.contributor.author	Tsang, IKH	en_HK
dc.contributor.author	Yu, ACH	en_HK
dc.date.accessioned	2010-12-23T08:40:38Z	-
dc.date.available	2010-12-23T08:40:38Z	-
dc.date.issued	2010	en_HK
dc.identifier.citation	The 2010 IEEE International Ultrasonics Symposium, San Diego, CA., 11-14 October 2010. In Proceedings of IEEE IUS, 2010, p. 1920-1923	en_HK
dc.identifier.isbn	978-1-4577-0381-2	-
dc.identifier.issn	1051-0117	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/129641	-
dc.description.abstract	High computational demand is known to be a technical hurdle for real-timeimplementation of advanced methods like synthetic aperture imaging (SAI) andplane wave imaging (PWI) that work with the pre-beamform data of each arrayelement. In this paper, we present the development of a software beamformer forSAI and PWI with real-time parallel processing capacity. Our beamformer designcomprises a pipelined group of graphics processing units (GPU) that are hostedwithin the same computer workstation. During operation, each available GPU isassigned to perform demodulation and beamforming for one frame of pre-beamformdata acquired from one transmit firing (e.g. point firing for SAI). Tofacilitate parallel computation, the GPUs have been programmed to treat thecalculation of depth pixels from the same image scanline as a block ofprocessing threads that can be executed concurrently, and it would repeat thisprocess for all scanlines to obtain the entire frame of image data i.e.low-resolution image (LRI). To reduce processing latency due to repeated accessof each GPU's global memory, we have made use of each thread block's fast-sharedmemory (to store an entire line of pre-beamform data during demodulation),created texture memory pointers, and utilized global memory caches (to streamrepeatedly used data samples during beamforming). Based on this beamformerarchitecture, a prototype platform has been implemented for SAI and PWI, and itsLRI processing throughput has been measured for test datasets with 40 MHzsampling rate, 32 receive channels, and imaging depths between 5-15 cm. Whenusing two Fermi-class GPUs (GTX-470), our beamformer can compute LRIs of512-by-255 pixels at over 3200 fps and 1300 fps respectively for imaging depthsof 5 cm and 15 cm. This processing throughput is roughly 3.2 times higher than aTesla-class GPU (GTX-275). © 2010 IEEE.	en_HK
dc.language	eng	en_US
dc.publisher	IEEE.	-
dc.relation.ispartof	Proceedings of the IEEE International Ultrasonics Symposium, IEEE IUS 2010	en_HK
dc.rights	©2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.	-
dc.subject	graphicsprocessing units	en_HK
dc.subject	parallel processing	en_HK
dc.subject	plane wave imaging	en_HK
dc.subject	software beamformer	en_HK
dc.subject	synthetic aperture imaging	en_HK
dc.title	Real-time GPU-based software beamformer designed for advanced imagingmethods research	en_HK
dc.type	Conference_Paper	en_HK
dc.identifier.openurl	http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=978-1-4577-0381-2&volume=&spage=1920&epage=1923&date=2010&atitle=Real-time+GPU-based+software+beamformer+designed+for+advanced+imaging+methods+research	-
dc.identifier.email	Yu, ACH:alfred.yu@hku.hk	en_HK
dc.identifier.authority	Yu, ACH=rp00657	en_HK
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1109/ULTSYM.2010.5935689	en_HK
dc.identifier.scopus	eid_2-s2.0-80054073904	en_HK
dc.identifier.hkuros	176828	en_US
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-80054073904&selection=ref&src=s&origin=recordpage	en_HK
dc.identifier.spage	1920	en_HK
dc.identifier.epage	1923	en_HK
dc.publisher.place	United States	en_HK
dc.description.other	The 2010 IEEE International Ultrasonics Symposium, San Diego, CA., 11-14 October 2010. In Proceedings of IEEE IUS, 2010, p. 1920-1923	-
dc.identifier.scopusauthorid	Yiu, BYS=26657783600	en_HK
dc.identifier.scopusauthorid	Tsang, IKH=26657657600	en_HK
dc.identifier.scopusauthorid	Yu, ACH=8699317700	en_HK
dc.identifier.issnl	1051-0117	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Real-time GPU-based software beamformer designed for advanced imagingmethods research

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats