High-Throughput Line Buffer Microarchitecture for Arbitrary Sized Streaming Image Processing

SHI, R; Wong, JS; So, HKH

File Download

content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.3390/jimaging5030034
Scopus: eid_2-s2.0-85067639240
WOS: WOS:000464312500001
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Electrical & Electronic Engineering: Journal/Magazine Articles

Article: High-Throughput Line Buffer Microarchitecture for Arbitrary Sized Streaming Image Processing

Title	High-Throughput Line Buffer Microarchitecture for Arbitrary Sized Streaming Image Processing
Authors	SHI, R Wong, JS So, HKH
Keywords	streaming architecture low-latency high-throughput FPGA D-SWIM
Issue Date	2019
Publisher	MDPI AG. The Journal's web site is located at http://www.mdpi.com/journal/jimaging
Citation	Journal of Imaging, 2019, v. 5 n. 3, p. 34 How to Cite? DOI: http://dx.doi.org/10.3390/jimaging5030034
Abstract	Parallel hardware designed for image processing promotes vision-guided intelligent applications. With the advantages of high-throughput and low-latency, streaming architecture on FPGA is especially attractive to real-time image processing. Notably, many real-world applications, such as region of interest (ROI) detection, demand the ability to process images continuously at different sizes and resolutions in hardware without interruptions. FPGA is especially suitable for implementation of such flexible streaming architecture, but most existing solutions require run-time reconfiguration, and hence cannot achieve seamless image size-switching. In this paper, we propose a dynamically-programmable buffer architecture (D-SWIM) based on the Stream-Windowing Interleaved Memory (SWIM) architecture to realize image processing on FPGA for image streams at arbitrary sizes defined at run time. D-SWIM redefines the way that on-chip memory is organized and controlled, and the hardware adapts to arbitrary image size with sub-100 ns delay that ensures minimum interruptions to the image processing at a high frame rate. Compared to the prior SWIM buffer for high-throughput scenarios, D-SWIM achieved dynamic programmability with only a slight overhead on logic resource usage, but saved up to 56% of the BRAM resource. The D-SWIM buffer achieves a max operating frequency of 329.5 MHz and reduction in power consumption by 45.7% comparing with the SWIM scheme. Real-world image processing applications, such as 2D-Convolution and the Harris Corner Detector, have also been used to evaluate D-SWIM’s performance, where a pixel throughput of 4.5 Giga Pixel/s and 4.2 Giga Pixel/s were achieved respectively in each case. Compared to the implementation with prior streaming frameworks, the D-SWIM-based design not only realizes seamless image size-switching, but also improves hardware efficiency up to 30×.
Persistent Identifier	http://hdl.handle.net/10722/275024
ISSN	2313-433X 2023 Impact Factor: 2.7 2023 SCImago Journal Rankings: 0.717
ISI Accession Number ID	WOS:000464312500001

DC Field	Value	Language
dc.contributor.author	SHI, R	-
dc.contributor.author	Wong, JS	-
dc.contributor.author	So, HKH	-
dc.date.accessioned	2019-09-10T02:33:53Z	-
dc.date.available	2019-09-10T02:33:53Z	-
dc.date.issued	2019	-
dc.identifier.citation	Journal of Imaging, 2019, v. 5 n. 3, p. 34	-
dc.identifier.issn	2313-433X	-
dc.identifier.uri	http://hdl.handle.net/10722/275024	-
dc.description.abstract	Parallel hardware designed for image processing promotes vision-guided intelligent applications. With the advantages of high-throughput and low-latency, streaming architecture on FPGA is especially attractive to real-time image processing. Notably, many real-world applications, such as region of interest (ROI) detection, demand the ability to process images continuously at different sizes and resolutions in hardware without interruptions. FPGA is especially suitable for implementation of such flexible streaming architecture, but most existing solutions require run-time reconfiguration, and hence cannot achieve seamless image size-switching. In this paper, we propose a dynamically-programmable buffer architecture (D-SWIM) based on the Stream-Windowing Interleaved Memory (SWIM) architecture to realize image processing on FPGA for image streams at arbitrary sizes defined at run time. D-SWIM redefines the way that on-chip memory is organized and controlled, and the hardware adapts to arbitrary image size with sub-100 ns delay that ensures minimum interruptions to the image processing at a high frame rate. Compared to the prior SWIM buffer for high-throughput scenarios, D-SWIM achieved dynamic programmability with only a slight overhead on logic resource usage, but saved up to 56% of the BRAM resource. The D-SWIM buffer achieves a max operating frequency of 329.5 MHz and reduction in power consumption by 45.7% comparing with the SWIM scheme. Real-world image processing applications, such as 2D-Convolution and the Harris Corner Detector, have also been used to evaluate D-SWIM’s performance, where a pixel throughput of 4.5 Giga Pixel/s and 4.2 Giga Pixel/s were achieved respectively in each case. Compared to the implementation with prior streaming frameworks, the D-SWIM-based design not only realizes seamless image size-switching, but also improves hardware efficiency up to 30×.	-
dc.language	eng	-
dc.publisher	MDPI AG. The Journal's web site is located at http://www.mdpi.com/journal/jimaging	-
dc.relation.ispartof	Journal of Imaging	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject	streaming architecture	-
dc.subject	low-latency	-
dc.subject	high-throughput	-
dc.subject	FPGA	-
dc.subject	D-SWIM	-
dc.title	High-Throughput Line Buffer Microarchitecture for Arbitrary Sized Streaming Image Processing	-
dc.type	Article	-
dc.identifier.email	Wong, JS: jsjwong@hku.hk	-
dc.identifier.email	So, HKH: hso@eee.hku.hk	-
dc.identifier.authority	So, HKH=rp00169	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.3390/jimaging5030034	-
dc.identifier.scopus	eid_2-s2.0-85067639240	-
dc.identifier.hkuros	304139	-
dc.identifier.volume	5	-
dc.identifier.issue	3	-
dc.identifier.spage	34	-
dc.identifier.epage	34	-
dc.identifier.isi	WOS:000464312500001	-
dc.publisher.place	Switzerland	-
dc.identifier.issnl	2313-433X	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: High-Throughput Line Buffer Microarchitecture for Arbitrary Sized Streaming Image Processing

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats