File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/FPT.2010.5681425
- Scopus: eid_2-s2.0-79551565216
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Design space exploration for sparse matrix-matrix multiplication on FPGAs
Title | Design space exploration for sparse matrix-matrix multiplication on FPGAs |
---|---|
Authors | |
Keywords | Architecture designs Block sizes Computational architecture Dense matrices Design space exploration |
Issue Date | 2010 |
Publisher | IEEE, Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000290 |
Citation | The 2010 International Conference on Field-Programmable Technology (FPT 2010), Beijing, China 8-10 December 2010. In IEEE International Conference on FieId-Programmable Technology Proceedings, 2010, p. 369-372 How to Cite? |
Abstract | The design and implementation of a sparse matrix-matrix multiplication architecture on FPGAs is presented. Performance of the design, in terms of computational latency, as well as the associated power-delay and energy-delay tradeoff are studied. Taking advantage of the sparsity of the input matrices, the proposed design allows user-tunable power-delay and energy-delay tradeoffs by employing different number of processing elements (PEs) in the architecture design and different block size in the blocking decomposition. Such ability allows designers to employ different on-chip computational architecture for different system power-delay and energy-delay requirements. It is in contrast to conventional dense matrix-matrix multiplication architectures that always favor the maximum number of PEs and largest block size. In our implementation, the better energy consumption and power-delay product favors less PEs and smaller block size for the 90%-sparsity matrix-matrix multiplications. While in order to achieve better energy-delay product, more PEs and larger block size are preferred. © 2010 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/129664 |
ISBN | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lin, CY | en_HK |
dc.contributor.author | Zhang, Z | en_HK |
dc.contributor.author | Wong, N | en_HK |
dc.contributor.author | So, HKH | en_HK |
dc.date.accessioned | 2010-12-23T08:40:50Z | - |
dc.date.available | 2010-12-23T08:40:50Z | - |
dc.date.issued | 2010 | en_HK |
dc.identifier.citation | The 2010 International Conference on Field-Programmable Technology (FPT 2010), Beijing, China 8-10 December 2010. In IEEE International Conference on FieId-Programmable Technology Proceedings, 2010, p. 369-372 | en_HK |
dc.identifier.isbn | 978-1-4244-8980-0 | - |
dc.identifier.uri | http://hdl.handle.net/10722/129664 | - |
dc.description.abstract | The design and implementation of a sparse matrix-matrix multiplication architecture on FPGAs is presented. Performance of the design, in terms of computational latency, as well as the associated power-delay and energy-delay tradeoff are studied. Taking advantage of the sparsity of the input matrices, the proposed design allows user-tunable power-delay and energy-delay tradeoffs by employing different number of processing elements (PEs) in the architecture design and different block size in the blocking decomposition. Such ability allows designers to employ different on-chip computational architecture for different system power-delay and energy-delay requirements. It is in contrast to conventional dense matrix-matrix multiplication architectures that always favor the maximum number of PEs and largest block size. In our implementation, the better energy consumption and power-delay product favors less PEs and smaller block size for the 90%-sparsity matrix-matrix multiplications. While in order to achieve better energy-delay product, more PEs and larger block size are preferred. © 2010 IEEE. | en_HK |
dc.language | eng | en_US |
dc.publisher | IEEE, Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000290 | - |
dc.relation.ispartof | Proceedings - 2010 International Conference on Field-Programmable Technology, FPT'10 | en_HK |
dc.rights | IEEE International Conference on FieId-Programmable Technology Proceedings. Copyright © IEEE, Computer Society. | - |
dc.rights | ©2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | Architecture designs | - |
dc.subject | Block sizes | - |
dc.subject | Computational architecture | - |
dc.subject | Dense matrices | - |
dc.subject | Design space exploration | - |
dc.title | Design space exploration for sparse matrix-matrix multiplication on FPGAs | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Wong, N:nwong@eee.hku.hk | en_HK |
dc.identifier.email | So, HKH:hso@eee.hku.hk | en_HK |
dc.identifier.authority | Wong, N=rp00190 | en_HK |
dc.identifier.authority | So, HKH=rp00169 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/FPT.2010.5681425 | en_HK |
dc.identifier.scopus | eid_2-s2.0-79551565216 | en_HK |
dc.identifier.hkuros | 178004 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-79551565216&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 369 | en_HK |
dc.identifier.epage | 372 | en_HK |
dc.publisher.place | United States | - |
dc.description.other | The 2010 International Conference on Field-Programmable Technology (FPT 2010), Beijing, China 8-10 December 2010. In IEEE International Conference on FieId-Programmable Technology Proceedings, 2010, p. 369-372 | - |
dc.identifier.scopusauthorid | Lin, CY=35177986900 | en_HK |
dc.identifier.scopusauthorid | Zhang, Z=35390468200 | en_HK |
dc.identifier.scopusauthorid | Wong, N=35235551600 | en_HK |
dc.identifier.scopusauthorid | So, HKH=10738896800 | en_HK |