File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/3316781.3317813
- Scopus: eid_2-s2.0-85067788769
- WOS: WOS:000482058200182
Supplementary
- Citations:
- Appears in Collections:
Conference Paper: E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System
Title | E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System |
---|---|
Authors | |
Issue Date | 2019 |
Publisher | ACM Press. The Proceedings' web site is located at https://dl.acm.org/citation.cfm?id=3316781&picked=prox |
Citation | Proceedings of the 56th Annual Design Automation Conference 2019 (DAC '19), Las Vegas, NV, USA, 2-6 June 2019, article no. 182:1--182:6 How to Cite? |
Abstract | Various models with Long Short-Term Memory (LSTM) network have demonstrated prior art performances in sequential information processing. Previous LSTM-specific architectures set large on-chip memory for weight storage to alleviate the memory-bound issue and facilitate the LSTM inference in cloud computing. In this paper, E-LSTM is proposed for embedded scenarios with the consideration of the chip-area and limited data-access bandwidth. The heterogeneous hardware in E-LSTM tightly couples an LSTM co-processor with an embedded RISC-V CPU. The eSELL format is developed to represent the sparse weight matrix. With the proposed cell fusion optimization based on the inherent sparsity in computation, E-LSTM achieves up to 2.2× speedup of processing throughput. |
Persistent Identifier | http://hdl.handle.net/10722/275274 |
ISBN | |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Shi, R | - |
dc.contributor.author | Liu, J | - |
dc.contributor.author | So, HKH | - |
dc.contributor.author | Wang, S | - |
dc.contributor.author | Liang, Y | - |
dc.date.accessioned | 2019-09-10T02:39:11Z | - |
dc.date.available | 2019-09-10T02:39:11Z | - |
dc.date.issued | 2019 | - |
dc.identifier.citation | Proceedings of the 56th Annual Design Automation Conference 2019 (DAC '19), Las Vegas, NV, USA, 2-6 June 2019, article no. 182:1--182:6 | - |
dc.identifier.isbn | 9781450367257 | - |
dc.identifier.uri | http://hdl.handle.net/10722/275274 | - |
dc.description.abstract | Various models with Long Short-Term Memory (LSTM) network have demonstrated prior art performances in sequential information processing. Previous LSTM-specific architectures set large on-chip memory for weight storage to alleviate the memory-bound issue and facilitate the LSTM inference in cloud computing. In this paper, E-LSTM is proposed for embedded scenarios with the consideration of the chip-area and limited data-access bandwidth. The heterogeneous hardware in E-LSTM tightly couples an LSTM co-processor with an embedded RISC-V CPU. The eSELL format is developed to represent the sparse weight matrix. With the proposed cell fusion optimization based on the inherent sparsity in computation, E-LSTM achieves up to 2.2× speedup of processing throughput. | - |
dc.language | eng | - |
dc.publisher | ACM Press. The Proceedings' web site is located at https://dl.acm.org/citation.cfm?id=3316781&picked=prox | - |
dc.relation.ispartof | Proceedings of the 56th Annual Design Automation Conference 2019 (DAC '19) | - |
dc.title | E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System | - |
dc.type | Conference_Paper | - |
dc.identifier.email | So, HKH: hso@eee.hku.hk | - |
dc.identifier.authority | So, HKH=rp00169 | - |
dc.identifier.doi | 10.1145/3316781.3317813 | - |
dc.identifier.scopus | eid_2-s2.0-85067788769 | - |
dc.identifier.hkuros | 304176 | - |
dc.identifier.spage | 182:1 | - |
dc.identifier.epage | 182:6 | - |
dc.identifier.isi | WOS:000482058200182 | - |
dc.publisher.place | New York, NY | - |