File Download
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Dynamic dictionary matching and compressed suffix trees
Title | Dynamic dictionary matching and compressed suffix trees |
---|---|
Authors | |
Keywords | Mathematics computers |
Issue Date | 2005 |
Publisher | Society for Industrial and Applied Mathematics. |
Citation | Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2005), Vancouver, BC, 23-25 January 2005. In Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2005, p. 13-22 How to Cite? |
Abstract | Recent breakthrough in compressed indexing data structures has reduced the space for indexing a text (or a collection of texts) of length n from O(n log n) bits to O(n) bits, while allowing very efficient pattern matching. Yet the compressed nature of such indices also makes them difficult to update dynamically. This paper presents the first O(n)-bit representation of a suffix tree for a dynamic collection of texts whose total length is n, which supports insertion and deletion of a text T in O(|T| log2 n) time, as well as all suffix tree traversal operations, including forward and backward suffix links. This work can be regarded as a generalization of the compressed representation of static texts. Our new suffix tree representation serves as a core part in a compact solution for the dynamic dictionary matching problem, i.e., providing an O(d)-bit data structure for a dynamic collection of patterns of total length d that can support the dictionary matching query efficiently. When compared with the O(d log d)-bit suffix tree based solution of Amir et al., the compact solution increases the query time by roughly a factor of log d only. In the study of the above results, we also derive the first O(n)-bit representation for maintaining n pairs of balanced parentheses in O(log n/log log n) time per operation, matching the time complexity of the previous O(n log n)-bit solution. |
Persistent Identifier | http://hdl.handle.net/10722/45528 |
ISSN | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chan, HL | en_HK |
dc.contributor.author | Hon, WK | en_HK |
dc.contributor.author | Lam, TW | en_HK |
dc.contributor.author | Sadakane, K | en_HK |
dc.date.accessioned | 2007-10-30T06:28:31Z | - |
dc.date.available | 2007-10-30T06:28:31Z | - |
dc.date.issued | 2005 | en_HK |
dc.identifier.citation | Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2005), Vancouver, BC, 23-25 January 2005. In Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2005, p. 13-22 | en_HK |
dc.identifier.issn | 1071-9040 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/45528 | - |
dc.description.abstract | Recent breakthrough in compressed indexing data structures has reduced the space for indexing a text (or a collection of texts) of length n from O(n log n) bits to O(n) bits, while allowing very efficient pattern matching. Yet the compressed nature of such indices also makes them difficult to update dynamically. This paper presents the first O(n)-bit representation of a suffix tree for a dynamic collection of texts whose total length is n, which supports insertion and deletion of a text T in O(|T| log2 n) time, as well as all suffix tree traversal operations, including forward and backward suffix links. This work can be regarded as a generalization of the compressed representation of static texts. Our new suffix tree representation serves as a core part in a compact solution for the dynamic dictionary matching problem, i.e., providing an O(d)-bit data structure for a dynamic collection of patterns of total length d that can support the dictionary matching query efficiently. When compared with the O(d log d)-bit suffix tree based solution of Amir et al., the compact solution increases the query time by roughly a factor of log d only. In the study of the above results, we also derive the first O(n)-bit representation for maintaining n pairs of balanced parentheses in O(log n/log log n) time per operation, matching the time complexity of the previous O(n log n)-bit solution. | en_HK |
dc.format.extent | 951964 bytes | - |
dc.format.extent | 4181 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | text/plain | - |
dc.language | eng | en_HK |
dc.publisher | Society for Industrial and Applied Mathematics. | en_HK |
dc.relation.ispartof | Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms | en_HK |
dc.rights | © 2005 Society for Industrial and Applied Mathematics. First Published in Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms in 2005, published by the Society for Industrial and Applied Mathematics (SIAM). | - |
dc.subject | Mathematics computers | en_HK |
dc.title | Dynamic dictionary matching and compressed suffix trees | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Chan, HL:hlchan@cs.hku.hk | en_HK |
dc.identifier.email | Lam, TW:twlam@cs.hku.hk | en_HK |
dc.identifier.authority | Chan, HL=rp01310 | en_HK |
dc.identifier.authority | Lam, TW=rp00135 | en_HK |
dc.description.nature | published_or_final_version | en_HK |
dc.identifier.scopus | eid_2-s2.0-20744440148 | en_HK |
dc.identifier.hkuros | 102695 | - |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-20744440148&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 13 | en_HK |
dc.identifier.epage | 22 | en_HK |
dc.identifier.scopusauthorid | Chan, HL=7403402384 | en_HK |
dc.identifier.scopusauthorid | Hon, WK=7004282818 | en_HK |
dc.identifier.scopusauthorid | Lam, TW=7202523165 | en_HK |
dc.identifier.scopusauthorid | Sadakane, K=7005716583 | en_HK |
dc.identifier.issnl | 1071-9040 | - |