Approximate string matching using compressed suffix arrays

Huynh, TND; Hon, WK; Lam, TW; Sung, WK

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.tcs.2005.11.022
Scopus: eid_2-s2.0-32644436921
WOS: WOS:000235826900018
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: Approximate string matching using compressed suffix arrays

Title	Approximate string matching using compressed suffix arrays
Authors	Huynh, TND Hon, WK Lam, TW Sung, WK
Issue Date	2006
Publisher	Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/tcs
Citation	Theoretical Computer Science, 2006, v. 352 n. 1-3, p. 240-249 How to Cite? DOI: http://dx.doi.org/10.1016/j.tcs.2005.11.022
Abstract	Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabet A. The k-difference (k-mismatch, respectively) problem is to find all occurrences of P in T that have edit distance (Hamming distance, respectively) at most k from P. In this paper we investigate a well-studied case in which T is fixed and preprocessed into an indexing data structure so that any pattern query can be answered faster. We give a solution using an O(nlogn) bits indexing data structure with O(\|A\|kmk·max(k,logn) +occ) query time, where occ is the number of occurrences. The best previous result requires O(nlogn) bits indexing data structure and gives O(\|A\|kmk+2+occ) query time. Our solution also allows us to exploit compressed suffix arrays to reduce the indexing space to O(n) bits, while increasing the query time by an O(logn) factor only. © 2005 Elsevier B.V. All right reserved.
Persistent Identifier	http://hdl.handle.net/10722/152330
ISSN	0304-3975 2023 Impact Factor: 0.9 2023 SCImago Journal Rankings: 0.570
ISI Accession Number ID	WOS:000235826900018
References	References in Scopus

DC Field	Value	Language
dc.contributor.author	Huynh, TND	en_US
dc.contributor.author	Hon, WK	en_US
dc.contributor.author	Lam, TW	en_US
dc.contributor.author	Sung, WK	en_US
dc.date.accessioned	2012-06-26T06:37:14Z	-
dc.date.available	2012-06-26T06:37:14Z	-
dc.date.issued	2006	en_US
dc.identifier.citation	Theoretical Computer Science, 2006, v. 352 n. 1-3, p. 240-249	en_US
dc.identifier.issn	0304-3975	en_US
dc.identifier.uri	http://hdl.handle.net/10722/152330	-
dc.description.abstract	Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabet A. The k-difference (k-mismatch, respectively) problem is to find all occurrences of P in T that have edit distance (Hamming distance, respectively) at most k from P. In this paper we investigate a well-studied case in which T is fixed and preprocessed into an indexing data structure so that any pattern query can be answered faster. We give a solution using an O(nlogn) bits indexing data structure with O(\|A\|kmk·max(k,logn) +occ) query time, where occ is the number of occurrences. The best previous result requires O(nlogn) bits indexing data structure and gives O(\|A\|kmk+2+occ) query time. Our solution also allows us to exploit compressed suffix arrays to reduce the indexing space to O(n) bits, while increasing the query time by an O(logn) factor only. © 2005 Elsevier B.V. All right reserved.	en_US
dc.language	eng	en_US
dc.publisher	Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/tcs	en_US
dc.relation.ispartof	Theoretical Computer Science	en_US
dc.title	Approximate string matching using compressed suffix arrays	en_US
dc.type	Article	en_US
dc.identifier.email	Lam, TW:twlam@cs.hku.hk	en_US
dc.identifier.authority	Lam, TW=rp00135	en_US
dc.description.nature	link_to_subscribed_fulltext	en_US
dc.identifier.doi	10.1016/j.tcs.2005.11.022	en_US
dc.identifier.scopus	eid_2-s2.0-32644436921	en_US
dc.identifier.hkuros	130777	-
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-32644436921&selection=ref&src=s&origin=recordpage	en_US
dc.identifier.volume	352	en_US
dc.identifier.issue	1-3	en_US
dc.identifier.spage	240	en_US
dc.identifier.epage	249	en_US
dc.identifier.isi	WOS:000235826900018	-
dc.publisher.place	Netherlands	en_US
dc.identifier.scopusauthorid	Huynh, TND=8846778300	en_US
dc.identifier.scopusauthorid	Hon, WK=7004282818	en_US
dc.identifier.scopusauthorid	Lam, TW=7202523165	en_US
dc.identifier.scopusauthorid	Sung, WK=13310059700	en_US
dc.identifier.issnl	0304-3975	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Approximate string matching using compressed suffix arrays

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats