File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1145/1267066.1267068
- Scopus: eid_2-s2.0-34548236885
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Mining periodic patterns with gap requirement from sequences
Title | Mining periodic patterns with gap requirement from sequences |
---|---|
Authors | |
Keywords | Gap Requirement Periodic Pattern Sequence Mining |
Issue Date | 2007 |
Publisher | Association for Computing Machinery, Inc. The Journal's web site is located at http://tkdd.cs.uiuc.edu |
Citation | Acm Transactions On Knowledge Discovery From Data, 2007, v. 1 n. 2 How to Cite? |
Abstract | We study a problem of mining frequently occurring periodic patterns with a gap requirement from sequences. Given a character sequence S of length L and a pattern P of length l, we consider P a frequently occurring pattern in S if the probability of observing P given a randomly picked length-l subsequence of S exceeds a certain threshold. In many applications, particularly those related to bioinformatics, interesting patterns are periodic with a gap requirement. That is to say, the characters in P should match subsequences of S in such a way that the matching characters in S are separated by gaps of more or less the same size. We show the complexity of the mining problem and discuss why traditional mining algorithms are computationally infeasible. We propose practical algorithms for solving the problem and study their characteristics. We also present a case study in which we apply our algorithms on some DNA sequences. We discuss some interesting patterns obtained from the case study. © 2007 ACM. |
Persistent Identifier | http://hdl.handle.net/10722/152360 |
ISSN | 2023 Impact Factor: 4.0 2023 SCImago Journal Rankings: 1.303 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhang, M | en_US |
dc.contributor.author | Kao, B | en_US |
dc.contributor.author | Cheung, DW | en_US |
dc.contributor.author | Yip, KY | en_US |
dc.date.accessioned | 2012-06-26T06:37:37Z | - |
dc.date.available | 2012-06-26T06:37:37Z | - |
dc.date.issued | 2007 | en_US |
dc.identifier.citation | Acm Transactions On Knowledge Discovery From Data, 2007, v. 1 n. 2 | en_US |
dc.identifier.issn | 1556-4681 | en_US |
dc.identifier.uri | http://hdl.handle.net/10722/152360 | - |
dc.description.abstract | We study a problem of mining frequently occurring periodic patterns with a gap requirement from sequences. Given a character sequence S of length L and a pattern P of length l, we consider P a frequently occurring pattern in S if the probability of observing P given a randomly picked length-l subsequence of S exceeds a certain threshold. In many applications, particularly those related to bioinformatics, interesting patterns are periodic with a gap requirement. That is to say, the characters in P should match subsequences of S in such a way that the matching characters in S are separated by gaps of more or less the same size. We show the complexity of the mining problem and discuss why traditional mining algorithms are computationally infeasible. We propose practical algorithms for solving the problem and study their characteristics. We also present a case study in which we apply our algorithms on some DNA sequences. We discuss some interesting patterns obtained from the case study. © 2007 ACM. | en_US |
dc.language | eng | en_US |
dc.publisher | Association for Computing Machinery, Inc. The Journal's web site is located at http://tkdd.cs.uiuc.edu | en_US |
dc.relation.ispartof | ACM Transactions on Knowledge Discovery from Data | en_US |
dc.subject | Gap Requirement | en_US |
dc.subject | Periodic Pattern | en_US |
dc.subject | Sequence Mining | en_US |
dc.title | Mining periodic patterns with gap requirement from sequences | en_US |
dc.type | Article | en_US |
dc.identifier.email | Kao, B:kao@cs.hku.hk | en_US |
dc.identifier.email | Cheung, DW:dcheung@cs.hku.hk | en_US |
dc.identifier.authority | Kao, B=rp00123 | en_US |
dc.identifier.authority | Cheung, DW=rp00101 | en_US |
dc.description.nature | link_to_subscribed_fulltext | en_US |
dc.identifier.doi | 10.1145/1267066.1267068 | en_US |
dc.identifier.scopus | eid_2-s2.0-34548236885 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-34548236885&selection=ref&src=s&origin=recordpage | en_US |
dc.identifier.volume | 1 | en_US |
dc.identifier.issue | 2 | en_US |
dc.publisher.place | United States | en_US |
dc.identifier.scopusauthorid | Zhang, M=20434954000 | en_US |
dc.identifier.scopusauthorid | Kao, B=35221592600 | en_US |
dc.identifier.scopusauthorid | Cheung, DW=34567902600 | en_US |
dc.identifier.scopusauthorid | Yip, KY=7101909946 | en_US |
dc.identifier.issnl | 1556-4681 | - |