Article: Detection of generic spaced motifs using submotif pattern mining
| Title | Detection of generic spaced motifs using submotif pattern mining |
|---|---|
| Authors | Wijaya, E1 4 Rajaraman, K1 Yiu, SM2 Sung, WK3 4 |
| Issue Date | 2007 |
| Publisher | Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/ |
| Citation | Bioinformatics, 2007, v. 23 n. 12, p. 1476-1485 [How to Cite?] DOI: http://dx.doi.org/10.1093/bioinformatics/btm118 |
| Abstract | Motivation: Identification of motifs is one of the critical stages in studying the regulatory interactions of genes. Motifs can have complicated patterns. In particular, spaced motifs, an important class of motifs, consist of several short segments separated by spacers of different lengths. Locating spaced motifs is not trivial. Existing motif-finding algorithms are either designed for monad motifs (short contiguous patterns with some mismatches) or have assumptions on the spacer lengths or can only handle at most two segments. An effective motif finder for generic spaced motifs is highly desirable. Results: This article proposes a novel approach for identifying spaced motifs with any number of spacers of different lengths. We introduce the notion of submotifs to capture the segments in the spaced motif and formulate the motif-finding problem as a frequent submotif mining problem. We provide an algorithm called SPACE to solve the problem. Based on experiments on real biological datasets, synthetic datasets and the motif assessment benchmarks by Tompa et al., we show that our algorithm performs better than existing tools for spaced motifs with improvements in both sensitivity and specificity and for monads, SPACE performs as good as other tools. © The Author 2007. Published by Oxford University Press. All rights reserved. |
| ISSN | 1367-4803 2011 Impact Factor: 5.468 2011 SCImago Journal Rankings: 1.118 |
| DOI | http://dx.doi.org/10.1093/bioinformatics/btm118 |
| References | References in Scopus |
| dc.contributor.author | Wijaya, E |
|---|---|
| dc.contributor.author | Rajaraman, K |
| dc.contributor.author | Yiu, SM |
| dc.contributor.author | Sung, WK |
| dc.date.accessioned | 2010-09-06T09:50:45Z |
| dc.date.available | 2010-09-06T09:50:45Z |
| dc.date.issued | 2007 |
| dc.description.abstract | Motivation: Identification of motifs is one of the critical stages in studying the regulatory interactions of genes. Motifs can have complicated patterns. In particular, spaced motifs, an important class of motifs, consist of several short segments separated by spacers of different lengths. Locating spaced motifs is not trivial. Existing motif-finding algorithms are either designed for monad motifs (short contiguous patterns with some mismatches) or have assumptions on the spacer lengths or can only handle at most two segments. An effective motif finder for generic spaced motifs is highly desirable. Results: This article proposes a novel approach for identifying spaced motifs with any number of spacers of different lengths. We introduce the notion of submotifs to capture the segments in the spaced motif and formulate the motif-finding problem as a frequent submotif mining problem. We provide an algorithm called SPACE to solve the problem. Based on experiments on real biological datasets, synthetic datasets and the motif assessment benchmarks by Tompa et al., we show that our algorithm performs better than existing tools for spaced motifs with improvements in both sensitivity and specificity and for monads, SPACE performs as good as other tools. © The Author 2007. Published by Oxford University Press. All rights reserved. |
| dc.description.nature | link_to_OA_fulltext |
| dc.identifier.citation | Bioinformatics, 2007, v. 23 n. 12, p. 1476-1485 [How to Cite?] DOI: http://dx.doi.org/10.1093/bioinformatics/btm118 |
| dc.identifier.citeulike | 1365971 |
| dc.identifier.doi | http://dx.doi.org/10.1093/bioinformatics/btm118 |
| dc.identifier.epage | 1485 |
| dc.identifier.hkuros | 161322 |
| dc.identifier.isi | WOS:000248271700006 |
| dc.identifier.issn | 1367-4803 2011 Impact Factor: 5.468 2011 SCImago Journal Rankings: 1.118 |
| dc.identifier.issue | 12 |
| dc.identifier.openurl | ![]() |
| dc.identifier.pmid | 17483509 |
| dc.identifier.scopus | eid_2-s2.0-34547840183 |
| dc.identifier.spage | 1476 |
| dc.identifier.uri | http://hdl.handle.net/10722/88970 |
| dc.identifier.volume | 23 |
| dc.language | eng |
| dc.publisher | Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/ |
| dc.publisher.place | United Kingdom |
| dc.relation.ispartof | Bioinformatics |
| dc.relation.references | References in Scopus |
| dc.subject.mesh | Amino Acid Motifs |
| dc.subject.mesh | Computational Biology - methods |
| dc.subject.mesh | Pattern Recognition, Automated |
| dc.subject.mesh | Protein Structure, Tertiary |
| dc.subject.mesh | Transcription Factors/genetics |
| dc.title | Detection of generic spaced motifs using submotif pattern mining |
| dc.type | Article |
Author Affiliations
- Institute for Infocomm Research, A-Star, Singapore
- The University of Hong Kong
- Genome Institute of Singapore
- National University of Singapore


