Article: MotifVoter: A novel ensemble method for fine-grained integration of generic motif finders
| Title | MotifVoter: A novel ensemble method for fine-grained integration of generic motif finders | ||||||
|---|---|---|---|---|---|---|---|
| Authors | Wijaya, E1 4 Yiu, SM2 Son, NT4 Kanagasabai, R1 Sung, WK3 4 | ||||||
| Issue Date | 2008 | ||||||
| Publisher | Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/ | ||||||
| Citation | Bioinformatics, 2008, v. 24 n. 20, p. 2288-2295 [How to Cite?] DOI: http://dx.doi.org/10.1093/bioinformatics/btn420 | ||||||
| Abstract | Motivation: Locating transcription factor binding sites (motifs) is a key step in understanding gene regulation. Based on Tompa's benchmark study, the performance of current de novo motif finders is far from satisfactory (with sensitivity ≤0.222 and precision ≤0.307). The same study also shows that no motif finder performs consistently well over all datasets. Hence, it is not clear which finder one should use for a given dataset. To address this issue, a class of algorithms called ensemble methods have been proposed. Though the existing ensemble methods overall perform better than stand-alone motif finders, the improvement gained is not substantial. Our study reveals that these methods do not fully exploit the information obtained from the results of individual finders, resulting in minor improvement in sensitivity and poor precision. Results: In this article, we identify several key observations on how to utilize the results from individual finders and design a novel ensemble method, MotifVoter, to predict the motifs and binding sites. Evaluations on 186 datasets show that MotifVoter can locate more than 95% of the binding sites found by its component motif finders. In terms of sensitivity and precision, MotifVoter outperforms stand-alone motif finders and ensemble methods significantly on Tompa's benchmark, Escherichia coli, and ChIP-Chip datasets. MotifVoter is available online via a web server with several biologist-friendly features. © The Author 2008. Published by Oxford University Press. All rights reserved. | ||||||
| ISSN | 1367-4803 2011 Impact Factor: 5.468 2011 SCImago Journal Rankings: 1.118 | ||||||
| DOI | http://dx.doi.org/10.1093/bioinformatics/btn420 | ||||||
| ISI Accession Number ID | WOS:000259973500003
Funding Information: National University of Singapore (grant R-252-000326-112); Research Output Prize (Faculty of Engineering) of the University of HongKong to S.M.Y. | ||||||
| References | References in Scopus |
| dc.contributor.author | Wijaya, E | ||||||
|---|---|---|---|---|---|---|---|
| dc.contributor.author | Yiu, SM | ||||||
| dc.contributor.author | Son, NT | ||||||
| dc.contributor.author | Kanagasabai, R | ||||||
| dc.contributor.author | Sung, WK | ||||||
| dc.date.accessioned | 2010-05-31T04:14:45Z | ||||||
| dc.date.available | 2010-05-31T04:14:45Z | ||||||
| dc.date.issued | 2008 | ||||||
| dc.description.abstract | Motivation: Locating transcription factor binding sites (motifs) is a key step in understanding gene regulation. Based on Tompa's benchmark study, the performance of current de novo motif finders is far from satisfactory (with sensitivity ≤0.222 and precision ≤0.307). The same study also shows that no motif finder performs consistently well over all datasets. Hence, it is not clear which finder one should use for a given dataset. To address this issue, a class of algorithms called ensemble methods have been proposed. Though the existing ensemble methods overall perform better than stand-alone motif finders, the improvement gained is not substantial. Our study reveals that these methods do not fully exploit the information obtained from the results of individual finders, resulting in minor improvement in sensitivity and poor precision. Results: In this article, we identify several key observations on how to utilize the results from individual finders and design a novel ensemble method, MotifVoter, to predict the motifs and binding sites. Evaluations on 186 datasets show that MotifVoter can locate more than 95% of the binding sites found by its component motif finders. In terms of sensitivity and precision, MotifVoter outperforms stand-alone motif finders and ensemble methods significantly on Tompa's benchmark, Escherichia coli, and ChIP-Chip datasets. MotifVoter is available online via a web server with several biologist-friendly features. © The Author 2008. Published by Oxford University Press. All rights reserved. | ||||||
| dc.description.nature | link_to_OA_fulltext | ||||||
| dc.identifier.citation | Bioinformatics, 2008, v. 24 n. 20, p. 2288-2295 [How to Cite?] DOI: http://dx.doi.org/10.1093/bioinformatics/btn420 | ||||||
| dc.identifier.citeulike | 3132697 | ||||||
| dc.identifier.doi | http://dx.doi.org/10.1093/bioinformatics/btn420 | ||||||
| dc.identifier.epage | 2295 | ||||||
| dc.identifier.hkuros | 161318 | ||||||
| dc.identifier.isi | WOS:000259973500003
Funding Information: National University of Singapore (grant R-252-000326-112); Research Output Prize (Faculty of Engineering) of the University of HongKong to S.M.Y. | ||||||
| dc.identifier.issn | 1367-4803 2011 Impact Factor: 5.468 2011 SCImago Journal Rankings: 1.118 | ||||||
| dc.identifier.issue | 20 | ||||||
| dc.identifier.openurl | ![]() | ||||||
| dc.identifier.pmid | 18697768 | ||||||
| dc.identifier.scopus | eid_2-s2.0-53749085875 | ||||||
| dc.identifier.spage | 2288 | ||||||
| dc.identifier.uri | http://hdl.handle.net/10722/60600 | ||||||
| dc.identifier.volume | 24 | ||||||
| dc.language | eng | ||||||
| dc.publisher | Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/ | ||||||
| dc.publisher.place | United Kingdom | ||||||
| dc.relation.ispartof | Bioinformatics | ||||||
| dc.relation.references | References in Scopus | ||||||
| dc.rights | Bioinformatics. Copyright © Oxford University Press. | ||||||
| dc.subject.mesh | Computational Biology - methods | ||||||
| dc.subject.mesh | Regulatory Elements, Transcriptional | ||||||
| dc.subject.mesh | Transcription Factors - chemistry - metabolism | ||||||
| dc.subject.mesh | Protein Structure, Tertiary | ||||||
| dc.title | MotifVoter: A novel ensemble method for fine-grained integration of generic motif finders | ||||||
| dc.type | Article |
Author Affiliations
- Institute for Infocomm Research, A-Star, Singapore
- The University of Hong Kong
- Genome Institute of Singapore
- National University of Singapore


