File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1186/1471-2105-9-S12-S3
- Scopus: eid_2-s2.0-57649234892
- PMID: 19091026
- WOS: WOS:000262154300003
- Find via
Supplementary
-
Bookmarks:
- CiteULike: 3
- Citations:
- Appears in Collections:
Conference Paper: Filtering of false positive microRNA candidates by a clustering-based approach
Title | Filtering of false positive microRNA candidates by a clustering-based approach |
---|---|
Authors | |
Issue Date | 2008 |
Publisher | BioMed Central Ltd. The Journal's web site is located at http://www.biomedcentral.com/bmcbioinformatics/ |
Citation | Bmc Bioinformatics, 2008, v. 9 SUPPL. 12 How to Cite? |
Abstract | Background: MicroRNAs are small non-coding RNA gene products that play diversified roles from species to species. The explosive growth of microRNA researches in recent years proves the importance of microRNAs in the biological system and it is believed that microRNAs have valuable therapeutic potentials in human diseases. Continual efforts are therefore required to locate and verify the unknown microRNAs in various genomes. As many miRNAs are found to be arranged in clusters, meaning that they are in close proximity with their neighboring miRNAs, we are interested in utilizing the concept of microRNA clustering and applying it in microRNA computational prediction. Results: We first validate the microRNA clustering phenomenon in the human, mouse and rat genomes. There are 45.45%, 51.86% and 48.67% of the total miRNAs that are clustered in the three genomes, respectively. We then conduct sequence and secondary structure similarity analyses among clustered miRNAs, non-clustered miRNAs, neighboring sequences of clustered miRNAs and random sequences, and find that clustered miRNAs are structurally more similar to one another, and the RNAdistance score can be used to assess the structural similarity between two sequences. We therefore design a clustering-based approach which utilizes this observation to filter false positives from a list of candidates generated by a selected microRNA prediction program, and successfully raise the positive predictive value by a considerable amount ranging from 15.23% to 23.19% in the human, mouse and rat genomes, while keeping a reasonably high sensitivity. Conclusion: Our clustering-based approach is able to increase the effectiveness of currently available microRNA prediction program by raising the positive predictive value while maintaining a high sensitivity, and hence can serve as a filtering step. We believe that it is worthwhile to carry out further experiments and tests with our approach using data from other genomes and other prediction software tools. Better results may be achieved with fine-tuning of parameters. © 2008 Leung et al; licensee BioMed Central Ltd. |
Description | B M C Bioinformatics |
Persistent Identifier | http://hdl.handle.net/10722/61686 |
ISSN | 2023 Impact Factor: 2.9 2023 SCImago Journal Rankings: 1.005 |
PubMed Central ID | |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Leung, WS | en_HK |
dc.contributor.author | Lin, MCM | en_HK |
dc.contributor.author | Cheung, DW | en_HK |
dc.contributor.author | Yiu, SM | en_HK |
dc.date.accessioned | 2010-07-13T03:45:06Z | - |
dc.date.available | 2010-07-13T03:45:06Z | - |
dc.date.issued | 2008 | en_HK |
dc.identifier.citation | Bmc Bioinformatics, 2008, v. 9 SUPPL. 12 | en_HK |
dc.identifier.issn | 1471-2105 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/61686 | - |
dc.description | B M C Bioinformatics | en_HK |
dc.description.abstract | Background: MicroRNAs are small non-coding RNA gene products that play diversified roles from species to species. The explosive growth of microRNA researches in recent years proves the importance of microRNAs in the biological system and it is believed that microRNAs have valuable therapeutic potentials in human diseases. Continual efforts are therefore required to locate and verify the unknown microRNAs in various genomes. As many miRNAs are found to be arranged in clusters, meaning that they are in close proximity with their neighboring miRNAs, we are interested in utilizing the concept of microRNA clustering and applying it in microRNA computational prediction. Results: We first validate the microRNA clustering phenomenon in the human, mouse and rat genomes. There are 45.45%, 51.86% and 48.67% of the total miRNAs that are clustered in the three genomes, respectively. We then conduct sequence and secondary structure similarity analyses among clustered miRNAs, non-clustered miRNAs, neighboring sequences of clustered miRNAs and random sequences, and find that clustered miRNAs are structurally more similar to one another, and the RNAdistance score can be used to assess the structural similarity between two sequences. We therefore design a clustering-based approach which utilizes this observation to filter false positives from a list of candidates generated by a selected microRNA prediction program, and successfully raise the positive predictive value by a considerable amount ranging from 15.23% to 23.19% in the human, mouse and rat genomes, while keeping a reasonably high sensitivity. Conclusion: Our clustering-based approach is able to increase the effectiveness of currently available microRNA prediction program by raising the positive predictive value while maintaining a high sensitivity, and hence can serve as a filtering step. We believe that it is worthwhile to carry out further experiments and tests with our approach using data from other genomes and other prediction software tools. Better results may be achieved with fine-tuning of parameters. © 2008 Leung et al; licensee BioMed Central Ltd. | en_HK |
dc.language | eng | en_HK |
dc.publisher | BioMed Central Ltd. The Journal's web site is located at http://www.biomedcentral.com/bmcbioinformatics/ | en_HK |
dc.relation.ispartof | BMC Bioinformatics | en_HK |
dc.rights | B M C Bioinformatics. Copyright © BioMed Central Ltd. | en_HK |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.mesh | Algorithms | en_HK |
dc.subject.mesh | Animals | en_HK |
dc.subject.mesh | Cluster Analysis | en_HK |
dc.subject.mesh | Computational Biology - methods | en_HK |
dc.subject.mesh | Computer Simulation | en_HK |
dc.subject.mesh | False Positive Reactions | en_HK |
dc.subject.mesh | Genome | en_HK |
dc.subject.mesh | Humans | en_HK |
dc.subject.mesh | Mice | en_HK |
dc.subject.mesh | MicroRNAs - chemistry - genetics | en_HK |
dc.subject.mesh | Predictive Value of Tests | en_HK |
dc.subject.mesh | Rats | en_HK |
dc.subject.mesh | Software | en_HK |
dc.title | Filtering of false positive microRNA candidates by a clustering-based approach | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1471-2105&volume=9&issue=Supp 12&spage=S3&epage=&date=2008&atitle=Filtering+of+False+Positive+MicroRNA+Candidates+by+a+Clustering-based+Approach | en_HK |
dc.identifier.email | Lin, MCM:mcllin@hkucc.hku.hk | en_HK |
dc.identifier.email | Cheung, DW:dcheung@cs.hku.hk | en_HK |
dc.identifier.email | Yiu, SM:smyiu@cs.hku.hk | en_HK |
dc.identifier.authority | Lin, MCM=rp00746 | en_HK |
dc.identifier.authority | Cheung, DW=rp00101 | en_HK |
dc.identifier.authority | Yiu, SM=rp00207 | en_HK |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1186/1471-2105-9-S12-S3 | en_HK |
dc.identifier.pmid | 19091026 | - |
dc.identifier.pmcid | PMC2638143 | - |
dc.identifier.scopus | eid_2-s2.0-57649234892 | en_HK |
dc.identifier.hkuros | 154196 | en_HK |
dc.identifier.hkuros | 157535 | - |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-57649234892&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 9 | en_HK |
dc.identifier.issue | SUPPL. 12 | en_HK |
dc.identifier.isi | WOS:000262154300003 | - |
dc.publisher.place | United Kingdom | en_HK |
dc.identifier.scopusauthorid | Leung, WS=14322103600 | en_HK |
dc.identifier.scopusauthorid | Lin, MCM=7404816359 | en_HK |
dc.identifier.scopusauthorid | Cheung, DW=34567902600 | en_HK |
dc.identifier.scopusauthorid | Yiu, SM=7003282240 | en_HK |
dc.identifier.citeulike | 3874780 | - |
dc.identifier.issnl | 1471-2105 | - |