File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1093/bioinformatics/btr216
- Scopus: eid_2-s2.0-79959422558
- PMID: 21685107
- WOS: WOS:000291752600012
- Find via
Supplementary
-
Bookmarks:
- CiteULike: 14
- Citations:
- Appears in Collections:
Conference Paper: Meta-IDBA: A de Novo assembler for metagenomic data
Title | Meta-IDBA: A de Novo assembler for metagenomic data |
---|---|
Authors | |
Issue Date | 2011 |
Publisher | Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/ |
Citation | The 19th Annual International Conference on Intelligent Systems for Molecular Biology and 10th European Conference on Computational Biology (ISMB/ECCB 2011), Vienna, Austria, 17-19 july 2011. In Bioinformatics, 2011, v. 27 n. 13, p. i94-i101, article no. btr216 How to Cite? |
Abstract | Motivation: Next-generation sequencing techniques allow us to generate reads from a microbial environment in order to analyze the microbial community. However, assembling of a set of mixed reads from different species to form contigs is a bottleneck of metagenomic research. Although there are many assemblers for assembling reads from a single genome, there are no assemblers for assembling reads in metagenomic data without reference genome sequences. Moreover, the performances of these assemblers on metagenomic data are far from satisfactory, because of the existence of common regions in the genomes of subspecies and species, which make the assembly problem much more complicated. Results: We introduce the Meta-IDBA algorithm for assembling reads in metagenomic data, which contain multiple genomes from different species. There are two core steps in Meta-IDBA. It first tries to partition the de Bruijn graph into isolated components of different species based on an important observation. Then, for each component, it captures the slight variants of the genomes of subspecies from the same species by multiple alignments and represents the genome of one species, using a consensus sequence. Comparison of the performances of Meta-IDBA and existing assemblers, such as Velvet and Abyss for different metagenomic datasets shows that Meta-IDBA can reconstruct longer contigs with similar accuracy. © The Author(s) 2011. Published by Oxford University Press. |
Persistent Identifier | http://hdl.handle.net/10722/140006 |
ISSN | 2023 Impact Factor: 4.4 2023 SCImago Journal Rankings: 2.574 |
PubMed Central ID | |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Peng, Y | en_HK |
dc.contributor.author | Leung, HCM | en_HK |
dc.contributor.author | Yiu, SM | en_HK |
dc.contributor.author | Chin, FYL | en_HK |
dc.date.accessioned | 2011-09-23T06:04:37Z | - |
dc.date.available | 2011-09-23T06:04:37Z | - |
dc.date.issued | 2011 | en_HK |
dc.identifier.citation | The 19th Annual International Conference on Intelligent Systems for Molecular Biology and 10th European Conference on Computational Biology (ISMB/ECCB 2011), Vienna, Austria, 17-19 july 2011. In Bioinformatics, 2011, v. 27 n. 13, p. i94-i101, article no. btr216 | en_HK |
dc.identifier.issn | 1367-4803 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/140006 | - |
dc.description.abstract | Motivation: Next-generation sequencing techniques allow us to generate reads from a microbial environment in order to analyze the microbial community. However, assembling of a set of mixed reads from different species to form contigs is a bottleneck of metagenomic research. Although there are many assemblers for assembling reads from a single genome, there are no assemblers for assembling reads in metagenomic data without reference genome sequences. Moreover, the performances of these assemblers on metagenomic data are far from satisfactory, because of the existence of common regions in the genomes of subspecies and species, which make the assembly problem much more complicated. Results: We introduce the Meta-IDBA algorithm for assembling reads in metagenomic data, which contain multiple genomes from different species. There are two core steps in Meta-IDBA. It first tries to partition the de Bruijn graph into isolated components of different species based on an important observation. Then, for each component, it captures the slight variants of the genomes of subspecies from the same species by multiple alignments and represents the genome of one species, using a consensus sequence. Comparison of the performances of Meta-IDBA and existing assemblers, such as Velvet and Abyss for different metagenomic datasets shows that Meta-IDBA can reconstruct longer contigs with similar accuracy. © The Author(s) 2011. Published by Oxford University Press. | en_HK |
dc.language | eng | en_US |
dc.publisher | Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/ | en_HK |
dc.relation.ispartof | Bioinformatics | en_HK |
dc.subject.mesh | Algorithms | - |
dc.subject.mesh | Escherichia coli - classification - genetics - isolation and purification | - |
dc.subject.mesh | Genome, Bacterial | - |
dc.subject.mesh | Metagenomics - methods | - |
dc.subject.mesh | Software | - |
dc.title | Meta-IDBA: A de Novo assembler for metagenomic data | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Leung, HCM:cmleung2@cs.hku.hk | en_HK |
dc.identifier.email | Yiu, SM:smyiu@cs.hku.hk | en_HK |
dc.identifier.email | Chin, FYL:chin@cs.hku.hk | en_HK |
dc.identifier.authority | Leung, HCM=rp00144 | en_HK |
dc.identifier.authority | Yiu, SM=rp00207 | en_HK |
dc.identifier.authority | Chin, FYL=rp00105 | en_HK |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.doi | 10.1093/bioinformatics/btr216 | en_HK |
dc.identifier.pmid | 21685107 | - |
dc.identifier.pmcid | PMC3117360 | - |
dc.identifier.scopus | eid_2-s2.0-79959422558 | en_HK |
dc.identifier.hkuros | 196172 | en_US |
dc.identifier.hkuros | 187808 | - |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-79959422558&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 27 | en_HK |
dc.identifier.issue | 13 | en_HK |
dc.identifier.spage | i94 | en_HK |
dc.identifier.epage | i101 | en_HK |
dc.identifier.eissn | 1460-2059 | - |
dc.identifier.isi | WOS:000291752600012 | - |
dc.publisher.place | United Kingdom | en_HK |
dc.description.other | The 19th Annual International Conference on Intelligent Systems for Molecular Biology and 10th European Conference on Computational Biology (ISMB/ECCB 2011), Vienna, Austria, 17-19 july 2011. In Bioinformatics, 2011, v. 27 n. 13, p. i94-i101, article no. btr216 | - |
dc.identifier.scopusauthorid | Peng, Y=54393903900 | en_HK |
dc.identifier.scopusauthorid | Leung, HCM=35233742700 | en_HK |
dc.identifier.scopusauthorid | Yiu, SM=7003282240 | en_HK |
dc.identifier.scopusauthorid | Chin, FYL=7005101915 | en_HK |
dc.identifier.citeulike | 9424946 | - |