Meta-IDBA: A de Novo assembler for metagenomic data

Peng, Y; Leung, HCM; Yiu, SM; Chin, FYL

File Download

re01.htm

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1093/bioinformatics/btr216
Scopus: eid_2-s2.0-79959422558
PMID: 21685107
WOS: WOS:000291752600012
Find via

Supplementary

Bookmarks:
- CiteULike: 14
Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Computer Science: Conference papers

See more details

Conference Paper: Meta-IDBA: A de Novo assembler for metagenomic data

Title	Meta-IDBA: A de Novo assembler for metagenomic data
Authors	Peng, Y Leung, HCM Yiu, SM Chin, FYL
Issue Date	2011
Publisher	Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/
Citation	The 19th Annual International Conference on Intelligent Systems for Molecular Biology and 10th European Conference on Computational Biology (ISMB/ECCB 2011), Vienna, Austria, 17-19 july 2011. In Bioinformatics, 2011, v. 27 n. 13, p. i94-i101, article no. btr216 How to Cite? DOI: http://dx.doi.org/10.1093/bioinformatics/btr216
Abstract	Motivation: Next-generation sequencing techniques allow us to generate reads from a microbial environment in order to analyze the microbial community. However, assembling of a set of mixed reads from different species to form contigs is a bottleneck of metagenomic research. Although there are many assemblers for assembling reads from a single genome, there are no assemblers for assembling reads in metagenomic data without reference genome sequences. Moreover, the performances of these assemblers on metagenomic data are far from satisfactory, because of the existence of common regions in the genomes of subspecies and species, which make the assembly problem much more complicated. Results: We introduce the Meta-IDBA algorithm for assembling reads in metagenomic data, which contain multiple genomes from different species. There are two core steps in Meta-IDBA. It first tries to partition the de Bruijn graph into isolated components of different species based on an important observation. Then, for each component, it captures the slight variants of the genomes of subspecies from the same species by multiple alignments and represents the genome of one species, using a consensus sequence. Comparison of the performances of Meta-IDBA and existing assemblers, such as Velvet and Abyss for different metagenomic datasets shows that Meta-IDBA can reconstruct longer contigs with similar accuracy. © The Author(s) 2011. Published by Oxford University Press.
Persistent Identifier	http://hdl.handle.net/10722/140006
ISSN	1367-4803 2023 Impact Factor: 4.4 2023 SCImago Journal Rankings: 2.574
PubMed Central ID	PMC3117360
ISI Accession Number ID	WOS:000291752600012
References	References in Scopus

DC Field	Value	Language
dc.contributor.author	Peng, Y	en_HK
dc.contributor.author	Leung, HCM	en_HK
dc.contributor.author	Yiu, SM	en_HK
dc.contributor.author	Chin, FYL	en_HK
dc.date.accessioned	2011-09-23T06:04:37Z	-
dc.date.available	2011-09-23T06:04:37Z	-
dc.date.issued	2011	en_HK
dc.identifier.citation	The 19th Annual International Conference on Intelligent Systems for Molecular Biology and 10th European Conference on Computational Biology (ISMB/ECCB 2011), Vienna, Austria, 17-19 july 2011. In Bioinformatics, 2011, v. 27 n. 13, p. i94-i101, article no. btr216	en_HK
dc.identifier.issn	1367-4803	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/140006	-
dc.description.abstract	Motivation: Next-generation sequencing techniques allow us to generate reads from a microbial environment in order to analyze the microbial community. However, assembling of a set of mixed reads from different species to form contigs is a bottleneck of metagenomic research. Although there are many assemblers for assembling reads from a single genome, there are no assemblers for assembling reads in metagenomic data without reference genome sequences. Moreover, the performances of these assemblers on metagenomic data are far from satisfactory, because of the existence of common regions in the genomes of subspecies and species, which make the assembly problem much more complicated. Results: We introduce the Meta-IDBA algorithm for assembling reads in metagenomic data, which contain multiple genomes from different species. There are two core steps in Meta-IDBA. It first tries to partition the de Bruijn graph into isolated components of different species based on an important observation. Then, for each component, it captures the slight variants of the genomes of subspecies from the same species by multiple alignments and represents the genome of one species, using a consensus sequence. Comparison of the performances of Meta-IDBA and existing assemblers, such as Velvet and Abyss for different metagenomic datasets shows that Meta-IDBA can reconstruct longer contigs with similar accuracy. © The Author(s) 2011. Published by Oxford University Press.	en_HK
dc.language	eng	en_US
dc.publisher	Oxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/	en_HK
dc.relation.ispartof	Bioinformatics	en_HK
dc.subject.mesh	Algorithms	-
dc.subject.mesh	Escherichia coli - classification - genetics - isolation and purification	-
dc.subject.mesh	Genome, Bacterial	-
dc.subject.mesh	Metagenomics - methods	-
dc.subject.mesh	Software	-
dc.title	Meta-IDBA: A de Novo assembler for metagenomic data	en_HK
dc.type	Conference_Paper	en_HK
dc.identifier.email	Leung, HCM:cmleung2@cs.hku.hk	en_HK
dc.identifier.email	Yiu, SM:smyiu@cs.hku.hk	en_HK
dc.identifier.email	Chin, FYL:chin@cs.hku.hk	en_HK
dc.identifier.authority	Leung, HCM=rp00144	en_HK
dc.identifier.authority	Yiu, SM=rp00207	en_HK
dc.identifier.authority	Chin, FYL=rp00105	en_HK
dc.description.nature	link_to_OA_fulltext	-
dc.identifier.doi	10.1093/bioinformatics/btr216	en_HK
dc.identifier.pmid	21685107	-
dc.identifier.pmcid	PMC3117360	-
dc.identifier.scopus	eid_2-s2.0-79959422558	en_HK
dc.identifier.hkuros	196172	en_US
dc.identifier.hkuros	187808	-
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-79959422558&selection=ref&src=s&origin=recordpage	en_HK
dc.identifier.volume	27	en_HK
dc.identifier.issue	13	en_HK
dc.identifier.spage	i94	en_HK
dc.identifier.epage	i101	en_HK
dc.identifier.eissn	1460-2059	-
dc.identifier.isi	WOS:000291752600012	-
dc.publisher.place	United Kingdom	en_HK
dc.description.other	The 19th Annual International Conference on Intelligent Systems for Molecular Biology and 10th European Conference on Computational Biology (ISMB/ECCB 2011), Vienna, Austria, 17-19 july 2011. In Bioinformatics, 2011, v. 27 n. 13, p. i94-i101, article no. btr216	-
dc.identifier.scopusauthorid	Peng, Y=54393903900	en_HK
dc.identifier.scopusauthorid	Leung, HCM=35233742700	en_HK
dc.identifier.scopusauthorid	Yiu, SM=7003282240	en_HK
dc.identifier.scopusauthorid	Chin, FYL=7005101915	en_HK
dc.identifier.citeulike	9424946	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Meta-IDBA: A de Novo assembler for metagenomic data

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats