File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Hybrid-denovo: A de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags

TitleHybrid-denovo: A de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags
Authors
Keywords16S rRNA
Microbiome
OTU picking
Issue Date2018
Citation
GigaScience, 2018, v. 7, n. 3, p. 1-7 How to Cite?
AbstractBackground: Illumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. However, the reverse read (R2) often has significant low base quality, and a large proportion of R2s will be discarded after quality control, resulting in a mixture of paired-end and single-end reads. A typical 16S analysis pipeline usually processes either paired-end or single-end reads but not a mixture. Thus, the quantification accuracy and statistical power will be reduced due to the loss of a large amount of reads. As a result, rare taxa may not be detectable with the paired-end approach, or low taxonomic resolution will result in a single-end approach. Results: To have both the higher phylogenetic resolution provided by paired-end reads and the higher sequence coverage by single-end reads, we propose a novel OTU-picking pipeline, hybrid-denovo, that can process a hybrid of single-end and paired-end reads. Using high-quality paired-end reads as a gold standard, we show that hybrid-denovo achieved the highest correlation with the gold standard and performed better than the approaches based on paired-end or single-end reads in terms of quantifying the microbial diversity and taxonomic abundances. By applying our method to a rheumatoid arthritis (RA) data set, we demonstrated that hybrid-denovo captured more microbial diversity and identified more RA-associated taxa than a paired-end or single-end approach. Conclusions: Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data.
Persistent Identifierhttp://hdl.handle.net/10722/324500
PubMed Central ID
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorChen, Xianfeng-
dc.contributor.authorJohnson, Stephen-
dc.contributor.authorJeraldo, Patricio-
dc.contributor.authorWang, Junwen-
dc.contributor.authorChia, Nicholas-
dc.contributor.authorKocher, Jean Pierre A.-
dc.contributor.authorChen, Jun-
dc.date.accessioned2023-02-03T07:03:30Z-
dc.date.available2023-02-03T07:03:30Z-
dc.date.issued2018-
dc.identifier.citationGigaScience, 2018, v. 7, n. 3, p. 1-7-
dc.identifier.urihttp://hdl.handle.net/10722/324500-
dc.description.abstractBackground: Illumina paired-end sequencing has been increasingly popular for 16S rRNA gene-based microbiota profiling. It provides higher phylogenetic resolution than single-end reads due to a longer read length. However, the reverse read (R2) often has significant low base quality, and a large proportion of R2s will be discarded after quality control, resulting in a mixture of paired-end and single-end reads. A typical 16S analysis pipeline usually processes either paired-end or single-end reads but not a mixture. Thus, the quantification accuracy and statistical power will be reduced due to the loss of a large amount of reads. As a result, rare taxa may not be detectable with the paired-end approach, or low taxonomic resolution will result in a single-end approach. Results: To have both the higher phylogenetic resolution provided by paired-end reads and the higher sequence coverage by single-end reads, we propose a novel OTU-picking pipeline, hybrid-denovo, that can process a hybrid of single-end and paired-end reads. Using high-quality paired-end reads as a gold standard, we show that hybrid-denovo achieved the highest correlation with the gold standard and performed better than the approaches based on paired-end or single-end reads in terms of quantifying the microbial diversity and taxonomic abundances. By applying our method to a rheumatoid arthritis (RA) data set, we demonstrated that hybrid-denovo captured more microbial diversity and identified more RA-associated taxa than a paired-end or single-end approach. Conclusions: Hybrid-denovo utilizes both paired-end and single-end 16S sequencing reads and is recommended for 16S rRNA gene targeted paired-end sequencing data.-
dc.languageeng-
dc.relation.ispartofGigaScience-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject16S rRNA-
dc.subjectMicrobiome-
dc.subjectOTU picking-
dc.titleHybrid-denovo: A de novo OTU-picking pipeline integrating single-end and paired-end 16S sequence tags-
dc.typeArticle-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1093/gigascience/gix129-
dc.identifier.pmid29267858-
dc.identifier.pmcidPMC5841375-
dc.identifier.scopuseid_2-s2.0-85045207122-
dc.identifier.volume7-
dc.identifier.issue3-
dc.identifier.spage1-
dc.identifier.epage7-
dc.identifier.eissn2047-217X-
dc.identifier.isiWOS:000427171400001-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats