File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: A random forest-based framework for genotyping and accuracy assessment of copy number variations

TitleA random forest-based framework for genotyping and accuracy assessment of copy number variations
Authors
Issue Date2020
PublisherOxford University Press: Open Access Journals. The Journal's web site is located at https://academic.oup.com/nargab
Citation
NAR Genomics and Bioinformatics, 2020, v. 2 n. 3, p. article no. lqaa071 How to Cite?
AbstractDetection of copy number variations (CNVs) is essential for uncovering genetic factors underlying human diseases. However, CNV detection by current methods is prone to error, and precisely identifying CNVs from paired-end whole genome sequencing (WGS) data is still challenging. Here, we present a framework, CNV-JACG, for Judging the Accuracy of CNVs and Genotyping using paired-end WGS data. CNV-JACG is based on a random forest model trained on 21 distinctive features characterizing the CNV region and its breakpoints. Using the data from the 1000 Genomes Project, Genome in a Bottle Consortium, the Human Genome Structural Variation Consortium and in-house technical replicates, we show that CNV-JACG has superior sensitivity over the latest genotyping method, SV2, particularly for the small CNVs (≤1 kb). We also demonstrate that CNV-JACG outperforms SV2 in terms of Mendelian inconsistency in trios and concordance between technical replicates. Our study suggests that CNV-JACG would be a useful tool in assessing the accuracy of CNVs to meet the ever-growing needs for uncovering the missing heritability linked to CNVs.
Persistent Identifierhttp://hdl.handle.net/10722/295279
ISSN
PubMed Central ID
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorZHUANG, X-
dc.contributor.authorYE, R-
dc.contributor.authorSo, MT-
dc.contributor.authorLam, WY-
dc.contributor.authorKARIM, A-
dc.contributor.authorYu, M-
dc.contributor.authorNgo, ND-
dc.contributor.authorCherny, SS-
dc.contributor.authorTam, PKH-
dc.contributor.authorGarcia-Barcelo, MM-
dc.contributor.authorTang, CSM-
dc.contributor.authorSham, PC-
dc.date.accessioned2021-01-11T13:57:52Z-
dc.date.available2021-01-11T13:57:52Z-
dc.date.issued2020-
dc.identifier.citationNAR Genomics and Bioinformatics, 2020, v. 2 n. 3, p. article no. lqaa071-
dc.identifier.issn2631-9268-
dc.identifier.urihttp://hdl.handle.net/10722/295279-
dc.description.abstractDetection of copy number variations (CNVs) is essential for uncovering genetic factors underlying human diseases. However, CNV detection by current methods is prone to error, and precisely identifying CNVs from paired-end whole genome sequencing (WGS) data is still challenging. Here, we present a framework, CNV-JACG, for Judging the Accuracy of CNVs and Genotyping using paired-end WGS data. CNV-JACG is based on a random forest model trained on 21 distinctive features characterizing the CNV region and its breakpoints. Using the data from the 1000 Genomes Project, Genome in a Bottle Consortium, the Human Genome Structural Variation Consortium and in-house technical replicates, we show that CNV-JACG has superior sensitivity over the latest genotyping method, SV2, particularly for the small CNVs (≤1 kb). We also demonstrate that CNV-JACG outperforms SV2 in terms of Mendelian inconsistency in trios and concordance between technical replicates. Our study suggests that CNV-JACG would be a useful tool in assessing the accuracy of CNVs to meet the ever-growing needs for uncovering the missing heritability linked to CNVs.-
dc.languageeng-
dc.publisherOxford University Press: Open Access Journals. The Journal's web site is located at https://academic.oup.com/nargab-
dc.relation.ispartofNAR Genomics and Bioinformatics-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titleA random forest-based framework for genotyping and accuracy assessment of copy number variations-
dc.typeArticle-
dc.identifier.emailSo, MT: jaymtso@hku.hk-
dc.identifier.emailLam, WY: wyslam@hku.hk-
dc.identifier.emailCherny, SS: cherny@hku.hk-
dc.identifier.emailTam, PKH: paultam@hku.hk-
dc.identifier.emailGarcia-Barcelo, MM: mmgarcia@hku.hk-
dc.identifier.emailTang, CSM: claratang@hku.hk-
dc.identifier.emailSham, PC: pcsham@hku.hk-
dc.identifier.authorityCherny, SS=rp00232-
dc.identifier.authorityTam, PKH=rp00060-
dc.identifier.authorityGarcia-Barcelo, MM=rp00445-
dc.identifier.authorityTang, CSM=rp02105-
dc.identifier.authoritySham, PC=rp00459-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1093/nargab/lqaa071-
dc.identifier.pmid33575619-
dc.identifier.pmcidPMC7671382-
dc.identifier.scopuseid_2-s2.0-85113271334-
dc.identifier.hkuros320790-
dc.identifier.volume2-
dc.identifier.issue3-
dc.identifier.spagearticle no. lqaa071-
dc.identifier.epagearticle no. lqaa071-
dc.identifier.isiWOS:000645609500013-
dc.publisher.placeUnited Kingdom-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats