File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: A new machine-learning based method to accurately assess copy number variants from whole genome sequencing data and its application on the analysis of the Hirschsprung disease genome
Title | A new machine-learning based method to accurately assess copy number variants from whole genome sequencing data and its application on the analysis of the Hirschsprung disease genome |
---|---|
Authors | |
Keywords | Copy number/structural variation Gastrointestinal system Genome-wide association Identification of disease genes Bioinformatics |
Issue Date | 2019 |
Publisher | American Society of Human Genetics. |
Citation | American Society of Human Genetics (ASHG) 2019 Annual Meeting, Houston, USA, 15-19 October 2019 How to Cite? |
Abstract | Copy Number Variants (CNVs) are defined as DNA segments larger than 50 base-pairs with copy number changes. CNVs discovery is essential for uncovering genes/risk factors for a wide range of diseases, including Hirschsprung disease(HSCR; colon aganglionosis). Incidentally, gross deletions encompassing RETor EDNRBlead to the discovery of these two main HSCR genes. Importantly, our previous CNV analysis of HSCR patients identified the association of CNVs encompassing NRG3with HSCR, therefore vindicating a role for CNVs in HSCR.
As CNV calling is still a challenge, we have devised and developed a new machine-learning based method CNV-JACG to accurately assess CNVs. Initially, in order to get the most comprehensive CNVs, we used four complementary CNV discovery methods namely CNVnator, Delly, Lumpy and Seeksv to call CNVs. We used 9 in-house trios to produce training dataset, 11 pairs of in-house duplicated samples for validation, and 2 trios from 1000 Genomes Project for evaluation. For deletions, after CNV-JACG assessment, the concordance between each pair of 11 duplicated samples increases from 63% to 88%, and the Mendelian inconsistent rate of the 2 trios decreases from 25% to 6%. In the benchmark sample NA12878, 84.7% of CNV-JACG predicted CNVs are consistent with previously published result.
CNV-JACG has been used for the CNV analysis of whole genome sequencing data (pair-end 150bp, ~30X) generated from 443 HSCR patients and 493 matched controls. Here we will present the performance of our new program together with the genome-wide association results. |
Description | Poster presentation - Session: Bioinformatics and Computational Approaches - no. PgmNr 1470 |
Persistent Identifier | http://hdl.handle.net/10722/284264 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhuang, X | - |
dc.contributor.author | Ye, R | - |
dc.contributor.author | Sham, PC | - |
dc.contributor.author | Garcia-Barcelo, MM | - |
dc.contributor.author | Tang, SM | - |
dc.contributor.author | Tam, PKH | - |
dc.date.accessioned | 2020-07-20T05:57:21Z | - |
dc.date.available | 2020-07-20T05:57:21Z | - |
dc.date.issued | 2019 | - |
dc.identifier.citation | American Society of Human Genetics (ASHG) 2019 Annual Meeting, Houston, USA, 15-19 October 2019 | - |
dc.identifier.uri | http://hdl.handle.net/10722/284264 | - |
dc.description | Poster presentation - Session: Bioinformatics and Computational Approaches - no. PgmNr 1470 | - |
dc.description.abstract | Copy Number Variants (CNVs) are defined as DNA segments larger than 50 base-pairs with copy number changes. CNVs discovery is essential for uncovering genes/risk factors for a wide range of diseases, including Hirschsprung disease(HSCR; colon aganglionosis). Incidentally, gross deletions encompassing RETor EDNRBlead to the discovery of these two main HSCR genes. Importantly, our previous CNV analysis of HSCR patients identified the association of CNVs encompassing NRG3with HSCR, therefore vindicating a role for CNVs in HSCR. As CNV calling is still a challenge, we have devised and developed a new machine-learning based method CNV-JACG to accurately assess CNVs. Initially, in order to get the most comprehensive CNVs, we used four complementary CNV discovery methods namely CNVnator, Delly, Lumpy and Seeksv to call CNVs. We used 9 in-house trios to produce training dataset, 11 pairs of in-house duplicated samples for validation, and 2 trios from 1000 Genomes Project for evaluation. For deletions, after CNV-JACG assessment, the concordance between each pair of 11 duplicated samples increases from 63% to 88%, and the Mendelian inconsistent rate of the 2 trios decreases from 25% to 6%. In the benchmark sample NA12878, 84.7% of CNV-JACG predicted CNVs are consistent with previously published result. CNV-JACG has been used for the CNV analysis of whole genome sequencing data (pair-end 150bp, ~30X) generated from 443 HSCR patients and 493 matched controls. Here we will present the performance of our new program together with the genome-wide association results. | - |
dc.language | eng | - |
dc.publisher | American Society of Human Genetics. | - |
dc.relation.ispartof | American Society of Human Genetics (ASHG) 2019 Annual Meeting | - |
dc.subject | Copy number/structural variation | - |
dc.subject | Gastrointestinal system | - |
dc.subject | Genome-wide association | - |
dc.subject | Identification of disease genes | - |
dc.subject | Bioinformatics | - |
dc.title | A new machine-learning based method to accurately assess copy number variants from whole genome sequencing data and its application on the analysis of the Hirschsprung disease genome | - |
dc.type | Conference_Paper | - |
dc.identifier.email | Sham, PC: pcsham@hku.hk | - |
dc.identifier.email | Garcia-Barcelo, MM: mmgarcia@hku.hk | - |
dc.identifier.email | Tang, SM: claratang@hku.hk | - |
dc.identifier.email | Tam, PKH: paultam@hku.hk | - |
dc.identifier.authority | Sham, PC=rp00459 | - |
dc.identifier.authority | Garcia-Barcelo, MM=rp00445 | - |
dc.identifier.authority | Tang, SM=rp02105 | - |
dc.identifier.authority | Tam, PKH=rp00060 | - |
dc.identifier.hkuros | 311104 | - |
dc.publisher.place | United States | - |