File Download
Supplementary

Conference Paper: A powerful approach to efficiently combine p-values for gene-based association analysis

TitleA powerful approach to efficiently combine p-values for gene-based association analysis
Authors
Issue Date2010
PublisherThe American Society of Human Genetics.
Citation
The 60th Annual Meeting of the American Society of Human Genetics (ASHG 2010), Washington D.C., 2-6 November 2010. How to Cite?
AbstractShifting from Single-nucleotide polymorphism (SNP)-based association analysis to gene-based analysis may be a promising solution for increasing power of genome-wide association studies (GWAS). Methodology for efficient evaluation of gene-wise association significance is still far from developed and does not suffice for the demand to handle large genome-wide dataset. Available methods resort to time-consuming permutation to account for the gene-size and linkage disequilibrium problems. We proposed a powerful association approach to efficiently combine p-values of SNPs within a gene to produce a gene-based p-value. This method employed the gene as an analysis unit. The p-value is used to evaluate gene-level association significance. We theoretically proved that the gene-based p-value is uniformly distributed U(0,1) under null hypothesis for independent SNPs. More importantly, this method could easily integrate prior weights of SNPs. Through a series of computer simulation, we demonstrated that the gene-based method was more powerful than the original SNP-based test and immune to gene size and linkage disequilibrium between markers. Moreover, our test outperformed three other alternatives including Bonferroni correction, Sidak combination test and Fisher combination test. It was even more powerful than the Logistic regression in some situations. In a test, this method (after implemented) only spent less than 1 minute in performing a genome-wide gene-based scan for 2,543,885 SNPs on an ordinary desktop computer, Intel Core™ 2 CPU 2.66GHz, RAM 1.97GB, and 32-bit Windows XP™ Professional Version 2002. To conceptually assess its performance in real data, we applied it to re-analyze three datasets of published GWAS. Our gene-based association method reported 6 more significant genes than the SNP-level association method did (FDR=0.005) in a GWAS dataset for Crohn's disease. Among the 6 genes, the two were convincingly replicated in independent samples conducted the original studies. It reported 4 more significant genes (FDR=0.01) than the SNP-level association test in GWAS for Psoriasis and Schizophrenia respectively. In conclusion, we provided a powerful method to efficient evaluate the gene-wise association significance. This method is expected to identify more disease susceptibility genes of complex diseases and will also facilitate advanced bioinformatics analysis like pathway and protein-protein interaction network enrichment analyses.
DescriptionPoster Presentation: abstract 2815/W
Persistent Identifierhttp://hdl.handle.net/10722/136024

 

DC FieldValueLanguage
dc.contributor.authorLi, Men_US
dc.contributor.authorSham, PC-
dc.contributor.authorCherny, SS-
dc.date.accessioned2011-07-27T02:01:42Z-
dc.date.available2011-07-27T02:01:42Z-
dc.date.issued2010en_US
dc.identifier.citationThe 60th Annual Meeting of the American Society of Human Genetics (ASHG 2010), Washington D.C., 2-6 November 2010.en_US
dc.identifier.urihttp://hdl.handle.net/10722/136024-
dc.descriptionPoster Presentation: abstract 2815/W-
dc.description.abstractShifting from Single-nucleotide polymorphism (SNP)-based association analysis to gene-based analysis may be a promising solution for increasing power of genome-wide association studies (GWAS). Methodology for efficient evaluation of gene-wise association significance is still far from developed and does not suffice for the demand to handle large genome-wide dataset. Available methods resort to time-consuming permutation to account for the gene-size and linkage disequilibrium problems. We proposed a powerful association approach to efficiently combine p-values of SNPs within a gene to produce a gene-based p-value. This method employed the gene as an analysis unit. The p-value is used to evaluate gene-level association significance. We theoretically proved that the gene-based p-value is uniformly distributed U(0,1) under null hypothesis for independent SNPs. More importantly, this method could easily integrate prior weights of SNPs. Through a series of computer simulation, we demonstrated that the gene-based method was more powerful than the original SNP-based test and immune to gene size and linkage disequilibrium between markers. Moreover, our test outperformed three other alternatives including Bonferroni correction, Sidak combination test and Fisher combination test. It was even more powerful than the Logistic regression in some situations. In a test, this method (after implemented) only spent less than 1 minute in performing a genome-wide gene-based scan for 2,543,885 SNPs on an ordinary desktop computer, Intel Core™ 2 CPU 2.66GHz, RAM 1.97GB, and 32-bit Windows XP™ Professional Version 2002. To conceptually assess its performance in real data, we applied it to re-analyze three datasets of published GWAS. Our gene-based association method reported 6 more significant genes than the SNP-level association method did (FDR=0.005) in a GWAS dataset for Crohn's disease. Among the 6 genes, the two were convincingly replicated in independent samples conducted the original studies. It reported 4 more significant genes (FDR=0.01) than the SNP-level association test in GWAS for Psoriasis and Schizophrenia respectively. In conclusion, we provided a powerful method to efficient evaluate the gene-wise association significance. This method is expected to identify more disease susceptibility genes of complex diseases and will also facilitate advanced bioinformatics analysis like pathway and protein-protein interaction network enrichment analyses.-
dc.languageengen_US
dc.publisherThe American Society of Human Genetics.-
dc.relation.ispartofAnnual Meeting of the American Society of Human Genetics, ASHG 2010,en_US
dc.titleA powerful approach to efficiently combine p-values for gene-based association analysisen_US
dc.typeConference_Paperen_US
dc.identifier.emailSham, PC: pcsham@hku.hken_US
dc.identifier.emailCherny, SS: cherny@hku.hk-
dc.identifier.authoritySham, PC=rp00459en_US
dc.description.naturelink_to_OA_fulltext-
dc.identifier.hkuros188444en_US
dc.publisher.placeUnited States-
dc.description.otherThe 60th Annual Meeting of the American Society of Human Genetics (ASHG 2010), Washington D.C., 2-6 November 2010.-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats