Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies

So, HC; Yip, BHK; Sham, PC

File Download

pone.0013898.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1371/journal.pone.0013898
Scopus: eid_2-s2.0-78649512740
PMID: 21103334
WOS: WOS:000284327800002
Find via

Supplementary

Bookmarks:
- CiteULike: 2
Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Psychiatry: Journal/Magazine Articles
- Centre for Genomic Sciences: Journal/Magazine Articles

Article: Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies

Title

Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies

Authors

So, HC Yip, BHK Sham, PC

Issue Date

2010

Publisher

Public Library of Science. The Journal's web site is located at http://www.plosone.org/home.action

Citation

Plos One, 2010, v. 5 n. 11 How to Cite?

DOI: http://dx.doi.org/10.1371/journal.pone.0013898

Abstract

Recently genome-wide association studies (GWAS) have identified numerous susceptibility variants for complex diseases. In this study we proposed several approaches to estimate the total number of variants underlying these diseases. We assume that the variance explained by genetic markers (Vg) follow an exponential distribution, which is justified by previous studies on theories of adaptation. Our aim is to fit the observed distribution of Vg from GWAS to its theoretical distribution. The number of variants is obtained by the heritability divided by the estimated mean of the exponential distribution. In practice, due to limited sample sizes, there is insufficient power to detect variants with small effects. Therefore the power was taken into account in fitting. Besides considering the most significant variants, we also tried to relax the significance threshold, allowing more markers to be fitted. The effects of false positive variants were removed by considering the local false discovery rates. In addition, we developed an alternative approach by directly fitting the z-statistics from GWAS to its theoretical distribution. In all cases, the ''winner's curse'' effect was corrected analytically. Confidence intervals were also derived. Simulations were performed to compare and verify the performance of different estimators (which incorporates various means of winner's curse correction) and the coverage of the proposed analytic confidence intervals. Our methodology only requires summary statistics and is able to handle both binary and continuous traits. Finally we applied the methods to a few real disease examples (lipid traits, type 2 diabetes and Crohn's disease) and estimated that hundreds to nearly a thousand variants underlie these traits. © 2010 So et al.

Persistent Identifier

http://hdl.handle.net/10722/137517

ISSN

1932-6203

2023 Impact Factor: 2.9

2023 SCImago Journal Rankings: 0.839

PubMed Central ID

PMC2984437

ISI Accession Number ID

WOS:000284327800002

Funding Agency	Grant Number
Hong Kong Research Grants Council	HKU 766906M HKU 774707M
University of Hong Kong Strategic Research Theme of Genomics
Croucher Foundation

Funding Information:

The work was supported by the Hong Kong Research Grants Council General Research Fund HKU 766906M and HKU 774707M and the University of Hong Kong Strategic Research Theme of Genomics. HCS was supported by a Croucher Foundation Scholarship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

References in Scopus

Grants

Genome-wide association study of schizophrenia

DC Field	Value	Language
dc.contributor.author	So, HC	en_HK
dc.contributor.author	Yip, BHK	en_HK
dc.contributor.author	Sham, PC	en_HK
dc.date.accessioned	2011-08-26T14:26:53Z	-
dc.date.available	2011-08-26T14:26:53Z	-
dc.date.issued	2010	en_HK
dc.identifier.citation	Plos One, 2010, v. 5 n. 11	en_HK
dc.identifier.issn	1932-6203	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/137517	-
dc.description.abstract	Recently genome-wide association studies (GWAS) have identified numerous susceptibility variants for complex diseases. In this study we proposed several approaches to estimate the total number of variants underlying these diseases. We assume that the variance explained by genetic markers (Vg) follow an exponential distribution, which is justified by previous studies on theories of adaptation. Our aim is to fit the observed distribution of Vg from GWAS to its theoretical distribution. The number of variants is obtained by the heritability divided by the estimated mean of the exponential distribution. In practice, due to limited sample sizes, there is insufficient power to detect variants with small effects. Therefore the power was taken into account in fitting. Besides considering the most significant variants, we also tried to relax the significance threshold, allowing more markers to be fitted. The effects of false positive variants were removed by considering the local false discovery rates. In addition, we developed an alternative approach by directly fitting the z-statistics from GWAS to its theoretical distribution. In all cases, the ''winner's curse'' effect was corrected analytically. Confidence intervals were also derived. Simulations were performed to compare and verify the performance of different estimators (which incorporates various means of winner's curse correction) and the coverage of the proposed analytic confidence intervals. Our methodology only requires summary statistics and is able to handle both binary and continuous traits. Finally we applied the methods to a few real disease examples (lipid traits, type 2 diabetes and Crohn's disease) and estimated that hundreds to nearly a thousand variants underlie these traits. © 2010 So et al.	en_HK
dc.language	eng	en_US
dc.publisher	Public Library of Science. The Journal's web site is located at http://www.plosone.org/home.action	en_HK
dc.relation.ispartof	PLoS ONE	en_HK
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.mesh	Algorithms	-
dc.subject.mesh	Diabetes Mellitus, Type 2 - genetics	-
dc.subject.mesh	Genetic Predisposition to Disease - genetics	-
dc.subject.mesh	Genome-Wide Association Study - methods	-
dc.subject.mesh	Polymorphism, Single Nucleotide	-
dc.title	Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies	en_HK
dc.type	Article	en_HK
dc.identifier.email	Sham, PC: pcsham@hku.hk	en_HK
dc.identifier.authority	Sham, PC=rp00459	en_HK
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1371/journal.pone.0013898	en_HK
dc.identifier.pmid	21103334	-
dc.identifier.pmcid	PMC2984437	-
dc.identifier.scopus	eid_2-s2.0-78649512740	en_HK
dc.identifier.hkuros	189825	en_US
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-78649512740&selection=ref&src=s&origin=recordpage	en_HK
dc.identifier.volume	5	en_HK
dc.identifier.issue	11	en_HK
dc.identifier.spage	e13898	en_US
dc.identifier.epage	e13898	en_US
dc.identifier.isi	WOS:000284327800002	-
dc.publisher.place	United States	en_HK
dc.relation.project	Genome-wide association study of schizophrenia	-
dc.identifier.scopusauthorid	So, HC=37031934700	en_HK
dc.identifier.scopusauthorid	Yip, BHK=16685586100	en_HK
dc.identifier.scopusauthorid	Sham, PC=34573429300	en_HK
dc.identifier.citeulike	8691792	-
dc.identifier.issnl	1932-6203	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats