A genetic ensemble approach for gene-gene interaction identification

Yang, Pengyi; Ho, Joshua W.K.; Zomaya, Albert Y.; Zhou, Bing B.

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1186/1471-2105-11-524
Scopus: eid_2-s2.0-77958048688
PMID: 20961462
WOS: WOS:000283844800001

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Biomedical Sciences: Journal/Magazine Articles

Article: A genetic ensemble approach for gene-gene interaction identification

Title	A genetic ensemble approach for gene-gene interaction identification
Authors	Yang, Pengyi Ho, Joshua W.K.Zomaya, Albert Y.Zhou, Bing B.
Issue Date	2010
Citation	BMC Bioinformatics, 2010, v. 11 How to Cite? DOI: http://dx.doi.org/10.1186/1471-2105-11-524
Abstract	Background: It has now become clear that gene-gene interactions and gene-environment interactions are ubiquitous and fundamental mechanisms for the development of complex diseases. Though a considerable effort has been put into developing statistical models and algorithmic strategies for identifying such interactions, the accurate identification of those genetic interactions has been proven to be very challenging.Methods: In this paper, we propose a new approach for identifying such gene-gene and gene-environment interactions underlying complex diseases. This is a hybrid algorithm and it combines genetic algorithm (GA) and an ensemble of classifiers (called genetic ensemble). Using this approach, the original problem of SNP interaction identification is converted into a data mining problem of combinatorial feature selection. By collecting various single nucleotide polymorphisms (SNP) subsets as well as environmental factors generated in multiple GA runs, patterns of gene-gene and gene-environment interactions can be extracted using a simple combinatorial ranking method. Also considered in this study is the idea of combining identification results obtained from multiple algorithms. A novel formula based on pairwise double fault is designed to quantify the degree of complementarity.Conclusions: Our simulation study demonstrates that the proposed genetic ensemble algorithm has comparable identification power to Multifactor Dimensionality Reduction (MDR) and is slightly better than Polymorphism Interaction Analysis (PIA), which are the two most popular methods for gene-gene interaction identification. More importantly, the identification results generated by using our genetic ensemble algorithm are highly complementary to those obtained by PIA and MDR. Experimental results from our simulation studies and real world data application also confirm the effectiveness of the proposed genetic ensemble algorithm, as well as the potential benefits of combining identification results from different algorithms. © 2010 Yang et al; licensee BioMed Central Ltd.
Persistent Identifier	http://hdl.handle.net/10722/262635
ISI Accession Number ID	WOS:000283844800001

DC Field	Value	Language
dc.contributor.author	Yang, Pengyi	-
dc.contributor.author	Ho, Joshua W.K.	-
dc.contributor.author	Zomaya, Albert Y.	-
dc.contributor.author	Zhou, Bing B.	-
dc.date.accessioned	2018-10-08T02:46:35Z	-
dc.date.available	2018-10-08T02:46:35Z	-
dc.date.issued	2010	-
dc.identifier.citation	BMC Bioinformatics, 2010, v. 11	-
dc.identifier.uri	http://hdl.handle.net/10722/262635	-
dc.description.abstract	Background: It has now become clear that gene-gene interactions and gene-environment interactions are ubiquitous and fundamental mechanisms for the development of complex diseases. Though a considerable effort has been put into developing statistical models and algorithmic strategies for identifying such interactions, the accurate identification of those genetic interactions has been proven to be very challenging.Methods: In this paper, we propose a new approach for identifying such gene-gene and gene-environment interactions underlying complex diseases. This is a hybrid algorithm and it combines genetic algorithm (GA) and an ensemble of classifiers (called genetic ensemble). Using this approach, the original problem of SNP interaction identification is converted into a data mining problem of combinatorial feature selection. By collecting various single nucleotide polymorphisms (SNP) subsets as well as environmental factors generated in multiple GA runs, patterns of gene-gene and gene-environment interactions can be extracted using a simple combinatorial ranking method. Also considered in this study is the idea of combining identification results obtained from multiple algorithms. A novel formula based on pairwise double fault is designed to quantify the degree of complementarity.Conclusions: Our simulation study demonstrates that the proposed genetic ensemble algorithm has comparable identification power to Multifactor Dimensionality Reduction (MDR) and is slightly better than Polymorphism Interaction Analysis (PIA), which are the two most popular methods for gene-gene interaction identification. More importantly, the identification results generated by using our genetic ensemble algorithm are highly complementary to those obtained by PIA and MDR. Experimental results from our simulation studies and real world data application also confirm the effectiveness of the proposed genetic ensemble algorithm, as well as the potential benefits of combining identification results from different algorithms. © 2010 Yang et al; licensee BioMed Central Ltd.	-
dc.language	eng	-
dc.relation.ispartof	BMC Bioinformatics	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.title	A genetic ensemble approach for gene-gene interaction identification	-
dc.type	Article	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1186/1471-2105-11-524	-
dc.identifier.pmid	20961462	-
dc.identifier.scopus	eid_2-s2.0-77958048688	-
dc.identifier.volume	11	-
dc.identifier.spage	null	-
dc.identifier.epage	null	-
dc.identifier.eissn	1471-2105	-
dc.identifier.isi	WOS:000283844800001	-
dc.identifier.issnl	1471-2105	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: A genetic ensemble approach for gene-gene interaction identification

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats