File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Gene-gene interaction filtering with ensemble of filters

TitleGene-gene interaction filtering with ensemble of filters
Authors
Issue Date2011
Citation
Ninth Asia Pacific Bioinformatics Conference (APBC 2011), Inchon, Korea, 11-14 January 2011. In BMC Bioinformatics, 2011, v. 12, n. SUPPL. 1 How to Cite?
AbstractBackground: Complex diseases are commonly caused by multiple genes and their interactions with each other. Genome-wide association (GWA) studies provide us the opportunity to capture those disease associated genes and gene-gene interactions through panels of SNP markers. However, a proper filtering procedure is critical to reduce the search space prior to the computationally intensive gene-gene interaction identification step. In this study, we show that two commonly used SNP-SNP interaction filtering algorithms, ReliefF and tuned ReliefF (TuRF), are sensitive to the order of the samples in the dataset, giving rise to unstable and suboptimal results. However, we observe that the 'unstable' results from multiple runs of these algorithms can provide valuable information about the dataset. We therefore hypothesize that aggregating results from multiple runs of the algorithm may improve the filtering performance.Results: We propose a simple and effective ensemble approach in which the results from multiple runs of an unstable filter are aggregated based on the general theory of ensemble learning. The ensemble versions of the ReliefF and TuRF algorithms, referred to as ReliefF-E and TuRF-E, are robust to sample order dependency and enable a more informative investigation of data characteristics. Using simulated and real datasets, we demonstrate that both the ensemble of ReliefF and the ensemble of TuRF can generate a much more stable SNP ranking than the original algorithms. Furthermore, the ensemble of TuRF achieved the highest success rate in comparison to many state-of-the-art algorithms as well as traditional χ2-test and odds ratio methods in terms of retaining gene-gene interactions. © 2011 Yang et al; licensee BioMed Central Ltd.
Persistent Identifierhttp://hdl.handle.net/10722/262638
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorYang, Pengyi-
dc.contributor.authorHo, Joshua W.K.-
dc.contributor.authorYang, Yee H.-
dc.contributor.authorZhou, Bing B.-
dc.date.accessioned2018-10-08T02:46:36Z-
dc.date.available2018-10-08T02:46:36Z-
dc.date.issued2011-
dc.identifier.citationNinth Asia Pacific Bioinformatics Conference (APBC 2011), Inchon, Korea, 11-14 January 2011. In BMC Bioinformatics, 2011, v. 12, n. SUPPL. 1-
dc.identifier.urihttp://hdl.handle.net/10722/262638-
dc.description.abstractBackground: Complex diseases are commonly caused by multiple genes and their interactions with each other. Genome-wide association (GWA) studies provide us the opportunity to capture those disease associated genes and gene-gene interactions through panels of SNP markers. However, a proper filtering procedure is critical to reduce the search space prior to the computationally intensive gene-gene interaction identification step. In this study, we show that two commonly used SNP-SNP interaction filtering algorithms, ReliefF and tuned ReliefF (TuRF), are sensitive to the order of the samples in the dataset, giving rise to unstable and suboptimal results. However, we observe that the 'unstable' results from multiple runs of these algorithms can provide valuable information about the dataset. We therefore hypothesize that aggregating results from multiple runs of the algorithm may improve the filtering performance.Results: We propose a simple and effective ensemble approach in which the results from multiple runs of an unstable filter are aggregated based on the general theory of ensemble learning. The ensemble versions of the ReliefF and TuRF algorithms, referred to as ReliefF-E and TuRF-E, are robust to sample order dependency and enable a more informative investigation of data characteristics. Using simulated and real datasets, we demonstrate that both the ensemble of ReliefF and the ensemble of TuRF can generate a much more stable SNP ranking than the original algorithms. Furthermore, the ensemble of TuRF achieved the highest success rate in comparison to many state-of-the-art algorithms as well as traditional χ2-test and odds ratio methods in terms of retaining gene-gene interactions. © 2011 Yang et al; licensee BioMed Central Ltd.-
dc.languageeng-
dc.relation.ispartofBMC Bioinformatics-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titleGene-gene interaction filtering with ensemble of filters-
dc.typeConference_Paper-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1186/1471-2105-12-S1-S10-
dc.identifier.pmid21342539-
dc.identifier.scopuseid_2-s2.0-79951526454-
dc.identifier.volume12-
dc.identifier.issueSUPPL. 1-
dc.identifier.spagenull-
dc.identifier.epagenull-
dc.identifier.eissn1471-2105-
dc.identifier.isiWOS:000290221000011-
dc.identifier.issnl1471-2105-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats