File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: On gene selection and classification for cancer microarray data using multi-step clustering and sparse representation

TitleOn gene selection and classification for cancer microarray data using multi-step clustering and sparse representation
Authors
Keywordsclassification
clustering
Gene selection
Lasso
cancer prediction
Issue Date2011
Citation
Advances in Adaptive Data Analysis, 2011, v. 3, n. 1-2, p. 127-148 How to Cite?
AbstractMicroarray data profiles gene expression on a whole genome scale, and provides a good way to study associations between gene expression and occurrence or progression of cancer disease. Many researchers realized that microarray data is useful to predict cancer cases. However, the high dimension of gene expressions, which is significantly larger than the sample size, makes this task very difficult. It is very important to identify the significant genes causing cancer. Many feature selection algorithms have been proposed focusing on improving cancer predictive accuracy at the expense of ignoring the correlations between the features. In this work, a novel framework (named by SGS) is presented for significant genes selection and efficient cancer case classification. The proposed framework first performs a clustering algorithm to find the gene groups where genes in each group have higher correlation coefficient, and then selects (1) the significant (2) genes in each group using the Bayesian Lasso method and important gene groups using the group Lasso method, and finally builds a prediction model based on the shrinkage gene space with efficient classification algorithm (such as support vector machine (SVM), 1NN, and regression). Experimental results on public available microarray data show that the proposed framework often outperforms the existing feature selection and prediction methods such as SAM, information gain (IG), and Lasso-type prediction models. © 2011 World Scientific Publishing Company.
Persistent Identifierhttp://hdl.handle.net/10722/276905
ISSN
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorJing, Liping-
dc.contributor.authorNg, Michael K.-
dc.contributor.authorZeng, Tieyong-
dc.date.accessioned2019-09-18T08:35:00Z-
dc.date.available2019-09-18T08:35:00Z-
dc.date.issued2011-
dc.identifier.citationAdvances in Adaptive Data Analysis, 2011, v. 3, n. 1-2, p. 127-148-
dc.identifier.issn1793-5369-
dc.identifier.urihttp://hdl.handle.net/10722/276905-
dc.description.abstractMicroarray data profiles gene expression on a whole genome scale, and provides a good way to study associations between gene expression and occurrence or progression of cancer disease. Many researchers realized that microarray data is useful to predict cancer cases. However, the high dimension of gene expressions, which is significantly larger than the sample size, makes this task very difficult. It is very important to identify the significant genes causing cancer. Many feature selection algorithms have been proposed focusing on improving cancer predictive accuracy at the expense of ignoring the correlations between the features. In this work, a novel framework (named by SGS) is presented for significant genes selection and efficient cancer case classification. The proposed framework first performs a clustering algorithm to find the gene groups where genes in each group have higher correlation coefficient, and then selects (1) the significant (2) genes in each group using the Bayesian Lasso method and important gene groups using the group Lasso method, and finally builds a prediction model based on the shrinkage gene space with efficient classification algorithm (such as support vector machine (SVM), 1NN, and regression). Experimental results on public available microarray data show that the proposed framework often outperforms the existing feature selection and prediction methods such as SAM, information gain (IG), and Lasso-type prediction models. © 2011 World Scientific Publishing Company.-
dc.languageeng-
dc.relation.ispartofAdvances in Adaptive Data Analysis-
dc.subjectclassification-
dc.subjectclustering-
dc.subjectGene selection-
dc.subjectLasso-
dc.subjectcancer prediction-
dc.titleOn gene selection and classification for cancer microarray data using multi-step clustering and sparse representation-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1142/S1793536911000763-
dc.identifier.scopuseid_2-s2.0-80052635806-
dc.identifier.volume3-
dc.identifier.issue1-2-
dc.identifier.spage127-
dc.identifier.epage148-
dc.identifier.eissn1793-7175-
dc.identifier.isiWOS:000216764200008-
dc.identifier.issnl1793-7175-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats