File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Strategies for identifying statistically significant dense regions in microarray data

TitleStrategies for identifying statistically significant dense regions in microarray data
Authors
KeywordsMicroarray
Gene expression
Dense region
Coexpressed genes
Bicluster
Categorical data
Clustering
Issue Date2007
Citation
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2007, v. 4, n. 3, p. 415-428 How to Cite?
AbstractWe propose and study the notion of dense regions for the analysis of categorized gene expression data and present some searching algorithms for discovering them. The algorithms can be applied to any categorical data matrices derived from gene expression level matrices. We demonstrate that dense regions are simple but useful and statistically significant patterns that can be used to 1) Identify genes and/or samples of Interest and 2) eliminate genes and/or samples corresponding to outliers, noise, or abnormalities. Some theoretical studies on the properties of the dense regions are presented which allow us to characterize dense regions Into several classes and to derive tailor-made algorithms for different classes of regions. Moreover, an empirical simulation study on the distribution of the size of dense regions is carried out which is then used to assess the significance of dense regions and to derive effective pruning methods to speed up the searching algorithms. Real microarray data sets are employed to test our methods. Comparisons with six other well-known clustering algorithms using synthetic and real data are also conducted which confirm the superiority of our methods in discovering dense regions. The DRIFT code and a tutorial are available as supplemental material, which can be found on the Computer Society Digital Library at http://computer.org/tcbb/archlves. htm. © 2007 IEEE.
Persistent Identifierhttp://hdl.handle.net/10722/276814
ISSN
2023 Impact Factor: 3.6
2023 SCImago Journal Rankings: 0.794
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorYip, Andy M.-
dc.contributor.authorNg, Michael K.-
dc.contributor.authorWu, Edmond H.-
dc.contributor.authorChan, Tony F.-
dc.date.accessioned2019-09-18T08:34:44Z-
dc.date.available2019-09-18T08:34:44Z-
dc.date.issued2007-
dc.identifier.citationIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2007, v. 4, n. 3, p. 415-428-
dc.identifier.issn1545-5963-
dc.identifier.urihttp://hdl.handle.net/10722/276814-
dc.description.abstractWe propose and study the notion of dense regions for the analysis of categorized gene expression data and present some searching algorithms for discovering them. The algorithms can be applied to any categorical data matrices derived from gene expression level matrices. We demonstrate that dense regions are simple but useful and statistically significant patterns that can be used to 1) Identify genes and/or samples of Interest and 2) eliminate genes and/or samples corresponding to outliers, noise, or abnormalities. Some theoretical studies on the properties of the dense regions are presented which allow us to characterize dense regions Into several classes and to derive tailor-made algorithms for different classes of regions. Moreover, an empirical simulation study on the distribution of the size of dense regions is carried out which is then used to assess the significance of dense regions and to derive effective pruning methods to speed up the searching algorithms. Real microarray data sets are employed to test our methods. Comparisons with six other well-known clustering algorithms using synthetic and real data are also conducted which confirm the superiority of our methods in discovering dense regions. The DRIFT code and a tutorial are available as supplemental material, which can be found on the Computer Society Digital Library at http://computer.org/tcbb/archlves. htm. © 2007 IEEE.-
dc.languageeng-
dc.relation.ispartofIEEE/ACM Transactions on Computational Biology and Bioinformatics-
dc.subjectMicroarray-
dc.subjectGene expression-
dc.subjectDense region-
dc.subjectCoexpressed genes-
dc.subjectBicluster-
dc.subjectCategorical data-
dc.subjectClustering-
dc.titleStrategies for identifying statistically significant dense regions in microarray data-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TCBB.2007.1022-
dc.identifier.pmid17666761-
dc.identifier.scopuseid_2-s2.0-34547973950-
dc.identifier.volume4-
dc.identifier.issue3-
dc.identifier.spage415-
dc.identifier.epage428-
dc.identifier.isiWOS:000248414700008-
dc.identifier.issnl1545-5963-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats