SNP and gene networks construction and analysis from classification of copy number variations data

Liu, Yang; Lee, Yiu F.; Ng, Michael K.

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1186/1471-2105-12-S5-S4
Scopus: eid_2-s2.0-79960713932
PMID: 21989070
WOS: WOS:000303930900004
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Mathematics: Journal/Magazine Articles

Article: SNP and gene networks construction and analysis from classification of copy number variations data

Title	SNP and gene networks construction and analysis from classification of copy number variations data
Authors	Liu, Yang Lee, Yiu F.Ng, Michael K.
Issue Date	2011
Citation	BMC Bioinformatics, 2011, v. 12, suppl. 5, article no. S4 How to Cite? DOI: http://dx.doi.org/10.1186/1471-2105-12-S5-S4
Abstract	Background: Detection of genomic DNA copy number variations (CNVs) can provide a complete and more comprehensive view of human disease. It is interesting to identify and represent relevant CNVs from a genome-wide data due to high data volume and the complexity of interactions.Results: In this paper, we incorporate the DNA copy number variation data derived from SNP arrays into a computational shrunken model and formalize the detection of copy number variations as a case-control classification problem. More than 80% accuracy can be obtained using our classification model and by shrinkage, the number of relevant CNVs to disease can be determined. In order to understand relevant CNVs, we study their corresponding SNPs in the genome and a statistical software PLINK is employed to compute the pair-wise SNP-SNP interactions, and identify SNP networks based on their P-values. Our selected SNP networks are statistically significant compared with random SNP networks and play a role in the biological process. For the unique genes that those SNPs are located in, a gene-gene similarity value is computed using GOSemSim and gene pairs that have similarity values being greater than a threshold are selected to construct gene networks. A gene enrichment analysis show that our gene networks are functionally important.Experimental results demonstrate that our selected SNP and gene networks based on the selected CNVs contain some functional relationships directly or indirectly to disease study.Conclusions: Two datasets are given to demonstrate the effectiveness of the introduced method. Some statistical and biological analysis show that this shrunken classification model is effective in identifying CNVs from genome-wide data and our proposed framework has a potential to become a useful analysis tool for SNP data sets. © 2011 Liu et al; licensee BioMed Central Ltd.
Persistent Identifier	http://hdl.handle.net/10722/276901
ISSN	1471-2105 2021 Impact Factor: 3.307 2020 SCImago Journal Rankings: 1.567
PubMed Central ID	PMC3226254
ISI Accession Number ID	WOS:000303930900004

DC Field	Value	Language
dc.contributor.author	Liu, Yang	-
dc.contributor.author	Lee, Yiu F.	-
dc.contributor.author	Ng, Michael K.	-
dc.date.accessioned	2019-09-18T08:35:00Z	-
dc.date.available	2019-09-18T08:35:00Z	-
dc.date.issued	2011	-
dc.identifier.citation	BMC Bioinformatics, 2011, v. 12, suppl. 5, article no. S4	-
dc.identifier.issn	1471-2105	-
dc.identifier.uri	http://hdl.handle.net/10722/276901	-
dc.description.abstract	Background: Detection of genomic DNA copy number variations (CNVs) can provide a complete and more comprehensive view of human disease. It is interesting to identify and represent relevant CNVs from a genome-wide data due to high data volume and the complexity of interactions.Results: In this paper, we incorporate the DNA copy number variation data derived from SNP arrays into a computational shrunken model and formalize the detection of copy number variations as a case-control classification problem. More than 80% accuracy can be obtained using our classification model and by shrinkage, the number of relevant CNVs to disease can be determined. In order to understand relevant CNVs, we study their corresponding SNPs in the genome and a statistical software PLINK is employed to compute the pair-wise SNP-SNP interactions, and identify SNP networks based on their P-values. Our selected SNP networks are statistically significant compared with random SNP networks and play a role in the biological process. For the unique genes that those SNPs are located in, a gene-gene similarity value is computed using GOSemSim and gene pairs that have similarity values being greater than a threshold are selected to construct gene networks. A gene enrichment analysis show that our gene networks are functionally important.Experimental results demonstrate that our selected SNP and gene networks based on the selected CNVs contain some functional relationships directly or indirectly to disease study.Conclusions: Two datasets are given to demonstrate the effectiveness of the introduced method. Some statistical and biological analysis show that this shrunken classification model is effective in identifying CNVs from genome-wide data and our proposed framework has a potential to become a useful analysis tool for SNP data sets. © 2011 Liu et al; licensee BioMed Central Ltd.	-
dc.language	eng	-
dc.relation.ispartof	BMC Bioinformatics	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.title	SNP and gene networks construction and analysis from classification of copy number variations data	-
dc.type	Article	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1186/1471-2105-12-S5-S4	-
dc.identifier.pmid	21989070	-
dc.identifier.pmcid	PMC3226254	-
dc.identifier.scopus	eid_2-s2.0-79960713932	-
dc.identifier.volume	12	-
dc.identifier.issue	suppl. 5	-
dc.identifier.spage	article no. S4	-
dc.identifier.epage	article no. S4	-
dc.identifier.isi	WOS:000303930900004	-
dc.identifier.issnl	1471-2105	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: SNP and gene networks construction and analysis from classification of copy number variations data

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats