Collective prediction of protein functions from protein-protein interaction networks

Wu, Qingyao; Ye, Yunming; Ng, Michael K.; Ho, Shen Shyang; Shi, Ruichao

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1186/1471-2105-15-S2-S9
Scopus: eid_2-s2.0-84901268258
PMID: 24564855
WOS: WOS:000330688000009
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Mathematics: Journal/Magazine Articles

Article: Collective prediction of protein functions from protein-protein interaction networks

Title	Collective prediction of protein functions from protein-protein interaction networks
Authors	Wu, Qingyao Ye, Yunming Ng, Michael K.Ho, Shen Shyang Shi, Ruichao
Issue Date	2014
Citation	BMC Bioinformatics, 2014, v. 15, suppl. 2, article no. S9 How to Cite? DOI: http://dx.doi.org/10.1186/1471-2105-15-S2-S9
Abstract	© 2014 Wu et al.; licensee BioMed Central Ltd. Background: Automated assignment of functions to unknown proteins is one of the most important task in computational biology. The development of experimental methods for genome scale analysis of molecular interaction networks offers new ways to infer protein function from protein-protein interaction (PPI) network data. Existing techniques for collective classification (CC) usually increase accuracy for network data, wherein instances are interlinked with each other, using a large amount of labeled data for training. However, the labeled data are timeconsuming and expensive to obtain. On the other hand, one can easily obtain large amount of unlabeled data. Thus, more sophisticated methods are needed to exploit the unlabeled data to increase prediction accuracy for protein function prediction. Results: In this paper, we propose an effective Markov chain based CC algorithm (ICAM) to tackle the label deficiency problem in CC for interrelated proteins from PPI networks. Our idea is to model the problem using two distinct Markov chain classifiers to make separate predictions with regard to attribute features from protein data and relational features from relational information. The ICAM learning algorithm combines the results of the two classifiers to compute the ranks of labels to indicate the importance of a set of labels to an instance, and uses an ICA framework to iteratively refine the learning models for improving performance of protein function prediction from PPI networks in the paucity of labeled data. Conclusion: Experimental results on the real-world Yeast protein-protein interaction datasets show that our proposed ICAM method is better than the other ICA-type methods given limited labeled training data. This approach can serve as a valuable tool for the study of protein function prediction from PPI networks.
Persistent Identifier	http://hdl.handle.net/10722/276682
ISSN	1471-2105 2023 Impact Factor: 2.9 2023 SCImago Journal Rankings: 1.005
PubMed Central ID	PMC4015526
ISI Accession Number ID	WOS:000330688000009

DC Field	Value	Language
dc.contributor.author	Wu, Qingyao	-
dc.contributor.author	Ye, Yunming	-
dc.contributor.author	Ng, Michael K.	-
dc.contributor.author	Ho, Shen Shyang	-
dc.contributor.author	Shi, Ruichao	-
dc.date.accessioned	2019-09-18T08:34:21Z	-
dc.date.available	2019-09-18T08:34:21Z	-
dc.date.issued	2014	-
dc.identifier.citation	BMC Bioinformatics, 2014, v. 15, suppl. 2, article no. S9	-
dc.identifier.issn	1471-2105	-
dc.identifier.uri	http://hdl.handle.net/10722/276682	-
dc.description.abstract	© 2014 Wu et al.; licensee BioMed Central Ltd. Background: Automated assignment of functions to unknown proteins is one of the most important task in computational biology. The development of experimental methods for genome scale analysis of molecular interaction networks offers new ways to infer protein function from protein-protein interaction (PPI) network data. Existing techniques for collective classification (CC) usually increase accuracy for network data, wherein instances are interlinked with each other, using a large amount of labeled data for training. However, the labeled data are timeconsuming and expensive to obtain. On the other hand, one can easily obtain large amount of unlabeled data. Thus, more sophisticated methods are needed to exploit the unlabeled data to increase prediction accuracy for protein function prediction. Results: In this paper, we propose an effective Markov chain based CC algorithm (ICAM) to tackle the label deficiency problem in CC for interrelated proteins from PPI networks. Our idea is to model the problem using two distinct Markov chain classifiers to make separate predictions with regard to attribute features from protein data and relational features from relational information. The ICAM learning algorithm combines the results of the two classifiers to compute the ranks of labels to indicate the importance of a set of labels to an instance, and uses an ICA framework to iteratively refine the learning models for improving performance of protein function prediction from PPI networks in the paucity of labeled data. Conclusion: Experimental results on the real-world Yeast protein-protein interaction datasets show that our proposed ICAM method is better than the other ICA-type methods given limited labeled training data. This approach can serve as a valuable tool for the study of protein function prediction from PPI networks.	-
dc.language	eng	-
dc.relation.ispartof	BMC Bioinformatics	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.title	Collective prediction of protein functions from protein-protein interaction networks	-
dc.type	Article	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1186/1471-2105-15-S2-S9	-
dc.identifier.pmid	24564855	-
dc.identifier.pmcid	PMC4015526	-
dc.identifier.scopus	eid_2-s2.0-84901268258	-
dc.identifier.volume	15	-
dc.identifier.issue	suppl. 2	-
dc.identifier.spage	article no. S9	-
dc.identifier.epage	article no. S9	-
dc.identifier.isi	WOS:000330688000009	-
dc.identifier.issnl	1471-2105	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Collective prediction of protein functions from protein-protein interaction networks

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats