File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Plus ça change - evolutionary sequence divergence predicts protein subcellular localization signals

TitlePlus ça change - evolutionary sequence divergence predicts protein subcellular localization signals
Authors
Issue Date2014
Citation
BMC Genomics, 2014, v. 15, n. 1 How to Cite?
AbstractBackground: Protein subcellular localization is a central problem in understanding cell biology and has been the focus of intense research. In order to predict localization from amino acid sequence a myriad of features have been tried: including amino acid composition, sequence similarity, the presence of certain motifs or domains, and many others. Surprisingly, sequence conservation of sorting motifs has not yet been employed, despite its extensive use for tasks such as the prediction of transcription factor binding sites.Results: Here, we flip the problem around, and present a proof of concept for the idea that the lack of sequence conservation can be a novel feature for localization prediction. We show that for yeast, mammal and plant datasets, evolutionary sequence divergence alone has significant power to identify sequences with N-terminal sorting sequences. Moreover sequence divergence is nearly as effective when computed on automatically defined ortholog sets as on hand curated ones. Unfortunately, sequence divergence did not necessarily increase classification performance when combined with some traditional sequence features such as amino acid composition. However a post-hoc analysis of the proteins in which sequence divergence changes the prediction yielded some proteins with atypical (i.e. not MPP-cleaved) matrix targeting signals as well as a few misannotations.Conclusion: We report the results of the first quantitative study of the effectiveness of evolutionary sequence divergence as a feature for protein subcellular localization prediction. We show that divergence is indeed useful for prediction, but it is not trivial to improve overall accuracy simply by adding this feature to classical sequence features. Nevertheless we argue that sequence divergence is a promising feature and show anecdotal examples in which it succeeds where other features fail. © 2014 Fukasawa et al.; licensee BioMed Central Ltd.
Persistent Identifierhttp://hdl.handle.net/10722/222151
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorFukasawa, Yoshinori-
dc.contributor.authorLeung, Ross K K-
dc.contributor.authorTsui, Stephen K W-
dc.contributor.authorHorton, Paul-
dc.date.accessioned2015-12-21T06:48:56Z-
dc.date.available2015-12-21T06:48:56Z-
dc.date.issued2014-
dc.identifier.citationBMC Genomics, 2014, v. 15, n. 1-
dc.identifier.urihttp://hdl.handle.net/10722/222151-
dc.description.abstractBackground: Protein subcellular localization is a central problem in understanding cell biology and has been the focus of intense research. In order to predict localization from amino acid sequence a myriad of features have been tried: including amino acid composition, sequence similarity, the presence of certain motifs or domains, and many others. Surprisingly, sequence conservation of sorting motifs has not yet been employed, despite its extensive use for tasks such as the prediction of transcription factor binding sites.Results: Here, we flip the problem around, and present a proof of concept for the idea that the lack of sequence conservation can be a novel feature for localization prediction. We show that for yeast, mammal and plant datasets, evolutionary sequence divergence alone has significant power to identify sequences with N-terminal sorting sequences. Moreover sequence divergence is nearly as effective when computed on automatically defined ortholog sets as on hand curated ones. Unfortunately, sequence divergence did not necessarily increase classification performance when combined with some traditional sequence features such as amino acid composition. However a post-hoc analysis of the proteins in which sequence divergence changes the prediction yielded some proteins with atypical (i.e. not MPP-cleaved) matrix targeting signals as well as a few misannotations.Conclusion: We report the results of the first quantitative study of the effectiveness of evolutionary sequence divergence as a feature for protein subcellular localization prediction. We show that divergence is indeed useful for prediction, but it is not trivial to improve overall accuracy simply by adding this feature to classical sequence features. Nevertheless we argue that sequence divergence is a promising feature and show anecdotal examples in which it succeeds where other features fail. © 2014 Fukasawa et al.; licensee BioMed Central Ltd.-
dc.languageeng-
dc.relation.ispartofBMC Genomics-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titlePlus ça change - evolutionary sequence divergence predicts protein subcellular localization signals-
dc.typeArticle-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1186/1471-2164-15-46-
dc.identifier.pmid24438075-
dc.identifier.scopuseid_2-s2.0-84892449284-
dc.identifier.volume15-
dc.identifier.issue1-
dc.identifier.eissn1471-2164-
dc.identifier.isiWOS:000331096300001-
dc.identifier.issnl1471-2164-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats