Predicting Protein Cellular Localization Using a Domain Projection Method
AUTOR(ES)
Mott, Richard
FONTE
Cold Spring Harbor Laboratory Press
RESUMO
We investigate the co-occurrence of domain families in eukaryotic proteins to predict protein cellular localization. Approximately half (300) of SMART domains form a “small-world network”, linked by no more than seven degrees of separation. Projection of the domains onto two-dimensional space reveals three clusters that correspond to cellular compartments containing secreted, cytoplasmic, and nuclear proteins. The projection method takes into account the existence of “bridging” domains, that is, instances where two domains might not occur with each other but frequently co-occur with a third domain; in such circumstances the domains are neighbors in the projection. While the majority of domains are specific to a compartment (“locale”), and hence may be used to localize any protein that contains such a domain, a small subset of domains either are present in multiple locales or occur in transmembrane proteins. Comparison with previously annotated proteins shows that SMART domain data used with this approach can predict, with 92% accuracy, the localizations of 23% of eukaryotic proteins. The coverage and accuracy will increase with improvements in domain database coverage. This method is complementary to approaches that use amino-acid composition or identify sorting sequences; these methods may be combined to further enhance prediction accuracy.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=186639Documentos Relacionados
- Predicting Subcellular Localization via Protein Motif Co-Occurrence
- A weighted projection centering method
- Predicting Protein Complex Membership Using Probabilistic Network Reliability
- Cellular localization of the Escherichia coli SpoT protein.
- Predicting co-complexed protein pairs using genomic and proteomic data integration