News & Topics

Protein Subcellular Localization Prediction( April 2005 )



CBRC:Paul Horton, Keun-Joon PARK(2005.4- Tokyo University Human Genome Center)
Tokyo Institute of Technology:Takeshi Obayashi
Tokyo University Human Genome Center :Kenta Nakai


WoLF PSORT predicts the subcellular localization sites of proteins based on their amino acid sequences. The method, which is a major extension to the venerable PSORTII program, makes predictions based on both known sorting signal motifs and some correlative sequence features such as amino acid content. Like PSORT and PSORTII, WoLF PSORT displays some information about detected sorting signals which is useful in helping users determine the reliability of the prediction in specific cases.

Our experiments (paper in preparation) show that the overall prediction accuracy of WoLF PSORT is over 80%. For common localization sites (e.g. cytosol, nucleus, mitochondria, etc) WoLF PSORT makes better than majority classifier predictions even for queries that do not have strong sequence similarity to any sequence in the dataset. Thus WoLF PSORT is a useful complement to tools such as BLAST. The current dataset used to train WoLF PSORT contains over 12,000 animal sequences and more than 2,000 plant and fungi sequences respectively. It was gathered mainly from Uniprot but several hundred Arabidopsis thaliana sequences from the Gene Ontology database were also included.

page top