Sunday, 26 August 2012

Jinjiang's WebWatcher on Biology (9) - Subcellular Localization


Subcellular Localization


Databases


eSLDB is a database of protein subcellular localization annotation for eukaryotic organisms. It contains experimental annotations derived from primary protein databases, homology based annotations and computational predictions.

http://gpcr.biocomp.unibo.it/esldb/


DBSubLoc is a database of protein subcellular localization. This database contains proteins from primary protein database SWISS-PROT and PIR.

http://www.bioinfo.tsinghua.edu.cn/dbsubloc.html


A database of organelle proteins, and subcellular structures / complexes

http://organelledb.lsi.umich.edu/


LOCATE is a curated database that houses data describing the membrane organization and subcellular localization of proteins from the RIKEN FANTOM4 mouse and human protein sequence set.

http://locate.imb.uq.edu.au/


LocDB is a expert curated database that collects experimental annotations for the subcellular localization of proteins in Human (Homo sapiens) and Weed (Arabidopsis thaliana).

http://www.rostlab.org/services/locDB/

General Prediction


Predicts the subcellular location of eukaryotic proteins (use results of ChloroP and SignalP). Secretory signal peptides, mitochondrial targeting peptides and chloroplast transit peptides in eukaryotes.

http://www.cbs.dtu.dk/services/TargetP/


Protein Subcellular Localization Prediction (plant, animal,fungi)

http://wolfpsort.org/


bacterial protein subcellular localization prediction

http://www.psort.org/psortb/


The SecretomeP 2.0 server produces ab initio predictions of non-classical i.e. not signal peptide triggered protein secretion. The method queries a large number of other feature prediction servers to obtain information on various post-translational and localizational aspects of the protein, which are integrated into the final secretion prediction.

http://www.cbs.dtu.dk/services/SecretomeP/


Twin-arginine translocation signal peptides in bacteria.

http://www.cbs.dtu.dk/services/TatP/


BaCelLo is a predictor for the subcellular localization of proteins in eukaryotes. It is based on a decision tree of several support vector machines (SVMs), it classifies up to four localizations for Fungi and Metazoan proteins and five localizations for Plant ones

http://gpcr.biocomp.unibo.it/bacello/


Subcellular Localisation Predictor (locations in plants and in other eukaryotes (TargetP data))..

http://pprowler.imb.uq.edu.au/


Subcellular localization predictor; CELLO is a multi-class SVM classification system.

http://cello.life.nctu.edu.tw/


The Proteome Analyst Specialized Subcellular Localization Server (PA-SUB) is part of Proteome Analyst (PA). PA is a web server built to predict protein properties, such as general function, in a high-throughput fashion. PA-SUB is specialized to predict the subcellular localization of proteins using established machine learning techniques.

http://www.cs.ualberta.ca/~bioinfo/PA/Sub/


Subcellular location in plants, other eukaryotes, fungi.

http://www-bs.informatik.uni-tuebingen.de/Services/MultiLoc/


PSLpred is a SVM based method to predict 5 major subcelullar localization (cytoplam, inner-membrane, outermembrane, extracellular, and periplasm) of Gram-negative bacteria.

http://www.imtech.res.in/raghava/pslpred/


pTARGET is a computational method to predict the subcellular localization of only eukaryotic proteins from animal species that include fungi and metazoans. Predictions are carried out based on the occurrence patterns of protein functional domains and the amino acid compositional differences in proteins from different subcellular locations. This method can predict proteins targeted to nine distinct subcellular locations that include cytoplasm, endoplasmic reticulum, extracellular/secreted, Golgi, lysosomes, mitochondria, nucleus, peroxysomes and plasma membrane.

http://bioapps.rit.albany.edu/pTARGET//


SubLoc is a prediction system for protein subcellular localization based on amino acid composition alone.

http://www.bioinfo.tsinghua.edu.cn/SubLoc/

Protein terminii


ProTeus (PROtein TErminUS) is a tool for identification of short linear signatures in protein termini. It is based on a positional-based search method for revealing short significant signatures in termini of proteins.

http://www.proteus.cs.huji.ac.il/

Signal Peptides


SignalP 3.0 server predicts the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms: Gram-positive prokaryotes, Gram-negative prokaryotes, and eukaryotes. The method incorporates a prediction of cleavage sites and a signal peptide/non-signal peptide prediction based on a combination of several artificial neural networks and hidden Markov models.

http://www.cbs.dtu.dk/services/SignalP/


PrediSi (PREDIction of SIgnal peptides) is a software tool for predicting signal peptide sequences and their cleavage positions in bacterial and eukaryotic proteins.

http://www.predisi.de/


Signal-3L is an automated method for predicting signal peptide sequences and their cleavage sites in eukaryotic and bacterial protein sequences that uses a 3-layer approach for predicting signal peptides.

http://www.csbio.sjtu.edu.cn/bioinf/Signal-3L/


Signal Peptide Prediction.

http://bmbpcu36.leeds.ac.uk/prot_analysis/Signal.html


Online tool to find and list homologies of published signal sequences with the input DNA sequence.

http://www-bimas.cit.nih.gov/molbio/signal/


iPSORT is a subcellular localization site predictor for N-terminal sorting signals. Given a protein sequence , it will predict whether it contains a Signal Peptide (SP), Mitochondrial Targeting Peptide (mTP), or Chloroplast Transit Peptide (cTP).

http://hc.ims.u-tokyo.ac.jp/iPSORT/


Prediction of Signal Peptides in Archaea with Hidden Markov Models.

http://bioinformatics.biol.uoa.gr/PRED-SIGNAL/


Prediction of Lipoprotein and Secretory Signal Peptides in Gram-positive Bacteria with Hidden Markov Models.

http://bioinformatics.biol.uoa.gr/PRED-LIPO/


High performance system for signal peptide prediction.

http://bp.nuap.nagoya-u.ac.jp/sosui/sosuisignal/sosuisignal_submit.html

Mitochondrial & ER Localization


MITOPROT: Prediction of mitochondrial targeting sequences

http://ihg.gsf.de/ihg/mitoprot.html


Prediction of N-terminal sequence for mitochondrial, plastid and ER targeting sequences

http://urgi.versailles.inra.fr/predotar/predotar.html

Choroplast localization


Prediction of chloroplast transit peptides

http://www.cbs.dtu.dk/services/ChloroP/

Golgi Localization


Prediction of Golgi Type II membrane proteins based on their transmembrane domains.

http://ccb.imb.uq.edu.au/golgi/golgi_predictor.shtml

Lipoprotein Signal Peptides


Prediction of lipoproteins and signal peptides in Gram negative bacteria.

http://www.cbs.dtu.dk/services/LipoP/


Prediction of Lipoprotein and Secretory Signal Peptides in Gram-positive Bacteria with Hidden Markov Models.

http://bioinformatics.biol.uoa.gr/PRED-LIPO/

Peroxisomal Targeting Signals


Prediction of peroxisomal signal

http://mendel.imp.ac.at/mendeljsp/sat/pts1/PTS1predictor.jsp


Predicts peroxisomal proteins and Pfam domains

http://www.bioinfo.se/PeroxiP/

Nuclear Localization


Predicting Nuclear Localization of Proteins

http://www.sbc.su.se/~maccallr/nucpred/


NetNES 1.1 server predicts leucine-rich nuclear export signals (NES) in eukaryotic proteins using a combination of neural networks and hidden Markov models.

http://www.cbs.dtu.dk/services/NetNES-1.1/index.php

GPI-Anchor Prediction


GPI lipid anchor predictor in animals

http://mendel.imp.ac.at/sat/gpi/gpi_server.html


GPI lipid anchor predictor in plants

http://mendel.imp.ac.at/sat/gpi/plant_server.html


GPI lipid anchor predictor in fungi

http://mendel.imp.ac.at/sat/gpi/fungi_server.html


Identification of GPI-anchor signals by a Kohonen Self Organizing Map

http://gpi.unibe.ch/


 

Prediction Tools


  • Cell-PLoc: A package of web-servers for predicting subcellular localization of proteins in various organisms.[2]
  • BaCelLo: Prediction of eukaryotic protein subcellular localization. Unlike other methods, the predictions are balanced among different classes and all the localizations that are predicted are considered as equiprobable, to avoid mispredictions.[9]
  • CELLO: CELLO uses a two-level Support Vector Machine system to assign localizations to both prokaryotic and eukaryotic proteins.[10][11]
  • ClubSub-P: ClubSub-P is a database of cluster-based subcellular localization (SCL) predictions for Archaea and Gram negative bacteria. [12]
  • Euk-mPLoc 2.0: Predicting the subcellular localization of eukaryotic proteins with both single and multiple sites.[13]
  • CoBaltDB: CoBaltDB is a novel powerful platform that provides easy access to the results of multiple localization tools and support for predicting prokaryotic protein localizations.[14]
  • HSLpred: This method allow to predict subcellular localization of human proteins. This method combines power of composition based SVM models and similarity search techniques PSI-BLAST.[15]
  • KnowPredsite: A knowledge-based approach to predict the localization site(s) of both single-localized and multi-localized proteins for all eukaryotes.[16]
  • LOCtree: Prediction based on mimicking the cellular sorting mechanism using a hierarchical implementation of support vector machines. LOCtree is a comprehensive predictor incorporating predictions based on PROSITE/PFAM signatures as well as SwissProt keywords.[17]
  • MultiLoc: An SVM-based prediction engine for a wide range of subcellular locations.[18]
  • PSORT: The first widely used method for protein subcellular localization prediction, developed under the leadership of Kenta Nakai.[19] Now researchers are also encouraged to use other PSORT programs such as WoLF PSORT and PSORTb for making predictions for certain types of organisms (see below). PSORT prediction performances are lower than those of recently developed predictors.
  • PSORTb: Prediction of bacterial protein localization.[20][21]
  • MetaLocGramN: Meta subcellular localization predictor of Gram-negative protein.[22] (submitted)
  • PredictNLS: Prediction of nuclear localization signals.[23]
  • Proteome Analyst: Prediction of protein localization for both prokaryotes and eukaryotes using a text mining approach.[24]
  • SecretomeP: Prediction of eukaryotic proteins that are secreted via a non-traditional secretory mechanism.[25]
  • SherLoc: An SVM-based predictor combining MultiLoc with text-based features derived from PubMed abstracts.[26]
  • TargetP: Prediction of N-terminal sorting signals.[27]
  • WoLF PSORT: An updated version of PSORT/PSORT II for the prediction of eukaryotic sequences.[28]


 

Tools for Prediction of Protein Localization
ChloroP:Predicts presence of chloroplast transit peptides (cTP)
MITOPROT:Prediction of mitochondrial targeting sequences
PSORT:Prediction of protein-sorting signals and localization sites
SignalP:predicts the presence and location of signal peptide cleavage sites
TargetP:predicts the subcellular location of eukaryotic protein sequences

Protein sorting


ChloroP » ChloroP | Download software
Chloroplast transit peptides and their cleavage sites in plant proteins

LipoP » LipoP | Download software | SOAP WS
Signal peptidase I & II cleavage sites in gram- bacteria

NetNES » NetNES | SOAP WS
Leucine-rich nuclear export signals (NES) in eukaryotic proteins

SecretomeP » SecretomeP | Download software
Non-classical and leaderless secretion of proteins

SignalP » SignalP | Download software | SOAP WS
Signal peptide and cleavage sites in gram+, gram- and eukaryotic amino acid sequences

TargetP » TargetP | Download software | SOAP WS
Subcellular location of proteins: mitochondrial, chloroplastic, secretory pathway, or other

TatP » TatP | SOAP WS
Twin-arginine signal peptides

Post-translational modifications of proteins


DictyOGlyc
O-(alpha)-GlcNAc glycosylation sites (trained on Dictyostelium discoideum proteins)

NetAcet
N-terminal acetylation in eukaryotic proteins

NetCGlyc » NetCGlyc | Download software
C-mannosylation sites in mammalian proteins

NetCorona
Coronavirus 3C-like proteinase cleavage sites in proteins

NetGlycate » NetGlycate | SOAP WS
Glycation of ε amino groups of lysines in mammalian proteins

NetNGlyc » NetNGlyc | Download software | SOAP WS
N-linked glycosylation sites in human proteins

NetNGlyc » NetNGlyc | Download software | SOAP WS
N-linked glycosylation sites in human proteins

NetOGlyc » NetOGlyc | Download software | SOAP WS
O-GalNAc (mucin type) glycosylation sites in mammalian proteins

NetOGlyc » NetOGlyc | Download software | SOAP WS
O-GalNAc (mucin type) glycosylation sites in mammalian proteins

NetPhorest
Linear motif atlas for phosphorylation-dependent signaling

NetPhos » NetPhos | Download software | SOAP WS
Generic phosphorylation sites in eukaryotic proteins

NetPhosBac
Generic phosphorylation sites in bacterial proteins

NetPhosK
Kinase specific phosphorylation sites in eukaryotic proteins

NetPhosYeast
Serine and threonine phosphorylation sites in yeast proteins

NetPicoRNA
Posttranslational cleavage by picornaviral proteases

NetworKIN
In vivo kinase-substrate relationships

ProP » ProP | Download software | SOAP WS
Arginine and lysine propeptide cleavage sites in eukaryotic protein sequences

YinOYang » YinOYang | Download software
O-(beta)-GlcNAc glycosylation and Yin-Yang sites (intracellular/nuclear proteins)

Post-translational modification prediction
  • ChloroP - Prediction of chloroplast transit peptides
  • LipoP - Prediction of lipoproteins and signal peptides in Gram negative bacteria
  • MITOPROT - Prediction of mitochondrial targeting sequences
  • PATS - Prediction of apicoplast targeted sequences
  • PlasMit - Prediction of mitochondrial transit peptides in Plasmodium falciparum
  • Predotar - Prediction of mitochondrial and plastid targeting sequences
  • PTS1 - Prediction of peroxisomal targeting signal 1 containing proteins
  • SignalP - Prediction of signal peptide cleavage sites
  • DictyOGlyc - Prediction of GlcNAc O-glycosylation sites in Dictyostelium
  • NetCGlyc - C-mannosylation sites in mammalian proteins
  • NetOGlyc - Prediction of O-GalNAc (mucin type) glycosylation sites in mammalian proteins
  • NetGlycate - Glycation of epsilon amino groups of lysines in mammalian proteins
  • NetNGlyc - Prediction of N-glycosylation sites in human proteins
  • OGPET - Prediction of O-GalNAc (mucin-type) glycosylation sites in eukaryotic (non-protozoan) proteins
  • YinOYang - O-beta-GlcNAc attachment sites in eukaryotic protein sequences
  • big-PI Predictor - GPI Modification Site Prediction
  • GPI-SOM - Identification of GPI-anchor signals by a Kohonen Self Organizing Map
  • Myristoylator - Prediction of N-terminal myristoylation by neural networks
  • NMT - Prediction of N-terminal N-myristoylation
  • CSS-Palm - Palmitoylation site prediction with CSS
  • PrePS - Prenylation Prediction Suite
  • NetAcet - Prediction of N-acetyltransferase A (NatA) substrates (in yeast and mammalian proteins)
  • NetPhos - Prediction of Ser, Thr and Tyr phosphorylation sites in eukaryotic proteins
  • NetPhosK - Kinase specific phosphorylation sites in eukaryotic proteins
  • NetPhosYeast - Serine and threonine phosphorylation sites in yeast proteins
  • GPS - Prediction of kinase-specific phosphorylation sites for 408 human protein kinases in hierarchy
  • Sulfinator - Prediction of tyrosine sulfation sites
  • SulfoSite - Prediction of tyrosine sulfation sites
  • SUMOplot - Prediction of SUMO protein attachment sites
  • SUMOsp - Prediction of sumoylation sites
  • TermiNator - Prediction of N-terminal modification (version 3)
  • NetPicoRNA - Prediction of protease cleavage sites in picornaviral proteins
  • NetCorona - Coronavirus 3C-like proteinase cleavage sites in proteins
  • ProP - Arginine and lysine propeptide cleavage sites in eukaryotic protein sequences