CAMEO contact prediction (CAMEO CP) is intended to continuously evaluate the performance of local contact prediction programs.
CAMEO-CP participant groups can register their methods either as web-server or as standalone package. The reliability of the participating contact prediction tools is then evaluated by comparing the predicted contacts with the observed contacts in the experimental reference structure using various scoring methods developed by the community.
If you are interested in registering your method to CAMEO CP, please validate your server's compliance with the submission mechanisms and prediction format.
Targets to be evaluated are provided as amino acid sequences in FASTA format.
Predicted contacts should be send back by email adhering to the CAMEO CP prediction format.
As a baseline for the contact prediction, we use mutual information (MI) to predict the most likely contacts from a multiple sequence alignment (MSA) obtained with HHBlits. Specifically the MSA for the target protein is obtained from a search against the UniProt nr20 database. For our calculation of MI, we use the small number correction introduced by Buslje et al. [Bulsje et al, Bioinformatics 25:1125-1131, 2009], which simply consists in adding a small number (here 0.05) to the number of observations of any residue pair when calculating the probabilities. Gaps are not counted as an amino acid type and are therefore not included in the calculation of MI, whereas we always use the total number of sequences in the alignment as normalisation factor in the calculation of the probabilities. This effectively penalises columns with gaps in the calculation of MI.