You are here
Structural descriptor database: a new tool for sequence-based functional site prediction.
Title | Structural descriptor database: a new tool for sequence-based functional site prediction. |
Publication Type | Journal Article |
Year of Publication | 2008 |
Authors | Bernardes, JS, Fernandez, JH, Vasconcelos, ATereza R |
Journal | BMC Bioinformatics |
Volume | 9 |
Pagination | 492 |
Date Published | 2008 Nov 25 |
ISSN | 1471-2105 |
Keywords | Algorithms, Artificial Intelligence, Binding Sites, Catalytic Domain, Databases, Protein, Information Storage and Retrieval, Internet, Pattern Recognition, Automated, Protein Conformation, Proteins, Proteomics, Sequence Analysis, Protein, Software, Structure-Activity Relationship |
Abstract | BACKGROUND: The Structural Descriptor Database (SDDB) is a web-based tool that predicts the function of proteins and functional site positions based on the structural properties of related protein families. Structural alignments and functional residues of a known protein set (defined as the training set) are used to build special Hidden Markov Models (HMM) called HMM descriptors. SDDB uses previously calculated and stored HMM descriptors for predicting active sites, binding residues, and protein function. The database integrates biologically relevant data filtered from several databases such as PDB, PDBSUM, CSA and SCOP. It accepts queries in fasta format and predicts functional residue positions, protein-ligand interactions, and protein function, based on the SCOP database. RESULTS: To assess the SDDB performance, we used different data sets. The Trypsion-like Serine protease data set assessed how well SDDB predicts functional sites when curated data is available. The SCOP family data set was used to analyze SDDB performance by using training data extracted from PDBSUM (binding sites) and from CSA (active sites). The ATP-binding experiment was used to compare our approach with the most current method. For all evaluations, significant improvements were obtained with SDDB. CONCLUSION: SDDB performed better when trusty training data was available. SDDB worked better in predicting active sites rather than binding sites because the former are more conserved than the latter. Nevertheless, by using our prediction method we obtained results with precision above 70%. |
DOI | 10.1186/1471-2105-9-492 |
Alternate Journal | BMC Bioinformatics |
PubMed ID | 19032768 |
PubMed Central ID | PMC2612011 |