Abstract
A statistical analysis was performed to determine to what extent an amino acid determines the identity of its neighbors and to what extent this is determined by the structural environment. Log-linear analysis was used to discriminate chance occurrence from statistically meaningful correlations. The classification of structures was arbitrary, but was also tested for significance. A list of statistically significant interaction types was selected and then ranked according to apparent importance for applications such as protein design. This showed that, in general, nonlocal, through- space interactions were more important than those between residues near in the protein sequence. The highest ranked nonlocal interactions involved residues in β-sheet structures. Of the local interactions, those between residues i and i + 2 were the most important in both α-helices and β- strands. Some surprisingly strong correlations were discovered within β- sheets between residues and sites sequentially near to their bridging partners. The results have a clear bearing on protein engineering studies, but also have implications for the construction of knowledge-based force fields.
Original language | English |
---|---|
Pages (from-to) | 175-189 |
Number of pages | 15 |
Journal | Proteins: Structure, Function and Bioinformatics |
Volume | 32 |
Issue number | 2 |
DOIs | |
Publication status | Published - 1 Aug 1998 |
Keywords
- Nonlocal interactions
- Pairwise statistics
- Secondary structure