| Results |
 |
| Robustness of the Delaunay Tessellation
in proteins: We show below how the
number of AD tetrahedra seems to decrease as we go from random points
to chains to decoys to proteins with similar sequence length and
packing density. Also, plotting the number of tetrahedra added at
low thresholds in (e) indicates that the DT of proteins changes
less than that of random points for a given perturbation. |
|
| |
|
|
 |
 |

|
| Comparing the AD tetrahedral threshold distributions,
with error bars for standard deviation, of (a) totally random
points (b) random lattice chains folded using MJ potential
(c) decoys of 2cro (d) proteins represented by C-αs.
(e) shows the stability of the DT in proteins.
|
|
|
 |
| Stability of SNAPP scores: We developed
variants of the SNAPP scores using almost-Delaunay tetrahedra with
positive threshold (AD-SNAPP), Delaunay and AD tetrahedra (D+AD-SNAPP)
and using Delaunay probabilities to weight each tetrahedron's score
(Delaunay-probability SNAPP). The average scores turn out to be
well correlated and equally good as averaged SNAPP scores at distinguishing
decoys. The total SNAPP scores are better at making this distinction,
and Delaunay-probability SNAPP scores are equally good as
SNAPP scores, as seen below in the second figure. |
|
| |
 |
| Comparison of average SNAPP scores using Delaunay
and almost-Delaunay tetrahedra (left, upper figure), and total
SNAPP scores against Delaunay-probability SNAPP scores (left,
lower), for the protein 2CRO (left, upper) and its decoys,
all represented by sidechain-centroids. |
|
 |
|
| |
|
 |
| Detection of structural motifs:
We are able to use the extra information available
from AD thresholds to improve the discrimination of secondary structural
elements (α-helices and β-sheets) over previous methods
based on the Delaunay tessellation. |
|
| |
 |
The histogram distribution of
AD thresholds in a synthetic α-helix is striking; apart from
the Delaunay tetrahedra that follow the patterns i+(1,2,3,4)
and i+(1,2,4,5) as known in the literature, there are three
characteristic α-helical peaks at ε=0.3, 0.7 and
1.2. The table at left shows the patterns and their corresponding
threshold values. These patterns and peaks are seen in the histograms
of all α-helical proteins represented by C-αs, and can
be used for quantitative estimation of α-helix content as well
as to estimate packing (by counting non-pattern tetrahedra).
Decoys of α-helical proteins show sharp peaks, but they can
be distinguished by the relative lack of non-pattern tetrahedra.
Proteins represented by sidechain centroids do not show peaks, since
the centroids don't have the regular structure seen in backbone
coordinates. |
|
| |
 |
| |
 |
synthetic
α-helix |
|
 |
α-helical
protein (PXR) |
|
 |
β-sheet
protein (chymotrypsin) |
|
 |
C-αs |
|
|
|
|
| |
 |
| |
 |
Patterns based on sequence interval (and
in fact all the patterns that we have tried) do not show
a clear signature of values of the AD threshold,
in proteins with β-sheets. The pattern we currently
use for detecting β-sheets is that almost-Delaunay
tetrahedra that straddle two neighboring β-strands
tend have similar values of the maximum gap in sequence
(for parallel strands) or values distributed in an interval
(for antiparallel strands). Thus we look for peaks and plateaus
in the histogram of the maximum sequence gap, and among
the tetrahedra that fall in these areas, we determine the
neighbors in adjacent β-strands by a mutual maximum
frequency of occurrence search, and cluster parallel and
antiparallel sequences of residues to complete the β-sheet.
|
|
|
| |
 |
| |
 |
| Above: Here we compare the results
of α-helix and β-sheet determination using our methods
with those from DSSP
for a set of proteins from different structural families in
CATH.
The individual α-helices are accurate to +/- 2 residues
at each end, and the total to within 10%. The β-sheet
prediction is less accurate than α-helix, since our method
currently does not use the value of the AD threshold as a
part of the pattern, so the results are not always specific
to the geometry of the β-sheet. We are working on methods
for detecting β-turns and more complex motifs. |
|
|
 |