117
Automated Protein NMR Structure Determination in Solution
For using the routine REFINE the following parameters have
to be set:
1. Set parameters as for RELAX.
2. Set parameters for the error calculation. Here, one can choose
whether minimal error bounds plus an optional user error
should be selected (recommended for final cycles of automated
structure calculation, or fully user defined error bounds, such as
a certain percentage of the restraint distance, should be used).
AUREMOL provides an interface for performing structure calcu-
lations. Necessary input files are automatically created by
AUREMOL. The structure calculations itself are done by exter-
nal programs such as CYANA or CNS, whereas the analysis of the
resulting structures is again performed within AUREMOL.
One of the most important steps in any structure determination
project is the validation of the final and/or intermediate struc-
tures. Often the quality of an NMR structure is mainly judged by
factors such as RMSD values or the quality of the Ramachandran
plot. However, these methods do not provide a direct measure of
how well the calculated structures fit the experimental data.
Therefore, we have implemented the program RFAC (61, 77) in
AUREMOL, which automatically calculates R-factors for protein
NMR structures to provide such a measure. The automated R-factor
analysis envisaged here consists, in principle, of two separate
parts: (1) a comparison of the experimental NOESY spectrum
with the NOESY spectrum back-calculated from a given struc-
ture, and (2) the calculation of the R-factor(s) from the data. In
the first part, the NOESY spectrum has to be calculated from the
trial structure or a bundle of trial structures using the resonance
line assignments of the side- and main-chain atoms. For the algo-
rithm to work properly, these assignments have to be complete or
almost complete. In our implementation, we use the full relax-
ation matrix approach of the AUREMOL module RELAX to
obtain accurate simulated peaks defined by their positions, inten-
sities, and line shapes. The corresponding experimental NOESY
spectrum is as described above automatically peak picked and
integrated in the preprocessing stage of AUREMOL. In addition,
the probabilities p
i
of the peaks i to be true NMR signals and not
noise or artifact peaks are also calculated according to Bayes’ the-
orem and are used as weighting factors during the calculation of
the R-factors. For the purpose of R-factor calculation, the experi-
mental data are automatically assigned based on the correspond-
ing simulated spectrum and the sequential resonance line
assignment. Note that in difference to KNOWNOE only assign-
ments are made that could be expected from the trial structure.
The AUREMOL routine SHIFTOPT (78) is used in this process
3.10. Structure
Calculation
3.11. Structure
Validation by NMR-
R-Factor Calculations