104 Gronwald and Kalbitzer
subsequent stages. A simple method for significantly reducing the
number of noise and artifact peaks is the exclusion of areas from
the peak search where no meaningful resonances can be expected.
Such spectral areas include regions outside the spectral range of
the molecule under investigation and spectral regions where reso-
nance peaks cannot be separated from artifact peaks (e.g., near
the water t
1
-ridge). In programs such as AURELIA (50) and
AUREMOL, these spectral regions can be defined interactively by
the user. (2) Additional information can be derived from the line
shape itself. With a segmentation procedure, the n-dimensional
line widths can be determined and peaks with very small line
widths (i.e., noise spikes) or very large line widths (ridges and
baseline rolls) can be automatically removed (51). (3) A Bayesian
approach coupled to a multivariate linear discriminant analysis of
the data (52) can be used as a generally applicable method for the
automated classification of multidimensional NMR peaks. The
analysis relies on the assumption that different signal classes have
different distributions of specific properties such as line shapes,
line widths, and intensities. In addition, a nonlocal feature is
included that takes into account the similarities of peak shapes in
symmetry-related positions. The calculated probabilities for the
different signal class memberships are realistic and reliable with a
high efficiency of discriminating between peaks that are true signals
and those that are not (53) (see also Notes 11–13).
The basis for macromolecular structure determination in solution
is still given by distance information from multidimensional NOE
data. As a consequence, automated routines for NOE integration
are required. Accurate integration of spectral cross-peaks demands
a reliable definition of the cross-peak area. However, such a
definition is always a compromise between requirements that the
integration area be as large as possible so that a complete integra-
tion is obtained, and also, as small as possible to reduce the
influence from artifacts associated with baseline rolls and tails of
other peaks. A similar approach defines the peak integration area
using an iterative “region-growing” algorithm (44, 51, 54), which
recognizes all data points that are part of a given cross-peak; the
integration can be performed based on a user-defined threshold
level. In AUREMOL, this threshold is defined relative to the
maximum value of the peak to ensure that the relative volumes are
directly proportional to the strength of interaction. This automatic
integration procedure works surprisingly well even for overlapping
peaks as long as the peak maxima are separately visible and there-
fore recognizable by the peak picking procedure. In a different
approach, peaks are fitted by a set of reference peaks defined by the
user (48, 55). This approach is probably best suited in cases where
peaks strongly overlap; however, it demands a careful selection of
the reference peaks by the user and is therefore not applicable for
fully automated applications (see also Notes 14–16).
3.4.2. Signal Integration