chap-04 4/6/2004 17: 22 page 98
98 GEOMETRIC MORPHOMETRICS FOR BIOLOGISTS
We can take this example a step further by specifying that all the chairs are located along
walls of the room, with every chair touching the wall. Now, the X- and Y-coordinates
can be replaced by the distance (L) around the perimeter of the room from the door to the
notebook, and the direction of the measurement (clockwise or counter-clockwise). If we
agree that distances around a perimeter are always measured in the same direction, then the
value of L is sufficient to describe the location of the notebook. The additional constraints
(chairs against the wall, perimeter measured in clockwise direction) have reduced the
degrees of freedom from two (X and Y) to one (L). We have not actually eliminated
either X or Y; rather, we have merely replaced that pair by L. Nor have we lost any
information; given L, and the direction in which L is measured, as well as the height of
the chairs, we can reconstruct the original three Cartesian coordinates (X, Y, and Z)of
the notebook.
In the case of two-dimensional shapes, we start out with K landmarks in two dimensions,
so we have 2K coordinates, which constitute 2K independent measurements (because each
coordinate is independent of the others, in principle). In the course of superimposing the
shapes on the reference form, we perform three operations: (1) we center the matrix on the
centroid, thereby losing two degrees of freedom; (2) we set centroid size to one, thereby
losing another; and (3) we compute the angle through which to rotate the specimen, thereby
losing one more. By the end, we have lost four degrees of freedom as a consequence of
applying these constraints to the data. However, unlike the notebook example, we still
have 2K variable coordinates in our data matrix; none of them have been removed or
constrained. We have not lost degrees of freedom by removing coordinates, because the
loss of degrees of freedom is shared by all coordinates – each coordinate has lost some
fraction of a degree of freedom because each is partially constrained by the operations of
centering, scaling and rotation. Consequently, we have too many variable coordinates for
the degrees of freedom. The primary advantage of the thin-plate spline methods (discussed
in Chapter 6) is that we can work with 2K −4 variables, so that the number of variables
and the number of degrees of freedom are the same.
Summary
Because there are several different morphometric spaces and distances, some with only
slightly different names, we summarize them below.
The configuration space is the set of all matrices representing landmark configurations
that have the same number of landmarks and coordinates. This space has K ×M
dimensions, where K is the number of landmarks and M is the number of coordinates.
The pre-shape space is the set of all K ×M configurations with a centroid size of one,
centered at the origin. This space is the surface of a hypersphere of radius one. Because of
the centering, configurations that differ only in position are represented as the same point in
pre-shape space. Similarly, because of the scaling, configurations that differ only in centroid
size are represented by the same point in pre-shape space. Consequently, this space has
KM –(M +1) dimensions; M dimensions are lost due to centering, and one dimension is
lost due to scaling. In pre-shape space, the set of all configurations that may be converted
into one another by rotation lies along a circular arc called a fiber, which lies on the surface
of the pre-shape hypersphere. The distance between shapes in pre-shape space is the length
of the shortest arc across the surface connecting the fibers representing those shapes, and