
17.10 Weighted Ridge Regression 587
Fig. 17.9. Movements in the least square and damped least square i terative process
away from the actual answer. But convergence is very slow near the a ctual
answer. Ridge regression has qualities of both the approaches i.e., it converges
very fast near the actual answer and it’s radius of convergence is reasonably
high. It means even if the initial guess is poor i.e., the distance between the
m
Prior
and m
true
is high, r idge regression can drag the model towards the
actual answer. Larger the number of parameters, lesser will be the radius of
convergence. Data inadequacy and data inaccuracy has direct relation with
the radius o f convergence. Choice of the value of K is dependent upon the
interpreter. Starting value of K can be anything between 10.0, 1.00, 0.01,
0.001 as suggested by Marquardt (1963). But as the iterative solution con-
verges, the value of K must be successively lowered down till its value becomes
negligible. Many interpreters used variance – covariance values instead o f a
pure number as Marquardt’s coefficient (Tarantola 1987, Menke, 1984).
17.10 Weighted Ridge Regression
In most of the scientific work we see that some of the experimental data in any
experiment are less reliable than the others. This is quite common in geophys-
ical field data analysis. It means that the data variances are not all equal. In
other words the matrix Var (ε)(Variance(ε) is not in the form of Iσ
2
where I
is the identity matrix and σ
2
is the variance (square of the standard deviation)
in the data. But Var (ε) is diagonally dominated matrix with unequal diago-
nal elements. It happens in some problems that the off diagonal elements of
Var (ε) are not zero, i.e., the observations are correlated. When either or both
of these occur, the general least squares estimator (17.104) is not valid and it
is necessary to change the procedure for obtaining the estimator. Draper and
Smith (1968 ) suggested that one has to transfer the observation
Y=Xβ + ε (17.105)
to another variable Z in a different plane which do satisfy the basic conditions
of linear regression and one can write