162 5. Scaling in Financial Data and in Physics
In order to appreciate the subsequent discussion, let us look at two un-
correlated time series δs
(1)
(t)andδs
(2)
(t), each of length T (and zero mean,
unit variance, of course). From (5.117), we have
C(1, 2) =
1
T
T
t=1
δs
(1)
(t)δs
(2)
(t) . (5.120)
C(1, 2) is the sum of T random variables with zero mean. Despite the ab-
sence of correlations (by construction) between the two time series, for finite
T , C(1, 2) is a random variable itself and different from zero. C(1, 2) is drawn
from a distribution with zero mean and a standard deviation decreasing as
1/
√
T . Only in the limit T →∞will C(1, 2) → 0, as is appropriate for uncor-
related random variables. The finite time scale T , over which the correlations
between the two time series are determined, produces a noise dressing of the
correlation coefficient. More specifically, for two independent time series of
length T of normally distributed random numbers ε
i
(t) with zero mean and
unit variance, the correlation coefficient again is a random number [120]
ε
i
(t)ε
j
(t) = δ
ij
+
1+δ
ij
T
ε
i
(t) . (5.121)
The finite-length autocorrelation is a random normally distributed variable
with mean unity and variance 2/T , and the cross-correlation is a random
normally distributed variable with zero mean and variance 1/T.
For correlation matrices where many time series enter, noise dressing may
be a severe effect. N time series with T entries each may be grouped into an
N ×T random matrix M, and the correlation matrix is written as C = T
−1
M · M where M is the transpose of M. In the same way as noise dressing
for finite T produced an artificial finite random value for C(1, 2), for finite T ,
noise dressing will produce artificial finite random entries C(γ, δ) in the corre-
lation matrix. Figure 5.26 demonstrates this effect: the correlation matrix C
of 40 uncorrelated time series is random when the time series is only 10 steps
long (left panel). The absence of correlations C(γ,δ)=δ
γ,δ
is well visible for
1000 time steps (right panel). The two panels of Fig. 5.26 are consistent with
(5.121). For T = 10, the autocorrelation is a Gaussian variable with mean
unity and standard deviation 0.48, and the cross-correlation coefficients are
Gaussians with mean zero and standard deviations of 0.32. For T = 1000,
the mean values are the same but standard deviations have decreased by
one order of magnitude. Roughly, for N time series, T N time steps are
required in the series in order to produce statistically significant correlation
matrices.
Random matrix theory predicts the spectrum of eigenvalues λ of a ran-
dom matrix (of the type appropriate for financial markets [121, 122]) to be
bounded and distributed according to a density
ρ(λ)=
Q
2πσ
2
(λ
max
− λ)(λ − λ
min)
λ
,