The Variogram: a spatial decomposition of variance

Suppose that we have measured the variables z at N locations, whose geographical positions are denoted x¯i. Consider all pairs of data locations, of which there are Np=N(N-1)2.

Now each pair of data locations is placed into a "lag class", determined by a vector h¯ (the "lag"), which means that the two points are separated by (roughly) h¯. Define Nc classes, each described by a set Ph¯ of pairs of indices, such that (i,j)Ph¯x¯i-x¯jh¯.

The univariate variogram is defined as

γ(h¯)=12E[(z(x¯+h¯)-z(x¯))2]
and the estimator of the variogram function for lag h¯ is
γ(h¯)=12Nh¯(i,j)Ph¯Nh¯(zi-zj)2,
where Nh¯ is the number of distinct pairs of data values, placed in the set Ph¯ (pairs displaced by the vector h¯). γ(h¯) is sometimes called the semi-variogram, because of the factor of 1/2 in its definition.

Now the sample variance, s2, can be written as a weighted sum

s2=c=1Nc(Nh¯Np)(12Nh¯(i,j)Ph¯Nh¯(zi-zj)2),
or
s2=c=1Ncγ(h¯)(Nh¯Np)
It is this equation which leads me to declare that

The sample variogram is the spatial decomposition of the sample variance.