Optimal Spatial Prediction with Kriging

Robert Nishihara · April 21, 2013

Suppose we are modeling a spatial process (for instance, the amount of rainfall around the world, the distribution of natural resources, or the population density of an endangered species). We've measured the latent function Z at some locations s1,,sN, and we'd like to predict the function's value at some new location s0. Kriging is a technique for extrapolating our measurements to arbitrary locations. For an in-depth discussion, see Cressie and Wikle (2011). Here I derive Kriging in a simplified case.

I will assume that Z is an intrinsically stationary process. In other words, there exists some semivariogram γ(h) such that

var[Z(s+h)Z(s)]=2γ(h).

Furthermore, I will assume that the process is isotropic, (i.e. that γ(h) is a function only of ||h||). As Andy described here, the existence of a covariance function implies intrinsic stationarity. In addition, I will assume that the process has a constant mean, E[Z(s)]=μ. We would like to estimate Z(s) with a linear combination of our current observations. Our estimator will be

Z^=n=1NλnZ(sn),

where the weights λn can be positive or negative. We further require that nλn=1 so that our estimate is unbiased. We would like to choose the weights λn so as to minimize the mean-squared predictive error

MSPE(λ1,,λN)=E[(Z^Z(s))2].

Let γnm denote γ(snsm). Expanding the expression for the mean-squared predictive error, we get

n,mλnλmE[Z(sn)Z(sm)]+E[Z(s0)2]2nλnE[Z(sn)Z(s0)].

Adding and subtracting nλnE[Z(sn)2], this expression breaks into A+B, where

A=nλnE[Z(sn)2]+n,mλnλmE[Z(sn)Z(sm)]=12n,mλnλmE[(Z(sn)Z(sm))2]=n,mλnλmγnm,

and

B=E[Z(s0)2]2nλnE[Z(sn)Z(s0)]+nλnE[Z(sn)2]=nλnE[(Z(s0)Z(sn))2]=2nλnγ0n.

We minimize the quantity A+B subject to the constraint nλn=1 using Lagrange multipliers. To simplify the notation, define the matrix Γ by Γnm=γnm, the vector λ=(λ1,,λN)T, the vector γ0=(γ01,,γ0N)T, and the vector 1=(1,,1)T. Then the mean-squared predictive error is given by

λTΓλ+2λTγ0.

Incorporating the Lagrange multiplier constraint, we have the quantity

Φ(λ,α)=λTΓλ+2λTγ0α(1Tλ1).

Differentiating Φ with respect to α gives back our constraint. Differentiating with respect to each λn and concatenating the resulting equations into matrix form gives

2Γλ+2γ0=α1λ=Γ1(γ0α1/2).

Incorporating the constraint gives

1=1Tλ=1TΓ1(γ0α1/2).

Solving for α and plugging this back into our formula for λ, we find that

λ=Γ1(γ01TΓ1γ011TΓ111).

This gives us our optimal Kriging predictor.

Twitter, Facebook