Deriving conditional multivariate normal distribution

Conditional distribution when multivariate normal random vector is conditioned on its partition.

Author

Eunki Chung

Published

May 19, 2023

Let random vector x follows a multivariate normal distribution xN(μ,Σ). Consider the conditional distribution of x1|x2 where x1 and x2 are partitions of the random vector x. We first create partitions as follows.

x=[x1x2]μ=[μ1μ2]Σ=[Σ11Σ12Σ21Σ22]

Then, x1|x2N(μ~,Σ~) where μ~ and Σ~ is given by,

μ~=μ1+Σ12Σ221(x2μ2) Σ~=Σ11Σ12Σ221Σ21

Proof

It is known that conditional distribution of multivaritate normal, conditioned on a subset of variables are again multivariate normal. Thus, we can verify the above theorem by deriving the conditional mean and covariace. Assuming Σ22 is invertible, define a new variable z=x1+Ax2, where A=Σ12Σ221.

First notice that z and x2 has zero covariance, Cov(z, x2)=Cov(x1+Ax2, x2)=Cov(x1, x2)+Cov(Ax2,x2)=Cov(x1, x2)+ACov(x2,x2)=Σ12+AΣ22=Σ12+Σ12Σ221Σ22=Σ12Σ12=0

Since z and x are jointly normal and uncorrelated, they are independent by the property of mulitvariate normal. Now we can easily find the conditional mean as follows.

E(x1|x2)=E(zAx2|x2)=E(z|x2)AE(x2|x2)=E(z)Ax2(x2z)=E(x1+Ax2)Ax2=E(x1)+AE(x2)Ax2=μ1+Aμ2Ax2=μ1A(x2μ2)=μ1+Σ12Σ221(x2μ2)=μ~

Before jumping into conditional variance. Recall the following facts for random vectors x, y and non-random matrix A,B,

Var(x+y)=Var(x)+Var(y)+Cov(x, y)+Cov(y, x)Cov(Ax,By)=ACov(x,y)BT

Using the above fact, we can expand the conditional variance as follows.

Var(x1|x2)=Var(zAx2|x2)=Var(z|x2)+Var(Ax2|x2)+Cov(z, Ax2|x2)+Cov(Ax2, z|x2)

Since Ax2|x2 is not random anymore given x2, we are only left with the first term.

=Var(z|x2)=Var(z)(x2z)=Var(x1+Ax2)=Var(x1)+Var(Ax2)+Cov(x1, Ax2)+Cov(Ax2, x1)=Var(x1)+AVar(x2)AT+Cov(x1, x2)AT+ACov(x2, x1)=Σ11+AΣ22AT+Σ12AT+AΣ21=Σ11AΣ22(Σ12Σ221)T+AΣ21+AΣ21=Σ11AΣ22Σ221Σ21+2AΣ21=Σ11AΣ21+2AΣ21=Σ11+AΣ21=Σ11Σ12Σ221Σ21=Σ~

Therefore, the proof is done.

References

(https://stats.stackexchange.com/users/4856/macro), Macro. n.d. “Deriving the Conditional Distributions of a Multivariate Normal Distribution.” Cross Validated. https://stats.stackexchange.com/q/30600.