[R] Results of MCD estimators in MASS and rrcov

Valentin Todorov valentin.todorov at chello.at
Thu Jan 27 22:08:09 CET 2005


The two implementations use different consistency factors as well as
different small sample correction factors.

1. The search parts of both implementations produce the same result -
compare rrcov.mcd$best and mass.mcd$best.

2. The raw MCD covariance matrix is corrected as follows:

MASS:
 - Rousseeuw and Leroy (1987), p.259 (eq. 1.26)
 - Marazzi (1993) (or may be Rousseeuw and van Zomeren (1900) p.638 (eq A.9)

rrcov:
 - Croux and Haesbroeck (1999), Pison et.al. p. 337
 - Pison et.al. (2002), p.338

3. The reweighted (final) covariance matrix is corrected as follows:

MASS: no correction
rrcov: Pison et.al. (2002) p. 339

This explains the different covariance matrices.
As far as the location is concerned, in this particular case the raw MCD
estimates in MASS identify one additional outlier - observation 53, which is
discarded from the computation of the reweighted estimates.
Look at the following plots and judge yourself if this is an outlier or not:

  covPlot(hbk, mcd=rrcov.mcd, which="distance", id.n=15)
  covPlot(hbk, mcd=mass.mcd, which="distance", id.n=15)

valentin




More information about the R-help mailing list