Multivariate analysis by data depth: descriptive statistics, graphics and inference, (with discussion and a rejoinder by Liu and Singh)
Open Access
- 1 June 1999
- journal article
- Published by Institute of Mathematical Statistics in The Annals of Statistics
- Vol. 27 (3) , 783-858
- https://doi.org/10.1214/aos/1018031260
Abstract
A data depth can be used to measure the “depth” or “outlyingness” of a given multivariate sample with respect to its underlying distribution. This leads to a natural center-outward ordering of the sample points. Based on this ordering, quantitative and graphical methods are introduced for analyzing multivariate distributional characteristics such as location, scale, bias, skewness and kurtosis, as well as for comparing inference methods. All graphs are one-dimensional curves in the plane and can be easily visualized and interpreted. A “sunburst plot” is presented as a bivariate generalization of the box-plot. DD-(depth versus depth) plots are proposed and examined as graphical inference tools. Some new diagnostic tools for checking multivariate normality are introduced. One of them monitors the exact rate of growth of the maximum deviation from the mean, while the others examine the ratio of the overall dispersion to the dispersion of a certain central region. The affine invariance property of a data depth also leads to appropriate invariance properties for the proposed statistics and methods.Keywords
This publication has 57 references indexed in Scilit:
- Skewness for multivariate distributions: two approachesThe Annals of Statistics, 1997
- Multivariate density estimation by probing depthPublished by Institute of Mathematical Statistics ,1997
- Breakdown Properties of Location Estimates Based on Halfspace Depth and Projected OutlyingnessThe Annals of Statistics, 1992
- Hyperdimensional Data Analysis Using Parallel CoordinatesJournal of the American Statistical Association, 1990
- A Multivariate Generalization of Quantile-Quantile PlotsJournal of the American Statistical Association, 1990
- On a Notion of Data Depth Based on Random SimplicesThe Annals of Statistics, 1990
- Descriptive statistics for multivariate distributionsStatistics & Probability Letters, 1983
- Descriptive Statistics for Nonparametric Models I. IntroductionThe Annals of Statistics, 1975
- The Use of Faces to Represent Points in k-Dimensional Space GraphicallyJournal of the American Statistical Association, 1973
- A General Definition of the Lorenz CurveEconometrica, 1971