Displaying the Important Features of Large Collections of Similar Curves

1 May 1992

journal article
research article
Published by Taylor & Francis in The American Statistician

Vol. 46 (2) , 140-145
https://doi.org/10.1080/00031305.1992.10475870

Abstract

Naively displaying a large collection of curves by superimposing them one on another all on the same graph is largely uninformative and aesthetically unappealing. We propose that a simple principal component analysis be used to identify important modes of variation among the curves and that principal component scores be used to identify particular curves which clearly demonstrate the form and extent of that variation. As a result, we obtain a small number of figures on which are plotted a very few “representative” curves from the original collection; these successfully convey the major information present in sets of “similar” curves in a clear and attractive manner. Useful adjunct displays, including the plotting of principal component scores against covariates, are also described. Two examples—one concerning a data-based bandwidth selection procedure for kernel density estimation, the other involving ozone level curve data—illustrate the ideas.

Keywords

This publication has 10 references indexed in Scilit:

Estimating the Mean and Covariance Structure Nonparametrically When the Data are Curves
Journal of the Royal Statistical Society Series B: Statistical Methodology, 1991
Some Tools for Functional Data Analysis
Journal of the Royal Statistical Society Series B: Statistical Methodology, 1991
On optimal data-based bandwidth selection in kernel density estimation
Biometrika, 1991
Hyperdimensional Data Analysis Using Parallel Coordinates
Journal of the American Statistical Association, 1990
Some Implementations of the Boxplot
The American Statistician, 1989
Principal component analysis and interpolation of stochastic processes: methods and simulation
Journal of Applied Statistics, 1987
Principal Modes of Variation for Processes with Continuous Sample Curves
Technometrics, 1986
Principal Components Analysis of Sampled Functions
Psychometrika, 1986
Principal Component Analysis
Published by Springer Nature ,1986
Density Estimation for Statistics and Data Analysis
Published by Springer Nature ,1400