Displaying the Important Features of Large Collections of Similar Curves
- 1 May 1992
- journal article
- research article
- Published by Taylor & Francis in The American Statistician
- Vol. 46 (2) , 140-145
- https://doi.org/10.1080/00031305.1992.10475870
Abstract
Naively displaying a large collection of curves by superimposing them one on another all on the same graph is largely uninformative and aesthetically unappealing. We propose that a simple principal component analysis be used to identify important modes of variation among the curves and that principal component scores be used to identify particular curves which clearly demonstrate the form and extent of that variation. As a result, we obtain a small number of figures on which are plotted a very few “representative” curves from the original collection; these successfully convey the major information present in sets of “similar” curves in a clear and attractive manner. Useful adjunct displays, including the plotting of principal component scores against covariates, are also described. Two examples—one concerning a data-based bandwidth selection procedure for kernel density estimation, the other involving ozone level curve data—illustrate the ideas.Keywords
This publication has 10 references indexed in Scilit:
- Estimating the Mean and Covariance Structure Nonparametrically When the Data are CurvesJournal of the Royal Statistical Society Series B: Statistical Methodology, 1991
- Some Tools for Functional Data AnalysisJournal of the Royal Statistical Society Series B: Statistical Methodology, 1991
- On optimal data-based bandwidth selection in kernel density estimationBiometrika, 1991
- Hyperdimensional Data Analysis Using Parallel CoordinatesJournal of the American Statistical Association, 1990
- Some Implementations of the BoxplotThe American Statistician, 1989
- Principal component analysis and interpolation of stochastic processes: methods and simulationJournal of Applied Statistics, 1987
- Principal Modes of Variation for Processes with Continuous Sample CurvesTechnometrics, 1986
- Principal Components Analysis of Sampled FunctionsPsychometrika, 1986
- Principal Component AnalysisPublished by Springer Nature ,1986
- Density Estimation for Statistics and Data AnalysisPublished by Springer Nature ,1400