Multiple Regimes in Northern Hemisphere Height Fields via MixtureModel Clustering*
- 1 November 1999
- journal article
- research article
- Published by American Meteorological Society in Journal of the Atmospheric Sciences
- Vol. 56 (21) , 3704-3723
- https://doi.org/10.1175/1520-0469(1999)056<3704:mrinhh>2.0.co;2
Abstract
A mixture model is a flexible probability density estimation technique, consisting of a linear combination of k component densities. Such a model is applied to estimate clustering in Northern Hemisphere (NH) 700-mb geopotential height anomalies. A key feature of this approach is its ability to estimate a posterior probability distribution for k, the number of clusters, given the data and the model. The number of clusters that is most likely to fit the data is thus determined objectively. A dataset of 44 winters of NH 700-mb fields is projected onto its two leading empirical orthogonal functions (EOFs) and analyzed using mixtures of Gaussian components. Cross-validated likelihood is used to determine the best value of k, the number of clusters. The posterior probability so determined peaks at k = 3 and thus yields clear evidence for three clusters in the NH 700-mb data. The three-cluster result is found to be robust with respect to variations in data preprocessing and data analysis parameters. The... Abstract A mixture model is a flexible probability density estimation technique, consisting of a linear combination of k component densities. Such a model is applied to estimate clustering in Northern Hemisphere (NH) 700-mb geopotential height anomalies. A key feature of this approach is its ability to estimate a posterior probability distribution for k, the number of clusters, given the data and the model. The number of clusters that is most likely to fit the data is thus determined objectively. A dataset of 44 winters of NH 700-mb fields is projected onto its two leading empirical orthogonal functions (EOFs) and analyzed using mixtures of Gaussian components. Cross-validated likelihood is used to determine the best value of k, the number of clusters. The posterior probability so determined peaks at k = 3 and thus yields clear evidence for three clusters in the NH 700-mb data. The three-cluster result is found to be robust with respect to variations in data preprocessing and data analysis parameters. The...This publication has 11 references indexed in Scilit:
- Low-Frequency Variability in a GCM: Three-Dimensional Flow Regimes and Their DynamicsJournal of Climate, 1997
- The Probability Density Distribution of the Planetary-Scale Atmospheric Wave Amplitude RevisitedJournal of the Atmospheric Sciences, 1995
- Weather Regimes: Recurrence and Quasi StationarityJournal of the Atmospheric Sciences, 1995
- Is There Evidence of Multiple Equilibria in Planetary Wave Amplitude Statistics?Journal of the Atmospheric Sciences, 1994
- Cluster Analysis of the Northern Hemisphere Wintertime 500-hPa Height Field: Spatial PatternsJournal of the Atmospheric Sciences, 1993
- Multiple Flow Regimes in the Northern Hemisphere Winter. Part II: Sectorial Regimes and Preferred TransitionsJournal of the Atmospheric Sciences, 1993
- Multiple Flow Regimes in the Northern Hemisphere Winter. Part I: Methodology and Hemispheric RegimesJournal of the Atmospheric Sciences, 1993
- Cluster analysis of multiple planetary flow regimesJournal of Geophysical Research: Atmospheres, 1988
- Tropical Cyclone Forecast Errors and the Multimodal Bivariate Normal DistributionJournal of Applied Meteorology, 1982
- Blocking Action in the Middle Troposphere and its Effect upon Regional ClimateTellus, 1950