Multiple Regimes in Northern Hemisphere Height Fields via MixtureModel Clustering*

1 November 1999

journal article
research article
Published by American Meteorological Society in Journal of the Atmospheric Sciences

Vol. 56 (21) , 3704-3723
https://doi.org/10.1175/1520-0469(1999)056<3704:mrinhh>2.0.co;2

Abstract

A mixture model is a flexible probability density estimation technique, consisting of a linear combination of k component densities. Such a model is applied to estimate clustering in Northern Hemisphere (NH) 700-mb geopotential height anomalies. A key feature of this approach is its ability to estimate a posterior probability distribution for k, the number of clusters, given the data and the model. The number of clusters that is most likely to fit the data is thus determined objectively. A dataset of 44 winters of NH 700-mb fields is projected onto its two leading empirical orthogonal functions (EOFs) and analyzed using mixtures of Gaussian components. Cross-validated likelihood is used to determine the best value of k, the number of clusters. The posterior probability so determined peaks at k = 3 and thus yields clear evidence for three clusters in the NH 700-mb data. The three-cluster result is found to be robust with respect to variations in data preprocessing and data analysis parameters. The... Abstract A mixture model is a flexible probability density estimation technique, consisting of a linear combination of k component densities. Such a model is applied to estimate clustering in Northern Hemisphere (NH) 700-mb geopotential height anomalies. A key feature of this approach is its ability to estimate a posterior probability distribution for k, the number of clusters, given the data and the model. The number of clusters that is most likely to fit the data is thus determined objectively. A dataset of 44 winters of NH 700-mb fields is projected onto its two leading empirical orthogonal functions (EOFs) and analyzed using mixtures of Gaussian components. Cross-validated likelihood is used to determine the best value of k, the number of clusters. The posterior probability so determined peaks at k = 3 and thus yields clear evidence for three clusters in the NH 700-mb data. The three-cluster result is found to be robust with respect to variations in data preprocessing and data analysis parameters. The...

This publication has 11 references indexed in Scilit:

Low-Frequency Variability in a GCM: Three-Dimensional Flow Regimes and Their Dynamics
Journal of Climate, 1997
The Probability Density Distribution of the Planetary-Scale Atmospheric Wave Amplitude Revisited
Journal of the Atmospheric Sciences, 1995
Weather Regimes: Recurrence and Quasi Stationarity
Journal of the Atmospheric Sciences, 1995
Is There Evidence of Multiple Equilibria in Planetary Wave Amplitude Statistics?
Journal of the Atmospheric Sciences, 1994
Cluster Analysis of the Northern Hemisphere Wintertime 500-hPa Height Field: Spatial Patterns
Journal of the Atmospheric Sciences, 1993
Multiple Flow Regimes in the Northern Hemisphere Winter. Part II: Sectorial Regimes and Preferred Transitions
Journal of the Atmospheric Sciences, 1993
Multiple Flow Regimes in the Northern Hemisphere Winter. Part I: Methodology and Hemispheric Regimes
Journal of the Atmospheric Sciences, 1993
Cluster analysis of multiple planetary flow regimes
Journal of Geophysical Research: Atmospheres, 1988
Tropical Cyclone Forecast Errors and the Multimodal Bivariate Normal Distribution
Journal of Applied Meteorology, 1982
Blocking Action in the Middle Troposphere and its Effect upon Regional Climate
Tellus, 1950