Preparatory Data Analysis
- 15 April 2003
- book chapter
- Published by Wiley
Abstract
Preparatory data analyses (data screening) are conducted before a main analysis to assess the fit between the data and the assumptions of that main analysis. Different main analyses have different assumptions that vary in importance; violation of some assumptions can lead to the wrong inferential conclusion (and a potential failure of replication) while violation of others yields an analysis that is correct as far as it goes, but misses certain additional relationships in the data. Assumptions that are often relevant for continuous variables are normality of sampling distributions, pairwise linearity, absence of outliers and collinearity, independence of errors, and homoscedasticity; these are evaluated by both graphical and statistical methods. When violation is detected, variables are often transformed or an alternative analytic strategy is employed. Relevant issues in the choice of when and how to screen are the level of measurement of the variables, whether the design produces grouped or ungrouped data, whether cases provide a single response or more than one response, and whether the variables themselves or the residuals of analysis are screened.Keywords
This publication has 18 references indexed in Scilit:
- Statistical methods in psychology journals: Guidelines and explanations.American Psychologist, 1999
- Outlier Detection in Multivariate Analytical Chemical DataAnalytical Chemistry, 1998
- Procedures for the Identification of Multiple Outliers in Linear ModelsJournal of the American Statistical Association, 1993
- Introduction to TransformationPublished by Wiley ,1991
- Unmasking Multivariate Outliers and Leverage PointsJournal of the American Statistical Association, 1990
- Robustness properties of nonorthogonal analysis of variance.Psychological Bulletin, 1987
- Factors that affect Type I and Type II error rates in the analysis of multidimensional contingency tables.Psychological Bulletin, 1980
- Practical considerations in choosing a MANOVA test statistic: A rejoinder to Stevens.Psychological Bulletin, 1979
- Two-sample T–2 procedure and the assumption of homogeneous covariance matrices.Psychological Bulletin, 1979
- Estimation of the Box Correction for Degrees of Freedom from Sample Data in Randomized Block and Split-Plot DesignsJournal of Educational Statistics, 1976