Treating grouped data as continuous in alcohol consumption measures

1 June 1997

journal article
Published by Wiley in Addiction

Vol. 92 (6) , 667-672
https://doi.org/10.1111/j.1360-0443.1997.tb02926.x

Abstract

In quantity‐frequency methods used for self‐report measurement of alcohol intake (or other exposures), respondents mark the appropriate ranges, e.g. ‘5 to 8 drinks’, ‘5 or 6 times per week’. To calculate average consumption only single values, not ranges, can be multiplied, and midpoints are commonly used. This results in bias if the range lies in the tail of a distribution, as often happens with drinks per occasion. The same bias occurs when risk, for example, is plotted against consumption levels, which inevitably are grouped into ranges. Consequently, estimates of aggregate consumption can be exaggerated and curves of risk against exposure level can be misleading. A method is described to calculate a relatively unbiased representative value for a range, requiring only knowledge of the normal distribution table, the log‐normal distribution, and basic arithmetic. Part of the procedure is also useful for estimating percentile points in data that have been grouped differently, suck as income in dollar groups.

This publication has 4 references indexed in Scilit:

Risk of cardiovascular mortality in alcohol drinkers, ex-drinkers and nondrinkers
The American Journal of Cardiology, 1990
Agreement between two measures of alcohol consumption.
Journal of Studies on Alcohol, 1987
Reliability and Validity of Adolescent Self-Reported Drug Use in a Family-Based Study: A Methodological Report
International Journal of the Addictions, 1983
Is Alcohol Consumption Lognormally Distributed?
British Journal of Addiction, 1980