Test?retest errors and the apparent heterogeneity of training response

Abstract
Published reports have shown large apparent inter-individual differences of gains in maximal oxygen intake (O2max) in response to a standard 20-week programme of aerobic conditioning that progressed to 75% of the individual’s initialO2max. The observed gains ofO2max ranged from 0 to 1,000 ml min−1, with a coefficient of variation (CV) of 8.4%. The present analysis evaluates the potential contribution of test–retest errors to these apparent large inter-individual differences in training response. The 2-day test–retest CV forO2max readings in 742 healthy adults was initially 5.0%, dropping to 4.1% after training. Published training responses were estimated from the mean of paired measurements obtained before and after training if readings agreed by 5%. Taking account of the relative proportions of single and paired observations, the weightedO2max data for the entire sample had an effective 2-day CV of 4.3% before and 3.4% after training. Assumption 1: if the 20-week test–retest error remained similar to the 2-day figure, measurement error would contribute a CV of 5.5% to apparent training responses, or (for the stated initial meanO2max of 2,409 ml min−1) an SD of 132 ml min−1. Assumption 2: if the 20-week CV was similar to that in other long-term studies (~5%), measurement error would contribute a CV of 6.1%, or a SD of 146 ml min−1. The published data show a total SD of 202 ml min−1 for apparent inter-individual differences in training response, with age, gender, race and baselineO2max accounting for only 11% of this variance. After estimating the likely effect of test–retest measurement errors, the SD due to inter-individual differences would decrease to 138 ml minO2max (assumption 1) or 123 ml min−1 (assumption 2). We conclude that when estimating the extent of inter-individual differences in training response, allowance must be made not only for the minor effects of recognized covariates (age, gender, race and initial fitness), but also for the larger influence of test-retest measurement errors. Nevertheless, substantial inter-individual differences persist after making such adjustments. The most likely explanation of these differences is a familial aggregation of training responses.