Use of Geocoding and Surname Analysis to Estimate Race and Ethnicity
- 5 May 2006
- journal article
- review article
- Published by Wiley in Health Services Research
- Vol. 41 (4p1) , 1482-1500
- https://doi.org/10.1111/j.1475-6773.2006.00551.x
Abstract
Objective. To review two indirect methods, geocoding and surname analysis, for estimating race/ethnicity as a means for health plans to assess disparities in care. Study Design. Review of published articles and unpublished data on the use of geocoding and surname analyses. Principal Findings. Few published studies have evaluated use of geocoding to estimate racial and ethnic characteristics of a patient population or to assess disparities in health care. Three of four studies showed similar estimates of the proportion of blacks and one showed nearly identical estimates of racial disparities, regardless of whether indirect or more direct measures (e.g., death certificate or CMS data) were used. However, accuracy depended on racial segregation levels in the population and region assessed and geocoding was unreliable for identifying Hispanics and Asians/Pacific Islanders. Similarly, several studies suggest surname analyses produces reasonable estimates of whether an enrollee is Hispanic or Asian/Pacific Islander and can identify disparities in care. However, accuracy depends on the concentrations of Asians or Hispanics in areas assessed. It is less accurate for women and more acculturated and higher SES persons due intermarriage, name changes, and adoption. Surname analysis is not accurate for identifying African Americans. Recent unpublished analyses suggest plans can successfully use a combined geocoding/surname analyses approach to identify disparities in care in most regions. Refinements based on Bayesian methods may make geocoding/surname analyses appropriate for use in areas where the accuracy is currently poor, but validation of these preliminary results is needed. Conclusions. Geocoding and surname analysis show promise for estimating racial/ethnic health plan composition of enrollees when direct data on major racial and ethnic groups are lacking. These data can be used to assess disparities in care, pending availability of self‐reported race/ethnicity data.Keywords
This publication has 54 references indexed in Scilit:
- A New Method for Estimating Race/Ethnicity and Associated Disparities Where Administrative Records Lack Self‐Reported Race/EthnicityHealth Services Research, 2008
- Health Care Organizations’ Use Of Race/Ethnicity Data To Address Quality DisparitiesHealth Affairs, 2005
- Limitations and potential uses of census-based data on ethnicity in a diverse communityAnnals of Epidemiology, 2004
- Monitoring Socioeconomic Inequalities in Sexually Transmitted Infections, Tuberculosis, and Violence: Geocoding and Choice of Area-Based Socioeconomic Measures—The Public Health Disparities Geocoding Project (US)Public Health Reports®, 2003
- Development and validation of a computerized South Asian Names and Group Recognition Algorithm (SANGRA) for use in British health-related studiesJournal of Public Health, 2001
- The Distribution of Survey Contact and Participation in the United States: Constructing a Survey-Based EstimateJournal of Marketing Research, 1999
- Classifying ethnicity utilizing the Canadian mortality data baseEthnicity & Health, 1997
- Surname analysis for estimating local concentration of Hispanics and AsiansPopulation Research and Policy Review, 1994
- Telephone Directory Listings of Presumptive Chinese SurnamesEpidemiology, 1990
- The classification of ethnic status using name information.Journal of Epidemiology and Community Health, 1988