Positional error in automated geocoding of residential addresses
Open Access
- 1 January 2003
- journal article
- Published by Springer Nature in International Journal of Health Geographics
- Vol. 2 (1) , 10
- https://doi.org/10.1186/1476-072X-2-10
Abstract
Public health applications using geographic information system (GIS) technology are steadily increasing. Many of these rely on the ability to locate where people live with respect to areas of exposure from environmental contaminants. Automated geocoding is a method used to assign geographic coordinates to an individual based on their street address. This method often relies on street centerline files as a geographic reference. Such a process introduces positional error in the geocoded point. Our study evaluated the positional error caused during automated geocoding of residential addresses and how this error varies between population densities. We also evaluated an alternative method of geocoding using residential property parcel data. Positional error was determined for 3,000 residential addresses using the distance between each geocoded point and its true location as determined with aerial imagery. Error was found to increase as population density decreased. In rural areas of an upstate New York study area, 95 percent of the addresses geocoded to within 2,872 m of their true location. Suburban areas revealed less error where 95 percent of the addresses geocoded to within 421 m. Urban areas demonstrated the least error where 95 percent of the addresses geocoded to within 152 m of their true location. As an alternative to using street centerline files for geocoding, we used residential property parcel points to locate the addresses. In the rural areas, 95 percent of the parcel points were within 195 m of the true location. In suburban areas, this distance was 39 m while in urban areas 95 percent of the parcel points were within 21 m of the true location. Researchers need to determine if the level of error caused by a chosen method of geocoding may affect the results of their project. As an alternative method, property data can be used for geocoding addresses if the error caused by traditional methods is found to be unacceptable.Keywords
This publication has 15 references indexed in Scilit:
- Childhood cancer incidence rates and hazardous air pollutants in California: an exploratory analysis.Environmental Health Perspectives, 2003
- Geocoding and Monitoring of US Socioeconomic Inequalities in Mortality and Cancer Incidence: Does the Choice of Area-based Measure and Geographic Level Matter?: The Public Health Disparities Geocoding ProjectAmerican Journal of Epidemiology, 2002
- Zip Code Caveat: Bias Due to Spatiotemporal Mismatches Between Zip Codes and US Census–Defined Geographic Areas—The Public Health Disparities Geocoding ProjectAmerican Journal of Public Health, 2002
- Locational uncertainty in georeferencing public health datasetsJournal of Exposure Science & Environmental Epidemiology, 2001
- Evaluation of spatial filters to create smoothed maps of health dataStatistics in Medicine, 2000
- Examining associations between childhood asthma and traffic flow using a geographic information system.Environmental Health Perspectives, 1999
- Breast Cancer Risk and Residence near Industry or Traffic in Nassau and Suffolk Counties, Long Island, New YorkArchives of environmental health, 1996
- EXPLORATORY SPATIAL ANALYSIS OF BIRTH DEFECT RATES IN AN URBAN POPULATIONStatistics in Medicine, 1996
- Spatial disease clusters: Detection and inferenceStatistics in Medicine, 1995
- Risk of Congenital Malformations Associated with Proximity to Hazardous Waste SitesAmerican Journal of Epidemiology, 1992