Accuracy and repeatability of commercial geocoding

Eric A. Whitsel, Kathryn M. Rose, Joy L. Wood, Amenda C. Henley, Duanping Liao, Gerardo Heiss

Research output: Contribution to journalArticle

70 Citations (Scopus)

Abstract

The authors estimated accuracy and repeatability of commercial geocoding to guide vendor selection in the Life Course Socioeconomic Status, Social Context and Cardiovascular Disease study (2001-2002). They submitted 1,032 participant addresses (97% in Maryland, Minnesota, Mississippi, or North Carolina) to vendor A twice over 9 months and measured repeatability as agreement between levels of address matching, discordance (%) between statistical tabulation areas, and median distance (d, in meters) and bearing (θ, in degrees) between coordinates assigned on each occasion (Hoi = 1 → ni/n] = 180°). They also submitted 75 addresses of nearby air pollution monitors (77% urban/suburban; 69% residential/commercial) to vendors A and B and then measured accuracy by comparing vendor- and US Environmental Protection Agency (EPA)-assigned geocodes using the above measures. Repeatability of geocodes assigned by vendor A was high (kappa = 0.90; census block group discordance = 5%; d < 1 m; θ = 177°). The match rate for EPA monitor addresses was higher for vendor B versus A (88% vs. 76%), but discordance at census block group, tract, and county levels also was, respectively, 1.4-, 1.9-, and 5.0-fold higher for vendor B. Moreover, coordinates assigned by vendor B were further from those assigned by the EPA (d = 212 m vs. 149 m; θ = 131° vs. 171°). These findings suggest that match rates, repeatability, and accuracy should be used to guide vendor selection.

Original languageEnglish (US)
Pages (from-to)1023-1029
Number of pages7
JournalAmerican journal of epidemiology
Volume160
Issue number10
DOIs
StatePublished - Nov 15 2004

Fingerprint

Geographic Mapping
United States Environmental Protection Agency
Censuses
Mississippi
Air Pollution
Social Class
Cardiovascular Diseases

All Science Journal Classification (ASJC) codes

  • Epidemiology

Cite this

Whitsel, E. A., Rose, K. M., Wood, J. L., Henley, A. C., Liao, D., & Heiss, G. (2004). Accuracy and repeatability of commercial geocoding. American journal of epidemiology, 160(10), 1023-1029. https://doi.org/10.1093/aje/kwh310
Whitsel, Eric A. ; Rose, Kathryn M. ; Wood, Joy L. ; Henley, Amenda C. ; Liao, Duanping ; Heiss, Gerardo. / Accuracy and repeatability of commercial geocoding. In: American journal of epidemiology. 2004 ; Vol. 160, No. 10. pp. 1023-1029.
@article{a86a37d51adf4f0eb9de71d2c9023deb,
title = "Accuracy and repeatability of commercial geocoding",
abstract = "The authors estimated accuracy and repeatability of commercial geocoding to guide vendor selection in the Life Course Socioeconomic Status, Social Context and Cardiovascular Disease study (2001-2002). They submitted 1,032 participant addresses (97{\%} in Maryland, Minnesota, Mississippi, or North Carolina) to vendor A twice over 9 months and measured repeatability as agreement between levels of address matching, discordance ({\%}) between statistical tabulation areas, and median distance (d, in meters) and bearing (θ, in degrees) between coordinates assigned on each occasion (Ho:Σ i = 1 → n[θi/n] = 180°). They also submitted 75 addresses of nearby air pollution monitors (77{\%} urban/suburban; 69{\%} residential/commercial) to vendors A and B and then measured accuracy by comparing vendor- and US Environmental Protection Agency (EPA)-assigned geocodes using the above measures. Repeatability of geocodes assigned by vendor A was high (kappa = 0.90; census block group discordance = 5{\%}; d < 1 m; θ = 177°). The match rate for EPA monitor addresses was higher for vendor B versus A (88{\%} vs. 76{\%}), but discordance at census block group, tract, and county levels also was, respectively, 1.4-, 1.9-, and 5.0-fold higher for vendor B. Moreover, coordinates assigned by vendor B were further from those assigned by the EPA (d = 212 m vs. 149 m; θ = 131° vs. 171°). These findings suggest that match rates, repeatability, and accuracy should be used to guide vendor selection.",
author = "Whitsel, {Eric A.} and Rose, {Kathryn M.} and Wood, {Joy L.} and Henley, {Amenda C.} and Duanping Liao and Gerardo Heiss",
year = "2004",
month = "11",
day = "15",
doi = "10.1093/aje/kwh310",
language = "English (US)",
volume = "160",
pages = "1023--1029",
journal = "American Journal of Epidemiology",
issn = "0002-9262",
publisher = "Oxford University Press",
number = "10",

}

Whitsel, EA, Rose, KM, Wood, JL, Henley, AC, Liao, D & Heiss, G 2004, 'Accuracy and repeatability of commercial geocoding', American journal of epidemiology, vol. 160, no. 10, pp. 1023-1029. https://doi.org/10.1093/aje/kwh310

Accuracy and repeatability of commercial geocoding. / Whitsel, Eric A.; Rose, Kathryn M.; Wood, Joy L.; Henley, Amenda C.; Liao, Duanping; Heiss, Gerardo.

In: American journal of epidemiology, Vol. 160, No. 10, 15.11.2004, p. 1023-1029.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Accuracy and repeatability of commercial geocoding

AU - Whitsel, Eric A.

AU - Rose, Kathryn M.

AU - Wood, Joy L.

AU - Henley, Amenda C.

AU - Liao, Duanping

AU - Heiss, Gerardo

PY - 2004/11/15

Y1 - 2004/11/15

N2 - The authors estimated accuracy and repeatability of commercial geocoding to guide vendor selection in the Life Course Socioeconomic Status, Social Context and Cardiovascular Disease study (2001-2002). They submitted 1,032 participant addresses (97% in Maryland, Minnesota, Mississippi, or North Carolina) to vendor A twice over 9 months and measured repeatability as agreement between levels of address matching, discordance (%) between statistical tabulation areas, and median distance (d, in meters) and bearing (θ, in degrees) between coordinates assigned on each occasion (Ho:Σ i = 1 → n[θi/n] = 180°). They also submitted 75 addresses of nearby air pollution monitors (77% urban/suburban; 69% residential/commercial) to vendors A and B and then measured accuracy by comparing vendor- and US Environmental Protection Agency (EPA)-assigned geocodes using the above measures. Repeatability of geocodes assigned by vendor A was high (kappa = 0.90; census block group discordance = 5%; d < 1 m; θ = 177°). The match rate for EPA monitor addresses was higher for vendor B versus A (88% vs. 76%), but discordance at census block group, tract, and county levels also was, respectively, 1.4-, 1.9-, and 5.0-fold higher for vendor B. Moreover, coordinates assigned by vendor B were further from those assigned by the EPA (d = 212 m vs. 149 m; θ = 131° vs. 171°). These findings suggest that match rates, repeatability, and accuracy should be used to guide vendor selection.

AB - The authors estimated accuracy and repeatability of commercial geocoding to guide vendor selection in the Life Course Socioeconomic Status, Social Context and Cardiovascular Disease study (2001-2002). They submitted 1,032 participant addresses (97% in Maryland, Minnesota, Mississippi, or North Carolina) to vendor A twice over 9 months and measured repeatability as agreement between levels of address matching, discordance (%) between statistical tabulation areas, and median distance (d, in meters) and bearing (θ, in degrees) between coordinates assigned on each occasion (Ho:Σ i = 1 → n[θi/n] = 180°). They also submitted 75 addresses of nearby air pollution monitors (77% urban/suburban; 69% residential/commercial) to vendors A and B and then measured accuracy by comparing vendor- and US Environmental Protection Agency (EPA)-assigned geocodes using the above measures. Repeatability of geocodes assigned by vendor A was high (kappa = 0.90; census block group discordance = 5%; d < 1 m; θ = 177°). The match rate for EPA monitor addresses was higher for vendor B versus A (88% vs. 76%), but discordance at census block group, tract, and county levels also was, respectively, 1.4-, 1.9-, and 5.0-fold higher for vendor B. Moreover, coordinates assigned by vendor B were further from those assigned by the EPA (d = 212 m vs. 149 m; θ = 131° vs. 171°). These findings suggest that match rates, repeatability, and accuracy should be used to guide vendor selection.

UR - http://www.scopus.com/inward/record.url?scp=8444239741&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=8444239741&partnerID=8YFLogxK

U2 - 10.1093/aje/kwh310

DO - 10.1093/aje/kwh310

M3 - Article

C2 - 15522859

AN - SCOPUS:8444239741

VL - 160

SP - 1023

EP - 1029

JO - American Journal of Epidemiology

JF - American Journal of Epidemiology

SN - 0002-9262

IS - 10

ER -

Whitsel EA, Rose KM, Wood JL, Henley AC, Liao D, Heiss G. Accuracy and repeatability of commercial geocoding. American journal of epidemiology. 2004 Nov 15;160(10):1023-1029. https://doi.org/10.1093/aje/kwh310