Modified mixed generalized ordered response model to handle misclassification in injury severity data

Lacramioara Balan, Rajesh Paleti

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Traditional crash databases that record police-reported injury severity data are prone to misclassification errors. Ignoring these errors in discrete ordered response models used for analyzing injury severity can lead to biased and inconsistent parameter estimates. In this study, a mixed generalized ordered response (MGOR) model that quantifies misclassification rates in the injury severity variable and adjusts the bias in parameter estimates associated with misclassification was developed. The proposed model does this by considering the observed injury severity outcome as a realization from a discrete random variable that depends on true latent injury severity that is unobservable to the analyst. The model was used to analyze misclassification rates in police-reported injury severity in the 2014 General Estimates System (GES) data. The model found that only 68.23% and 62.75% of possible and non-incapacitating injuries were correctly recorded in the GES data. Moreover, comparative analysis with the MGOR model that ignores misclassification not only has lower data fit but also considerable bias in both the parameter and elasticity estimates. The model developed in this study can be used to analyze misclassification errors in ordinal response variables in other empirical contexts.

Original languageEnglish (US)
Pages (from-to)53-63
Number of pages11
JournalTransportation Research Record
Volume2672
Issue number30
DOIs
StatePublished - Jan 1 2018

Fingerprint

Law enforcement
Random variables
Elasticity

All Science Journal Classification (ASJC) codes

  • Civil and Structural Engineering
  • Mechanical Engineering

Cite this

@article{43125143fe764bd38a965d67aa112f24,
title = "Modified mixed generalized ordered response model to handle misclassification in injury severity data",
abstract = "Traditional crash databases that record police-reported injury severity data are prone to misclassification errors. Ignoring these errors in discrete ordered response models used for analyzing injury severity can lead to biased and inconsistent parameter estimates. In this study, a mixed generalized ordered response (MGOR) model that quantifies misclassification rates in the injury severity variable and adjusts the bias in parameter estimates associated with misclassification was developed. The proposed model does this by considering the observed injury severity outcome as a realization from a discrete random variable that depends on true latent injury severity that is unobservable to the analyst. The model was used to analyze misclassification rates in police-reported injury severity in the 2014 General Estimates System (GES) data. The model found that only 68.23{\%} and 62.75{\%} of possible and non-incapacitating injuries were correctly recorded in the GES data. Moreover, comparative analysis with the MGOR model that ignores misclassification not only has lower data fit but also considerable bias in both the parameter and elasticity estimates. The model developed in this study can be used to analyze misclassification errors in ordinal response variables in other empirical contexts.",
author = "Lacramioara Balan and Rajesh Paleti",
year = "2018",
month = "1",
day = "1",
doi = "10.1177/0361198118796352",
language = "English (US)",
volume = "2672",
pages = "53--63",
journal = "Transportation Research Record",
issn = "0361-1981",
publisher = "US National Research Council",
number = "30",

}

Modified mixed generalized ordered response model to handle misclassification in injury severity data. / Balan, Lacramioara; Paleti, Rajesh.

In: Transportation Research Record, Vol. 2672, No. 30, 01.01.2018, p. 53-63.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Modified mixed generalized ordered response model to handle misclassification in injury severity data

AU - Balan, Lacramioara

AU - Paleti, Rajesh

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Traditional crash databases that record police-reported injury severity data are prone to misclassification errors. Ignoring these errors in discrete ordered response models used for analyzing injury severity can lead to biased and inconsistent parameter estimates. In this study, a mixed generalized ordered response (MGOR) model that quantifies misclassification rates in the injury severity variable and adjusts the bias in parameter estimates associated with misclassification was developed. The proposed model does this by considering the observed injury severity outcome as a realization from a discrete random variable that depends on true latent injury severity that is unobservable to the analyst. The model was used to analyze misclassification rates in police-reported injury severity in the 2014 General Estimates System (GES) data. The model found that only 68.23% and 62.75% of possible and non-incapacitating injuries were correctly recorded in the GES data. Moreover, comparative analysis with the MGOR model that ignores misclassification not only has lower data fit but also considerable bias in both the parameter and elasticity estimates. The model developed in this study can be used to analyze misclassification errors in ordinal response variables in other empirical contexts.

AB - Traditional crash databases that record police-reported injury severity data are prone to misclassification errors. Ignoring these errors in discrete ordered response models used for analyzing injury severity can lead to biased and inconsistent parameter estimates. In this study, a mixed generalized ordered response (MGOR) model that quantifies misclassification rates in the injury severity variable and adjusts the bias in parameter estimates associated with misclassification was developed. The proposed model does this by considering the observed injury severity outcome as a realization from a discrete random variable that depends on true latent injury severity that is unobservable to the analyst. The model was used to analyze misclassification rates in police-reported injury severity in the 2014 General Estimates System (GES) data. The model found that only 68.23% and 62.75% of possible and non-incapacitating injuries were correctly recorded in the GES data. Moreover, comparative analysis with the MGOR model that ignores misclassification not only has lower data fit but also considerable bias in both the parameter and elasticity estimates. The model developed in this study can be used to analyze misclassification errors in ordinal response variables in other empirical contexts.

UR - http://www.scopus.com/inward/record.url?scp=85060895007&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85060895007&partnerID=8YFLogxK

U2 - 10.1177/0361198118796352

DO - 10.1177/0361198118796352

M3 - Article

AN - SCOPUS:85060895007

VL - 2672

SP - 53

EP - 63

JO - Transportation Research Record

JF - Transportation Research Record

SN - 0361-1981

IS - 30

ER -