TY - JOUR
T1 - Bias Analysis for Misclassification Errors in both the Response Variable and Covariate
AU - Liu, Juxin
AU - Afful, Annshirley
AU - Mansell, Holly
AU - Ma, Yanyuan
N1 - Funding Information:
The first author’s work is supported by the Natural Sciences and Engineering Research Council of Canada. The second author’s work was supported by the trainee scholarship from the Saskatchewan Centre for Patient-Oriented Research. The last author’s work is partially supported by the National Science Foundation and National Institute of Health.
Publisher Copyright:
© 2022 American Statistical Association.
PY - 2022
Y1 - 2022
N2 - Abstract–Much literature has focused on statistical inference for misclassified response variables or misclassified covariates. However, misclassification in both the response variable and the covariate has received very limited attention within applied fields and the statistics community. In situations where the response variable and the covariate are simultaneously subject to misclassification errors, an assumption of independent misclassification errors is often used for convenience without justification. This article aims to show the harmful consequences of inappropriate adjustment for joint misclassification errors. In particular, we focus on the wrong adjustment by ignoring the dependence between the misclassification process of the response variable and the covariate. In this article, the dependence of misclassification in both variables is characterized by covariance-type parameters. We extend the original definition of dependence parameters to a more general setting. We discover a single quantity that governs the dependence of the two misclassification processes. Moreover, we propose likelihood ratio tests to check the nondifferential/independent misclassification assumption in main study/internal validation study designs. Our simulation studies indicate that ignoring the dependent error structure can be even worse than ignoring all the misclassification errors when the validation data size is relatively small. The methodology is illustrated by a real data example.
AB - Abstract–Much literature has focused on statistical inference for misclassified response variables or misclassified covariates. However, misclassification in both the response variable and the covariate has received very limited attention within applied fields and the statistics community. In situations where the response variable and the covariate are simultaneously subject to misclassification errors, an assumption of independent misclassification errors is often used for convenience without justification. This article aims to show the harmful consequences of inappropriate adjustment for joint misclassification errors. In particular, we focus on the wrong adjustment by ignoring the dependence between the misclassification process of the response variable and the covariate. In this article, the dependence of misclassification in both variables is characterized by covariance-type parameters. We extend the original definition of dependence parameters to a more general setting. We discover a single quantity that governs the dependence of the two misclassification processes. Moreover, we propose likelihood ratio tests to check the nondifferential/independent misclassification assumption in main study/internal validation study designs. Our simulation studies indicate that ignoring the dependent error structure can be even worse than ignoring all the misclassification errors when the validation data size is relatively small. The methodology is illustrated by a real data example.
UR - http://www.scopus.com/inward/record.url?scp=85130446946&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130446946&partnerID=8YFLogxK
U2 - 10.1080/00031305.2022.2066725
DO - 10.1080/00031305.2022.2066725
M3 - Article
AN - SCOPUS:85130446946
SN - 0003-1305
VL - 76
SP - 353
EP - 362
JO - American Statistician
JF - American Statistician
IS - 4
ER -