TY - JOUR
T1 - Statistical identifiability and sample size calculations for serial seroepidemiology
AU - Vinh, Dao Nguyen
AU - Boni, Maciej F.
N1 - Funding Information:
Thanks to Neil Ferguson for critical feedback on this manuscript. Thanks to Tran Tinh Hien, Jeremy Farrar, Cameron Simmons, Huynh Le Anh Huy for help with the initial study setup. D.N.V. is supported by the Wellcome Trust ( 089276/B/09/7 ). M.F.B. is a Wellcome Trust/Royal Society Sir Henry Dale Fellow ( 098511/Z/12/Z ). Computing and hardware costs were funded by the Li Ka Shing Foundation – University of Oxford Global Health Program ( LG05 ). We are grateful to the British Medical Association for the initial funding for this project (HC Roscoe 2011).
Publisher Copyright:
© 2015 The Authors.
PY - 2015/9/1
Y1 - 2015/9/1
N2 - Inference on disease dynamics is typically performed using case reporting time series of symptomatic disease. The inferred dynamics will vary depending on the reporting patterns and surveillance system for the disease in question, and the inference will miss mild or underreported epidemics. To eliminate the variation introduced by differing reporting patterns and to capture asymptomatic or subclinical infection, inferential methods can be applied to serological data sets instead of case reporting data. To reconstruct complete disease dynamics, one would need to collect a serological time series. In the statistical analysis presented here, we consider a particular kind of serological time series with repeated, periodic collections of population-representative serum. We refer to this study design as a serial seroepidemiology (SSE) design, and we base the analysis on our epidemiological knowledge of influenza. We consider a study duration of three to four years, during which a single antigenic type of influenza would be circulating, and we evaluate our ability to reconstruct disease dynamics based on serological data alone. We show that the processes of reinfection, antibody generation, and antibody waning confound each other and are not always statistically identifiable, especially when dynamics resemble a non-oscillating endemic equilibrium behavior. We introduce some constraints to partially resolve this confounding, and we show that transmission rates and basic reproduction numbers can be accurately estimated in SSE study designs. Seasonal forcing is more difficult to identify as serology-based studies only detect oscillations in antibody titers of recovered individuals, and these oscillations are typically weaker than those observed for infected individuals. To accurately estimate the magnitude and timing of seasonal forcing, serum samples should be collected every two months and 200 or more samples should be included in each collection; this sample size estimate is sensitive to the antibody waning rate and the assumed level of seasonal forcing.
AB - Inference on disease dynamics is typically performed using case reporting time series of symptomatic disease. The inferred dynamics will vary depending on the reporting patterns and surveillance system for the disease in question, and the inference will miss mild or underreported epidemics. To eliminate the variation introduced by differing reporting patterns and to capture asymptomatic or subclinical infection, inferential methods can be applied to serological data sets instead of case reporting data. To reconstruct complete disease dynamics, one would need to collect a serological time series. In the statistical analysis presented here, we consider a particular kind of serological time series with repeated, periodic collections of population-representative serum. We refer to this study design as a serial seroepidemiology (SSE) design, and we base the analysis on our epidemiological knowledge of influenza. We consider a study duration of three to four years, during which a single antigenic type of influenza would be circulating, and we evaluate our ability to reconstruct disease dynamics based on serological data alone. We show that the processes of reinfection, antibody generation, and antibody waning confound each other and are not always statistically identifiable, especially when dynamics resemble a non-oscillating endemic equilibrium behavior. We introduce some constraints to partially resolve this confounding, and we show that transmission rates and basic reproduction numbers can be accurately estimated in SSE study designs. Seasonal forcing is more difficult to identify as serology-based studies only detect oscillations in antibody titers of recovered individuals, and these oscillations are typically weaker than those observed for infected individuals. To accurately estimate the magnitude and timing of seasonal forcing, serum samples should be collected every two months and 200 or more samples should be included in each collection; this sample size estimate is sensitive to the antibody waning rate and the assumed level of seasonal forcing.
UR - http://www.scopus.com/inward/record.url?scp=84940789117&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84940789117&partnerID=8YFLogxK
U2 - 10.1016/j.epidem.2015.02.005
DO - 10.1016/j.epidem.2015.02.005
M3 - Article
C2 - 26342240
AN - SCOPUS:84940789117
SN - 1755-4365
VL - 12
SP - 30
EP - 39
JO - Epidemics
JF - Epidemics
ER -