Statistical identifiability and sample size calculations for serial seroepidemiology

Dao Nguyen Vinh, Maciej F. Boni

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Inference on disease dynamics is typically performed using case reporting time series of symptomatic disease. The inferred dynamics will vary depending on the reporting patterns and surveillance system for the disease in question, and the inference will miss mild or underreported epidemics. To eliminate the variation introduced by differing reporting patterns and to capture asymptomatic or subclinical infection, inferential methods can be applied to serological data sets instead of case reporting data. To reconstruct complete disease dynamics, one would need to collect a serological time series. In the statistical analysis presented here, we consider a particular kind of serological time series with repeated, periodic collections of population-representative serum. We refer to this study design as a serial seroepidemiology (SSE) design, and we base the analysis on our epidemiological knowledge of influenza. We consider a study duration of three to four years, during which a single antigenic type of influenza would be circulating, and we evaluate our ability to reconstruct disease dynamics based on serological data alone. We show that the processes of reinfection, antibody generation, and antibody waning confound each other and are not always statistically identifiable, especially when dynamics resemble a non-oscillating endemic equilibrium behavior. We introduce some constraints to partially resolve this confounding, and we show that transmission rates and basic reproduction numbers can be accurately estimated in SSE study designs. Seasonal forcing is more difficult to identify as serology-based studies only detect oscillations in antibody titers of recovered individuals, and these oscillations are typically weaker than those observed for infected individuals. To accurately estimate the magnitude and timing of seasonal forcing, serum samples should be collected every two months and 200 or more samples should be included in each collection; this sample size estimate is sensitive to the antibody waning rate and the assumed level of seasonal forcing.

Original languageEnglish (US)
Pages (from-to)30-39
Number of pages10
JournalEpidemics
Volume12
DOIs
StatePublished - Sep 1 2015

Fingerprint

Sample Size
Asymptomatic Infections
Antibodies
Human Influenza
Basic Reproduction Number
Serology
Serum
Research Design
Population

All Science Journal Classification (ASJC) codes

  • Parasitology
  • Epidemiology
  • Microbiology
  • Public Health, Environmental and Occupational Health
  • Infectious Diseases
  • Virology

Cite this

@article{046ff2be32004a91866ab124f036c559,
title = "Statistical identifiability and sample size calculations for serial seroepidemiology",
abstract = "Inference on disease dynamics is typically performed using case reporting time series of symptomatic disease. The inferred dynamics will vary depending on the reporting patterns and surveillance system for the disease in question, and the inference will miss mild or underreported epidemics. To eliminate the variation introduced by differing reporting patterns and to capture asymptomatic or subclinical infection, inferential methods can be applied to serological data sets instead of case reporting data. To reconstruct complete disease dynamics, one would need to collect a serological time series. In the statistical analysis presented here, we consider a particular kind of serological time series with repeated, periodic collections of population-representative serum. We refer to this study design as a serial seroepidemiology (SSE) design, and we base the analysis on our epidemiological knowledge of influenza. We consider a study duration of three to four years, during which a single antigenic type of influenza would be circulating, and we evaluate our ability to reconstruct disease dynamics based on serological data alone. We show that the processes of reinfection, antibody generation, and antibody waning confound each other and are not always statistically identifiable, especially when dynamics resemble a non-oscillating endemic equilibrium behavior. We introduce some constraints to partially resolve this confounding, and we show that transmission rates and basic reproduction numbers can be accurately estimated in SSE study designs. Seasonal forcing is more difficult to identify as serology-based studies only detect oscillations in antibody titers of recovered individuals, and these oscillations are typically weaker than those observed for infected individuals. To accurately estimate the magnitude and timing of seasonal forcing, serum samples should be collected every two months and 200 or more samples should be included in each collection; this sample size estimate is sensitive to the antibody waning rate and the assumed level of seasonal forcing.",
author = "Vinh, {Dao Nguyen} and Boni, {Maciej F.}",
year = "2015",
month = "9",
day = "1",
doi = "10.1016/j.epidem.2015.02.005",
language = "English (US)",
volume = "12",
pages = "30--39",
journal = "Epidemics",
issn = "1755-4365",
publisher = "Elsevier",

}

Statistical identifiability and sample size calculations for serial seroepidemiology. / Vinh, Dao Nguyen; Boni, Maciej F.

In: Epidemics, Vol. 12, 01.09.2015, p. 30-39.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Statistical identifiability and sample size calculations for serial seroepidemiology

AU - Vinh, Dao Nguyen

AU - Boni, Maciej F.

PY - 2015/9/1

Y1 - 2015/9/1

N2 - Inference on disease dynamics is typically performed using case reporting time series of symptomatic disease. The inferred dynamics will vary depending on the reporting patterns and surveillance system for the disease in question, and the inference will miss mild or underreported epidemics. To eliminate the variation introduced by differing reporting patterns and to capture asymptomatic or subclinical infection, inferential methods can be applied to serological data sets instead of case reporting data. To reconstruct complete disease dynamics, one would need to collect a serological time series. In the statistical analysis presented here, we consider a particular kind of serological time series with repeated, periodic collections of population-representative serum. We refer to this study design as a serial seroepidemiology (SSE) design, and we base the analysis on our epidemiological knowledge of influenza. We consider a study duration of three to four years, during which a single antigenic type of influenza would be circulating, and we evaluate our ability to reconstruct disease dynamics based on serological data alone. We show that the processes of reinfection, antibody generation, and antibody waning confound each other and are not always statistically identifiable, especially when dynamics resemble a non-oscillating endemic equilibrium behavior. We introduce some constraints to partially resolve this confounding, and we show that transmission rates and basic reproduction numbers can be accurately estimated in SSE study designs. Seasonal forcing is more difficult to identify as serology-based studies only detect oscillations in antibody titers of recovered individuals, and these oscillations are typically weaker than those observed for infected individuals. To accurately estimate the magnitude and timing of seasonal forcing, serum samples should be collected every two months and 200 or more samples should be included in each collection; this sample size estimate is sensitive to the antibody waning rate and the assumed level of seasonal forcing.

AB - Inference on disease dynamics is typically performed using case reporting time series of symptomatic disease. The inferred dynamics will vary depending on the reporting patterns and surveillance system for the disease in question, and the inference will miss mild or underreported epidemics. To eliminate the variation introduced by differing reporting patterns and to capture asymptomatic or subclinical infection, inferential methods can be applied to serological data sets instead of case reporting data. To reconstruct complete disease dynamics, one would need to collect a serological time series. In the statistical analysis presented here, we consider a particular kind of serological time series with repeated, periodic collections of population-representative serum. We refer to this study design as a serial seroepidemiology (SSE) design, and we base the analysis on our epidemiological knowledge of influenza. We consider a study duration of three to four years, during which a single antigenic type of influenza would be circulating, and we evaluate our ability to reconstruct disease dynamics based on serological data alone. We show that the processes of reinfection, antibody generation, and antibody waning confound each other and are not always statistically identifiable, especially when dynamics resemble a non-oscillating endemic equilibrium behavior. We introduce some constraints to partially resolve this confounding, and we show that transmission rates and basic reproduction numbers can be accurately estimated in SSE study designs. Seasonal forcing is more difficult to identify as serology-based studies only detect oscillations in antibody titers of recovered individuals, and these oscillations are typically weaker than those observed for infected individuals. To accurately estimate the magnitude and timing of seasonal forcing, serum samples should be collected every two months and 200 or more samples should be included in each collection; this sample size estimate is sensitive to the antibody waning rate and the assumed level of seasonal forcing.

UR - http://www.scopus.com/inward/record.url?scp=84940789117&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84940789117&partnerID=8YFLogxK

U2 - 10.1016/j.epidem.2015.02.005

DO - 10.1016/j.epidem.2015.02.005

M3 - Article

VL - 12

SP - 30

EP - 39

JO - Epidemics

JF - Epidemics

SN - 1755-4365

ER -