Heterogeneity in vaccination coverage explains the size and occurrence of measles epidemics in German surveillance data

S. A. HERZOG; M. PAUL; L. HELD

doi:10.1017/S0950268810001664

Heterogeneity in vaccination coverage explains the size and occurrence of measles epidemics in German surveillance data

Published online by Cambridge University Press: 12 July 2010

S. A. HERZOG ,

M. PAUL and

L. HELD

Show author details

S. A. HERZOG*: Affiliation:
Institute of Social and Preventive Medicine, University of Bern, Switzerland
M. PAUL: Affiliation:
Biostatistics Unit, Institute of Social and Preventive Medicine, University of Zurich, Switzerland
L. HELD: Affiliation:
Biostatistics Unit, Institute of Social and Preventive Medicine, University of Zurich, Switzerland
*: *Author for correspondence: Ms. S. Herzog, Institute of Social and Preventive Medicine, University of Bern, Finkenhubelweg 11, 3012 Bern, Switzerland. (Email: sherzog@ispm.unibe.ch)

Article contents

Summary
INTRODUCTION
DATA
METHODS
RESULTS
DISCUSSION
APPENDIX: Simulation study
References

Rights & Permissions

Summary

The objective of this study was to characterize empirically the association between vaccination coverage and the size and occurrence of measles epidemics in Germany. In order to achieve this we analysed data routinely collected by the Robert Koch Institute, which comprise the weekly number of reported measles cases at all ages as well as estimates of vaccination coverage at the average age of entry into the school system. Coverage levels within each federal state of Germany are incorporated into a multivariate time-series model for infectious disease counts, which captures occasional outbreaks by means of an autoregressive component. The observed incidence pattern of measles for all ages is best described by using the log proportion of unvaccinated school starters in the autoregressive component of the model.

Keywords

Infectious disease epidemiology measles (rubeola)MMR vaccination modelling

Type: Original Papers
Information: Epidemiology & Infection , Volume 139 , Issue 4 , April 2011 , pp. 505 - 515

DOI: https://doi.org/10.1017/S0950268810001664 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2010

INTRODUCTION

Measles is a highly contagious disease and still an important health concern [Reference Muscat1]. Numerous efforts such as routine childhood vaccination programmes or the WHO measles elimination plan have significantly reduced the incidence of measles in Europe. The epidemic pattern has changed from a roughly biennial cycle to an irregular sequence of outbreaks [Reference Wallinga, Heijne and Kretzschmar2]. However, disease has not been eradicated. The incidence of measles varies widely, with large outbreaks in Romania, Germany, UK, Switzerland and Italy in 2006 and 2007, whereas in other countries such as Finland, Slovakia and Hungary almost no cases were reported [Reference Muscat1]. Since most measles cases were unvaccinated or incompletely vaccinated, the differences in incidence are likely to be due to differences in the success of national vaccination programmes [Reference Muscat1, Reference Wallinga, Heijne and Kretzschmar2]. For instance, there have been several outbreaks in some of the 16 federal states of Germany in recent years [Reference Siedler3–Reference Wichmann6]. Detailed investigations of selected outbreaks showed that most cases occurred in unvaccinated individuals [Reference Wichmann4].

National surveillance systems such as that at the Robert Koch Institute (RKI), Germany, typically provide weekly time-series of counts stratified by for example, region, age or sex. Accordingly, statistical methods for the analysis of multivariate time-series of counts are needed. It is of public health interest to investigate empirically the relationship between vaccination coverage and the occurrence and size of measles epidemics using such data.

Cummings et al. [Reference Cummings7] used a linear model to analyse the sum of measles cases over 5 years in several provinces of Cameroon, including vaccination coverage among other covariates. However, the time-series aspect was not considered. Multivariate time-series methods for counts of infectious diseases have only recently been developed and applied to epidemiological data. However, these models are not able to cope with occasional large outbreaks. For instance, Frank et al. [Reference Frank8] investigated the association between human infection with Shiga toxin-producing Escherichia coli (STEC) and cattle density based on German notification data. A Bayesian Poisson regression model was used to analyse the weekly number of cases in each age group and district of Germany. The model accounted for temporal and seasonal trends, spatial variation and cattle density as explanatory factors. No large STEC gastroenteritis outbreaks occurred in the time period considered. Hens et al. [Reference Hens9] modelled the yearly, age-stratified incidence of hepatitis B in Bulgaria using a log-additive Poisson model, where age and time were modelled as non-parametric functions. The impact of vaccination was taken into account by including indicators for various immunization programmes as covariates. The log-additive Poisson model chosen was justified since the data contained no outbreaks.

If there are outbreaks in the data, a more realistic formulation for (multivariate) time-series of infectious disease counts has been suggested by Held et al. [Reference Held, Höhle and Hofmann10]. The model decomposes the disease incidence into two additive components. One component represents an autoregression on past counts which allows for temporal dependence beyond regular patterns, i.e. epidemic behaviour. The other component accounts for regular, endemic behaviour. However, this method did not consider the inclusion of covariates.

The aim of this paper is to investigate the association between vaccination coverage and the size and occurrence of measles epidemics. We first describe the data about measles incidence [11] and vaccination coverage [12] in Germany obtained from the RKI. The approach of Held et al. [Reference Held, Höhle and Hofmann10] is extended to allow for the inclusion of covariates and applied to the measles data using vaccination coverage as an explanatory variable. Different formulations of the proposed model are compared based on Akaike's Information Criterion (AIC [Reference Lindsey and Jones13]). A simulation study is performed in order to further investigate the ability of AIC to identify the underlying true model.

DATA

Measles incidence

In Germany, introduction of the measles vaccine had reduced the incidence of measles to a historical low of 0·2 cases/100 000 inhabitants in 2004 [Reference Siedler3], before the disease re-emerged due to outbreaks in a few regions. We used measles surveillance data from Germany for the years 2005–2007, which contain weekly counts of cases for all ages in all 16 federal states reported to the RKI [11]. Figure 1 shows the notified measles cases in the years 2005–2007 for six selected federal states to illustrate the different incidence patterns. Large outbreaks occurred in Hesse and Bavaria in 2005 [Reference Siedler3], in North Rhine-Westphalia in 2006 [Reference Wichmann4] and in North Rhine-Westphalia and Bavaria in 2007 [Reference Bernard5]. The majority of cases (~80%) occurred in children and adolescents. About 12% occurred in infants aged <2 years. This pattern was very similar in all three years considered. A brief summary of the number of reported cases in each state is shown in Table 1 together with population numbers at 31 December 2006 obtained from the Federal Statistical Office of Germany [14].

Fig. 1. Number of weekly measles cases in selected German federal states for the years 2005–2007. Note that the y-axis is not the same for all states.

Table 1. Measles cases and estimated vaccination coverage in the 16 federal states of Germany

Population estimated at 31 December 2006; maximum and total number of weekly measles cases from week 1, 2005 to week 52, 2007; coverage at school entry for the first and second dose of MMR vaccine in 2006 estimated from children presenting vaccination cards at school entry examinations; percentage of children with a vaccination card.

Measles-mumps-rubella (MMR) vaccination

Coverage levels of the combined MMR vaccine were derived from vaccination cards presented at medical examinations, which are conducted by local health authorities at school entry [12]. Records include information about receipt of the first and second doses of MMR, but no information about dates or age of the child at vaccination. Age at school entry ranges between states from 4 to 7 years [Reference Kalies15], therefore the information collected typically refers to vaccinations received 3–5 years previously [Reference Reiter and Poethko-Müller16].

The estimated coverage data do not include any information from children who did not present a vaccination card on the day of the medical examination (5–13% of children attending the school entry examination in different states). This is likely to overestimate true coverage, because the vaccination status of children with vaccination cards is generally more complete than in those without a card [Reference Wichmann4, Reference Poethko-Müller17]. However, there are no national data about the degree of overestimation. We made an assumption, which was used in a previous German study [Reference Tischer, Siedler and Rasch18], that for each dose, the percentage of children without a vaccination card, ‘non-card holders’ was half that of ‘card holders’. We applied this adjustment to all analyses and conducted a sensitivity analysis to examine the robustness of the assumption.

Coverage levels for both the first and the second dose were higher in the new, re-established states in East Germany (Brandenburg, Mecklenburg-Western Pomerania, Saxony, Saxony-Anhalt, Thuringia) than in West Germany (Table 1). This might reflect continuing adherence to different childhood vaccination policies before re-unification [Reference Kalies15, Reference Hellenbrand19]. Immunization is voluntary in Germany now, but it was mandatory in the former German Democratic Republic.

METHODS

To investigate a possible association between the occurrence of measles epidemics and MMR vaccination coverage, we first examined the correlation between the number of observed cases in a region and region-specific vaccination coverage. One possibility is to apply the variance-stabilizing transformation for Poisson counts [Reference Palmgren, Armitage and Colton20], i.e. taking the square root of cases, before estimating the empirical correlation coefficient which might improve the goodness of the corresponding confidence intervals. An alternative approach, based on a Poisson regression model [Reference Kirkwood and Sterne21, Reference Kuhn, Davidson and Durkin22], assumes that the sum of cases in region i, aggregated over all three years, has mean

(1)

$\mu _{i} \equals {\rm exp}\lpar \alpha \plus \beta x_{i} \rpar \comma \hfill$

where x _i denotes the coverage in state i. For example, to adjust for regionally varying population numbers, the right hand side of equation (1) can be multiplied by an offset n _i. Conclusions about the effect β of the covariate x _i in equation (1) remain the same when considering the weekly number of cases instead of the sum of cases, assuming that the weekly counts are independent. However, a multivariate time-series analysis of counts is able to incorporate autocorrelation and provides many more possibilities compared to the analysis of temporally aggregated data.

In the following, y _i,t denotes the number of cases of a specific disease in a defined geographical region i=1, …, I at time t=1, …, T. A fundamental assumption of a Poisson regression model is that the response variables y _i,t are independent given the covariates. Thus the above model is not suited for the analysis of the measles data as the weekly counts are clearly dependent. Regular temporal dependence can easily be accounted for by including covariates for long-term or seasonal trends in the model. For instance, seasonal variation can be modelled parametrically using a superposition of harmonic waves [Reference Held, Höhle and Hofmann10, Reference Paul, Held and Toschke23] or non-parametrically [Reference Hens9, Reference Knorr-Held and Richardson24]. However, such a model may still not adequately capture occasional outbreaks typical for infectious diseases.

A natural way to incorporate temporal dependence beyond seasonal variation is to consider the number of past cases as additional explanatory variables in the model. Held et al. [Reference Held, Höhle and Hofmann10] suggest a Poisson regression model with an identity link, where the (conditional) mean μ_i,t of y _i,t is additively decomposed into two parts

(2)

$\mu _{i\comma t} \equals \lambda y_{i\comma t \minus \setnum{1}} \plus \nu _{i\comma t}.\hfill$

The first part with conditional rate λy _i,t−1 is called the ‘epidemic’ component and the second part with rate ν_i,t the ‘endemic’ component. The former component captures occasional (epidemic) outbreaks whereas the latter describes regular (endemic) patterns.

To include region-specific covariate information, we allow the autoregressive parameter λ in equation (2) to vary across regions, i.e. we switch notation from λ to λ_i and model λ_i as a function of these covariates. Furthermore, covariates can also be considered in the other component ν_i,t. Note that the conditional mean μ_i,t needs to be non-negative. This can be ensured by modelling both λ_i and ν_i,t on a log-scale.

Our first model (type A) assumes that the coverage levels in all states, x _i, enter into the epidemic component and the model is given by

(3)

${\rm log}\lpar \lambda _{i} \rpar \equals \beta _{\setnum{0}} \plus \beta _{\setnum{1}} x_{i} \comma \hfill$

(4)

${\rm log}\lpar v_{i\comma t} \rpar \equals \alpha _{\setnum{0}} \plus \lcub \gamma \;{\rm sin}\lpar 2\pi t\sol f\hskip2pt\rpar \plus \delta \;{\rm cos}\lpar 2\pi t\sol f\hskip2pt\rpar \rcub \plus {\rm log}\lpar n_{i} \rpar \comma$

where β₀ is an intercept and β₁ quantifies the influence of vaccination coverage. The parameter α₀ denotes the intercept of the endemic component and the offset log(n _i) represents population fractions, computed from Table 1. The terms in curly brackets in equation (4) are used to model seasonal variation. The number of data points per season is denoted by f. For instance, for a season of 1 year and weekly data f=52. For ease of interpretation, the seasonal terms can be written equivalently as a sine wave with amplitude A describing the magnitude, and phase difference ϕ describing the onset of the seasonal pattern [Reference Paul, Held and Toschke23]. In the second model, the term β₁x _i is omitted in equation (3) and the coverage levels x _i are included instead in the endemic component with coefficient α₁. Altogether, the model (type B) is given by

(5)

$\hskip {\rm log}\lpar \lambda _{i} \rpar \equals \beta _{\setnum{0}}\hskip1pt \comma \hfill$

(6)

$\hskip -1.65pt\eqalign{ {\rm log}\lpar v_{i\comma t} \rpar \equals \tab \alpha _{\setnum{0}} \plus \alpha _{\setnum{1}} x_{i} \plus \lcub \gamma \;{\rm sin}\lpar 2\pi t\sol f\hskip2pt \rpar \plus \delta \;{\rm cos}\lpar 2\pi t\sol f\hskip2pt \rpar \rcub \cr \tab \plus {\rm log}\lpar n_{i} \rpar .\cr} \vskip 7pt$

To investigate the impact of the explanatory variable, we also consider a model of type C, given by equations (4) and (5), where no covariate is included. Additionally, a standard log-linear Poisson regression model without the autoregressive component is fitted (model D).

For the model of type A we use the log proportion of unvaccinated school starters as explanatory variable x _i in equation (3) in accordance with the mass action principle [Reference Anderson and May25]. This principle assumes that the rate of disease spread is proportional to the product of the density of susceptibles (unvaccinated school starters) multiplied by the density of infected individuals (reported cases). Taking the logarithm of the proportion of unvaccinated school starters produces the multiplicative relation (model A₀). Similarly, the log proportion of all school starters who received at most one dose of MMR vaccine is used as an explanatory variable. We used the same covariates in the model of type B.

Maximum likelihood (ML) estimates of parameters and standard errors (s.e.) are obtained by numerically maximizing the respective Poisson log-likelihood. Standard software for linear Poisson regression cannot be used because of the nonlinearity of the parameters. Therefore, the quasi-Newton BFGS method implemented in the R [26] function optim is used for optimization. The fitting procedure and the measles data are integrated in the R package surveillance ([Reference Höhle27]; http://surveillance.r-forge.r-project.org). Note that models involving more than one covariate, time-varying covariates or additional seasonal terms at higher frequencies [Reference Diggle28] can also be fitted with this function in surveillance.

The models investigated in the Results section are compared based on the model choice AIC criterion. We were particularly interested in the ability of AIC to distinguish between the model types A and B. In order to investigate this we conducted a simulation study (see Appendix).

RESULTS

The sum of cases over the years 2005–2007 in each state is negatively correlated with coverage for both the first and second dose of MMR vaccine (Table 2). Absolute correlation increases slightly when taking the square root of cases. However, the statistical evidence for correlation is weak, since the upper 95% confidence limits are always positive.

Table 2. Estimated Pearson's correlation coefficient, r, with 95% confidence intervals

We describe here an analysis of the multivariate time-series of counts to further investigate the measles incidence patterns. The generation time [Reference Anderson and May25] for measles, i.e. the average time between the onset of symptoms in one case and the onset of symptoms in a second case directly infected by the first, is about 10 days [Reference Anderson and May25, Reference Fine and Clarkson29]. We therefore aggregate measles cases in successive bi-weekly periods to better reflect this characteristic time-scale [Reference Cliff and Haggett30, Reference Finkenstädt and Grenfell31]. AIC is used as a model choice criterion. The simulation study, discussed in detail in the Appendix, showed that this criterion is suitable for the comparison of the different model formulations.

The results of the analysis of the bi-weekly aggregated measles data are summarized in Table 3. All considered models contain an overall intercept α₀, a seasonal term and population fractions n _i as offset. The last two models in the table contain no covariates. When including only an intercept in the epidemic component (model C), the fit improves substantially compared to a model without autoregression (model D). The ML estimate of λ=exp(β₀) is quite high, $\hat{\lambda }$ =0·85 (s.e.=0·02), which indicates a strong dependence on the number of counts at the previous time point after adjustment for seasonal effects. Consequently, the use of a Poisson regression model (without autoregression) seems inappropriate for these data. Indeed, the series of deviance residuals obtained from model D showed considerable autocorrelation compared with model C, which showed almost no autocorrelation.

Table 3. Analysis of bi-weekly aggregated measles data

The log-likelihood is denoted by log(L); p is the number of parameters and Akaike's Information Criterion (AIC)=−2log(L)+2p; lower AIC values indicate better fit. The parameters β₀ and α₀ denote intercepts; β₁ and α₁ denote the effect of the covariate; A and ϕ denote the amplitude and onset of the seasonal pattern. The standard error is denoted by s.e.

In the next step, we investigated the impact of the inclusion of vaccination coverage in either the epidemic or endemic component compared to model C. Inclusion of the log proportion of unvaccinated school starters in the epidemic component (model A₀) leads to a considerably better fit.

The effect of the covariate β₁ in model A₀ is clearly significant (P<0·0001). Note that the estimated coefficients in the endemic component remain similar as in model C while the autoregressive parameter now varies across states. Inclusion of the covariate into the endemic component (model B₀) also improves the fit compared to model C but is worse compared to model A₀ according to AIC.

The above conclusions also hold when including the log proportion of school starters with at most one dose of MMR vaccine (models A₁, B₁). However, the model fit is considerably worse in terms of AIC. All results in Table 3 are based on the assumption that the coverage levels of the non-card holders are half those of card holders (adjustment factor 0·5). We tried several adjustment factors to investigate the robustness of our results. The ranking of the models according to AIC does not change for an adjustment factor <0·6. With regard to AIC an adjustment factor of 0·2 yields the best fit.

Figure 2 shows the estimated parameters λ_i and corresponding 95% confidence intervals for models A₀ and A₁ for each state. There is considerable heterogeneity across states. The ML estimates for the five states in East Germany are markedly lower than estimates for the remaining states. Vaccination coverage is considerably higher in these states. Note that model A₀ which includes the log proportion of unvaccinated school starters in the epidemic component performs better in terms of AIC than a model with the original (untransformed) proportion.

Fig. 2. Estimated autoregressive parameters $\hat{\lambda }_{i}$ and corresponding 95% confidence intervals for models A₀ (•) and A₁ (×). For comparison, the horizontal line denotes the estimated parameter $\hat{\lambda }$ for model C without covariates with the dashed lines representing the corresponding 95% confidence intervals. For definition of state abbreviations see Table 1.

The analysis of the multivariate time-series of measles surveillance counts showed that there is an association between vaccination coverage and the occurrence and size of measles epidemics within states, with model A₀ fitting best. Figure 3 shows the fitted number of cases, decomposed into endemic and epidemic components, for this model in three of the states shown in Figure 1 for illustrative purposes. The estimated mean is clearly dominated by the epidemic component.

Fig. 3. Fitted mean for model A₀, which includes the log proportion of unvaccinated school starters as covariate in the epidemic component, in selected states.

DISCUSSION

We observed a significant association between estimated vaccination coverage at school entry and the overall incidence of measles in the federal states of Germany (Table 3). The inclusion of the log proportion of unvaccinated school starters in the epidemic component of the model is the most suitable formulation to describe the occurrence and size of measles epidemics. This is plausible since the proportion of unvaccinated school starters acts as a proxy for the population of susceptibles, and the number of cases at a future time point depends on the number of infectious cases in the present as well as on the number of individuals susceptible to infection.

A strength of the proposed model is the decomposition of the disease incidence into an endemic and an epidemic component. Compared to a standard log-linear Poisson regression model our formulation is able to account for occasional outbreaks by including an autoregressive component. This is particularly important for the analysis of highly infectious diseases such as measles. In addition, information about vaccination coverage was included to cope with regional heterogeneity.

There are some limitations to this study. The RKI also provides estimates of vaccination coverage at school entry for children aged 4–7 for the years 2005 and 2007. However, the measles data comprise cases of all ages. Thus, changes in age-specific vaccination coverage may lead to shifts in the age distribution of the number of cases, but it will be impossible to discern such shifts from age-aggregated surveillance data. In addition, there is uncertainty about the true vaccination status, when obtained from school entry examinations. Hence small changes in coverage levels in successive years are not expected to be particularly meaningful. Therefore, we used only data for 2006 as an approximate measure of the overall immunization status in each state in all age groups.

We were aware that vaccination coverage was probably overestimated because vaccination uptake in school starters who presented vaccination cards is assumed to be higher [12]. Roughly 10% of school starters did not present vaccination cards and coverage for them is unknown. To assess the sensitivity of the assumed coverage for those without cards (0·5 times that of card holders) we considered values ranging from the same coverage as children who presented cards (corresponding to 1) to all children who did not present cards being unvaccinated (corresponding to 0). In terms of AIC, model B where the covariate is included in the endemic component is not very sensitive with regard to the assumed coverage. In contrast, the AIC for model A where the covariate is included in the epidemic component changes considerably. When coverage for non-card holders is >0·6 times that of card holders, model B is preferred.

Wichmann et al. [Reference Wichmann4] investigated a local outbreak in a school in Duisburg (North Rhine-Westphalia) in 2006. They estimated that receipt of one dose of MMR in the 22% without cards was 75% (significantly lower than the coverage of 95% in students with vaccination cards). This corresponds to a coverage level for non-card holders around 0·8 times that of card holders. However, this investigation involved only one school and no information about uncertainty around the estimated 75% coverage was given. The results are probably not generalizable to data at state level in this study. According to AIC, the measles data in our study are best described assuming coverage in non-card holders of 0·2 times that of card holders and using a model in which the proportion of unvaccinated school starters is incorporated in the epidemic component of the model.

To investigate the ability of AIC to identify the correct type of the model, we conducted a simulation study (Appendix). We used a simple model, comparable to the model of type A, where vaccination coverage influences the epidemic component. The simulation study showed that AIC identifies the true underlying model as long as the influence of vaccination coverage is strong or non-existent.

The proposed model approach allows us to consider infectious disease counts with several time-varying covariates. If quarterly, age-specific vaccination coverage was available, it could also be investigated whether vaccination-related trends in age-specific incidence [Reference Fine and Clarkson32] are observable using such notification data. Another interesting aspect would be to investigate the behaviour of the model where vaccination coverage is simultaneously included as an explanatory variable in both components. In this case, attention should be paid to potential issues related to multicollinearity or identifiability of parameter estimates.

In order to apply the proposed model to data at a finer spatial resolution we would need more detailed information about vaccination coverage because there are great regional and local differences leading to immunization gaps [Reference Wichmann6, Reference Kalies15]. For example, coverage levels for one dose of MMR vaccine ranged from 77·5% to 98% in the 77 health districts of Bavaria at school entry examinations 2005/2006 [33]. At a finer spatial resolution, it might also be necessary to account for spatio-temporal dependence, e.g. due to commuting. This could be done by including the previous number of cases in adjacent regions in the epidemic component [Reference Held, Höhle and Hofmann10, Reference Paul, Held and Toschke23].

Although the data on measles incidence and vaccination coverage have some limitations, clear associations were observed. The pattern observed in the reported measles cases for all ages is best described by including the log proportion of unvaccinated school starters as an explanatory variable in the autoregressive (epidemic) component of the model.

APPENDIX: Simulation study

We investigated whether AIC identifies the correct structure of the model with a simulation study. Multivariate time-series of length T=156 (3 years of weekly data) were simulated based on a model where the number of cases y _i,t in region i at time t is influenced by vaccination coverage as a covariate. Each simulated dataset is analysed with different models and AIC is calculated.

We assumed that vaccination coverage influences the epidemic component, which also contains an intercept. The endemic component contains no seasonal terms, an overall intercept α₀ and population fractions n _i as offset. Four randomly selected regions are used where the population sizes N _i are selected from population data of Germany in 2006 and artificial vaccination coverage levels x _i are attached (see Table 4). The coverage levels x _i differ between the regions and have been transformed with log(1 – x _i) as in the measles analysis. The simulation model corresponds to model A in Table 5 and is similar to model A₀ for the measles data (Table 3).

Table 4. Population sizes (N_i) and corresponding vaccination coverage levels (x_i) used in the simulation study

The states used in the simulation study were selected at random. The population fraction is denoted by n _i.

Table 5. Models for the simulation analysis

We chose different values for the yearly incidence c (10⁻⁴, 10⁻⁵) and the basic level of the epidemic component not influenced by covariates, $\bar{\lambda }$ (0·5, 0·8). Furthermore, we assumed that vaccination coverage has either no (β₁=0), a small (β₁=0·1), or a strong (β₁=0·5) influence. All combinations of these values give 12 different simulation scenarios. For each of these scenarios, 1000 datasets have been simulated. The incidence c and the population size N _i are used to calculate the mean number of cases for the first week μ_i,1 for each region with μ_i,1=cN _i /52. The parameter $\bar{\lambda }$ is used to calculate the intercept β₀ as a basic level

$\beta _{\setnum{0}} \equals {\rm log}\lpar \bar{\lambda }\rpar \minus {\rm mean}\lpar \beta _{\setnum{1}} {\rm log}\lpar 1 \minus x_{i} \rpar \rpar.$

Next, the epidemic component λ_i is calculated as in model A (Table 5) and used for the simulation. The endemic component ν is calculated with the stationary mean equation [Reference Held, Höhle and Hofmann10]

$\eqalign{ \nu _{i} \equals \tab \mu _{i\comma t} {{\lpar 1 \minus \bar{\lambda }\rpar } \over {n_{i} }} \equals {{cN_{i} } \over {52}}{{\lpar 1 \minus \bar{\lambda }\rpar \sum\nolimits_{i} {N_{i} } } \over {N_{i} }} \cr \equals \tab{c \over {52}}\lpar 1 \minus \bar{\lambda }\rpar \sum\limits_{i} {N_{i} } \cr}$

and is the same for all regions. The cases y _i,t are simulated for each region i and point in time t as follows:

$\openup3pt \matrix{ {y_{i\comma t} \sim Po\left( \displaystyle{{{n_{i} \nu } \over {1 \minus \lambda _{i} }}} \right)} \hfill \tab {\lpar t \equals 1\rpar \comma } \hfill \cr {y_{i\comma t} \sim Po\lpar \lambda _{i} y_{i\comma t \minus \setnum{1}} \plus n_{i} \nu \rpar } \hfill \tab {\lpar t \equals 2\comma \ldots \comma T\rpar.} \hfill \cr}$

For the analysis of each simulated dataset three different models, listed in Table 5, have been considered. The models differ with regard to the influence of vaccination coverage: in the epidemic component, in the endemic component, or none. Note that the values of the covariates used in the analysis are the same as in the simulation.

The results of the analysis are shown in Table 6. In all simulations where there was no influence of vaccination coverage the true underlying model C resulted most frequently in the lowest AIC value (i.e. highest AIC %). When there was a small influence of vaccination coverage in the epidemic component, AIC in general preferred model C with no influence, followed by model A with influence in the epidemic component. When there was a strong influence, model A is clearly preferred. In summary, AIC identifies the true underlying model as long as the influence of vaccination coverage is strong or non-existent.

Table 6. Results for the simulation study

AIC, Akaike's Information Criterion.

Parameter values are shown for the simulations (Sim), the mean number of cases for each region (Reg), and how often each model has the lowest AIC value (AIC % of model).

ACKNOWLEDGEMENTS

We thank J. C. M. Heijne and N. Low for valuable comments and suggestions. Financial support by the Swiss National Science Foundation is gratefully acknowledged.

DECLARATION OF INTEREST

None.

References

REFERENCES

1.Muscat, M, et al. Measles in Europe: an epidemiological assessment. Lancet 2009; 373: 383–389.Google Scholar

2.Wallinga, J, Heijne, JCM, Kretzschmar, M. A measles epidemic threshold in a highly vaccinated population. PLoS Medicine 2005; 2: e316.Google Scholar

3.Siedler, A, et al. Two outbreaks of measles in Germany 2005. Eurosurveillance 2006; 11: 131–134.Google Scholar

4.Wichmann, O, et al. Large measles outbreak at a German public school, 2006. Pediatric Infectious Disease Journal 2007; 26: 782–786.Google Scholar

5.Bernard, H, et al. An outbreak of measles in Lower Bavaria, Germany, January–June 2007. Eurosurveillance 2007; 12: pii=3278.Google Scholar

6.Wichmann, O, et al. Further efforts needed to achieve measles elimination in Germany: results of an outbreak investigation. Bulletin of the World Health Organization 2009; 87: 108–115.Google Scholar

7.Cummings, DAT, et al. Improved measles surveillance in Cameroon reveals two major dynamic patterns of incidence. International Journal of Infectious Diseases 2006; 10: 148–155.Google Scholar

8.Frank, C, et al. Cattle density and Shiga toxin-producing Escherichia coli infection in Germany: increased risk for most but not all serogroups. Vector-Borne and Zoonotic Diseases 2008; 8: 635–643.Google Scholar

9.Hens, N, et al. Estimating the impact of vaccination using age-time-dependent incidence rates of hepatitis B. Epidemiology and Infection 2008; 136: 341–351.Google Scholar

10.Held, L, Höhle, M, Hofmann, M. A statistical framework for the analysis of multivariate infectious disease surveillance counts. Statistical Modelling 2005; 5: 187–199.Google Scholar

11.Robert Koch Institute. SurvStat (http://www3.rki.de/SurvStat). Accessed 14 October 2009.Google Scholar

12.Robert Koch Institute. On vaccination coverage estimated at school entry examinations in Germany, 2006 [in German]. Epidemiologisches Bulletin 2008; 7: 55–57.Google Scholar

13.Lindsey, JK, Jones, B. Choosing among generalized linear models applied to medical data. Statistics in Medicine 1998; 17: 59–68.Google Scholar

14.Statistisches Bundesamt. 12411-0009 Projections of the resident population for German states on reference date (https://www-genesis.destatis.de/genesis/online/logon). Accessed 21 January 2009.Google Scholar

15.Kalies, H, et al. Immunisation status of children in Germany: temporal trends and regional differences. European Journal of Pediatrics 2006; 165: 30–36.Google Scholar

16.Reiter, S, Poethko-Müller, C. Current vaccination coverage and immunization gaps of children and adolescents in Germany [in German]. Bundesgesundheitsblatt – Gesundheitsforschung – Gesundheitsschutz 2009; 52: 1037–1044.Google Scholar

17.Poethko-Müller, C, et al. Vaccination coverage against measles in German-born and foreign-born children and identification of unvaccinated subgroups in Germany. Vaccine 2009; 27: 2563–2569.Google Scholar

18.Tischer, A, Siedler, A, Rasch, G. Surveillance of measles in Germany [in German]. Gesundheitswesen 2001; 63: 703–709.Google Scholar

19.Hellenbrand, W, et al. Progress toward measles elimination in Germany. Journal of Infectious Diseases 2003; 187 (Suppl. 1): S208–S216.Google Scholar

20.Palmgren, J. Poisson distribution. In: Armitage, P, Colton, T, eds. Encyclopaedia of Biostatistics, 2nd edn. West Sussex: John Wiley and Sons, 2005, pp. 4109–4113.Google Scholar

21.Kirkwood, B, Sterne, J. Essential Medical Statistics, 2nd edn, 2003. Malden: Wiley.Google Scholar

22.Kuhn, L, Davidson, LL, Durkin, MS. Use of Poisson regression and time series analysis for detecting changes over time in rates of child injury following a prevention program. American Journal of Epidemiology 1994; 140: 943–955.Google Scholar

23.Paul, M, Held, L, Toschke, M. Multivariate modelling of infectious disease surveillance data. Statistics in Medicine 2008; 27: 6250–6267.Google Scholar

24.Knorr-Held, L, Richardson, S. A hierarchical model for space-time surveillance data on meningococcal disease incidence. Journal of the Royal Statistical Society Series C 2003; 52: 169–183.Google Scholar

25.Anderson, RM, May, RM. Infectious Diseases of Humans: Dynamics and Control. Oxford: Oxford University Press, 1991.Google Scholar

26.R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing, 2009.Google Scholar

27.Höhle, M. Surveillance: an R package for the monitoring of infectious diseases. Computational Statistics 2007; 22: 571–582.Google Scholar

28.Diggle, PJ. Time Series. A Biostatistical Introduction. New York: Oxford University Press, 1990.Google Scholar

29.Fine, PEM, Clarkson, JA. Measles in England and Wales – I: An analysis of factors underlying seasonal patterns. International Journal of Epidemiology 1982; 11: 5–14.Google Scholar

30.Cliff, AD, Haggett, P. Statistical modelling of measles and influenza outbreaks. Statistical Methods in Medical Research 1993; 2: 43–73.Google Scholar

31.Finkenstädt, BF, Grenfell, BT. Time series modelling of childhood diseases: A dynamical systems approach. Journal of the Royal Statistical Society: Series C 2000; 49: 187–205.Google Scholar

32.Fine, PEM, Clarkson, JA. Measles in England and Wales – II: The impact of the measles vaccination program on the distribution of immunity in the population. International Journal of Epidemiology 1982; 11: 15–25.Google Scholar

33.Bayerisches Landesamt für Gesundheit und Lebensmittelsicherheit. MMR vaccination coverage of children starting school in Bavaria in 2006/07 at a regional level (http://www.lgl.bayern.de/gesundheit/gesundheitsindikatoren/themenfeld07/indikator0714.htm). Accessed 10 June 2009.Google Scholar

Fig. 1. Number of weekly measles cases in selected German federal states for the years 2005–2007. Note that the y-axis is not the same for all states.

Table 1. Measles cases and estimated vaccination coverage in the 16 federal states of Germany

Table 2. Estimated Pearson's correlation coefficient, r, with 95% confidence intervals

Table 3. Analysis of bi-weekly aggregated measles data

Fig. 2. Estimated autoregressive parameters \hat{\lambda }_{i} and corresponding 95% confidence intervals for models A0 (•) and A1 (×). For comparison, the horizontal line denotes the estimated parameter \hat{\lambda } for model C without covariates with the dashed lines representing the corresponding 95% confidence intervals. For definition of state abbreviations see Table 1.

Fig. 3. Fitted mean for model A0, which includes the log proportion of unvaccinated school starters as covariate in the epidemic component, in selected states.

Table 4. Population sizes (Ni) and corresponding vaccination coverage levels (xi) used in the simulation study

Table 5. Models for the simulation analysis

Table 6. Results for the simulation study

Article contents

Heterogeneity in vaccination coverage explains the size and occurrence of measles epidemics in German surveillance data

Summary

Keywords

INTRODUCTION

DATA

Measles incidence

Measles-mumps-rubella (MMR) vaccination

METHODS

RESULTS

DISCUSSION

APPENDIX: Simulation study

ACKNOWLEDGEMENTS

DECLARATION OF INTEREST

References

REFERENCES

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests