Skip to main content
Advertisement
  • Loading metrics

A simulation-based method to inform serosurvey design for estimating the force of infection using existing blood samples

  • Anna Vicco,

    Roles Data curation, Formal analysis, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Department of Molecular Medicine, University of Padua, Padua, Italy, MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom

  • Clare P. McCormack,

    Roles Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom

  • Belen Pedrique,

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Writing – review & editing

    Affiliation Drugs for Neglected Diseases initiative, Geneva, Switzerland

  • John H. Amuasi,

    Roles Investigation, Project administration, Resources, Writing – review & editing

    Affiliations Department of Global Health, School of Public Health, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana, Global Health and Infectious Diseases Research Group, Kumasi Centre for Collaborative Research in Tropical Medicine, Kumasi, Ghana, Research Group Global One Health, Department of Implementation Research, Bernhard Nocht Institute of Tropical Medicine, Hamburg, Germany, Division for Tropical Medicine, Department of Medicine, University Medical Centre Hamburg-Eppendorf, Hamburg, Germany

  • Anthony Afum-Adjei Awuah,

    Roles Investigation, Resources, Writing – review & editing

    Affiliations Global Health and Infectious Diseases Research Group, Kumasi Centre for Collaborative Research in Tropical Medicine, Kumasi, Ghana, Research Group Global One Health, Department of Implementation Research, Bernhard Nocht Institute of Tropical Medicine, Hamburg, Germany, Department of Molecular Medicine, School of Medicine and Dentistry, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana

  • Christian Obirikorang,

    Roles Investigation, Resources, Writing – review & editing

    Affiliations Global Health and Infectious Diseases Research Group, Kumasi Centre for Collaborative Research in Tropical Medicine, Kumasi, Ghana, Research Group Global One Health, Department of Implementation Research, Bernhard Nocht Institute of Tropical Medicine, Hamburg, Germany, Department of Molecular Medicine, School of Medicine and Dentistry, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana

  • Nicole S. Struck,

    Roles Investigation, Resources, Writing – review & editing

    Affiliations Infectious Disease Epidemiology, Bernhard Nocht Institute for Tropical Medicine, Hamburg, Germany, German Center for Infection Research (DZIF), partner site Hamburg-Borstel-Lübeck-Riems, Germany

  • Eva Lorenz,

    Roles Investigation, Resources, Writing – review & editing

    Affiliations Infectious Disease Epidemiology, Bernhard Nocht Institute for Tropical Medicine, Hamburg, Germany, German Center for Infection Research (DZIF), partner site Hamburg-Borstel-Lübeck-Riems, Germany, Institute of Medical Biostatistics, Epidemiology and Informatics, University Medical Centre of the Johannes Gutenberg University Mainz, Mainz, Germany

  • Jürgen May,

    Roles Investigation, Resources, Writing – review & editing

    Affiliations Infectious Disease Epidemiology, Bernhard Nocht Institute for Tropical Medicine, Hamburg, Germany, German Center for Infection Research (DZIF), partner site Hamburg-Borstel-Lübeck-Riems, Germany, Department of Tropical Medicine I, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany

  • Isabela Ribeiro,

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Writing – review & editing

    Affiliation Drugs for Neglected Diseases initiative, Geneva, Switzerland

  • Gathsaurie Neelika Malavige,

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Writing – review & editing

    Affiliation Drugs for Neglected Diseases initiative, Geneva, Switzerland

  • Christl A. Donnelly,

    Roles Conceptualization, Methodology, Supervision, Visualization, Writing – original draft, Writing – review & editing

    Affiliations MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom, Department of Statistics, University of Oxford, Oxford, United Kingdom, Pandemic Sciences Institute, University of Oxford, Oxford, United Kingdom

  • Ilaria Dorigatti

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Methodology, Project administration, Software, Supervision, Visualization, Writing – original draft, Writing – review & editing

    i.dorigatti@imperial.ac.uk

    Affiliation MRC Centre for Global Infectious Disease Analysis, School of Public Health, Imperial College London, London, United Kingdom

Abstract

The extent to which dengue virus has been circulating globally and especially in Africa is largely unknown. Testing available blood samples from previous cross-sectional serological surveys offers a convenient strategy to investigate past dengue infections, as such serosurveys provide the ideal data to reconstruct the age-dependent immunity profile of the population and to estimate the average per-capita annual risk of infection: the force of infection (FOI), which is a fundamental measure of transmission intensity.

In this study, we present a novel methodological approach to inform the size and age distribution of blood samples to test when samples are acquired from previous surveys. The method was used to inform SERODEN, a dengue seroprevalence survey which is currently being conducted in Ghana among other countries utilizing samples previously collected for a SARS-CoV-2 serosurvey.

The method described in this paper can be employed to determine sample sizes and testing strategies for different diseases and transmission settings.

Author summary

The historical circulation of dengue virus is still poorly understood in many parts of the world, and age-stratified seroprevalence surveys can provide the data to quantify population exposure to dengue and its transmission intensity.

In this work, we developed a simulation-based method that can be used to identify the sample sizes and age-distribution of the samples needed to obtain informative estimates of dengue force of infection from existing blood samples. We apply this method to data obtained from a SARS-CoV-2 serological survey previously conducted in three cities in Ghana.

The methods and code developed in this paper can be used to design serological surveys for dengue and other pathogens when using existing blood samples with accompanying information on age and location.

Introduction

Dengue virus (DENV) is one of the most rapidly spreading vector-borne diseases worldwide and places a large burden on public health [1]. In the last 50 years, dengue incidence has increased more than 30-fold [2], with recent estimates suggesting that up to 105 million people are infected each year [3]. Due to ongoing changes in climate, demography and socioeconomic structure and growing urbanization [4], there are concerns that the geographical range of dengue will expand in the coming decades.

Estimates suggest this potential geographic expansion may affect Sub-Saharan Africa in particular [57], a region where the burden of dengue remains poorly understood and is highly uncertain [8,9]. There are also key knowledge gaps on the rates of asymptomatic infection [4,10] and misclassification [2]. In addition, the lack of a strong surveillance system and insufficient infrastructure, have been identified as key challenges for arbovirus surveillance and disease control across the African region [11].

Cross-sectional serological surveys, which involve the collection of sera samples and the subsequent analysis of antibody level in the sera [12], are the gold standard for assessing individual-level exposure and for understanding the historical circulation of dengue at the population-level [13]. Age-stratified serological surveys provide the necessary data to reconstruct the immunity profile of the population, i.e., the extent of dengue exposure as a function of age, which in turn can be used to estimate the force of infection (FOI), defined as the per capita rate at which susceptible individuals become infected [10,14], the burden of infection and the impact of interventions [3].

However, several challenges, including the need of substantial resources and infrastructure, have posed limitations on the number of serosurveys conducted in low-income countries [12]. In Africa for example, only 17 age-stratified dengue seroprevalence surveys have been conducted to date [3,15]. Optimizing the serosurvey design, particularly the number of samples to be tested, is critical to promoting the use of these surveys, especially when resources are limited. While conducting new seroprevalence surveys provides the opportunity to target optimal age-groups for estimating epidemiological parameters, it also requires extensive resources, including time and operational capacity. In some cases these requirements can be mitigated by leveraging existing blood samples.

Simulation-based analyses are a powerful tool for informing serosurvey design as they enable the testing of different sampling scenarios, which in turn can help determine sample sizes targeted to the outcomes of interest [1618]. For example, reversible catalytic models have been used to optimize the sample sizes required to accurately estimate the seroconversion rate to malaria [16], compartmental dynamic transmission models have been developed to identify the optimal age distribution of blood samples for different pathogens and outcomes of interest [17] and during the COVID-19 pandemic, Bayesian methods were developed to propagate the uncertainty of imperfect testing and determine optimal sample size to minimize the uncertainty in the seroprevalence estimates [18].

In this study, we developed methods to inform serosurvey design when leveraging existing blood samples. We use simple catalytic models to develop a new simulation-based framework for identifying the optimal number and age distribution of available blood samples to obtain accurate setting-specific dengue FOI estimates whist optimising resources and at the same time accounting for imperfect test sensitivity and specificity. The methods described in this study have been used to inform the design of serosurveys aimed to reconstruct dengue FOI estimates in three locations in Ghana [19], utilising blood samples previously collected for SARS-CoV-2 surveillance. Below we provide evidence of how the simulation framework developed in this study can be readily adopted for estimating the FOI of other infectious pathogens across different transmission settings.

Materials and methods

Ethics statement

Ethical approval was obtained from the Committee on Human Research and Publication (CHRPE) of the Kwame Nkrumah University of Science and Technology (KNUST), IRB number CHRPE/AP/218/20. Blood samples were collected from participants or legal guardians who gave informed written consent to participate in the study.

Blood samples

The blood samples available for testing were collected in a SARS-CoV-2 seroprevalence study conducted in three cities in Ghana (Accra, Kumasi and Tamale) between February 2021 and February 2022 by the Bernard Nocht Institute for Tropical Medicine (BNITM) and the Kumasi Centre for Collaborative Research in Tropical Medicine (KCCR) [20]. The study was designed with a two-stage geographical cluster sampling method stratified by age and sex in urban households [20]. The available blood samples included 2,051 samples from participants aged 10 to 94 years; all samples were stored in a biobank at -80°C.

Potential sampling scenarios

We simulated alternative sampling scenarios (Fig 1) to select the sample sizes and their distribution. We denoted the scenario that included testing all available samples as ‘scenario 0’ and explored the following alternative scenarios: testing an equal number of samples in each age-group, using the minimum number of available samples by age-group (scenario A); testing all available samples in the younger age-groups (i.e., the first half of the age-groups), and half of the minimum number of samples across the young age-groups for the older age-groups (scenario B); testing all available samples for the older age-groups and half of the minimum number of samples across the old age-groups for the younger age-groups (scenario C); testing all available samples for the younger age-groups and half of the available number of samples for the older age-groups (scenario D) and testing all available samples for the older age-groups and half of the available number of samples for the younger age-groups (scenario E).

thumbnail
Fig 1. Blood sample distribution across the simulated scenarios for the city of Accra using 5-year age groups.

Panel A shows the distribution of the available samples across the age-groups and compares them with the population age structure of Ghana (World Population Prospect [21]). Panels B to F illustrate the sample distribution under scenarios A to E compared to the available samples in the baseline scenario 0 (which in the panel is indicated as “samples”).

https://doi.org/10.1371/journal.pcbi.1011666.g001

Simulation study & model fitting

In model 1, we assumed a constant in time and age average yearly FOI, and that antibodies do not wane. Using these assumptions, the age-dependent seroprevalence (i.e., the probability of developing measurable antibodies upon exposure) could be modelled as detailed in Eq 1, where ai is the midpoint of age-group i, λ corresponds to the average yearly FOI per serotype, based on the estimates obtained from Cattarino et al. [3] (Table 1), and n is the number of serotypes assumed to be circulating. For Ghana, we assumed n = 2 independently transmitting serotypes, following Bonney et al. [22].

(1)

We assumed that the number of subjects exposed to dengue in each age-group was binomially distributed as a function of the number of samples tested in age-group i, as described in Eq 2 (2) where Xi denotes the number of people testing seropositive in group i, Ni denotes the number of samples tested in age-group i, and z(ai) is the probability of testing seropositive in that age-group.

thumbnail
Table 1. Summary of accuracy metrics obtained for Ghana with 5-year age groups under the optimal scenarios across models and assumptions.

https://doi.org/10.1371/journal.pcbi.1011666.t001

For each sampling scenario we denoted the age-specific sample sizes N1,,Nm in age-groups 1,,m, and simulated the expected age-stratified seroprevalence as shown in Eq 1. For each scenario, we generated 1,000 datasets by sampling the number of seropositive samples I1,,Im from a binomial distribution with parameters N1,…,Nm and z1,…,zm, where z derives from Eq 1, assuming the FOI estimates from Cattarino et al. [3] (Table 1). To account for sensitivity (se) and specificity (sp) of the test respectively, we simulated the number of samples testing positive P1,…, Pm among the I1,…,Im samples as the sum of the true positive TP1,…TPm and false positive FP1,…,FPm samples, which were sampled from a binomial distribution as described in Eqs 35.

(3)(4)(5)

We assumed se = 0.892 and sp = 0.988 [23] and for each simulated dataset we fitted the catalytic model separately, thus obtaining 1,000 FOI estimates, denoted ˜.

We account for imperfect testing by defining the test positivity function (t+(ai)) shown in Eq 6, which identifies the probability that an individual of age ai tests positive according to the seroprevalence, sensitivity (se) and specificity (sp) of the assay. We used a binomial log-likelihood (Eq 7) to infer the posterior distribution of the FOI λ using the Hamiltonian Monte Carlo (HMC) algorithm coded in CmdStanR [24], having assumed a uniform prior distribution between zero and one (Fig A in S1 Text), and sampling 3 chains of 5,000 iterations each, with a warming-up step of 1,000 iterations. Convergence was assessed visually and using the Rhat and effective sample size [24]. In the results we report the median and 95% credible interval of the posterior distribution of the FOI.

(6)(7)

Performance metrics and selection criteria

Three performance indexes were used to compare the FOI estimates with the FOI used to generate the simulated data. Specifically, we calculated the median of the FOI under each scenario and age category, and then calculated the:

  • Bias: the absolute value of the difference between the estimated FOI and the λ used to simulate the data divided by λ, expressed in percentage.
  • Uncertainty: the width of the 95% credible interval (CrI) of the estimated FOI .
  • Coverage: the number of times the FOI λ used to simulate the data falls within 95% CrI of the estimated FOI , across the 1,000 simulations, expressed in percentage.

We selected the optimal sample sizes and age-distributions based on the following criteria. Firstly, we retained only the scenarios with an upper bound of the 95% CrI of the bias within a 15% tolerance above the one obtained with scenario 0. Then, among these scenarios, we retained those with a coverage within 5% of the maximum coverage value across the selected scenarios. Finally, we selected the scenario with the smallest number of samples. We also compared a posteriori the uncertainty of the optimal scenario with the one of scenario 0 to check that the selected scenario did not introduce further uncertainty around the estimate.

Sensitivity analyses

Alternative categorisation used to report the age-specific seroprevalence.

In a sensitivity analysis, we ran the simulation study using 10-year age categories (Fig B in S1 Text) (instead of 5-year age categories as done in the main analysis) to explore the effect that reporting the age-specific seroprevalence over wider age-groups has on the precision of the FOI estimates.

Alternative serocatalytic model formulations.

To adapt our framework to different pathogens and settings, we implemented the simulation framework for the case where antibody protection decays over time at a yearly rate α (model 2) [10]. In this scenario, the seroprevalence can be modelled as shown in Eq 8.

(8)

We also explored scenarios with time-varying FOIs (model 3) [25]. In our simulation study we assumed that the FOI changed in a stepwise fashion over time (Table J in S1 Text), with individuals 0–29 years old (age-groups i < 5) exposed solely to λ1, individuals 30–49 years old (age-groups 5 ≤ i ≤ 8) exposed to λ1 during the last 30 years of life and to λ2 during the first 20 years of life and individuals 50+ years of age (age-groups i > 8) exposed to λ1 during the last 30 years of life, λ2 during the middle 20 years of life and λ3 for the earlier years of their life. In this case the age-stratified seroprevalence can be described as shown in Eq 9: (9) where ai is the midpoint of age-group i, a4 is the mid- point of age-group i = 4 (25–29 years old), and a8 is the mid-point of age-group i = 8 (45–49 years old).

We used models 2 and 3 to generate simulated datasets, and then used the corresponding catalytic model (respectively model 2 and 3) and the likelihood defined in Eq 7 to reconstruct the posterior distribution of the FOI λ and decay rate α in model 2 and of the three FOIs λ1, λ2 and λ3 in model 3. We assumed a normal distribution with mean 0 and standard deviation of 1 for the lambda parameters in both model 2 and 3, and a normal distribution with mean 0.02 (as suggested in the literature [10]) and standard deviation of 0.5 for the decay rate in model 2. We sampled the parameters on the log scale to improve the exploration of the parameter space.

Alternative assumptions on the magnitude of the FOI and decay rate.

In model 1, we tested three different FOI values: 1) λ taken from Cattarino et al. [3]; 2) twice the value taken from Cattarino et al. [3]; and 3) half the value taken from Cattarino et al. [3], as shown in Fig C in S1 Text.

In model 2, we applied our framework to three alternative scenarios exploring different FOI and decay rate values: scenario 1) λ taken from Cattarino et al. [3] and a decay rate α = 0.02 from Imai et al. [10]; scenario 2) λ equal to twice the estimate in Cattarino et al. [3] and α = 0.02; and scenario 3) λ taken from Cattarino et al. [3] and a higher decay rate α = 0.06. Table 1 and Fig D in S1 Text show the values of λ and α used in the three scenarios.

In model 3 we tested three different patterns of transmission, with increasing, decreasing, and non-monotonic FOIs as shown in Fig E in S1 Text. In the selection of the optimal sampling scenario, we calculated the average bias, coverage and uncertainty over the three FOIs.

Results

A total of 2,051 blood samples from a previous seroprevalence study conducted in three cities in Ghana (Accra, Kumasi and Tamale) [20], were available for testing for dengue antibodies (Table 1, scenario 0). Younger age-groups were less represented in the study population than in the general population, with 24.6% of the participants belonging to the 10–24 years age-group, compared to an expected 41.2% from the national age structure published in the 2022 Revision of World Population Prospects (WPP) (Fig 1). The 20–34 years age-group was the most represented (39.3% of participants) [20].

Fig 2 shows the bias, uncertainty, and coverage obtained across all scenarios using the 5-year age categories for the three Ghanian cities under model 1 and the Cattarino et al. FOI estimates [3]. The corresponding figure for the 10-year age categories is presented in the Supporting Information (Fig B in S1 Text). Overall, we found that reducing the sample sizes did not bias the FOI estimates, except for scenario A (which had the smallest number of samples) where the central estimate was consistently larger. On the other hand, as expected, we found that reducing the sample sizes increased the uncertainty of the FOI estimates, depending on the number and distribution of the samples by age. On average, we observed a coverage between 93% and 96% across all analysed scenarios, with some variability depending on the city. Fig 2 shows that reducing the number of tests in certain age-groups (such as in scenario B, highlighted in green) can increase the coverage without biasing the estimates.

thumbnail
Fig 2. Summary of the accuracy metrics obtained for Accra, Kumasi and Tamale with 5-year age category across scenarios.

The four columns represent respectively the bias (in percentage), coverage (in percentage), number of tested samples and uncertainty obtained for each scenario. The scenario highlighted in green indicates the selected scenario. The median bias and uncertainty are reported with their 95% CrI (columns 1 and 4), while the median coverage is reported with its 95% exact binomial CI. The orange line represents the median (columns 1, 2 and 4) and the orange ribbon represents the 95% CrI of the baseline scenario 0 (columns 2 and 4). The pink ribbon in the first column represents the 15% tolerance around the upper bound of the 95% CrI of scenario 0, which was used in the first step of the selection criterion.

https://doi.org/10.1371/journal.pcbi.1011666.g002

In Ghana, using the FOI estimates from Cattarino et al. [3] and the simple catalytic model (model 1), we found that scenarios B and D, which included fewer samples from the older age-groups, showed a higher coverage and a lower uncertainty compared to the baseline scenario (scenario 0). This indicates that selecting the appropriate age distribution of the samples can help improve the accuracy of the FOI estimates, while at the same time reducing the sample sizes. Scenarios B and D provided the best combination of coverage, bias and uncertainty in both the 5-year (Fig 2) and 10-year (Fig B in S1 Text) age categories for the three cities. In these scenarios, the distribution of samples in the two age categories led to a similar uncertainty, but the coverage was generally higher in the 5-year age category, which also required testing fewer samples than with the 10-year group (a total of 1,487 instead of 1,610). As a result, using the proposed selection criterion, the total sample size of the study in Ghana was reduced to 1,487 samples from the available 2,051 samples.

Fig 3 shows the expected seroprevalence profile and model fit obtained from the baseline scenario (scenario 0) versus those obtained from the selected scenario, indicating no significant deviation in model fit caused by the reduced samples sizes and adopted sampling strategy.

thumbnail
Fig 3. Model fit for the three cities under the 5-year age categorization with model 1.

The first row represents the fit for scenario 0, where the median and 95% CrI of the estimated seroprevalence are reported in orange as a line and shaded area, respectively. The second row illustrates the model fit with the selected scenario in green. In both panels, the datasets used for model fitting are shown in black, with the error bars showing the exact binomial 95% Confidence Interval (CI). The third row shows a comparison of the estimated seroprevalence in scenario 0 (orange) and the optimal scenario (green).

https://doi.org/10.1371/journal.pcbi.1011666.g003

The optimal testing scenario changes with the expected FOI

In a sensitivity analysis we tested the impact of assuming alternative FOI values for the three locations, as described in Table 1. Fig 4 summarises the results obtained under a high and low FOI, showing that in higher FOI settings testing younger age-groups is preferrable (scenario B), while in lower FOI settings testing of older age-groups (scenario E) is required to get accurate FOI estimates. In both cases the chosen scenario allowed to estimate the FOI with high precision and low uncertainty, as shown in Table 1. The model also fitted the simulated data, as shown in Fig F in S1 Text.

thumbnail
Fig 4. Model fit for Accra under the 5-year age categorization and a high and low FOI.

The first row represents the seroprevalence using the chosen scenario, where the median and 95% credible interval of the estimated seroprevalence are reported in green as a line and shaded area, while the simulated dataset is reported in black, with the error bars indicating the exact binomial 95% CI. The second row illustrates the age-distributions of the tests with the chosen scenario (in green) for each city, compared to the available sample distribution in orange. The figure reports the results obtained with the high FOI (first column) and low FOI (second column).

https://doi.org/10.1371/journal.pcbi.1011666.g004

The simulation framework works with decaying immunity and across transmission settings

The proposed simulation framework can successfully reconstruct the FOI also in scenarios with antibody decay and temporal changes in the intensity of transmission, including increasing, decreasing and non-monotonic transmission intensity patterns, and allows to identify an optimal testing scenario also when using these alternative assumptions on immunity decay and temporal changes in the FOI.

The selected testing scenarios varied across locations and with the modelled assumptions (Table 1). The simulation-based framework under model 2 could capture the seroprevalence data (Fig G in S1 Text), and produced accurate estimates of the FOIs and decay rates (Fig D in S1 Text), with a slightly larger uncertainty around the decay rate than in the FOI. Scenario A, which includes the minimum number of tests performed across all age-groups, was among the preferred scenarios for the antibody decay model (model 2). However, when assuming a high FOI, the chosen scenario was B, which involves the testing of the younger age-groups.

We found preferential testing of older age-groups (scenarios C and E) when using the time-varying FOI model (model 3), which can be explained by the fact that older age-groups are the only ones providing information to estimate the earliest FOI (λ3) in the time-varying model. The simulation framework can reconstruct the expected seroprevalence with good accuracy (Fig H in S1 Text) and provides precise estimates of all FOIs across the three time-varying scenarios explored, with a slight increase in uncertainty for the earliest FOI (λ3) (Fig E in S1 Text).

Discussion

Estimating the transmission intensity and burden of dengue globally is key to quantifying the current and future risk of infection and disease, and health care demand. In addition, it is critical to assess the potential impact of new interventions, such as vaccination campaigns implemented alone or in combination with vector control interventions [26,27], both prospectively and retrospectively. Across Africa, only 17 seroprevalence surveys have been conducted to date [3,15] and, as such, the African region remains a continent with limited knowledge of the historical circulation and seroprevalence of dengue [8]. Serological surveys are, therefore, invaluable surveillance tools to help understand and estimate dengue transmission intensity and its heterogeneity across the continent [28].

We propose a simulation-based method to guide secondary testing of existing serosurveys for other pathogens. Retargeting existing serosurveys has been previously explored, for example by Carcelen et al. [29], who piggybacked on a HIV national serosurvey performed in Zambia to test for anti-measles and anti-rubella virus IgG antibodies, using almost 10,000 residual sera. While testing existing samples may not be suitable in all circumstances, there are several advantages to secondary testing of existing blood samples and the number of recent serosurveys conducted globally and across Africa during the COVID-19 pandemic provide new opportunities to assess arbovirus transmission [30,31]. Using existing samples from previous well-designed serosurveys reduces costs, resources and logistics, thus providing new opportunities for serosurveillance across multiple locations when the budget is limited. This approach is particularly important, given the limited resources available for public health initiatives in many parts of the world, and our study contributes to the development of new strategies to make best use of available resources and improve our understanding of viral seroprevalence.

The proposed framework can determine the testing scenario under different transmission intensity assumptions (constant FOI, time-varying FOI and with antibody decay) and estimate the FOI with high accuracy under the chosen scenario. Notably, different assumptions on the intensity of transmission as well as on the duration of immunity and on time-varying patterns of transmission influence the choice of the testing scenario. Moreover, we found that the FOI estimates obtained with model 2 (with antibody decay) and model 3 (with time-varying FOI) exhibit larger bias and uncertainty compared to the estimates obtained with model 1, which is expected given the larger number of parameters estimated using the same data.

In the context of the SERODEN survey in Ghana, we used the simplest constant FOI model due to the evidence that dengue antibodies are long lasting across several transmission settings [10]. Under this model, the testing scenarios selected in Ghana were scenario B for Accra and Tamale, and scenario D for Kumasi. Both scenarios require testing of the samples from young age-groups, which demonstrates the importance of recruiting children in seroprevalence studies. In the case of Kumasi, scenario B had a slightly higher bias and lower coverage compared to the baseline scenario, while scenario D not only decreased the bias but also increased the coverage while maintaining a low uncertainty around the estimate which made it the best one for this location. This demonstrates how the chosen testing scenario depends on both the transmission setting and the available number of samples.

One limitation of the simulation approach adopted in this study is the use of imputed FOI estimates from the first global FOI map for dengue developed by Cattarino et al. [3] which, despite the accurate calibration, was validated on a limited number of FOI estimates from the African region. In the absence of pre-existing information that can be used to inform the assumed FOI estimates, one possible strategy to overcome the intrinsic uncertainty associated with model estimates would be to implement serological surveys in two steps, e.g., by (1) testing half of the sample sizes identified in the simulation study according to the optimal age-distribution based on initial calculations, and these results can then be used to update the simulation study and inform the age-distribution, before (2) testing the other half of the samples (Fig 5). In S1 Text we provide an example of how it is possible to adjust the distribution of the second half of the samples in a hypothetical situation where the FOI from the interim analysis (obtained from testing half of the samples in step 1) was different from the assumed one. We provided two examples exploring the case where the FOI estimate after step 1 was half the assumed one, and another where the FOI was twice the value assumed in the planning phase. The results are shown in the Supporting Information (Fig K in S1 Text), and in both cases, our framework and the two-step approach allowed us to efficiently adapt the testing scheme to the new information and re-distribute the remaining samples accordingly to accurately estimate the FOI (Figs K and L in S1 Text).

thumbnail
Fig 5. Conceptual description of the workflow for the implementation of serological surveys in the absence of previous information (such as serosurvey or case-notification data) on the FOI.

Illustration of the two-step process proposed to identify the sample sizes and optimize their age-distribution when the study is conducted in locations with little or no prior information on dengue FOI.

https://doi.org/10.1371/journal.pcbi.1011666.g005

Another limitation of the method we developed is the empirical choice of the tolerance used to determine sample sizes such as the 15% difference from the upper bound of the 95% CrI of the baseline scenario which can be tweaked depending on the targeted accuracy and available budget and resources. Nonetheless, the methodological framework developed in this study provides a rationale that can be applied to secondary testing of existing blood samples, including in locations where there is limited previous information on the historical circulation and transmission intensity of a pathogen, while optimizing the accuracy of the FOI estimates and the use of resources.

Conclusion

Age-stratified serological surveys are an ideal surveillance tool for reconstructing the age-dependent immunity profile of a population against one or more circulating viruses. Because the implementation of new serological surveys requires significant financial and human resources, testing stored blood samples (e.g., from previous serological surveys) can be an alternative strategy to estimate the FOI. In this work, we developed a method to inform the number of samples and age-distribution to test when using existing samples. In the presence of large uncertainties around the accuracy of the FOI estimates, we propose implementing the serological survey in two-steps, including an interim analysis of the results obtained upon testing half of the samples, which informs the ultimate sample distribution using the evidence collected from the transmission setting. The method developed in this paper can be used when testing existing blood samples for antibodies against different diseases with long-lasting or waning immunity as well as with time-varying transmission patterns, including in locations with limited prior information on the historical circulation of the pathogen of interest.

Supporting information

S1 Text. Includes Figs A-H, Tables A-I and additional details on the Two-steps sampling approach.

https://doi.org/10.1371/journal.pcbi.1011666.s001

(PDF)

S1 Code. Incudes the code required to reproduce the analysis and results presented in this paper.

Within the "script" directory, the main analysis using Model 1 can be executed using sequentially numbered files 1 to 6. The analysis pertaining to the antibody decay (model 2) and time-varying FOI (model 3) can be run using the scripts numbered 7 to 10.

https://doi.org/10.1371/journal.pcbi.1011666.s002

(ZIP)

References

  1. 1. Who, Dengue disease, https://www.afro.who.int/health-topics/dengue.
  2. 2. Guzman MG, Gubler DJ, Izquierdo A, Martinez E, Halstead SB. Dengue infection. Nature reviews Disease primers. 2016;2(1):1–25. pmid:27534439
  3. 3. Cattarino L, Rodriguez-Barraquer I, Imai N, Cummings DAT, Ferguson NM. Mapping global variation in dengue transmission intensity. Science translational medicine. 2020;12(528):eaax4144. pmid:31996463
  4. 4. Gainor EM, Harris E, LaBeaud AD. Uncovering the burden of dengue in Africa: considerations on magnitude, misdiagnosis, and ancestry. Viruses. 2022;14(2):233. pmid:35215827
  5. 5. Messina JP, Brady OJ, Golding N, Kraemer MUG, Wint GR, Ray SE, et al. The current and future global distribution and population at risk of dengue. Nature microbiology. 2019;4(9):1508–15. pmid:31182801
  6. 6. Mordecai EA, Ryan SJ, Caldwell JM, Shah MM, LaBeaud AD. Climate change could shift disease burden from malaria to arboviruses in Africa. The Lancet Planetary Health. 2020;4(9):e416–e23. pmid:32918887
  7. 7. Joshua L, Abel WW, Victor M, Oliver K, Furaha M, Jennifer SL. Aedes albopictus invasion across Africa: the time is now for cross-country collaboration and control. The Lancet Global Health. 2023.
  8. 8. Bhatt S, Gething PW, Brady OJ, Messina JP, Farlow AW, Moyes CL, et al. The global distribution and burden of dengue. Nature. 2013;496(7446):504–7. pmid:23563266
  9. 9. Eltom K, Enan K, El Hussein ARM, Elkhidir IM. Dengue Virus Infection in Sub-Saharan Africa Between 2010 and 2020: A Systematic Review and Meta-Analysis. Frontiers in cellular and infection microbiology. 2021;11:678945. pmid:34113579
  10. 10. Imai N, Dorigatti I, Cauchemez S, Ferguson NM. Estimating dengue transmission intensity from sero-prevalence surveys in multiple countries. PLoS neglected tropical diseases. 2015;9(4):e0003719. pmid:25881272
  11. 11. Surveillance and control of arboviral diseases in the WHO African region: assessment of country capacities, author = World Health Organization and UNICEF and others. 2022.
  12. 12. Cutts FT, Hanson M. Seroepidemiology: an underused tool for designing and monitoring vaccination programmes in low- and middle-income countries. Trop Med Int Health. 2016;21(9):1086–98. pmid:27300255
  13. 13. Rodriguez-Barraquer I, Salje H, Cummings DA. Opportunities for improved surveillance and control of dengue from age-specific case data. Elife. 2019;8:e45474. pmid:31120419
  14. 14. O’Driscoll M, Imai N, Ferguson NM, Hadinegoro SR, Satari HI, Tam CC, et al. Spatiotemporal variability in dengue transmission intensity in Jakarta, Indonesia. PLoS neglected tropical diseases. 2020;14(3):e0008102. pmid:32142516
  15. 15. Vicco A, McCormack C, Pedrique B, Ribeiro I, Malavige GN, Dorigatti I. A scoping literature review of global dengue age-stratified seroprevalence data: estimating dengue force of infection in endemic countries. medRxiv. 2023:2023.04.07.23288290.
  16. 16. Sepulveda N, Paulino CD, Drakeley C. Sample size and power calculations for detecting changes in malaria transmission using antibody seroconversion rate. Malar J. 2015;14:529. pmid:26715538
  17. 17. Blaizot S, Herzog SA, Abrams S, Theeten H, Litzroth A, Hens N. Sample size calculation for estimating key epidemiological parameters using serological data and mathematical modelling. BMC Med Res Methodol. 2019;19(1):51. pmid:30845904
  18. 18. Larremore DB, Fosdick BK, Bubar KM, Zhang S, Kissler SM, Metcalf CJE, et al. Estimating SARS-CoV-2 seroprevalence and epidemiological parameters with uncertainty from serological surveys. Elife. 2021;10. pmid:33666169
  19. 19. DNDi, SERODEN. https://dndi.org/research-development/portfolio/dengue-seroprevalence-and-burden-studies/.
  20. 20. Lorenz E, Souares A, Amuasi J, Loag W, Deschermeier C, Fusco D, et al. Seroprevalence of SARS-CoV-2 in urban settings in three sub-Saharan African countries (SeroCoV): a study protocol for a household-based cross-sectional prevalence study using two-stage cluster sampling. BMJ open. 2021;11(12):e056853. pmid:34921091
  21. 21. Desa U. World population prospects 2022. New York: United Nations Department of Economic and Social Affairs, Population Division. https://population.un.org/wpp/
  22. 22. Bonney JHK, Hayashi T, Dadzie S, Agbosu E, Pratt D, Nyarko S, et al. Molecular detection of dengue virus in patients suspected of Ebola virus disease in Ghana. PLoS One. 2018;13(12):e0208907. pmid:30566466
  23. 23. DiazGranados CA, Bonaparte M, Wang H, Zhu M, Lustig Y, Schwartz E, et al. Accuracy and efficacy of pre-dengue vaccination screening for previous dengue infection with five commercially available immunoassays: a retrospective analysis of phase 3 efficacy trials. The Lancet Infectious Diseases. 2021;21(4):529–36. pmid:33212068
  24. 24. Gelman A, Lee D, Guo J. Stan: A probabilistic programming language for Bayesian inference and optimization. Journal of Educational and Behavioral Statistics. 2015;40(5):530–43.
  25. 25. Cox V, O’Driscoll M, Imai N, Prayitno A, Hadinegoro SR, Taurel AF, et al. Estimating dengue transmission intensity from serological data: A comparative analysis using mixture and catalytic models. PLoS Negl Trop Dis. 2022;16(7):e0010592. pmid:35816508
  26. 26. Huang C-H, Tsai Y-T, Wang S-F, Wang W-H, Chen Y-H. Dengue vaccine: an update. Expert Review of Anti-Infective Therapy. 2021;19(12):1495–502. pmid:34182875
  27. 27. Edenborough KM, Flores HA, Simmons CP, Fraser JE. Using Wolbachia to eliminate dengue: Will the virus fight back? Journal of virology. 2021;95(13):e02203–20. pmid:33853965
  28. 28. Fischer C, Jo WK, Haage V, Moreira-Soto A, de Oliveira Filho EF, Drexler JF. Challenges towards serologic diagnostics of emerging arboviruses. Clinical microbiology and infection: the official publication of the European Society of Clinical Microbiology and Infectious Diseases. 2021;27(9):1221–9. pmid:34111589
  29. 29. Carcelen AC, Winter AK, Moss WJ, Chilumba I, Mutale I, Chongwe G, et al. Leveraging a national biorepository in Zambia to assess measles and rubella immunity gaps across age and space. Scientific reports. 2022;12(1):10217. pmid:35715547
  30. 30. Bergeri I, Whelan MG, Ware H, Subissi L, Nardone A, Lewis HC, et al. Global SARS-CoV-2 seroprevalence from January 2020 to April 2022: A systematic review and meta-analysis of standardized population-based studies. PLoS Med. 2022;19(11):e1004107. pmid:36355774
  31. 31. Lewis HC, Ware H, Whelan M, Subissi L, Li Z, Ma X, et al. SARS-CoV-2 infection in Africa: a systematic review and meta-analysis of standardised seroprevalence studies, from January 2020 to December 2021. BMJ Glob Health. 2022;7(8). pmid:35998978