Analysing trends and forecasting malaria epidemics in Madagascar using a sentinel surveillance network: a web-based application
- Florian Girond1, 2Email authorView ORCID ID profile,
- Laurence Randrianasolo1,
- Lea Randriamampionona1, 3,
- Fanjasoa Rakotomanana1,
- Milijaona Randrianarivelojosia1,
- Maherisoa Ratsitorahina1, 3,
- Télesphore Yao Brou2, 4,
- Vincent Herbreteau2,
- Morgan Mangeas2,
- Sixte Zigiumugabe5,
- Judith Hedje5, 6,
- Christophe Rogier1, 7, 8 and
- Patrice Piola1
© The Author(s) 2017
Received: 22 November 2016
Accepted: 8 February 2017
Published: 13 February 2017
The use of a malaria early warning system (MEWS) to trigger prompt public health interventions is a key step in adding value to the epidemiological data routinely collected by sentinel surveillance systems.
This study describes a system using various epidemic thresholds and a forecasting component with the support of new technologies to improve the performance of a sentinel MEWS. Malaria-related data from 21 sentinel sites collected by Short Message Service are automatically analysed to detect malaria trends and malaria outbreak alerts with automated feedback reports.
Roll Back Malaria partners can, through a user-friendly web-based tool, visualize potential outbreaks and generate a forecasting model. The system already demonstrated its ability to detect malaria outbreaks in Madagascar in 2014.
This approach aims to maximize the usefulness of a sentinel surveillance system to predict and detect epidemics in limited-resource environments.
Early detection of outbreaks and rapid control actions are essential to prevent and contain the spread of infectious diseases to reduce morbidity and death. The implementation of an automated early warning system (EWS) is a key step in adding value to the epidemiological data routinely collected by surveillance systems to improve the timeliness of detection of diseases outbreaks. The World Health Organization (WHO) supports the strengthening of existing infectious disease surveillance systems by developing such EWSs . Monitoring of the epidemic risk of malaria may integrate sequential and complementary components, such as an early detection system (EDS), an EWS and long-range forecasting (LRF) .
An increasing number of statistical methods for detecting changes in trends  have been developed, but there is not yet a single reference standard . The absence of a gold standard in past epidemics and the lack of consensus  on outbreak characterization has serious operational implications and can become a stumbling block for EWS implementation.
In Madagascar, malaria burden has decreased in recent decades, mainly due to successful malaria control interventions . Nevertheless, an upsurge of malaria outbreaks in recent years has highlighted the need for a malaria EWS (MEWS) adapted to the Malagasy context. An innovative interactive MEWS with a web-based interface that includes standard, such as Cumulative-Sum or Mean +2 standard deviations [1, 11], and alternative outbreak detection methods could strengthen the national surveillance system. Recent open-source Internet technology allows processing of surveillance data and application of outbreak detection algorithms with automatic and interactive graphical feedback . Thus, current web-based technologies allow user-friendly assessments of an outbreak hypothesis with model comparisons using prospective surveillance data as well as retrospective descriptions of the effects of malaria epidemic. An integrated system for real-time detection and forecasting could also be a pathway for the dissemination and communication of results.
Here, this study describes a system with intertwining of new electronic health (e-Health) technology (i) to assess the benefits of a MEWS including not only early detection but also forecasting based on a sentinel surveillance system, (ii) to maximize the potential of the sentinel surveillance system by innovative but simple explorations of population health data, and (iii) to provide practical examples and suggestions for use in other systems or settings.
Fever sentinel surveillance network
The Institut Pasteur de Madagascar (IPM) and the Malagasy Ministry of Health (MoH) implemented a sentinel surveillance system based on primary health care centers (PHCCs); this system expanded from 13 sites in 2007 to 34 sites in 2011. Expansion was intended to improve geographic coverage and representativeness and to make it possible to monitor epidemiologic trends in different climate zones  (Fig. 1). Sentinel sites include the presence of at least two general practitioners and a mobile phone network available. Participation is entirely voluntary. Sentinel general practitioners (SGPs) serving on a gratuitous, voluntary basis are the backbone of the system, which currently covers about 8% of the Malagasy population. Supervision of sentinel sites is performed twice a year, either by the team of the medical inspector or by the IPM team and the central level of the Ministry. There is an evaluation of the quality of the rapid diagnostic test used by the sentinel sites and a verification of the quality of the data collected during the supervision. The system monitors several potential epidemic diseases: malaria, influenza-like illnesses, suspected arboviral infections and diarrhoeal syndromes. Per national policies, every febrile patient is tested for malaria infection with an RDT. Data are aggregated and reported daily by short message service (SMS). The data are then automatically stored in a PostgreSQL database hosted on a dedicated server at IPM. The data received by SMS include: sentinel site code, date of data collection, total number of outpatient consultations, total number of confirmed malaria cases, total number of ILI cases, total number of dengue-like cases, total number of diarrhoea cases, and the number of consultations by age group. The age groups were those commonly used by the Ministry of Health in Madagascar: less than 1 year, 1–4 years, 5–14 years, 15–24 years, 25 years and over. The reporting system, which is based on mHealth technology, has been improved using Android smartphones and dedicated data entry forms.
Statistical detection methods
There are several different ways to define epidemic alert thresholds, and the three most commonly used methods have been included in the surveillance system: (i) Mean + 2 standard deviation (Mean + 2SD) . The method is based on the weekly mean calculated from previous 5 years plus 2 standard deviations where “Epidemic years” must be excluded; (ii) Cumulative SUM + 2 standard deviations (C-SUM + 2SD) . This method is a derivative of the mean + 2 standard deviations. To improve specificity, expected number of cases used the average for 3 weeks (including the previous and following weeks)  during the previous 5 years plus two standard deviations [13, 14]; and (iii) the weekly slope . The method is defined as a doubling of cases during 3 consecutive weeks. Weekly slope is included in core policy documents  from the Malagasy MOH. A fourth percentile-based method has been specifically developed by IPM using a threshold defined as weekly malaria cases exceeding the 90th percentile value. The percentile value is not seasonal-dependent and calculated over the whole chronological series of a site.
For these four methods, an alert is triggered if the defined threshold is exceeded for the three previous consecutive weeks 3 to increase the specificity of the alert system for any given threshold . These four algorithms were applied on the FSS dataset to determine the operability of signals and to assess the algorithms’ usefulness in the identification of outbreaks. To reduce noise in detected signals, 13 sites with a maximum weekly number of malaria cases lower than 10 were excluded.
Each detection method was applied to each sentinel site over the 52 weeks from 2014-05-26 to 2015-05-18, representing the last complete cycle of malaria seasons, such as low season (LS), moderate season (MS) and high season (HS). All historical datasets since the setting up of sentinel surveillance in 2007, excluding the year (52 weeks) being tested, were used to define the baseline for the Mean + 2 SD and C-SUM + 2 SD methods . “Epidemic years” were not excluded from the base years because there is no standardized method to define them retrospectively and because the MoH has not officially reported any epidemics.
Forecasting may rely on several techniques related to statistics and mathematical modeling or machine-learning methods . The forecasting method used on sentinel dataset is based on a statistical method known as Seasonal Auto-Regressive Integrated Moving Average (SARIMA) , with use of external regressor variables (SARIMAX) [19–21] including satellite weather data and information on transmission-reducing interventions. SARIMA(X) models are designed to account for serial autocorrelation in seasonal time series.
Satellite weather data related to changes in malaria prevalence such as temperature, rainfall and Normalized Difference Vegetation Index (NDVI)  provided by the International Research Institute for Climate and Society (IRI) through a web server  are routinely and automatically acquired by the surveillance system. Historical data up to April 2007 are also downloaded. The data are processed to match epidemiological weeks and are stored in a PostgreSQL database.
Time–space data on Malaria Control Interventions (MCIs) were obtained from national (National Malaria Control Programme) and international (President’s Malaria Initiative) agencies in charge of malaria control in Madagascar. Indoor Residual Spraying (IRS) data have been available only since 2008. Two LLIN mass distribution campaigns were implemented in Madagascar at the end of 2009 and 2012 (coverage ranges from 80 to 94%). Data on LLIN distribution were available at district level on a weekly basis and encoded in the database as a binary variable: weekly absence or presence of distribution.
A retrospective analysis was performed on sentinel sites for which a malaria alert was detected by this system and subsequently confirmed by an epidemiological investigation. The SARIMA(X) model was selected using the forecast package , which is available for the R programming environment . Model selection was automated using the auto.arima function, which performs a stepwise regression on the data and selects the best model based on the Akaike Information Criterion (AIC). The time series were log transformed to induce constant variance. Rainfall and temperature were log transformed and lagged from 0 to 8 weeks to account for the delayed effects of weather on malaria infections . The variable representing interventions against malaria was defined as the time elapsed in weeks since the last intervention and was categorized as (i) less than 1 year (≤52 weeks, reference) (ii) >52 weeks and ≤104 weeks, (iii) more than 2 year (>104 weeks) [Girond et al. pers. comm.]. To select the most pertinent predictors, both forward and backward stepwise methods were used . The entire dataset and all associated variables were used to train the model.
The model was fitted using data from 2009-12-21 to 2015-02-16. The last 52 weeks of the time series were withheld from model fitting and used to make a one-step-ahead forecast. The simulation proceeded by iteratively adding a new week of data, training a new model based on the updated data, and predicting the number of malaria cases for the following weeks (n = 231 for model development, n = 52 for external validation). The pre-dic-tive per-for-mance of the SARIMAX model was estimated using a confusion matrix , showing the proportions of predicted outbreak and detected outbreak based on the percentile-based method over the 52 weeks.
An incidence analysis of signals showed inter- and intra-variability for the detection methods. The frequency of signals was roughly equivalent across transmission periods for C-SUM + 2 SD and Mean + 2 SD. The C-SUM + 2 SD method generated a stable incidence of alerts, with 35, 42 and 34 alerts in the LS, MS and HS respectively. The Mean + 2 SD method was also constant, with 31, 37 and 34 alerts, respectively. The frequency of alerts was progressive for the MOH method across the transmission periods, but the numbers of alerts were low, with 3, 4, and 5 for LS, MS and HS, respectively. Alert frequencies increased gradually across the three periods of transmission with the percentile-based method, with 4 alerts triggered during LS, followed by 38 alerts during MS and 51 alerts during HS. The frequency of alerts started its sharp rise in the middle of MS (Fig. 5b).
The variability of the duration of the detected signals was noteworthy across detection methods. The range of durations of detected signals for Mean + 2 SD and C-SUM + 2 SD varies from 1 to 31 weeks and 29 weeks, respectively. The duration of detected signals by percentile-based method ranges from 1 to 13 weeks. The maximum duration of detected signals for the MoH method did not exceed 1 week (Fig. 5a).
An outbreak in the southeastern part of the country for the sentinel site of Farafangana (Fig. 1) on 2014-10-06 has been detected using the percentile-based method. This detected signal indicated that a historical level of malaria cases was reached 6 weeks before the moderate transmission period and 6 months before the high transmission period.
Retrospective SARIMA evaluation
The SARIMAX model [1, 4] (1, 0, 1) had the lowest AIC value (77.89) for this dataset and therefore was the best-fit model, with root-mean-square deviation (RMSE) = 0.26 and mean absolute scaled error (MASE) = 0.15. Thus, to conclude, the SARIMA [1, 4] (1, 0, 1) 52 model fit the data well.
Prospective SARIMA evaluation
Mhealth and website
The reporting system based on mHealth technology, is using the Android operating system smartphones. This new open-source technology runs through a dedicated application developed by IPM, involving handheld data entry in the national language, a feedback report with automated analysis via charts and maps, and an edutainment-based learning solution. No Internet is required, which avoids the need to cope with patchy Internet coverage; the Android application generates all outgoing SMS messages, which are streamlined into the central surveillance server, and decrypts the “feedback SMS” generated by the surveillance server.
The operational web-based surveillance system includes both an EDS and a forecasting model. The website is accessible to Roll Back Malaria partners . Epidemic threshold detection algorithms are integrated into the website and applied to the sentinel dataset in real time (Fig. 3). The selected detection methods can be easily modified by changing the number of years in the baseline dataset (i), the number of standard deviations (ii), the slope value (iii), the percentile value (iv) and the number of weeks above the threshold (v) by intuitive pick-and-click functionality. The results are instantly displayed both on interactive charts and maps. Users can superimpose additional data such as temperature, rainfall, and NDVI data from satellite Earth observation and also malaria control interventions (LLIN use and IRS).
This sentinel surveillance coupled with a technology platform has yielded positive results, detecting the 2014-10-06 outbreak in the southeastern part of the country. The web-based surveillance system, with automated analysis and timely output, allowed real-time monitoring and communication with RBM partners about this malaria event. The high number of malaria cases reported, and the assumption of the existence of a plasmodium reservoir preceding the rainy season together with limited access to artemisinin-based combination therapy (ACT) in the whole area suggested a worsening malaria situation the following weeks. The affected local public health jurisdiction concomitantly alerted the MoH about excess mortality and morbidity beyond their response capabilities , and this outbreak was subsequently confirmed by an epidemiological investigation.
The use of methods based on several consecutive weeks above the threshold, with the aim of improving the methods’ specificity, accordingly reduced the methods’ ability to detect incipient outbreaks at the earliest stages. An EDS has to be strengthened by a forecasting method to provide lead time benefits . The malaria outbreak on the eastern coast was predicted with a sensitivity of 83% and a specificity of 78% up to 4 weeks in advance (accuracy of 0.80%, 95% CI [0.66, 0.90]). Nevertheless, the model predicted a threshold overrun, but the stochastic behaviour of epidemics limited the prediction of the amplitude. The system can give timely alerts for epidemic control, even if it is unable to provide very accurate predictions of malaria case numbers. The improved lead time of an EDS, however, comes at the expense of a degree of accuracy .
The MEWS is accessible through a user-friendly web-based interface  for both internal use and use by external organizations and donors. This MEWS allows rapid dissemination, interpretation and subsequent action to control any suspected outbreak. Recent open-source technology also allows for the development and improvement of an interactive web-based interface with dedicated analysis and visualization output. Furthermore, based on R language (coupled with Shiny package), its growing popularity in the scientific community makes this technological platform easily modifiable and maintainable and also transferable.
The detection and reporting methods for malaria cases of the sentinel surveillance system remained unchanged since its implementation. The increasing trend observed from 2011 to 2015 (Fig. 4) reflects the malaria situation at national level over the same period . The outbreak thresholds in this analysis were defined based on the absolute number of malaria cases due to the lack of population denominators to calculate the malaria incidence for health facilities. The population size in the catchment area of each facility (denominator) is also unavailable and varies with the availability of health care from the private and informal sectors. Forecasting models and surveillance systems should be improved through the integration of additional covariables, such as the availability of ACT. The resurgence of malaria across most of Madagascar in 2014 occurred in the context of nearly generalized ACT stock-outs. Furthermore, the inclusion of individual data in the surveillance system would allow enhanced description of malaria transmission (i.e., description of the most vulnerable age groups). Such a system might also be reinforced by integration of transmission-reducing interventions on a smaller scale across both time and space. Staffs from the Ministry of Health were involved in the project through regular working group meetings and MoH medical epidemiologists were permanently detached in our team to ensure a constant transfer of knowledge and experience. The Health Monitoring and Disease Surveillance were very supportive of this sentinel project and used several of its successful components to improve their nationwide surveillance system. There are indeed challenges in extending an electronic based surveillance system to an entire country, although it is admitted that the current paper-based surveillance system does not allow a prompt analysis of trends to detect emerging epidemics. A progressive scaling-up of e-surveillance to health centres using affordable technologies is deemed reasonable and efficient, and is currently being promoted by WHO as a way forward. This technology-based approach to surveillance has a great potential for real-time evaluation of malaria control interventions at both the national and the regional levels.
The authors describe an automated malaria outbreak detection system using percentile-based statistical detection method that uses data electronically collected in Madagascar by FSS. The system assesses data as soon as they are made available and disseminates the information by means of the Internet and smartphone to all involved health professionals to help in the rapid interpretation and subsequent action to control any suspected malaria outbreak.
Much still needs to be done and efforts are now focusing on the expansion of the surveillance system with the aim of a progressive and realistic “strengthening” to improve outbreak detection and forecasting system to malaria elimination. This approach, entirely based on free and open-source technology, should also benefit other initiatives aimed at improving surveillance data management in other health care facilities and provides a demonstration project for improving existing systems in Africa.
PP initiated the project. FG drafted the manuscript. LR and LR contributed the data. PP, VH and MM contributed to the interpretation of data. All authors read and approved the final manuscript.
This research was supported by the USAID (Grant No. AID-687-G-13-00003). We thank administration authorities and health authorities from the Ministry of Health, the National Malaria Control Program and the President’s Malaria Initiative. We especially thank all the sentinel surveillance team of the Institut Pasteur de Madagascar. We thank Bienvenue Rahoilijaona, Reziky Mangahasimbola and Stephan Valentini for data management and Android development.
The authors declare that they have no competing interests.
Availability of data and materials
Data are available from the Ministry of Health and from the Institut Pasteur de Madagascar.
Consent for publication
All authors approved the manuscript’s submission for publication.
“The opinions expressed by authors contributing to this journal do not necessarily reflect the opinions of the Centers for Disease Control and Prevention or the institutions with which the authors are affiliated.”
This research was supported by the USAID (Grant No. AID-687-G-13-00003).
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- WHO. Roll back malaria. Malaria early warning system. A Framework for Field Research in Africa: concepts, indicators and partners. Geneva: World Health Organization; 2001.
- WHO. Prevention and control of malaria epidemics. Third meeting of the Technical Support Network. Geneva: World Health Organization; 22002.
- Watkins RE, Eagleson S, Hall RG, Dailey L, Plant AJ. Approaches to the evaluation of outbreak detection methods. BMC Public Health. 2006;6:263.View ArticlePubMedPubMed CentralGoogle Scholar
- Mckelvie WR, Haghdoost AA, Raeisi A. Defining and detecting malaria epidemics in south-east Iran. Malar J. 2012;11:81.View ArticlePubMedPubMed CentralGoogle Scholar
- World Health Organization. New horizons for health through mobile technologies; 2011.
- Randrianasolo L, Raoelina Y, Ratsitorahina M, Ravolomanana L, Andriamandimby S, Heraud J, et al. Sentinel surveillance system for early outbreak detection in Madagascar. BMC Public Health. 2010;10:31.View ArticlePubMedPubMed CentralGoogle Scholar
- WHO. Roll Back Malaria. Disease surveillance for malaria elimination. Geneva: World Health Organization; 2012.
- Brady OJ, Smith DL, Scott TW, Hay SI. Dengue disease outbreak definitions are implicitly variable. Epidemics. 2015;11:92–102.View ArticlePubMedPubMed CentralGoogle Scholar
- CDC. Framework for evaluating public health surveillance systems for early detection of outbreaks recommendations from the CDC Working Group. 2004.
- Ministère de la Santé Publique. Plan stratégique de lutte contre le paludisme 2007–2012—du contrôle vers l’élimination du paludisme à Madagascar. Antananarivo: Ministère de la Santé Publique; 2007. p. 1–54.Google Scholar
- Cullen JR, Chitprarop U, Doberstyn EB, Sombatwattanangkul K. An epidemiological early warning system for malaria control in northern Thailand. Bull World Health Organ. 1984;62:107–14.PubMedPubMed CentralGoogle Scholar
- RStudio. Shiny. http://shiny.rstudio.com/. Accessed 1 Jan 2012.
- Hay SI, Simba M, Busolo M, Noor AM, Guyatt HL, Ochola S, et al. Defining and detecting malaria epidemics in the highlands of western Kenya. Emerg Infect Dis. 2002;8:555–62.View ArticlePubMedPubMed CentralGoogle Scholar
- WHO. Field guide for malaria epidemic assessment and reporting. Geneva: World Health Organization; 2004.
- WHO. Prevention and control of malaria epidemics. Third meeting of the Technical Support Network. Geneva: World Health Organization; 2001.
- Teklehaimanot HD, Schwartz J, Teklehaimanot A, Lipsitch M. Alert threshold algorithms and malaria epidemic detection. Emerg Infect Dis. 2004;10:1220–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Zinszer K, Verma AD, Charland K, Brewer TF, Brownstein JS, Sun Z, et al. A scoping review of malaria forecasting: past work and future directions. BMJ Open. 2012;2:e001992.View ArticlePubMedPubMed CentralGoogle Scholar
- Hu W, Tong S, Mengersen K, Connell D. Weather variability and the incidence of cryptosporidiosis: comparison of time series poisson regression and SARIMA models. Ann Epidemiol. 2007;17:679–88.View ArticlePubMedGoogle Scholar
- Tatem AJ, Goetz SJ, Hay SI. Terra and Aqua: new data for epidemiology and public health. Int J Appl Earth Obs Geoinf. 2004;6:33–46.View ArticlePubMedPubMed CentralGoogle Scholar
- Ceccato P, Connor SJ, Jeanne I, Thomson MC. Application of geographical information systems and remote sensing technologies for assessing and monitoring malaria risk. Parassitologia. 2005;47:81–96.PubMedGoogle Scholar
- Ceccato P, Ghebremeskel T, Jaiteh M, Graves PM, Levy M, Ghebreselassie S, et al. Malaria stratification, climate, and epidemic early warning in Eritrea. Am J Trop Med Hyg. 2007;77(Suppl. 6):61–8.PubMedGoogle Scholar
- International Research Institute for Climate and Society. http://iri.columbia.edu/. Accessed 1 Jan 2013.
- Hyndman RJ. Forecasting functions for time series and linear models to make the computation tractable. https://cran.r-project.org/web/packages/forecast/index.html. Accessed 1 Jan 2014.
- R Core Team. R: A language and environment for statistical computing. Vienna: R foundation for statistical computing. 2012. https://www.r-project.org/. Accessed 1 Jan 2014.
- Midekisa A, Senay G, Henebry GM, Semuniguse P, Wimberly MC. Remote sensing-based time series models for malaria early warning in the highlands of Ethiopia. Malar J. 2012;11:165.View ArticlePubMedPubMed CentralGoogle Scholar
- Adimi F, Soebiyanto RP, Safi N, Kiang R. Towards malaria risk prediction in Afghanistan using remote sensing. Malar J. 2010;9:125.View ArticlePubMedPubMed CentralGoogle Scholar
- Hyndman RJ. Why every statistician should know about cross-validation. 2010. http://robjhyndman.com/hyndsight/crossvalidation/. Accessed 1 Jan 2014.
- Kuhn M. A Short Introduction to the caret Package. R Found Stat Comput. 2015;1–10. cran.r-project. org/web/packages/caret/vignettes/caret.pdf.
- Madagascar sentinel surveillance system. http://sentinel.pasteur.mg. Accessed 1 Jul 2016.
- Albonico M, De Giorgi F, Razanakolona J, Raveloson A, Sabatinelli G, Pietra V, et al. Control of epidemic malaria on the highlands of Madagascar. Parassitologia. 1999;41:373–6.PubMedGoogle Scholar
- Madagascar-tribune.com. Poussée de paludisme à Ampasimanjeva-Manakara. 2014. http://www.madagascar-tribune.com/Epidemie-de-paludisme-a,20453.html. Accessed 9 Nov 2014.
- Teklehaimanot HD, Schwartz J, Teklehaimanot A, Lipsitch M. Weather-based prediction of Plasmodium falciparum malaria in epidemic-prone regions of Ethiopia II. Weather-based prediction systems perform comparably to early detection systems in identifying times for interventions. Malar J. 2004;3:44.View ArticlePubMedPubMed CentralGoogle Scholar
- Global Malaria Programme, WHO. World Malaria Report. Geneva: World Health Organization; 2015.