Research | Open | Published:
Rapid case-based mapping of seasonal malaria transmission risk for strategic elimination planning in Swaziland
Malaria Journalvolume 12, Article number: 61 (2013)
As successful malaria control programmes move towards elimination, they must identify residual transmission foci, target vector control to high-risk areas, focus on both asymptomatic and symptomatic infections, and manage importation risk. High spatial and temporal resolution maps of malaria risk can support all of these activities, but commonly available malaria maps are based on parasite rate, a poor metric for measuring malaria at extremely low prevalence. New approaches are required to provide case-based risk maps to countries seeking to identify remaining hotspots of transmission while managing the risk of transmission from imported cases.
Household locations and travel histories of confirmed malaria patients during 2011 were recorded through routine surveillance by the Swaziland National Malaria Control Programme for the higher transmission months of January to April and the lower transmission months of May to December. Household locations for patients with no travel history to endemic areas were compared against a random set of background points sampled proportionate to population density with respect to a set of variables related to environment, population density, vector control, and distance to the locations of identified imported cases. Comparisons were made separately for the high and low transmission seasons. The Random Forests regression tree classification approach was used to generate maps predicting the probability of a locally acquired case at 100 m resolution across Swaziland for each season.
Results indicated that case households during the high transmission season tended to be located in areas of lower elevation, closer to bodies of water, in more sparsely populated areas, with lower rainfall and warmer temperatures, and closer to imported cases than random background points (all p < 0.001). Similar differences were evident during the low transmission season. Maps from the fit models suggested better predictive ability during the high season. Both models proved useful at predicting the locations of local cases identified in 2012.
The high-resolution mapping approaches described here can help elimination programmes understand the epidemiology of a disappearing disease. Generating case-based risk maps at high spatial and temporal resolution will allow control programmes to direct interventions proactively according to evidence-based measures of risk and ensure that the impact of limited resources is maximized to achieve and maintain malaria elimination.
Recent reductions in malaria coupled with increased funding have resulted in a renewed focus on malaria eradication . Many countries have adopted a goal of national malaria elimination , and the World Health Organization (WHO) recognizes 17 as having pre-elimination or elimination programmes . Achieving elimination is an operationally challenging endeavour requiring a strong evidence base and targeted interventions .
Many of the operational requirements for malaria elimination, including identifying residual transmission foci , targeting vector control and case detection to high risk areas , focusing not only on clinical disease but also asymptomatic infections , and managing importation risk  can be facilitated by accurate and timely creation of malaria risk maps. For example, risk maps may allow proactive deployment of vector control measures to high-risk areas to prevent local transmission, or suggest areas where active case detection may be used to identify and treat remaining parasite reservoirs.
Parasite rate-based maps for malaria are now widely available [8–11], but infection prevalence is a poor metric for measuring malaria at very low levels of endemicity due to the enormous surveys required for precise measurement in such contexts . Additionally, while the Bayesian approaches typically used provide valuable information on spatial uncertainty , they can require substantial resources and computing time to produce, neither of which may be aligned with the needs of an elimination programme. In very low transmission environments, diagnostically confirmed malaria incidence provides a more useful measure than prevalence. Understanding the epidemiology of these confirmed cases requires differentiating between imported and locally acquired cases . As endemic transmission declines, an increasing proportion of incident cases may be attributed to transmission chains traced directly to imported cases , and such imported cases may therefore become increasingly important drivers of local transmission.
Swaziland has achieved its lowest ever recorded malaria prevalence in recent years , and it aims to achieve elimination by 2015 . WHO certification of elimination requires achieving an absence of all local transmission for three years, as well as a sufficiently strong surveillance system to prove that cases would have been identified had they occurred . In sub-Saharan Africa, only Lesotho and the island of Mauritius are considered malaria-free, and only the latter achieved that status through active measures. Achieving this goal in Swaziland will require identifying and interrupting remaining foci of endemic transmission  and preventing onward transmission from the imported cases that will continue to occur from endemic neighbours . Focusing limited resources on hotspots of transmission rather than aiming for untargeted coverage could considerably improve the impact of interventions .
Swaziland instituted a reactive surveillance system in 2009 in which all notifiable diseases, including malaria, identified in health facilities are reported to a central toll-free hotline. Entry of any malaria case into the central database triggers an automated phone short message service (SMS) to the national malaria control programme (NMCP) with basic details on the patient. Surveillance agents then obtain the case patient’s contact details and directions to his or her household from the health facility where the diagnosis was made. The patient is interviewed at the household after identification; the protocol from 2009–2011 was to complete this follow up within seven days of the case report, though the programme now attempts to investigate within 48 hours. Among other information, the interview ascertains travel history, to assist with categorizing the infection as locally acquired or imported, and geocoordinates for the house location. If there is suspicion of local transmission, family members and neighbours living within 1 km of the index case are screened for malaria by rapid diagnostic test, and any individuals who test positive are referred to the nearest health facility for treatment.
Such a surveillance system is a crucial component of an elimination strategy, but achieving and maintaining elimination will require complementing it with proactive case detection to seek out cases that may never come into contact with reporting health facilities . In Swaziland’s 2010 Malaria Indicator Survey, 53.5% of women and 67.4% of children were reported as attending a health facility when febrile , leaving a substantial fraction of potential infections that may be missed by passive surveillance. Furthermore, molecular diagnostic methods have indicated that in low transmission settings such as Swaziland, a majority of infections may be asymptomatic and thus will not be identified by the passive surveillance system . Understanding and investigating all regions where unobserved transmission may be occurring will be required before elimination can be achieved.
Malaria prevalence in Swaziland is too low for standard parasite rate-based mapping to be useful , so individual case-based approaches are required to predict risk across the country. This investigation seeks to generate maps of malaria risk at fine spatial resolution from existing case-based surveillance data, including the locations of imported cases. Transmission risk maps are derived separately for the high and low transmission seasons in case the key determinants of transmission change over the course of the year. Accurate case-based risk mapping of this kind will help Swaziland to target its vector control and surveillance activities most effectively.
This investigation investigated transmission risk in Swaziland based on malaria cases identified during 2011 (Figure 1). Household locations of cases identified by passive or reactive case detection were categorized by the NMCP according to reported travel history. Infected individuals reporting no travel, whether abroad or within Swaziland, in the previous two weeks were assumed to represent locally acquired cases. Infected individuals who reported travel abroad to endemic countries within biologically meaningful windows were assumed to represent imported cases. Those who reported travel to known endemic regions of Swaziland were assumed to have “intraported” infections to their household locations and were grouped together with the imported cases. Cases from 2011 were divided according to whether they occurred during the higher transmission season from January to April or the lower season from May to December. The higher transmission season followed the increase in rains in September as well as an annual peak in malaria importation in January following the end of holiday season travel.
Gridded maps of spatial covariates were collated to describe weather, geography, land cover, population density, vector control and imported infections (Table 1). Rainfall and temperature strongly impact the biology of malaria transmission , while elevation and topography have been demonstrated to influence risk through their effects on temperature and suitability for mosquito breeding . The topographic wetness index, a measure representing the amount of water that should enter a given spatial unit divided by the rate at which the water should flow out of that unit, was calculated from elevation as a measure for suitability for mosquito breeding habitat [21, 22]. Suitability for mosquito habitat was also described using remotely sensed imagery . The normalized difference vegetation index (NDVI)  and the modified normalized difference water index (NDWI)  were calculated from a single Landsat Enhanced Thematic Mapper (ETM) image from March 2009 with spatial resolution 30 m. Images from the beginning of each season could not be identified due to cloud cover and satellite malfunctions. Densely populated areas may face substantially different malaria risks from very sparsely populated, rural areas , and the susceptibility of these populations is influenced by the control measures currently being implemented. The NMCP records the geolocations of all distributed nets but only tallies the number of households receiving indoor residual spraying (IRS) within each of the localities in the country. These IRS data were aggregated to the level of the 55 constituencies since no geographic data on the localities could be identified. Finally, cases classified by the NMCP as imported or intraported based on travel history to endemic areas were also used as predictor variables under the assumption that at the very low prevalence levels observed in Swaziland, infections from other regions play a large role in sparking transmission . Time-varying covariates were generated separately for the high and low seasons where possible as described in Table 1.
Transmission risk modelling
Values for each of the covariates in Table 1 were compared between the locations of the households of patients identified with locally acquired infections and population weighted, randomly selected “background” points from across Swaziland. Background points do not necessarily indicate the absence of transmission, but instead characterize the environment of the country  in the places where people live. A sample of 10,000 background points [39, 40] was selected randomly but proportionately to population density across Swaziland using the Geospatial Modelling Environment v0.6 . The population-based weighting ensured that the territory sampled by the background points was comparable to the locations from which local cases arose . The locations of local case households in the high and low transmission seasons were compared to the locations of the background locations to identify conditions under which local transmission is likely to occur. This comparison was based on the assumption that case household locations were indicative of where transmission occurred. Mean values of each of the predictor variables were compared between case households and the background locations using Satterthwaite t-tests for unequal variance .
Given the potential importance of imported infections for sustaining malaria transmission in Swaziland , a second analysis investigated risk factors associated with whether or not an imported case led to onward transmission. Each imported infection identified in 2011 was classified as to whether or not a locally acquired infection was identified within a space-time cylinder consistent with onward transmission. Local transmission was assumed if the household of a locally acquired infection was identified within 3 km of the imported case household (based upon the typical maximum dispersal range of African vectors ) and three to six weeks after identification of the imported case (assuming transmission would require two to three weeks for parasite development inside the mosquito vector, symptoms would develop in an infected human within the following one to two weeks , and up to an additional week might elapse before the case patient appeared in passive surveillance reports).
A logistic regression mixed model predicting whether or not an imported case was associated with a locally acquired case was fit using the GLIMMIX procedure in SAS software, Version 9.2 of the SAS System for Windows . An exponential structure was used to account for spatial autocorrelation . Predictive variables included in this model were the same as above but additionally included time-varying rainfall variables describing total rainfall two, four, six, and eight weeks prior to identification of the imported case. Initial models were fit to identify the variables from each category (weather, geography, land cover, population density, and vector control) most associated with transmission, and then each of those was entered jointly into a final model. Variables were removed in order of least significance until all remaining in the model were significant at α = 0.10.
The regression tree classification approach ‘Random Forest’  was applied using the R  package ModelMap  to model the probability of a locally acquired case occurring in each 100 sq m location across Swaziland. Regression trees create a series of rules to partition the data into a set of groups that are as homogenous as possible with respect to the outcome . For example, one such rule might differentiate the locations of case households from those of control households based on elevation below a certain threshold, while another rule might further divide the data based on rainfall within specific bounds. In the Random Forest approach, the data are repeatedly split according to many different branching "trees" of this type, and the final prediction is made by averaging across all of the individual trees . The free ModelMap package contains a detailed tutorial and example code for implementing Random Forest in R .
To assess the accuracy of model predictions, eighty percent of the locally-acquired cases observed in 2011 were selected at random for training the algorithm, with the other 20% were used for testing. All of the above predictor variables were included in the fitting step to produce a model predicting the probability of a local case occurring at a particular location as a function of the combined covariates. Model quality was assessed by examining calibration plots  and the area under the curve (AUC) on receiver operating characteristic (ROC) graphs. The fit model was then applied in conjunction with the 100 m spatial resolution gridded datasets of all included predictive variables to generate a map of predicted risk across Swaziland. Models and maps were generated separately for the high and low seasons.
Although the AUC values suggested the ability of the model to predict the locations of local cases not used in the model fitting, these test points were obtained over the same time period as the training points and may thus not be indicative of the true value of the risk map for prospective prediction. To examine the utility of the maps for predicting the occurrence of cases in future transmission seasons, an additional dataset of locally acquired cases identified during 2012 was used for validation of the 2011-based risk map. The predicted risk at the locations of 2012 case households according to the Random Forest maps was compared to predicted risk at other random locations across Swaziland. These "control" locations were selected in two ways: first, 10,000 points were randomly selected from across Swaziland, and second, 10,000 points were selected randomly but proportionally to population density. The high transmission season risk map was used to examine predicted risk at the locations of households of cases occurring from January to April 2012, while the low transmission season risk map was used for subsequent months.
Of the 372 malaria cases investigated during 2011, 191 (51.3%) were classified as locally acquired and 170 were classified as imported (45.7%). Eleven were either not classified or categorized as cryptic  and were excluded from analysis. A total of 314 of these cases had valid coordinates that matched the region of the country in which the case was reported to live on the investigation form and were used in analyses. These cases included 118 locally acquired infections from the higher transmission season of January to April and 44 from the lower transmission season of May to December. There were 152 imported infections during 2011 with valid coordinates, of which 143 (94.1%) originated outside of Swaziland.
Characteristics of the locations of case households and the background points during the two study periods are contrasted in Table 2. Of the 152 imported cases used in analysis, 12 (7.9%) were associated with a locally acquired case occurring within 3 km and after three to six weeks. All 12 originated abroad. The final model predicting the probability of an imported case in 2011 leading to onward transmission is reported in Table 3.
The ROC plot for the high-season Random Forest model suggested very strong model prediction with AUC = 0.94 (Figure 2A). Judging by the mean decrease in accuracy, model predictions were most dependent upon, in order of descending importance, the distance to the areas with highest NDWI, the distance to the nearest imported case, the distance to lakes, the NDVI, and the TWI. Least important variables were coverage with bed nets, and the mean and sum of rainfall. Model fit was poorer for the low-season model, with AUC = 0.89, likely due at least in part to the smaller sample size (Figure 2B). The model was most dependent on the minimum, mean, summed and maximum rainfall, while least important variables included coverage with bed nets and distance to rivers. Figure 3 and Figure 4 depict the maps generated from the predictive models for the high and low seasons respectively.
Twenty-five locally acquired cases were identified by the NMCP from January to April 2012, and 10 from May to October (the most recent data available at the time of analysis). The locations of the 25 high-season cases in 2012 were predicted to have a median risk of 3.4% (interquartile range = 0.4%-12.0%), compared to 0.2% (0.0%-1.2%) for random points from across the country (t = −2.92, p = 0.008) and 0.0% (0.0%-0.2%) for random points sampled proportionately to population density (t = −3.14, p = 0.005) (Figure 5A). The locations of the 10 cases from May to October were predicted to have a median risk of 44.0% (16.8%-64.0%), compared to 0.0% (0.0%-0.2%) for random points from across the country (t = −4.57, p = 0.001) and 0.0% (0.0%-4.8%) for random points sampled proportionately to population density (t = −4.23, p = 0.002) (Figure 5B).
As countries move towards elimination of malaria, ongoing endemic transmission will become limited to residual foci, and the importance of preventing onward transmission from imported infections will increase . The results of this investigation suggest that both of these epidemiological changes are already well underway in Swaziland. The maps generated here can be applied to target surveillance and vector control to eliminate the remaining foci of transmission in Swaziland and minimize the potential for transmission from imported cases elsewhere. Doing so will improve the efficiency of resource use and have greater impact than aiming for universal coverage everywhere .
These risk maps highlight a few areas of Swaziland at very high predicted risk, broad regions at low levels of risk, and many places where risk is estimated to be non-existent. Validation against cases that occurred during 2012 confirm that the areas of predicted risk are the likely locations of future transmission. Accordingly, the areas of highest predicted risk likely represent residual transmission foci where interventions must be targeted to ensure cessation of endemic transmission. The appropriate strategy to minimize the potential for transmission in the low-risk regions identified here will depend upon available resources and Swaziland's risk tolerance. For example, ensuring all areas with any predicted risk are fully covered with vector control interventions would minimize the chance for transmission to occur, but such a strategy may be prohibitively expensive.
Distance to the nearest imported case proved to be one of the most important variables for prediction of transmission risk in Swaziland, second only to the distance to the highest NDWI locations in improving model accuracy during the high season. This result indicates that imported cases from endemic neighbours are playing an important role in sparking transmission during the months of the year with highest burden, and it suggests that ongoing endemic transmission may only be occurring in limited, highly focalized regions where suitability for mosquito breeding is high. Both of these conclusions are consistent with an epidemiological context in which endemic transmission has been interrupted in the great majority of the country, and where the majority of malaria transmission might cease to occur if importation could be substantially reduced. Over 2011, there were 191 case patients with no travel history to endemic regions compared to 170 imported cases, giving a ratio of just over one local case per imported case. Such a result would be expected if RC, the reproductive number under control, averaged approximately 0.5 .
The apparent importance of imported cases for driving high-season transmission in Swaziland today also raises interesting questions about the causes of the observed seasonality in disease. Increases in local transmission occurred following the peak rainy season, and rainfall was found to be an important predictor of risk, particularly during the low season months. However, the peak of the rains also coincided with a peak in imported malaria cases following the return of travellers from endemic areas after the holiday season. Although the relative contribution of these two factors is not yet clear, they suggest the importance of a dual strategy that focuses on reducing importation while ensuring that transmission potential in high risk areas is minimised. The multivariate mixed model predicting whether or not an imported case will lead to local transmission indicates that onward transmission risk may be predictable on the basis of factors including temperature, elevation, wetness and vector control. This result suggests an evidence-based mechanism for prioritizing responses in highest risk regions.
Prediction of areas of risk during the low season produced a weaker fitting model than for the high season. In part, this result may be attributed to the fact that only 44 case patients with no travel history to endemic areas were identified during this period. As more surveillance data become available from future years, improved prediction may become possible. Nevertheless, the low season map proved useful in prospectively predicting areas at risk of local transmission in 2012 (Figure 5B). Interventions may have the greatest impact when implemented during the low transmission season [55, 56], and these maps may provide a useful means for targeting those interventions. Regions at highest predicted risk were roughly consistent between the high and low season map, supporting the theory that malaria transmission in the high season may spread from hotspots that remain during the low season . Understanding whether these higher-risk regions remain consistent from year to year will require further investigation.
Vector control interventions were not found to be important determinants of model accuracy. Coverage with nets and IRS was found to be higher in areas where locally acquired cases were identified, suggesting that these interventions are appropriately targeted to high-risk areas. The models generated here likely reflect a mixed effect where vector control implemented early in the time period has a negative effect on subsequent transmission, but vector control implemented later is targeted to areas where cases have recently been observed. These two effects may cancel out the observed impact of vector control in these models. Making maps with greater temporal resolution - risk over a month, for example - may better capture the effects of these interventions.
This investigation has several important limitations. Only a single usable Landsat image was identified within a similar timeframe as the surveillance data in this analysis. Temporally linked imagery for each season would improve prediction and comparison across seasons. The planned launch of the Landsat Data Continuity Mission  in 2013 should provide new images useful for this purpose, and the availability of processed and composited imagery through the Google Earth Engine  will also improve access. Similarly, high-resolution rainfall data were not available at appropriate temporal resolution. Few cases were identified during the low season, producing too small a sample size for reliable prediction. Future surveillance data may be combined across seasons to overcome this limitation. Hotspot identification using clinical malaria may be limited by the fact that higher immunity in hotspots may actually reduce development of symptoms in these higher transmission areas . Nevertheless, in Swaziland, transmission appears to be so low that it is likely that this problem is minimized. It is likely that the location at which cases were investigated is not always the location at which they were infected, which would introduce error into the model. Selection of the background points was performed proportionately to population density to ensure comparability, but if only a subset of the population tended to seek care at health facilities (for example, those living nearest to clinics), these background points may differ in important ways from the locations of identified cases. Finally, not all confirmed cases were investigated by surveillance workers, and it is likely that not all malaria cases were identified by the passive surveillance system. As the system improves in detecting all malaria cases, these sorts of analyses will become more accurate.
As scale-up of vector control and effective treatment continues, other countries will join Swaziland in reducing malaria to the point where identification and elimination of the final foci of endemic transmission and prevention of onward transmission from imported cases become the goals of anti-malarial efforts. Once malaria incidence has declined to the point that geolocation of case households is operationally feasible, generation of case-based risk maps at high spatial resolution will support control programmes in targeting elimination interventions. Integrating mapping approaches into user-friendly, rapidly updateable tools , potentially linked to dynamic transmission models, will provide strategic, evidence-based guidance for adaptive management of malaria programmes. Efforts to create user-friendly tools based on the models generated here are underway to aid Swaziland's malaria program in rapidly updating risk maps as new data become available. This sort of case-based mapping will help ensure that the impact of limited resources is maximised to achieve and maintain malaria elimination.
Area under the curve
Landsat Enhanced Thematic Mapper
Geographic information system
Indoor residual spraying
Normalized difference vegetation index
Modified normalized difference water index
National Malaria Control Programme
Receiver operator characteristic
System for Automated Geoscientific Analyses
Short message service
Topographic wetness index
World Health Organization
Mendis K, Rietveld A, Warsame M, Bosman A, Greenwood B, Wernsdorfer WH: From malaria control to eradication: the WHO perspective. Trop Med Int Health. 2009, 14: 802-809. 10.1111/j.1365-3156.2009.02287.x.
Feachem R, Phillips AA, Hwang J, Cotter C, Wielgosz B, Greenwood BM, Sabot O, Rodriguez MH, Abeyasinghe RR, Ghebreyesus TA, Snow RW: Shrinking the malaria map: progress and prospects. Lancet. 2010, 376: 1566-1578. 10.1016/S0140-6736(10)61270-6.
World Health Organization: World Malaria Report 2011. 2011, Geneva: World Health Organization
Moonen B, Cohen JM, Snow RW, Slutsker L, Drakeley C, Smith DL, Abeyasinghe RR, Rodriguez MH, Maharaj R, Tanner M, Targett GA: Operational strategies to achieve and maintain malaria elimination. Lancet. 2010, 376: 1592-1603. 10.1016/S0140-6736(10)61269-X.
World Health Organization Regional Office for the Eastern Mediterranean: Guidelines on the elimination of residual foci of malaria transmission. 2007, Geneva: World Health Organization
Bousema T, Griffin JT, Sauerwein RW, Smith DL, Churcher TS, Takken W, Ghani A, Drakeley C, Gosling R: Hitting hotspots: spatial targeting of malaria for control and elimination. PLoS Med. 2012, 9: e1001165-10.1371/journal.pmed.1001165.
Le Menach A, Tatem AJ, Cohen JM, Hay SI, Randell H, Patil AP, Smith DL: Travel risk, malaria importation and malaria transmission in Zanzibar. Sci Rep. 2011, 1: 93-
Hay SI, Guerra CA, Gething PW, Patil AP, Tatem AJ, Noor AM, Kabaria CW, Manh BH, Elyazar IR, Brooker S, Smith DL, Moyeed RA, Snow RW: A world malaria map: Plasmodium falciparum endemicity in 2007. PLoS Med. 2009, 6: e1000048-
Gething PW, Patil AP, Smith DL, Guerra CA, Elyazar IR, Johnston GL, Tatem AJ, Hay SI: A new world malaria map: Plasmodium falciparum endemicity in 2010. Malar J. 2011, 10: 378-10.1186/1475-2875-10-378.
Elyazar IRF, Gething PW, Patil AP, Rogayah H, Kusriastuti R, Wismarini DM, Tarmizi SN, Baird JK, Hay SI: Plasmodium falciparum malaria endemicity in Indonesia in 2010. PLoS One. 2011, 6: e21315-10.1371/journal.pone.0021315.
Noor AM, Clements AC, Gething PW, Moloney G, Borle M, Shewchuk T, Hay SI, Snow RW: Spatial prediction of Plasmodium falciparum prevalence in Somalia. Malar J. 2008, 7: 159-10.1186/1475-2875-7-159.
Hay SI, Smith DL, Snow RW: Measuring malaria endemicity from intense to interrupted transmission. Lancet Infect Dis. 2008, 8: 369-378. 10.1016/S1473-3099(08)70069-0.
Gething PW, Patil AP, Hay SI: Quantifying aggregated uncertainty in Plasmodium falciparum malaria prevalence and populations at risk via efficient space-time geostatistical joint simulation. PLoS Comput Biol. 2010, 6: e1000724-10.1371/journal.pcbi.1000724.
World Health Organization: WHO Expert Committee on Malaria: sixth report. 1957, Geneva: World Health Organization, [World Health Organization Technical Report Series No. 123]
Cohen JM, Moonen B, Snow RW, Smith DL: How absolute is zero? An evaluation of historical and current definitions of malaria elimination. Malar J. 2010, 9: 213-10.1186/1475-2875-9-213.
Swaziland National Malaria Control Programme: Swaziland malaria indicator survey 2010. 2011, Manzini, Swaziland: Swaziland Ministry of Health
Kunene S, Phillips AA, Gosling RD, Kandula D, Novotny JM: A national policy for malaria elimination in Swaziland: a first for sub-Saharan Africa. Malar J. 2011, 10: 313-10.1186/1475-2875-10-313.
Roll Back Malaria Partnership: Eliminating malaria: learning from the past, looking ahead. 2011, Geneva: RBM, [Progress & Impact Series]
Okell LC, Bousema T, Griffin JT, Ouédraogo AL, Ghani AC, Drakeley CJ: Factors determining the occurrence of submicroscopic malaria infections and their relevance for control. Nat Commun. 2012, 3: 1237-
Craig MH, Snow RW, Le Sueur D: A climate-based distribution model of malaria transmission in sub-Saharan Africa. Parasitol Today. 1999, 15: 105-110. 10.1016/S0169-4758(99)01396-4.
Cohen JM, Ernst KC, Lindblade KA, Vulule JM, John CC, Wilson ML: Topography-derived wetness indices are associated with household-level malaria risk in two communities in the western Kenyan highlands. Malar J. 2008, 7: 40-10.1186/1475-2875-7-40.
Cohen J, Ernst K, Lindblade K, Vulule J, John C, Wilson M: Local topographic wetness indices predict household malaria risk better than land-use and land-cover in the western Kenya highlands. Malar J. 2010, 9: 328-10.1186/1475-2875-9-328.
Hay S, Snow R, Rogers D: From predicting mosquito habitat to malaria seasons using remotely sensed data: practice, problems and perspectives. Parasitol Today. 1998, 14: 306-313. 10.1016/S0169-4758(98)01285-X.
Rouse JW, Haas RH, Schell JA, Deering DW: Monitoring vegetation systems in the Great Plains with ERTS. 1974
Xu H: Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery. International Journal of Remote Sensing. 2006, 27: 3025-3033. 10.1080/01431160600589179.
Hay SI, Guerra CA, Tatem AJ, Atkinson PM, Snow RW: Urbanization, malaria transmission and disease burden in Africa. Nat Rev Microbiol. 2005, 3: 81-90. 10.1038/nrmicro1069.
Moonen B, Cohen J, Tatem A, Cohen J, Hay S, Sabot O, Smith D: A framework for assessing the feasibility of malaria elimination. Malar J. 2010, 9: 322-10.1186/1475-2875-9-322.
Package “gstat”: Package “gstat”. [http://cran.r-project.org/web/packages/gstat/gstat.pdf]
Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A: Very high resolution interpolated climate surfaces for global land areas. Int J Climatology. 2005, 25: 1965-1978. 10.1002/joc.1276.
Reuter HI, Nelson A, Jarvis A: An evaluation of void‐filling interpolation methods for SRTM data. International Journal of Geographical Information Science. 2007, 21: 983-1008. 10.1080/13658810601169899.
SRTM 90 m Digital Elevation Data: SRTM 90 m Digital Elevation Data. [srtm.csi.cgiar.org]
Sørensen R, Zinko U, Seibert J: On the calculation of the topographic wetness index: evaluation of different methods based on field observations. Hydrology and Earth System Sciences. 2006, 10: 101-112. 10.5194/hess-10-101-2006.
System for Automated Geoscientific Analyses (SAGA). Hamburg, Germany: Hamburg, Germany,http://www.saga-gis.org,
Food and Agriculture Organization of the United Nations: Aquastat: Food and Agriculture Organization of the United Nations: Aquastat.http://www.fao.org/nr/water/aquastat/water_use_agr/index.stm,
ESRI (Environmental Systems Resource Institute): ArcGIS Desktop 10.0. 2010, Redlands, California: ESRI
Linard C, Gilbert M, Snow RW, Noor AM, Tatem AJ: Population distribution, settlement patterns and accessibility across Africa in 2010. PLoS One. 2012, 7: e31743-10.1371/journal.pone.0031743.
The AfriPop Project.http://www.afripop.org/,
Elith JH, Graham CP, Anderson R, Dudík M, Ferrier S, Guisan AJ, Hijmans R, Huettmann FR, Leathwick J, Lehmann A, Li JG, Lohmann LA, Loiselle B, Manion G, Moritz C, Nakamura M, Nakazawa Y, McC M, Overton J, Townsend Peterson AJ, Phillips S, Richardson K, Scachetti‐Pereira RE, Schapire R, Soberón J, Williams SS, Wisz ME, Zimmermann N: Novel methods improve prediction of species’ distributions from occurrence data. Ecography. 2006, 29: 129-151. 10.1111/j.2006.0906-7590.04596.x.
Phillips SJ, Dudík M: Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography. 2008, 31: 161-175. 10.1111/j.0906-7590.2008.5203.x.
Beyer H: Geospatial Modelling Environment (Version 0.6.0.0). 2012
Phillips SJ, Dudík M, Elith J, Graham CH, Lehmann A, Leathwick J, Ferrier S: Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. Ecol Appl. 2009, 19: 181-197. 10.1890/07-2153.1.
Satterthwaite FE: An approximate distribution of estimates of variance components. Biometrics. 1946, 2: 110-114. 10.2307/3002019.
Carter R, Mendis KN, Roberts D: Spatial targeting of interventions against malaria. Bull World Health Organ. 2000, 78: 1401-1411.
Molineaux L: The epidemiology of human malaria as an explanation of its distribution, including some implications for its control. Malaria: principles and practice of malariology. Edited by: Wernsdorfer WH, McGregor I. 1988, Edinburgh: Churchill Livingstone, vol. 2
SAS Institute Inc: SAS 9.2 Help and Documentation. 2012, Cary, NC: SAS Institute Inc
Rasmussen S: Modelling of discrete spatial variation in epidemiology with SAS using GLIMMIX. Computer Methods Programs Biomed. 2004, 76: 83-89. 10.1016/j.cmpb.2004.03.003.
Breiman L: Random Forests. Mach Learn. 2001, 45: 5-32. 10.1023/A:1010933404324.
R Development Core Team: R: A language and environment for statistical computing. 2009, Vienna, Austria: R Foundation for Statistical Computing,http://www.R-project.org,
Freeman E, Frescino T: ModelMap: Modeling and Map production using Random Forest and Stochastic Gradient Boosting. USDA Forest Service. 2009, Ogden, UT, USA: Rocky Mountain Research Station
De’ath G, Fabricius KE: Classification and regression trees: a powerful yet simple technique for ecological data analysis. Ecology. 2000, 81: 3178-3192. 10.1890/0012-9658(2000)081[3178:CARTAP]2.0.CO;2.
ModelMap: An R Package for Model Creation and Map Production. [http://cran.r-project.org/web/packages/ModelMap/vignettes/VModelMap.pdf]
Pearce J, Ferrier S: Evaluating the predictive performance of habitat models developed using logistic regression. Ecol Model. 2000, 133: 225-245. 10.1016/S0304-3800(00)00322-7.
World Health Organization: WHO Expert Committee on Malaria: tenth report. 1964, Geneva: World Health Organization, [World Health Organization Technical Report Series No. 272]
Griffin JT, Hollingsworth TD, Okell LC, Churcher TS, White M, Hinsley W, Bousema T, Drakeley CJ, Ferguson NM, Basáñez M-G, Ghani AC: Reducing Plasmodium falciparum malaria transmission in Africa: a model-based evaluation of intervention strategies. PLoS Med. 2010, 7: e1000324-10.1371/journal.pmed.1000324.
Kern SE, Tiono AB, Makanga M, Gbadoé AD, Premji Z, Gaye O, Sagara I, Ubben D, Cousin M, Oladiran F, Sander O, Ogutu B: Community screening and treatment of asymptomatic carriers of Plasmodium falciparum with artemether-lumefantrine to reduce malaria disease burden: a modelling and simulation analysis. Malar J. 2011, 10: 210-10.1186/1475-2875-10-210.
National Aeronautics and Space Administration: Landsat: Data Continuity Mission. [ldcm.nasa.gov]
Google Earth Engine. [http://earthengine.google.org]
Kelly GC, Tanner M, Vallely A, Clements A: Malaria elimination: moving forward with spatial decision support systems. Trends Parasitol. 2012, 28: 297-304. 10.1016/j.pt.2012.04.002.
JMC, JN, and DK acknowledge funding support for this work from the Global Health Group at University of California, San Francisco and the Bill and Melinda Gates Foundation (#1013170). AJT acknowledges funding support from the RAPIDD programme of the Science and Technology Directorate, Department of Homeland Security, and the Fogarty International Center, National Institutes of Health, and is also supported by grants from NIH/NIAID (U19AI089674) and the Bill and Melinda Gates Foundation (#49446 and #1032350).
The authors declare that they have no competing interests.
JMC and AJT conceived of this study. SD and SK oversaw data collection and contributed to its analysis and interpretation. JMC, SD, JMN, and AJT performed the statistical analysis and mapping. All authors contributed to interpretation of results. JC drafted the manuscript, and all authors read and approved the final manuscript.