- Open Access
The impact of urbanization and population density on childhood Plasmodium falciparum parasite prevalence rates in Africa
Malaria Journal volume 16, Article number: 49 (2017)
Although malaria has been traditionally regarded as less of a problem in urban areas compared to neighbouring rural areas, the risk of malaria infection continues to exist in densely populated, urban areas of Africa. Despite the recognition that urbanization influences the epidemiology of malaria, there is little consensus on urbanization relevant for malaria parasite mapping. Previous studies examining the relationship between urbanization and malaria transmission have used products defining urbanization at global/continental scales developed in the early 2000s, that overestimate actual urban extents while the population estimates are over 15 years old and estimated at administrative unit level.
Methods and results
This study sought to discriminate an urbanization definition that is most relevant for malaria parasite mapping using individual level malaria infection data obtained from nationally representative household-based surveys. Boosted regression tree (BRT) modelling was used to determine the effect of urbanization on malaria transmission and if this effect varied with urbanization definition. In addition, the most recent high resolution population distribution data was used to determine whether population density had significant effect on malaria parasite prevalence and if so, could population density replace urban classifications in modelling malaria transmission patterns. The risk of malaria infection was shown to decline from rural areas through peri-urban settlements to urban central areas. Population density was found to be an important predictor of malaria risk. The final boosted regression trees (BRT) model with urbanization and population density gave the best model fit (Tukey test p value <0.05) compared to the models with urbanization only.
Given the challenges in uniformly classifying urban areas across different countries, population density provides a reliable metric to adjust for the patterns of malaria risk in densely populated urban areas. Future malaria risk models can, therefore, be improved by including both population density and urbanization which have both been shown to have significant impact on malaria risk in this study.
Although malaria has been traditionally regarded as less of a problem in urban areas compared to neighbouring rural areas, the risk of malaria infection continues to exist in densely populated, urban areas of Africa. The process of urbanization and accompanying demographic change is associated with decreased risks of infection due to reduction of suitable breeding grounds for malaria vectors through reduction of vegetative cover, water surfaces and other natural surfaces with building structures and other paved surfaces as well as through pollution of available breeding sites. The reduction of breeding sites reduces the number of malaria vectors as well as their species diversity with the dominant malaria vectors in Africa, Anopheles gambiae s.s. and Anopheles funestus, shown not to proliferate well in urban habitats [1–4]. Low entomological inoculation rates (EIR) have been linked to high human population densities in urban areas [5, 6].
Recent efforts to map the intensity of malaria transmission in Africa have used urbanization to “force down” infection risks [1, 7–12]. Omumbo and colleagues  used discriminant analysis to examine impact of urbanization in Kenya, Tanzania and Uganda using investigator-defined urban/rural assignments supplemented by the urbanization definition from the Global Rural Urban Mapping Project (GRUMP). The inclusion of urbanization was found to improve the consistency of predictive malaria prevalence maps when compared with expert opinion maps . However, the effects of urbanization were difficult to define due to discrepancies in urban classification, the coarse spatial resolution of climate data and poor coverage of malaria training data in the study [11, 12]. Hay and colleagues using GRUMP urban extents refined using population density showed urbanization was associated with lower entomological inoculation rates (EIR) compared to peri-urban and rural areas. However, there were inherent uncertainties in translation of EIR into prevalence of malaria infection . Tatem et al.  found that GRUMP urban extents (GRUMP-UE) produced the most accurate match to author-defined urbanization. GRUMP-UE were then combined with population density to discriminate between malaria-relevant urban and ‘peri-urban’. This definition of urbanization has been used in recent global maps of Plasmodium falciparum endemicity as a prior covariate in the space–time geo-statistical model used in predicting parasite prevalence [8–10].
Despite the recognition that urbanization influences the epidemiology of malaria, there is little consensus in defining urbanization for malaria parasite mapping. Large-scale spatial datasets of urbanization developed in the last two decades vary in terms of spatial and temporal resolution of input census data, satellite imagery and spatial population allocation methods which results in differing extents of urban areas . Previous studies examining the relationship between urbanization and malaria transmission have used products defining urbanization at global/continental scales developed in the early 2000s, that overestimate actual urban extents while the population estimates are over 15 years old and estimated at administrative unit level [8–10].
Defining a consistent/accurate gradient between rural and urban settlements is an important factor when defining malaria transmission patterns and has an influence on the estimated impact of urbanization on malaria parasite prevalence. The transition from a rural settlement to one best described as urban does not follow definite boundary gradations but transitions gradually similar to malaria transmission patterns. This study therefore sought to discriminate an urbanization definition that is most relevant for malaria parasite mapping using individual level malaria infection data obtained from nationally representative household-based surveys. Boosted regression tree (BRT) modelling was used to determine the effect of urbanization on malaria transmission and if this effect varied with urbanization definition. In addition, the most recent high resolution population distribution data was used to determine whether population density had significant effect on malaria parasite prevalence and if so, could population density replace urban classifications in modelling malaria transmission patterns.
Malaria prevalence data
Malaria parasite prevalence data was assembled from national cluster sample household surveys: Demographic and Health Surveys (DHS) ; Malaria Indicator Survey (MIS) . Data was assembled from 19 household surveys conducted between 2007 and 2013 in 14 malaria endemic countries across Africa where information on malaria infection prevalence and household geographic coordinates was available (Additional file 1: Table S1). For each child identified from the household survey data, information was also assembled on factors that might influence malaria prevalence at child, household and cluster levels from the household survey datasets. A cluster is a group of adjacent households that serves as the primary sampling unit and is often defined using enumeration areas (EAs) provided by the most recent national population census. The following child related variables were assembled from the survey data: child’s age in months, child’s gender, whether the child tested positive for P. falciparum and the malaria testing method used [microscopy or rapid diagnostic tests (RDT)]. Information was also linked on whether the child slept under an ITN the previous night; whether children who had been febrile in the last 2 weeks had received an anti-malarial drug in the previous 2 weeks. Variables related to the mother (caretaker) of the index child assembled included the mothers’ age and education level. Variables that related to the household where the child lived included the application of IRS during the previous year; the number of nets available in a household and household size. The household wealth index, a composite measure of household cumulative living standards by quintiles (poorest, very poor, middle, fourth, and highest) was also recorded.
Finally, data on extrinsic determinants of malaria transmission related to the cluster where the index child lived was assembled from remotely sensed environmental datasets. Geographic coordinates of each cluster were used to ascribe to every child-record annual mean temperature, temperature suitability index (TSI), precipitation, enhanced vegetation index (EVI), and malaria seasonality index. Two indicators of temperature were used in this study, annual mean temperature and temperature suitability index (TSI). Annual mean temperature was calculated from monthly mean temperature (1950–2000) available at 1 km resolution grids from WorldClim global climatology dataset . TSI was developed from a biological model that accounts for the dependency of the malaria transmission cycle on temperature developed to provide a biologically relevant suitability index . Information on rainfall was extracted from a synoptic annual mean precipitation grid calculated from monthly total precipitation gridded datasets obtained from WorldClim global climatology dataset. EVI values were extracted from Fourier-processed AVHRR data available at 1 km resolution gridded surface . The influence of seasonal variation in malaria transmission was accounted for using a malaria seasonality index. The malaria seasonality index was derived from daily rainfall estimates from the African Rainfall Estimates version 2 (RFE 2.0) dataset developed as a collaborative programme between NOAA’s Climate Prediction centre (CPC) and USAID/Famine Early Systems Network (FEWS). Daily-accumulated rainfall data between 2002 and 2009 was used to identify areas that received 60% of annual rainfall within 3 months which was found to best fit seasonal clinical malaria profiles . All extractions were done using the spatial analyst tool in ArcGIS 10 (ESRI, USA). The average value within 5 km of the cluster centre in rural areas and 2 km in urban areas was calculated to account for household dispersal with a cluster and additionally account for random positional error deliberately introduced for confidentiality purposes in DHS surveys reporting HIV by up to 5 km in rural areas and 2 km or less in urban areas .
The urbanization status of the household was recorded during the DHS and MIS surveys, based on a national classification of urban sample clusters defined by national Central Statistics Offices (CSO). The definitions are often not the same across countries . The different criteria used in defining urban areas in 14 countries included in the study are given in additional information (Additional file 1: Table S2).
More standardized, routinely available urbanization classifications were extracted from two global datasets: Global Rural Urban Mapping Project Urban extents (GRUMP UE) and Moderate Resolution Imaging Spectrometer (MODIS) urban extents. GRUMP UE is a freely available dataset from the Centre for International Earth Science Information Network at 1 km spatial resolution  derived from NOAA’s night-time lights dataset [24, 25] combined with settlements data. Urban extents are defined as contiguous lit cells from the night-time Lights with total population greater than 5000 persons or approximated based on buffered settlement points [26, 27]. MODIS urban extents on the other hand are derived from supervised classification of MODIS satellite imagery data. Contiguous cells with >50% in the built-up class covering an area greater than 1 km2 are defined as urban. In a study comparing eight existing urban area maps, MODIS urban extents were found to be the most accurate global urban dataset . The gridded dataset is freely available to the public at 500 m resolution . A fourth urbanization definition derived from a modification of the GRUMP UE to include a peri-urban class defined by assigning a true urban core with population density ≥1000 people per km2 while the surrounding urban extents with population <1000 defined as peri-urban areas. Population densities were derived from Gridded Population of the World version 3 (GPW3) projected to 2007. In a previous study, GRUMP UE and the GRUMP-modified UE were shown to be the best to use for malaria mapping . A map comparing the four urbanization classifications used in this study is provided in Additional file 1.
The principal source of human population distribution data used to assess the effects of population density on malaria parasite prevalence was the WorldPop surface . The WorldPop dataset provides Africa-wide gridded population distribution estimates at 100 m spatial resolution projected to 2010 . The WorldPop dataset re-sampled to 1 km spatial resolution and projected to each year between 2006 and 2010 was used to extract population density for each cluster in the household survey dataset in their respective year of survey. Population density extraction was done using the spatial analyst tool in ArcGIS 10 (ESRI, USA). To account for household dispersal within a cluster, binomial interpolation technique was used to calculate the average population density within 5 km of the cluster centre.
Statistical modelling: BRT model building
Boosted regression trees (BRT) modelling was used to assess the impact of urbanization and population density on malaria prevalence adjusting for the effect of confounding factors. Boosted regression trees (BRT) is a relatively novel, but increasingly important, method of event distribution modelling in ecology and epidemiology. BRT modelling is increasingly being used in spatial modelling and was found very efficient in predicting species’ spatial distributions based on a set of environmental variables [31–33]. It has been used to produce soil predictive maps , remote sensing applications in land cover classification  and recently modelling land cover change .
Four separate BRT models (Models I–IV) were first constructed using each of the four urbanization classifications described and a set of common independent variables (Table 1). The main outcome was the probability of a child being malaria parasite positive. Random effects between the fourteen countries were controlled for in the model as a factor with unordered levels. A detailed description of BRT model parameterization and optimization are provided in Additional file 1. A fifth BRT model (Model V) was constructed with same set of common independent variables with population density replacing urbanization as the main explanatory variable to determine the impact of population density on falciparum malaria prevalence.
For comparison purposes, a sixth BRT model (Model VI) was constructed using only the set of common independent variables excluding both urbanization and population density. Finally, a seventh BRT model (Model VII) was constructed including population density and urbanization and the common set of independent variables. The models are summarized as follows:
Model I: Common set of covariates (Table 1) + GRUMP UE
Model II: Common set of covariates (Table 1) + CSO urban
Model III: Common set of covariates (Table 1) + modified GRUMP UE
Model IV: Common set of covariates (Table 1) + MODIS urban
Model V: Common set of covariates (Table 1) + population density
Model VI: Common set of covariates (Table 1)
Model VII: Common set of covariates (Table 1) + population density + CSO urban
Partial dependence plots were used to examine the effect of each predictor variable on the response (malaria positivity) after accounting for the average effect of all other variables in the model. Cross-validation techniques were used to evaluate model predictive performance, by randomly separating the dataset into a modelling dataset that was used to fit the model and a testing dataset that was excluded from model fitting and was used for testing the model’s predictive performance. The ratio model set was 50%, which defined the percentage of the data sampled at every run without replacement. This was further improved using bootstrapping techniques with 25 iterations for each of the defined BRT models (Models I–VII) and the predictive power of each model measured using the AUC. To assess which BRT model best predicted the outcome, AUC values were compared and the significance of the differences examined using a Tukey’s honest significant difference test. All BRT models were developed using the R package ‘gbm’ version 1.6–3.2  and the additional functions provided in . All analyses were conducted using R (version 2.15.3).
78,882 records of children aged less than 5 years tested for malaria in close to 6000 clusters were assembled for analysis from 19 household surveys across 14 African countries undertaken between 2007 and 2012. 1110 (0.01%) of the available child records had no geographic positions and were excluded. Of the remaining 77,772 child-records, 17% were positive for P. falciparum malaria where the majority (81%) had been tested for malaria using microscopy. A summary of the characteristics of the child-level and household level predictors that were used in the analysis is given in additional information (Additional file 1: Table S3).
The relationship between the outcome (malaria positivity) and the main predictor variables (urbanization and population density) is shown in Figs. 1 and 2 while the relative contribution of each predictor variables on the outcome (malaria positivity) is summarized in Table 1. In the four separate BRT models constructed to evaluate which of four urbanization datasets best predicted malaria risk, urbanization was found to have an impact on malaria risk with individuals residing in urban areas shown to have decreased malaria risk compared to individuals living in rural or peri-urban areas (Fig. 1). However, no significant difference was observed between the four urbanization models using a Tukey’s honest significant difference test (p value >0.05).
In a separate model (Model V), the impact of population density on malaria infection in childhood was examined. Population density was found to be an important predictor of malaria risk with a relative contribution of 10% (Table 1). The non-linear relationship between population density and the response is shown in Fig. 2. Decreased malaria risk was observed in areas of low population (less than 10 persons per km2). The malaria risk curve rises with increase in population but a significant decrease is observed for population densities greater than 1000 persons per km2. Mean AUC value of the BRT model with population density (Model V) was higher compared to the other four models with different urbanization definitions (Models I, II, III and IV), a result that was found significant using a Tukey’s honest significant difference test (p value <0.001).
Figure 3 shows box plots comparing the performance of the BRT models including population density compared to the four BRT models including urban classifications. For comparison, Model VI and Model VII were constructed to examine the impact of combining/excluding urbanization and population density in malaria prevalence models. Model VI constructed with the common set of variables excluding urbanization and population density resulted in the lowest mean AUC thus least accurate in predicting malaria positivity compared to the other models (Fig. 3) and this difference was found to be significant (Tukey test p value <0.05). On the other hand, Model VII which included both urbanization and population density performed significantly better (Tukey test p value <0.05) than all the other models (Fig. 3). The partial dependence plots of the common set of covariates controlled for in the models are given in Additional file 1.
Children living in urban areas were found to have a decreased risk of malaria infection compared to children residing in rural areas in all the four urbanization datasets used: CSO urban, GRUMP UE, modified GRUMP UE, MODIS urban (Fig. 1). Malaria risk was shown to decline from rural areas through peri-urban settlements to urban central areas using the modified GRUMP UE dataset. These are not new findings and are consistent with previous studies comparing malaria risk according to settlement patterns [2, 3, 11, 12, 38]. In this study however, the modelling procedure used takes into account the average effects of other variables influencing malaria risk. In addition, the BRT models controlled for the interaction between variables and thus was able to tease out the true effects of urbanization on malaria risk.
In general, this study shows that urbanization is an important predictor of malaria risk. The inclusion of urbanization, regardless of definition, significantly improved the predictive performance of all models with average AUC values increasing from 0.75 for the baseline model (Model VI) to more than 0.89 in all models including urbanization (Model I, II, III and IV). Although CSO-defined urbanization had a higher AUC, the difference in predictive performance observed amongst the four urbanization models definitions was not statistically significant.
This study also examines the relationship between human population density and malaria infection risk in children aged less than 5 years. Population density was found to be the third most important predictor of malaria infection risk with an average overall relative contribution of close to 10% to the BRT model. As seen in Fig. 2, malaria transmission is sustained as the density of human hosts increases but a decline in risk is observed in densely populated areas with densities greater than 1000 persons per km2. Previous studies describing malaria transmission have linked high human population densities in urban areas to low entomological inoculation rates (EIR) by reducing the overall chances of a host getting an infective bite when vector densities are low [5, 6].
Replacing urbanization with population density significantly improved the predictive performance of the BRT model (Fig. 3). The BRT model with population density was found to perform significantly better (p values <0.001) in predicting malaria risk when I compared the model’s predictive performance to the other four models with urbanization classification (Fig. 3). Given the challenges in uniformly classifying urban areas across different countries, population density may be a more accurate and reliable metric to adjust for the patterns of malaria risk in densely populated urban areas. Malaria transmission has been shown to progressively decrease from rural to peri-urban areas and from peri-urban areas to urban centre in a review of studies on malaria transmission . With population density, it is also possible to more realistically model the progressive transition in malaria risk from densely populated urban centres to surrounding less densely populated peri-urban areas and on to rural areas. This provides an advantage over urbanization classifications that tend to force artificial limits in malaria risk between urban and rural area most often missing the transition in risk. Previous studies describing malaria transmission have used a single rule to “force down” infection risks in all urban and peri-urban regardless of their location or characteristics of the urban extent [1, 8, 10, 39–41]. However, malaria risk is not uniform between or within urban extents and can vary significantly within a city [2, 42–45]. Including population density in malaria modelling is important because the risk of malaria can be different in high-density cities and low-density cities, yet both would be classified as “urban”.
While previous efforts to map the intensity of malaria transmission in Africa have used only urban classification to “force down” infection risks in urban areas, this study has shown that including both urbanization and population density in the BRT model had a more significant impact in predicting malaria prevalence. Model VI which excludes both urbanization and population density performed poorly compared to other models that included either population density or urbanization. The use of population density contributes significantly to the models’ predictive performance compared to any of the previous models that included urbanization only. This implies that there are aspects of the influence of urbanization on malaria prevalence that cannot be solely explained by high population density or population distribution patterns with the final model (Model VII) that includes both urbanization and population density resulting in the best performing model (Fig. 3) compared to other models that included either or both population density and urbanization.
There are however some limitations to this study. Although the WorldPop dataset used in this study provides the best continent wide estimate of population distribution, it is prone to some under/overestimation as it relies on national census data, which is conducted in varying years for different countries and conducted at wide intervals (10 years or more in some countries). Population estimates for any year between censuses were projected using population growth estimates from the previous census period, which may not always be precise. In addition, the frequency of DHS/MICS household surveys was not sufficient to account for annual variation in malaria risk and thus constant variance was assumed for the period of study. There are also some limitations associated with environmental datasets used in this study. In order to account for the effect of time on environmental determinants of malaria, the environmental covariates used must be matched with the observed data on malaria transmission. However, the environmental covariates are rarely available at time points that correspond with the date of surveys as most are derived from long-term processed remotely sensed satellite imagery or modelled climatic data generated as synoptic estimates that do not represent a specific year [38, 46]. In an ideal experiment, individual-level malaria positivity data with geographic coordinates are household level would be linked to up to date urbanization and population density datasets that are comparable across countries with the effect of environmental variables controlled for using high resolution environmental datasets that can be matched to the period of household survey. In extracting environmental variables, additional bias could be introduced by using a radius around a cluster radius. This is however inherent when using DHS datasets where surveys with spatial information, geographic coordinates are provided for the sampling clusters, consisting of 15–30 households spread over up to 5 km in rural areas and 2 km or less in urban areas . The level of bias introduced can be minimized by using household geographic coordinates to relate household members’ malaria positivity to environmental conditions obtained from equally high resolution environmental datasets. The use of household coordinates may however be limited in surveys reporting HIV due to confidentiality issues.
Additionally, dichotomous classification doesn’t always effectively describe urbanization. The degree of urbanization can be defined based demographic characteristics like population density, economic activities, infrastructure, social and cultural behaviours or a combination of these [23, 47, 48]. Demographic characteristics can be combined with satellite-derived spatial datasets to better describe urbanization as a continuum that incorporated peri-urban growth. In this study, population density was combined with satellite derived urban extents to define peri-urban areas in Model V (Table 1). The application of population density in replacing dichotomous urban classifications in modelling malaria transmission patterns was also evaluated. Population density was shown to more realistically model the continuum transition from densely populated urban centres to surrounding less densely populated peri-urban and rural areas in relation to malaria risk. There is also potential to use other demographic indicators linked to functional definitions of urbanization, often measuring level of poverty and urban-ness, like household electrification, access to improved drinking water and toilet facilities as well as housing quality  to describe the urbanization continuum. However, there still remains a challenge in a universal definition to quantify urbanization with little consensus among national governments and international agencies making comparisons and aggregation across countries difficult [48, 49]. There are on-going efforts to globally map settlement extents at higher resolutions of 10–30 m [50–53]. However, these projects are very recent and still in developing/testing phase so could not be used in this study. The availability of these datasets that reflect the current extents of rapidly growing urban areas especially in Africa can potentially improve the estimation of the impact of urbanization on malaria risk.
In general, this study shows urbanization and human population density influence the predicted risk of malaria infection. This study shows that the risk of infection with malaria does exist in these densely populated urban areas across Africa. Despite a reduction in malaria risk associated with increasing population density, these high-density settlement areas do not have zero risks of infection. As such it is important to recognize that the public health burdens in some settings might be large and significant. Africa’s population is projected to double to close to 2 billion over the next 35 years with most of this growth expected to be concentrated in urban areas. Future malaria risk models can, therefore, be improved by including both population density and urbanization, which have both been shown to have significant impact on malaria risk in this study. Population density is more interpretable while urbanization would account for aspects of urban life not simply reflected by population density.
Hay SI, Guerra CA, Tatem AJ, Atkinson PM, Snow RW. Urbanization, malaria transmission and disease burden in Africa. Nat Rev Microbiol. 2005;3:81–90.
Machault V, Vignolles C, Pages F, Gadiaga L, Gaye A, Sokhna C, et al. Spatial heterogeneity and temporal evolution of malaria transmission risk in Dakar, Senegal, according to remotely sensed environmental data. Malar J. 2010;9:252.
Robert V, Macintyre K, Keating J, Trape JF, Duchemin JB, Warren M, et al. Malaria transmission in urban sub-Saharan Africa. Am J Trop Med Hyg. 2003;68:169–76.
Warren M, Billig P, Bendahmane D, Wijeyaratne P. Malaria in urban and peri-urban areas in sub-Sahara Africa. Environment Health Project, Activity report No 71. 1999.
Oyewole IO, Awolola TS. Impact of urbanisation on bionomics and distribution of malaria vectors in Lagos, southwestern Nigeria. J Vector Borne Dis. 2006;43:173–8.
Smith DL, McKenzie FE, Snow RW, Hay SI. Revisiting the basic reproductive number for malaria and its implications for malaria control. PLoS Biol. 2007;5:e42.
Bhatt S, Weiss DJ, Cameron E, Bisanzio D, Mappin B, Dalrymple U, et al. The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015. Nature. 2015;526:207–11.
Gething PW, Patil AP, Smith DL, Guerra CA, Elyazar IR, Johnston GL, et al. A new world malaria map: Plasmodium falciparum endemicity in 2010. Malar J. 2011;10:378.
Hay SI, Guerra CA, Gething PW, Patil AP, Tatem AJ, Noor AM, et al. A world malaria map: Plasmodium falciparum endemicity in 2007. PLoS Med. 2009;6:e1000048.
Noor AM, Kinyoki DK, Mundia CW, Kabaria CW, Mutua JW, Alegana VA, et al. The changing risk of Plasmodium falciparum malaria infection in Africa: 2000–10: a spatial and temporal analysis of transmission intensity. Lancet. 2014;383:1739–47.
Omumbo JA, Guerra CA, Hay SI, Snow RW. The influence of urbanisation on measures of Plasmodium falciparum infection prevalence in East Africa. Acta Trop. 2005;93:11–21.
Omumbo JA, Hay SI, Snow RW, Tatem AJ, Rogers DJ. Modelling malaria risk in East Africa at high-spatial resolution. Trop Med Int Health. 2005;10:557–66.
Center for International Earth Science Information Network (CIESIN), (IFPRI) IFPRI, Bank TW, Centro International de Agricultura Tropical (CIAT). Global Rural–Urban Mapping Project (GRUMP): Urban Extents. Palisades. 2004.
Tatem AJ, Guerra CA, Kabaria CW, Noor AM, Hay SI. Human population, urban settlement patterns and their impact on Plasmodium falciparum malaria endemicity. Malar J. 2008;7:218.
Potere D, Schneider A, Angel S, Civco DL. Mapping urban areas on a global scale: which of the eight maps now available is more accurate? Int J Remote Sens. 2009;30:6531–58.
Demographic and Health Surveys (DHS). ICF International. https://dhsprogram.com/. Accessed 01 Sep 2014.
Malaria Indicator Surveys (MIS). President’s Malaria Initiative (PMI), (USAID) USAfID. http://www.malariasurveys.org/. Accessed 01 Sep 2014.
Hijmans R, Cameron S, Parra J. WorldClim Global Climate Data. http://www.worldclim.org/. Accessed 03 June 2014.
Gething PW, Van Boeckel TP, Smith DL, Guerra CA, Patil AP, Snow RW, et al. Modelling the global constraints of temperature on transmission of Plasmodium falciparum and P. vivax. Parasite Vectors. 2011;4:92.
Hay SI, Tatem AJ, Graham AJ, Goetz SJ, Rogers DJ. Global environmental data for mapping infectious disease distribution. Adv Parasitol. 2006;62:37–77.
Cairns M, Roca-Feltrer A, Garske T, Wilson AL, Diallo D, Milligan PJ, et al. Estimating the potential public health impact of seasonal malaria chemoprevention in African children. Nat Commun. 2012;3:881.
Measure D. Demographic and health surveys. Calverton: Measure DHS; 2010.
United Nations. World urbanization prospects: the 2011 revision. New York: Department of Economic and Social Affairs, Population Division: United Nations; 2012.
Elvidge C, Baugh K, Hobson V, Kihn E, Kroehl H, Davis E, et al. Satellite inventory of human settlements using nocturnal radiation emissions: a contribution for the global toolchest. Glob Change Biol. 1997;3:387–95.
Small C, Pozzi F, Elvidge CD. Spatial analysis of global urban extent from DMSP-OLS night lights. Remote Sens Environ. 2005;96:277–91.
Balk D, Pozzi F, Yetman G, Deichmann U, Nelson A. The distribution of people and the dimension of place: methodologies to improve global population estimates in urban and rural areas. New York: New York CIESIN, Columbia University; 2004.
Balk D, Pullum T, Storeyguard A, Greenwell F, Neuman M. A spatial analysis of childhood mortality in West Africa. Popul Space Place. 2004;10:175–216.
Schneider A, Friedl MA, Potere D. Mapping global urban areas using MODIS 500-m data: new methods and datasets based on ‘urban ecoregions’. Remote Sens Environ. 2010;114:1733–46.
WorldPop population database. GeoData Institute, University of Southampton. Accessed 01 Sep 2015.
Linard C, Gilbert M, Snow RW, Noor AM, Tatem AJ. Population distribution, settlement patterns and accessibility across Africa in 2010. PLoS ONE. 2012;7:e31743.
Elith J, Graham CH, Anderson RP, Dudík M, Ferrier S, Guisan A, et al. Novel methods improve prediction of species’ distributions from occurrence data. Ecography. 2006;29:129–51.
Martin V, Pfeiffer DU, Zhou X, Xiao X, Prosser DJ, Guo F, et al. Spatial distribution and risk factors of highly pathogenic avian influenza (HPAI) H5N1 in China. PLoS Pathog. 2011;7:e1001308.
Van Boeckel TP, Thanapongtharm W, Robinson T, Biradar CM, Xiao X, Gilbert M. Improving risk models for avian influenza: the role of intensive poultry farming and flooded land during the 2004 Thailand epidemic. PLoS ONE. 2012;7:e49528.
Grinand C, Arrouays D, Laroche B, Martin MP. Extrapolating regional soil landscapes from an existing soil map: sampling intensity, validation procedures, and integration of spatial context. Geoderma. 2008;143:180–90.
Linard C, Tatem AJ, Gilbert M. Modelling spatial patterns of urban growth in Africa. Appl Geogr. 2013;44:23–32.
Ridgeway G. Package ‘gbm’. R-News. 2009. p. 15.
Elith J, Leathwick JR, Hastie T. A working guide to boosted regression trees. J Anim Ecol. 2008;77:802–13.
Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A. Very high resolution interpolated climate surfaces for global land areas. Int J Climatol. 2005;25:1965–78.
Guerra CA, Snow RW, Hay SI. Defining the global spatial limits of malaria transmission in 2005. Adv Parasitol. 2006;62:157–79.
Guerra CA, Snow RW, Hay SI. Mapping the global extent of malaria in 2005. Trends Parasitol. 2006;22:353–8.
Snow RW, Guerra CA, Noor AM, Myint HY, Hay SI. The global distribution of clinical episodes of Plasmodium falciparum malaria. Nature. 2005;434:214–7.
Adlaoui E, Faraj C, El Bouhmi M, El Aboudi A, Ouahabi S, Tran A, et al. Mapping malaria transmission risk in northern Morocco using entomological and environmental data. Malar Res Treat. 2011;2011:391463.
Baragatti M, Fournet F, Henry MC, Assi S, Ouedraogo H, Rogier C, et al. Social and environmental malaria risk factors in urban areas of Ouagadougou, Burkina Faso. Malar J. 2009;8:13.
Machault V, Vignolles C, Pages F, Gadiaga L, Tourre YM, Gaye A, et al. Risk mapping of Anopheles gambiae s.l. densities using remotely-sensed environmental and meteorological data in an urban area: Dakar, Senegal. PLoS ONE. 2012;7:e50674.
Stoler J, Weeks JR, Getis A, Hill AG. Distance threshold for the effect of urban agriculture on elevated self-reported malaria prevalence in Accra, Ghana. Am J Trop Med Hyg. 2009;80:547–54.
Scharlemann JP, Benz D, Hay SI, Purse BV, Tatem AJ, Wint GR, et al. Global data for ecology and epidemiology: a novel algorithm for temporal Fourier processing MODIS data. PLoS ONE. 2008;3:e1408.
Potere D, Schneider A. A critical look at representations of urban areas in global maps. GeoJournal. 2007;69:55–80.
Utzinger J, Keiser J. Urbanization and tropical health—then and now. Ann Trop Med Parasitol. 2006;100:517–33.
Dorélien A, Balk D, Todd M. What is urban? Comparing a satellite view with the demographic and health surveys. Popul Dev Rev. 2013;39:413–39.
Esch T, Marconcini M, Felbier A, Roth A, Heldens W, Huber M, et al. Urban footprint processor—fully automated processing chain generating settlement masks from global data of the TanDEM-X mission. IEEE Geosci Remote Sens Lett. 2013;10:1617–21.
Gamba P, Lisini G. Fast and efficient urban extent extraction using ASAR wide swath mode data. IEEE J Sel Top Appl Earth Obs Remote Sens. 2013;6:2184–95.
Gong P, Wang J, Yu L, Zhao Y, Zhao Y, Liang L, et al. Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data. Int J Remote Sens. 2013;34:2607–54.
Pesaresi M, Ehrlich D, Ferri S, Florczyk A, Freire S, Haag F, et al. Global human settlement analysis for disaster risk reduction. In: The international archives of the photogrammetry, remote sensing and spatial information sciences, vol XL-7/W3, 36th International symposium on remote sensing of environment, Berlin, pp 837–43.
Schneider A, Friedl MA, Potere D. A new map of global urban extent from MODIS satellite data. Environ Res Lett. 2009;4:044003.
NOAA Climate Prediction centre (CPC), USAID/Famine Early Systems Network (FEWS). RFE 2.0 rainfall estimates. http://www.cpc.noaa.gov/products/international/data.shtml. Accessed Sep 2014.
CWK, CL and RWS developed the study concept and study design. CWK was responsible for data assembly and data analysis and wrote the first draft of the manuscript. CL and MG guided study design, model development and data interpretation and helped in drafting the manuscript. AMN and RWS provided the policy and epidemiological interpretation of the study results. All authors reviewed the manuscript and contributed to the final submission. All authors read and approved the final manuscript.
All persons involved in data acquisition, analysis and interpretation of data as well as drafting and revision of the manuscript are included in the authorship of the paper.
The authors declare that they have no competing interests.
Availability of data and materials
The household survey datasets used in this study are publicly available to registered users on the Demographic and Health Surveys (DHS) website  and Malaria Indicator Surveys (MIS) website . Urbanization datasets used in this study are available free for registered users [13, 28, 54]. All environmental and geographical variables used in this study are freely available to the public [19, 20, 55]. All other data used in this study are presented in the publication.
This study is the secondary analysis of DHS and MIS data. The procedures and questionnaires for DHS surveys have been reviewed and approved by the ICF International Institutional Review Board (IRB). The ICF International IRB ensures that the survey complies with the U.S. Department of Health and Human Services regulations for the protection of human subjects (45 CFR 46).
AM is supported by the Wellcome Trust, UK as an intermediate fellow (# 095127); RWS is supported by the Wellcome Trust as Principal Research Fellow (# 079080 and # 103602), that’s also supported CWK. CL is supported by funding from the Belgian Science Policy (SR/00/304). MG is funded by the ‘Fonds National de la Recherche Scientifique’. CWK is also grateful to the KEMRI-Wellcome Trust Overseas Programme Strategic Award (# 084538) for additional support during her Ph. D.
About this article
Cite this article
Kabaria, C.W., Gilbert, M., Noor, A.M. et al. The impact of urbanization and population density on childhood Plasmodium falciparum parasite prevalence rates in Africa. Malar J 16, 49 (2017) doi:10.1186/s12936-017-1694-2
- Population density
- Boosted regression trees