Spatially variable risk factors for malaria in a geographically heterogeneous landscape, western Kenya: an explorative study
© Homan et al. 2015
Received: 17 July 2015
Accepted: 9 December 2015
Published: 4 January 2016
Large reductions in malaria transmission and mortality have been achieved over the last decade, and this has mainly been attributed to the scale-up of long-lasting insecticidal bed nets and indoor residual spraying with insecticides. Despite these gains considerable residual, spatially heterogeneous, transmission remains. To reduce transmission in these foci, researchers need to consider the local demographical, environmental and social context, and design an appropriate set of interventions. Exploring spatially variable risk factors for malaria can give insight into which human and environmental characteristics play important roles in sustaining malaria transmission.
On Rusinga Island, western Kenya, malaria infection was tested by rapid diagnostic tests during two cross-sectional surveys conducted 3 months apart in 3632 individuals from 790 households. For all households demographic data were collected by means of questionnaires. Environmental variables were derived using Quickbird satellite images. Analyses were performed on 81 project clusters constructed by a traveling salesman algorithm, each containing 50–51 households. A standard linear regression model was fitted containing multiple variables to determine how much of the spatial variation in malaria prevalence could be explained by the demographic and environmental data. Subsequently, a geographically-weighted regression (GWR) was performed assuming non-stationarity of risk factors. Special attention was taken to investigate the effect of residual spatial autocorrelation and local multicollinearity.
Combining the data from both surveys, overall malaria prevalence was 24 %. Scan statistics revealed two clusters which had significantly elevated numbers of malaria cases compared to the background prevalence across the rest of the study area. A multivariable linear model including environmental and household factors revealed that higher socioeconomic status, outdoor occupation and population density were associated with increased malaria risk. The local GWR model improved the model fit considerably and the relationship of malaria with risk factors was found to vary spatially over the island; in different areas of the island socio-economic status, outdoor occupation and population density were found to be positively or negatively associated with malaria prevalence.
Identification of risk factors for malaria that vary geographically can provide insight into the local epidemiology of malaria. Examining spatially variable relationships can be a helpful tool in exploring which set of targeted interventions could locally be implemented. Supplementary malaria control may be directed at areas, which are identified as at risk. For instance, areas with many people that work outdoors at night may need more focus in terms of vector control.
Trial registration: Trialregister.nl NTR3496—SolarMal, registered on 20 June 2012
Across sub-Saharan Africa, malaria remains one of the leading causes of morbidity and mortality with up to 200 million symptomatic cases every year . In Kenya, 75 % of the population is at risk of malaria infection, but due to intensified control efforts the number of malaria cases has decreased two fold in one decade to well under five million annually. Interventions which have contributed to the decline of malaria transmission and mortality are the use of insecticide-treated nets (ITNs), long-lasting insecticidal nets (LLINs), indoor residual spraying (IRS) and treatment of patients with artemisinin-based combination therapy (ACT) [2, 3]. The goal of WHO and Roll Back Malaria (RBM) is to continue the efforts to fight malaria until local elimination and eventually eradication is achieved [4–6].
Since large successes have been realized and many areas have moved into a pre-elimination phase, the epidemiology of malaria is changing . Although malaria transmission has always been geographically heterogeneous, under pressure of current interventions the spatial heterogeneity of malaria becomes more pronounced, typically characterized by areas or clusters of households that persistently have higher proportions of infected individuals compared with the population average. In order to aid the malaria elimination phase, a better understanding of the epidemiology of malaria, considering geographical heterogeneity, is needed . Heterogeneity in malaria transmission is not a new phenomenon , but because of improved research methods and the enhanced capacity of information technology, recent studies have more frequently shed light on the smaller-scale geographical heterogeneity of malaria [10–12]. Studies suggest that factors associated with the spatial clustering of malaria include: house structure, human behaviour, environmental, geographical and demographical variables [13–17].
Many studies have investigated clustering and the spatial heterogeneity of malaria risk [18–21] but fewer studies have investigated ways in which relationships of factors influencing this heterogeneity vary over space. Lessons can be learnt from studies that investigated the geographically varying nature of factors on agricultural  and environmental [23, 24] outcomes. Relatively few studies have addressed the questions of causes of spatial heterogeneity in health outcomes [25, 26] like malaria [27–30].
In the present study, it is explored whether risk factors for malaria also vary over space. Household and environmental risk factors contributing to malaria prevalence were studied by means of a frequentist non-spatial risk model and clusters of elevated malaria risk were identified through scan statistics. The final aim of this study was to investigate the spatial heterogeneity in relationships between malaria prevalence and associated risk factors by Geographically Weighted Regression (GWR). The added value of using this geostatistical model is explored, and the advantage compared to a standard linear regression model is evaluated.
The study is embedded as part of a baseline study in a large malaria vector control trial (SolarMal) on Rusinga Island, western Kenya . The SolarMal trial aims to reduce malaria transmission on Rusinga Island by mass trapping of malaria vectors with odour-baited traps (OBTs), which contain a blend of organic volatiles that mimic a human odour . Through daily removal trapping the project aims to reduce malaria vector populations and eventually decrease malaria transmission. The analysis of spatial heterogeneity of risk factors for malaria can give a better understanding of malaria epidemiology and can be of value for programme managers who want explore targeting interventions to specific geographical locations.
Study site and population
On Rusinga Island, the population is traditionally part of the Luo tribe. The principal occupation is fishing and labour associated with fishing, otherwise many of the inhabitants are involved in rain-fed subsistence agriculture. Malaria transmission occurs throughout the year, with peaks in transmission late in the rainy seasons when parasite prevalence is approximately 30 % across the population . Plasmodium falciparum is the most prevalent species of malaria in western Kenya accounting for 98 % of the cases and the malaria transmitting vectors are Anopheles funestus and to a lesser extent Anopheles gambiae s.s. and Anopheles arabiensis [33–35].
Field set up
The SolarMal project is based at the Thomas Odhiambo Campus of the International Centre of Insect Physiology and Ecology (TOC-icipe) in the village of Mbita Point, one kilometre from the causeway which connects the island to the mainland. Meteorological data such as daily temperature and precipitation were obtained from the Suba meteorological field station at Rusinga Island (0°24′19.28″ South and 34°08′51.94″ East). A health and demographic surveillance system (HDSS) was set up to visit every individual living on Rusinga Island three times per year. A census enumeration survey, conducted from May to July 2012 recorded 23,337 individuals residing in 6954 residential structures (henceforth termed houses) divided into 4063 economically independent households. During the census HDSS round, the coordinates of all residential structures, as well as public buildings, were recorded. Fieldworkers were equipped with mobile tablet computer devices (Samsung Galaxy Tab 2, 10.1) with in inbuilt global positioning system (GPS) receiver for the data collection. All individuals were asked to provide their full name, sex, date of birth, main occupation and their relationship to the head of household. An individual was considered eligible for participation in the study when he or she intended to live for at least 6 months on the island. Data collection and handling was conducted using general structured questionnaires in the OpenHDS data collection and management platform. Data were transferred on a daily basis to a secured local server enabling researchers to work with a completely digital near real time database. Clean data were deposited in a MySQL database. During baseline studies one HDSS update survey was conducted from January to June 2013. For the rollout of the intervention the island was divided into 81 geographically contiguous clusters with 50–51 households per cluster. The households were allocated to clusters according to a travelling salesman algorithm by which the shortest imaginary route connecting every household on the island was identified. A new cluster was created after every 50–51 households  (Fig. 1). 81 clusters is a sufficient number of units to carry out regression while a sample from approximately 50 households provides enough statistical power to estimate the true value for a cluster.
During the baseline period before the rollout of the intervention commenced, two parasitological prevalence surveys were conducted in a cross section of the study population. Households were randomly selected for inclusion in each prevalence survey to the point where 10 % of the population was included. All members of selected households were informed in advance of the date and time of the survey and were invited to assemble at a public place such as a church or a school near their home for malaria testing. In total, residents of 790 randomly selected households were sampled, covering 1223 houses. The first survey examined 1822 individuals (7.8 % of the total island population) and was carried out during the start of the short rainy season starting from September and finishing in November 2012. A second prevalence survey examined 1810 individuals (7.7 % of the total population) and was conducted from February to April 2013. Individual body temperature was measured by means of a Braun™ IRT 3020 ear thermometer. A drop of blood was obtained through a finger prick and directly tested for antigens of malaria parasites using an SD BIOLINE™ Malaria Ag P.f/Pan (HRP-II/pLDH) Rapid Diagnostic Test (RDT). The SD Bioline RDT kit results distinguish between infection with P. falciparum and other Plasmodium species. However, tests results with more than one positive reading or indicating multiple species of Plasmodium were pooled. If the individual tested positive for malaria antigens, an appropriate dose of Coartem® (Artemether/lumefantrine) was provided free of charge.
Variables considered for the global regression model of malaria prevalence
Description for GWR per project cluster
% of children under 5 years old
% of children between 5 and 15 years old
% of people above the age of 15
% outdoor occupation
People per sleeping room
Mean people per sleeping room
People per house
Mean people per house
% houses with open eaves
Condition of bed nets
% bed nets without damages
House sprayed last 12 months
% sprayed houses in last 12 months
Nets per person
Mean number of nets per person
Socio economic status1
% of people with highest SES
Socio economic status2
% of people with lowest SES
% of houses owned
Mean population density
Mean malaria mosquito catches per house
Distance to lake
Mean distance to the lake
Elevation from lake
Mean elevation from lake
Distance to clinic
Mean distance to nearest health clinic
Monitoring of mosquitoes took place across five consecutive rounds from September 2012 until June 2013, selecting 80 households per round, each time by means of a simple random sample, with replacement, of all households on the island. Mosquitoes were collected inside and outside selected households using odour-baited MM-X traps (American Biophysics Corporation, RI, USA) [32, 39]. Data from the first, second, fourth and fifth rounds of surveillance (September to November 2012 and March to June 2013) were pooled as they corresponded temporally with the two baseline malaria prevalence surveys. In total entomological data from 353 households was included in this study. The total number of female anophelines caught inside and outside each household was pooled as a single observation for that particular household.
A multispectral QuickBird image, taken on 17/03/2010 with a spatial resolution of 2.4 m, was obtained through DigitalGlobe®. Initially, the image was used for geo-referencing of residential and public structures and infrastructure. The image was geo-referenced, radio-metrically corrected, corrected for sensor and platform-induced distortions, and was ready for orthorectification. Orthorectification was performed using a Digital Elevation Model (DEM). The DEM used was an ASTER GDEM 2, the geographical coordinate system was referenced to the 1984 World Geodetic System (WGS84). Several geographic variables were derived for each household using the image and DEM: elevation relative to lake, distance to lake, distance to nearest clinic, population density, the Normalized Difference Vegetation Index (NDVI) and the Topographic Wetness Index (TWI). The NDVI is a commonly used indication of greenness and is calculated based on the values of the red and near infrared spectral bands within a radius of 250 m. The TWI defines the wetness of an area and combines the upstream area with the local slope expressed as the number of cells ‘upstream’ of cells measuring 30 × 30 m (900 m2). Population density measures were calculated within a radius of 250 metres. All the geographical variables per household were averaged per project cluster for data analysis and the analysis was at cluster-level. Geographic data and variables were pre-processed, compiled and displayed using ArcGIS (ArcGIS 10.2.1, ESRI Inc., Redlands, CA, USA).
For this analysis the measurements of both prevalence surveys were pooled and the mean malaria prevalence per project cluster on basis of individual RDT outcomes was analysed and mapped with smoothing using areal interpolation technique. Areal interpolation is a kriging-based interpolation method that considers involvement of polygons of different shapes . A Gaussian distribution for data averaged over polygons was used to produce semivariograms. Semivariograms were then used to investigate the degree of spatial variation; the model function was chosen which captured the most empirical data points within its confidence intervals.
Unlike the regression analyses that are based on continuous household or individual data of project clusters (Table 1), the detection of potential ‘hot spots’ of malaria cases were analysed with a binomial distribution on an individual level, with the outcome variable malaria positive or negative. Kuldorff spatial scan statistic analyses were performed (SaTScan, v9.1.1) [41, 42] using a circular window that gradually scans the map of the island, quantifying the number of observed and expected observations within the window for every house. Within each circle, values in a radius around each household were compared to the expected values and a likelihood ratio test was subsequently performed. P values were obtained by 999 Monte Carlo replications and when p values were ≤0.05, houses in this circle were considered to be part of a significant hot spot of elevated malaria prevalence. The maximum scan window was set at 1.5 km and a maximum of 50 % of the population was allowed in one possible hot spot.
Stationary epidemiological risk models assume that observations are geographically independent. These ‘global’ models assume that malaria and the coefficients of predictor variables apply to the whole island . Outcomes can be biased because the models do not account for spatial dependence considering that the relationship of risk factors for malaria can vary over space, such as demographical and environmental features . In order to gain an enhanced insight into variation in malaria outcomes, incorporating potential spatial dependence of predictor and dependent variables is vital where disease patterns are spatially heterogeneous. Moreover, to effectively capture spatially variable associations between risk factors and malaria outcomes, regression coefficients may vary locally as well. To include these considerations of spatial non-stationarity a geographically weighted regression (GWR) model was deployed . A log transformation was performed to normalize the slightly positively skewed malaria prevalence data on cluster level.
To explore which predictor variables to include in the GWR model, a global multivariable regression (stationary model) was initially performed. In adopting the best model for explaining log transformed risk several other model features other than the best goodness-of-fit or statistical significance of predictors were looked at. Next, the assumption of normally distributed residuals of the estimated outcome (tested by the Jarque–Bera test) was tested as the model prediction function relies on normally distributed unexplained variance. The predictor variables that were included cannot have any multicollinearity in order to prevent duplication of capturing any predictive effect (indicated by a Variance Inflation Factor of <7.5). Moreover, regression residuals need to be randomly distributed to make sure that observed relationships are not inflated because the observed minus the predicted values are not independent from each other . Regression residuals were examined for residual spatial autocorrelation (RSA). Furthermore, a test to detect heteroscedasticity was carried out to get an idea of heterogeneity in the relationship between the predictor and dependent variables (Breusch–Pagan statistic). The model that satisfied all these requirements and had the highest R2 was selected for further analysis in a GWR model. The model did not control for possible correlated observations.
A set of local goodness-of-fit statistics was derived by plotting the local R2 per cluster. Furthermore, local coefficients and p-values belonging to predictor variables yielded were plotted to explore the geographically varying relationships with malaria prevalence. A semivariogram of regression residuals is constructed to explore the spatial structure of the model. To examine the final GWR model for possible spatial autocorrelation in the residuals (RSA), a Moran’s I test was performed on the residuals between observed and predicted values of malaria prevalence. Finally the model predictions were validated by means of exhaustive cross validation. Many different samples of training and a validation sets were considered to validate predictions in every cluster.
Special attention is given to the issue of local multicollinearity because GWR outcomes can be heavily biased, and local coefficients can become inflated if different predictor variables have similar geographical patterns . Local multicollinearity is assessed by the condition number. This number increases if predictor variables show similar patterns, and when this number is above 30, the model is assumed to be unstable and unreliable.
Statistical analysis and model building were performed using R software (RStudio, Inc.© version 0.98.1102 package spgwr), GWR4© (Newcastle University, UK) and ArcGIS (10.2.1, ESRI Inc., Redlands, USA).
Ethical approval was obtained from the Kenyan Medical Research Institute (KEMRI); non-SSC Protocol No. 350. All participants were provided with written and oral information regarding the project aims, the ongoing demographic and entomological surveillance activities, the implementation of the intervention, and the collection and use of blood samples. Adults, mature minors and caregivers of children provided written informed consent in the local language agreeing to participation in the SolarMal project activities.
Summary results of hot spots detected by SatScan
Number of individuals
Expected infected individuals
Global linear regression model
Summary results for best non-spatial linear regression model for malaria prevalence
Robust Std error
Robust P value
Joint Wald Statistic
18.75; p = 0.001
0.45; p = 0.21
8.86; p = 0.03
4.05; p = 0.13
Because heteroscedasticity is significantly present in the GLR model, the robust p value and standard errors were used to assess the relationships of the predictor variables with malaria prevalence. Outdoor occupation is the strongest significant predictor in the model with a coefficient of 0.57 (and a p value of <0.0001). Furthermore, belonging to a household with a high SES is positively associated with malaria prevalence with a significant coefficient of 0.24 (and a p value of 0.02). A third significant predictor variable is population density, although the coefficient was only −0.004 (p value of 0.001). All predictor variables in the final global model were tested for multicollinearity, and all are well below the threshold of 7.5 (Table 3).
Geographically weighted regression model
Comparison between global regression and GWR model
0.45; p = 0.21
0.23; p = 0.25
Residual sum of squares
−2 Log likelihood
Over the past decade large reductions in malaria have been achieved, yet the current distribution of malaria is still spatially heterogeneous [7, 49]. Considerable research is currently being conducted to find tools for malaria control that are able to target residual malaria transmission, in order to reach the goals set by the RBM initiative to eliminate malaria where possible, or reduce it to a minimum [50, 51]. Established interventions such as LLINs, IRS and case management have proven to be effective but this one size fits all strategy is not appropriate when moving into the elimination phase . These existing methods will need to be complemented by novel tools, which may entail interventions targeting local geography, demography and societal context . Exploring locally varying relationships of risk factors for malaria may aid in exploring and eventually targeting appropriate interventions. Traditional descriptions and models report on the progressively heterogeneous nature of malaria transmission, but analyses reporting on risk factors for malaria and disease usually ignore spatial heterogeneity of the underlying risk factors of disease .
In exploring spatially varying relationships of risk factors for malaria, factors that are directly related to malaria risk as well as proxy factors were used. Socioeconomic status, screened eaves and condition of bed nets are examples of factors directly influencing malaria risk, whereas distance to nearest clinic and environmental variables as TWI and NDVI can have an indirect effect because of access to anti-malarials or proximity to possible breeding sites for malaria vectors. The GLR model explained 27 % of the spatial variance in malaria prevalence, however GWR analysis greatly improved model fit to 69 %. A better fit by the GWR model is confirmed by a reduction in the residual sum of squares as well as an increased likelihood when comparing the global and the local model (Table 4). Local estimations of model fit did vary somewhat over the island (Fig. 4a), and whilst there are several areas where the model does not fit more than 50 %, in all study clusters an improved fit using the GWR was observed compared with the global model.
Outdoor occupation and activity at night have previously been associated with higher risk for malaria [53, 54]. In the case of Rusinga Island, many people are involved in fishing and labour related to fishing, and these activities are generally performed in shifts during the night. It is known that in between shifts, fishermen spend their time around fishing beaches close to their home with little or no protection against biting malaria mosquitoes. It is during the night that Anopheles gambiae s.l. and An. funestus mosquitoes exhibit their peak host-seeking behaviour, biting mostly indoors but also outdoors , thus people who are active at night are expected to be at increased risk for receiving infective mosquito bites. Spatial heterogeneity of outdoor occupation in the south-east of the island is characterized by a large area where having an outdoor occupation leads to increased risk of malaria. This is the area of Rusinga with the highest proportion of fishermen. Malaria infections could be acquired there, subsequently fuelling the malaria reservoir and infection risk for others in these areas, a concept that has been proposed previously . Study clusters that include fishing beaches almost all appear to have higher risk because of outdoor occupations. For example the small cluster in the north and the smaller clusters west of the island, which fall within a malaria hot spot (Fig. 2b). In the northern part of the island there are also clusters with a reduced risk of malaria for outdoor occupation; these clusters lie in one of the malaria hot spots. The effect is not as large and is also less significant, but possibly an explanation here can be that in this area farming, also an outdoor occupation, is the dominant occupation, usually performed during the day when mosquitoes are less active. Nevertheless working outside at dawn and dusk becomes increasingly more important as a predictor of malaria risk as the mosquito vectors are recurrently reported to bite after sunrise and before sunset .
Socioeconomic status has often been linked with risk of malaria. Better schooling, improved housing and a higher income are commonly associated with reduced malaria risk . On Rusinga, areas with a higher risk as well as areas with a lower risk for malaria when residing in the highest SES category are identified. The local patterns of SES show that a positive association with malaria mostly affects the central western part of the island and the tip in the north-east (orange clusters), with an increased risk of malaria. The south-eastern part (green clusters) of the island, by contrast, yield clusters that show a reduced risk of malaria among those with the highest SES.
Socioeconomic status itself does not affect malaria directly; hence the components of SES were further explored. It was found that in most of the clusters where high SES is associated with increased malaria risk, most farmland and dwellings are owned by the occupants while house structure is predominantly poor. This could suggest that variables as owning land and a house, indicators for being in a high SES class, do not necessarily directly relate to reduced malaria risk. Thus even though people are in the highest SES class, the house structure could allow for considerable malaria risk because there is poor protection against mosquitoes entering the house. A higher education level of the head of household could indicate that there is more financial freedom within the family. This can possibly result in a higher expenditure on health care and malaria prevention, which would presumably lead to reduced malaria risk. The components of location of kitchen and wall structure in this SES PCA are proxies of exposure to mosquitoes. When people cook outside during sunset and at night-time they may be exposed to outdoor-biting mosquitoes. Finally and interestingly SES did not have a strong (Fig. 5b) or significant relationship (Fig. 6b) with malaria in the hot spots (Fig. 2b). Thus, residing in a malaria hot spot was independent of house ownership, educational level or other SES factors.
A higher population density was associated with a slightly reduced risk of malaria in the GLR model, in keeping with previous findings from various studies in both urban and rural settings in Africa . Higher population density has a large protective effect in some clusters farther from the lake and further from potential breeding sites, whereas the association between population density and malaria risk was positive in some clusters closer to the lake. It appears that the effect of a higher population density depended on proximity to possible breeding sites of malaria vectors near the lake shore. In a large simulation study  the dynamics of a spatially heterogeneous human and mosquito population was modelled and it was suggested that where there are few mosquitoes or breeding sites, the chance of receiving an infective bite is reduced in densely populated areas whereas the chance or receiving an infective bite is not reduced in sparsely populated areas. On the other hand, if there are many breeding sites and many mosquitoes close to a densely populated area, the chance of malaria transmission increases considerably compared to areas that are less densely populated where the chance or malaria transmission does not increase further with increasing mosquito numbers.
Other risk factors considered in the GLR model have all been suggested in previous literature as predictive for malaria risk. Remarkably, human age and mosquito counts as a proxy for exposure did not enter the final model. Young children (0–5 years) and adolescents typically have a higher risk of malaria because of different behaviour regarding malaria prevention and less well developed immune systems . However, on Rusinga age was not significantly related to malaria, and there was no spatial heterogeneity in the effect of age on malaria. Furthermore, increased numbers of mosquitoes caught in some clusters were not accompanied by higher local prevalence. Screened eaves was not a significant predictor, but this can be explained by the fact that more than 90 % of the households did not have screened eaves and therefore there was insufficient information relating to the impact of this variable. There was a fairly homogenous coverage of bed nets and IRS activities across the island in the year prior to the present study. Bed nets continued to be used, but no further IRS treatments took place. This lack of variability could explain why number of bed nets and IRS coverage were not significantly associated with malaria. NDVI and TWI were also rather homogeneous over the island and therefore not important predictors for malaria. Finally, the average distance to a clinic did not play a role in this model. On this relatively small island, there are five health clinics or dispensaries, and even the households furthest away from a health clinic are at a walking distance of only 3 km.
An advantage of this study is firstly the assumption that non-stationarity of underlying risk factors for malaria can improve model fit considerably and can subsequently be used to explore geographically varying factors responsible for spatial patterns of malaria. Local outcomes and relationships can shed light on why malaria persists in certain areas. Secondly, as the data collected for this analysis serves as the baseline survey for a large vector control study, this analysis can assist in exploring further research and explain why the interventions may ultimately perform better in some areas than in others. One could consider increasing the intensity of available malaria interventions near fishing beaches at night, account for poor housing structures and reduce the number of traps in a densely populated area where high population density is associated with lower risk of malaria.
It is essential to understand the degree by which the results could be influenced by the unit of analysis. The use of discrete zones to perform spatial analysis is very common , but rather contradictory because geographical variation is a continuous process. Project clusters were defined and used to perform the intervention study, with the baseline malaria data described here. The number of clusters and population size per cluster were optimized and adopted for the rollout of the vector control intervention with optimal statistical power as well as community acceptance . Creation of 81 clusters with an even number of households per cluster was calculated to provide sufficient generalizability and randomness to detect a possible difference in malaria incidence (T Smith, personal communication). As the intervention trial is analysed on basis of geographical divisions it was logical to use the same clusters for analysis of baseline data, which gave rise to this work. Spatial analyses are often performed on a similar scale at which this data was collected, for instance on village or county level . Published work stresses that a societal or biological rationale is important when constructing discrete geographical zones. The rationale behind using the project clusters in this study is because it will be valuable to know what factors will have influenced the outcome of the vector intervention study which was conducted on this cluster scale. However, using different discrete clusters or cluster sizes or individual level data may yield slightly different outcomes. More detailed variation in coefficients is yielded when using smaller units and vice versa . Additionally, when using an adaptive kernel function the radius of data included of local regression is variable. Also here it applies that smaller scale local regression usually leads to more variation in coefficients , and this mostly leads to weaker or stronger local relationships rather than reversed relationships. Nonetheless, when first performing a global linear regression, one can be confident that the risk factors obtained are important predictors of malaria and that subsequently the local coefficients of GWR are justified, despite of varying strengths of the relationships being influenced by the scale chosen .
Further limitations of this analysis are linked with the statistical methods used by GWR . GWR has been criticized for lacking an integrated statistical framework because it represents a collective of local spatial regressions and a precise inference becomes imperfect. In understanding the varying coefficients one has to bear in mind that the coefficients that were estimated can be interpreted as an exploration and not as exact inference . Since this issue was raised, significance tests have been developed to reduce uncertainty about the relationships identified using this approach. These local tests were incorporated in our analysis, showing areas where relationships were more significant than in other areas. Another concern raised regarding GWR is that the technique yields local effects that can be inflated because of residual spatial autocorrelation and multicollinearity. Residual spatial autocorrelation occurs when regression residuals cluster spatially, violating the assumption of independence in a linear regression model. Even though GWR accounts for this by adding a random error term for observations, coefficients can become inflated due to clustering of residuals. In this analysis much care was invested in examining and testing for RSA, minimizing possible uncertainty in coefficients resulting from RSA. Finally, in recent years another limitation of GWR was put forward; inflation of local coefficients because of local multicollinearity . If predictor variables locally indicate the same patterns, their effect on the outcome variable can be overestimated. Since this problem was raised several tools have been developed to assess the extent of local multicollinearity . In this analysis a measure of local multicollinearity by means of the condition number was incorporated, but it is concluded that this issue caused a negligible distorting effect on the local coefficients.
In this study, geographically-varying risk factors for malaria were modelled. The spatial heterogeneity of malaria risk factors is explored rather than concluding upon perfect inferences. The study reveals that predictor variables for malaria vary geographically even over small distances of several kilometres. The exploration demonstrates that assuming stationarity of risk factors by means of a global statistical model ignores spatial components that can yield useful information and improve model fit. Being part of the highest SES, working outdoors (during night time) and population density were most predictive for malaria patterns on Rusinga Island. When considering SES as a risk factor for malaria one has to bear in mind that this depends on the local setting and the components included, hence results need to be interpreted with caution. All relationships with risk factors were spatially heterogeneous and these varying effects can be used to explore for what reasons vector intervention at the island possibly may have dissimilar effects in different areas.
TS, NM, WT, AH, CM, WRM and TH designed the study, TH, NM, AdP, IK and KO designed and implemented the questionnaires and managed the database of the Health and Demographic Surveillance System, AH and WT designed the entomological monitoring study, TS and AR assisted with the spatial analysis and TH performed the statistical analyses. TH, AH and WT wrote the manuscript. All authors read and approved the final manuscript.
We want to thank the population of Rusinga Island for cooperating with us and for embracing the project. We are also very thankful to our fieldworkers and staff at the International Centre of Insect Physiology and Ecology, enabling us to implement and manage all our scientific activities from the field station in Mbita. This study was funded by a grant from the COmON Foundation through the campaign Food for Thought of the Wageningen University Fund. AR gratefully acknowledges support from the Julia Bangerter-Rhyne Stiftung and the Novartis Foundation for Medical Biological Research project 13A13.
The authors declare that they have no competing interests.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- WHO. World Malaria Report. Geneva: World Health Organization; 2014.Google Scholar
- Murray CJ, Rosenfeld LC, Lim SS, Andrews KG, Foreman KJ, Haring D, et al. Global malaria mortality between 1980 and 2010: a systematic analysis. Lancet. 2012;379:413–31.View ArticlePubMedGoogle Scholar
- Okiro EA, Alegana VA, Noor AM, Snow RW. Changing malaria intervention coverage, transmission and hospitalization in Kenya. Malar J. 2010;9:285.PubMed CentralView ArticlePubMedGoogle Scholar
- RBM: Annual report 2013. Roll Back Malaria Partnership, Geneva, Switzerland; 2013.Google Scholar
- Tanner M, de Savigny D. Malaria eradication back on the table. Bull World Health Organ. 2008;86:82.PubMed CentralView ArticlePubMedGoogle Scholar
- Alonso PL, Brown G, Arevalo-Herrera M, Binka F, Chitnis C, Collins F, et al. A research agenda to underpin malaria eradication. PLoS Med. 2011;8:e1000406.PubMed CentralView ArticlePubMedGoogle Scholar
- Cotter C, Sturrock HJ, Hsiang MS, Liu J, Phillips AA, Hwang J, et al. The changing epidemiology of malaria elimination: new strategies for new challenges. Lancet. 2013;382:900–11.View ArticlePubMedGoogle Scholar
- Snow RW. Global malaria eradication and the importance of Plasmodium falciparum epidemiology in Africa. BMC Med. 2015;13:23.PubMed CentralView ArticlePubMedGoogle Scholar
- Greenwood BM. The microepidemiology of malaria and its importance to malaria control. Trans R Soc Trop Med Hyg. 1989;83(Suppl):25–9.View ArticlePubMedGoogle Scholar
- Clark TD, Greenhouse B, Njama-Meya D, Nzarubara B, Maiteki-Sebuguzi C, Staedke SG, et al. Factors determining the heterogeneity of malaria incidence in children in Kampala, Uganda. J Infect Dis. 2008;198:393–400.View ArticlePubMedGoogle Scholar
- Ernst KC, Adoka SO, Kowuor DO, Wilson ML, John CC. Malaria hotspot areas in a highland Kenya site are consistent in epidemic and non-epidemic years and are associated with ecological factors. Malar J. 2006;5:78.PubMed CentralView ArticlePubMedGoogle Scholar
- Wanjala CL, Waitumbi J, Zhou G, Githeko AK. Identification of malaria transmission and epidemic hotspots in the western Kenya highlands: its application to malaria epidemic prediction. Parasit Vectors. 2011;4:81.PubMed CentralView ArticlePubMedGoogle Scholar
- Bousema T, Kreuels B, Gosling R. Adjusting for heterogeneity of malaria transmission in longitudinal studies. J Infect Dis. 2011;204:1–3.PubMed CentralView ArticlePubMedGoogle Scholar
- Mosha JF, Sturrock HJ, Greenwood B, Sutherland CJ, Gadalla NB, Atwal S, et al. Hot spot or not: a comparison of spatial statistical methods to predict prospective malaria infections. Malar J. 2014;13:53.PubMed CentralView ArticlePubMedGoogle Scholar
- Srivastava A, Nagpal BN, Joshi PL, Paliwal JC, Dash AP. Identification of malaria hot spots for focused intervention in tribal state of India: a GIS based approach. Int J Health Geogr. 2009;8:30.PubMed CentralView ArticlePubMedGoogle Scholar
- Toty C, Barre H, Le Goff G, Larget-Thiery I, Rahola N, Couret D, Fontenille D. Malaria risk in Corsica, former hot spot of malaria in France. Malar J. 2010;9:231.PubMed CentralView ArticlePubMedGoogle Scholar
- Bi Y, Hu W, Yang H, Zhou XN, Yu W, Guo Y, et al. Spatial patterns of malaria reported deaths in Yunnan Province, China. Am J Trop Med Hyg. 2013;88:526–35.PubMed CentralView ArticlePubMedGoogle Scholar
- Kreuels B, Kobbe R, Adjei S, Kreuzberg C, von Reden C, Bater K, et al. Spatial variation of malaria incidence in young children from a geographically homogeneous area with high endemicity. J Infect Dis. 2008;197:85–93.View ArticlePubMedGoogle Scholar
- Smith DL, Dushoff J, McKenzie FE. The risk of a mosquito-borne infection in a heterogeneous environment. PLoS Biol. 2004;2:e368.PubMed CentralView ArticlePubMedGoogle Scholar
- Bejon P, Williams TN, Nyundo C, Hay SI, Benz D, Gething PW, et al. A micro-epidemiological analysis of febrile malaria in Coastal Kenya showing hotspots within hotspots. Elife. 2014;3:e02130.PubMed CentralView ArticlePubMedGoogle Scholar
- Brooker S, Clarke S, Njagi JK, Polack S, Mugo B, Estambale B, et al. Spatial clustering of malaria and associated risk factors during an epidemic in a highland area of western Kenya. Trop Med Int Health. 2004;9:757–66.View ArticlePubMedGoogle Scholar
- Feuillet T, Coquin J, Mercier D, Cossart E, Decaulne A, Jonsson HP, et al. Focusing on the spatial non-stationarity of landslide predisposing factors in northern Iceland: do paraglacial factors vary over space? Prog Phys Geogr. 2014;38:354–77.View ArticleGoogle Scholar
- Rodrigues M, de la Riva J, Fotheringham S. Modeling the spatial variation of the explanatory factors of human-caused wildfires in Spain using geographically weighted logistic regression. Appl Geogr. 2014;48:52–63.View ArticleGoogle Scholar
- Luo J, Wei YHD. Modeling spatial variations of urban growth patterns in Chinese cities: the case of Nanjing. Landsc Urban Plan. 2009;91:51–64.View ArticleGoogle Scholar
- Gilbert A, Chakraborty J. Using geographically weighted regression for environmental justice analysis: cumulative cancer risks from air toxics in Florida. Soc Sci Res. 2011;40:273–86.View ArticleGoogle Scholar
- Comber AJ, Brunsdon C, Radburn R. A spatial analysis of variations in health access: linking geography, socio-economic status and access perceptions. Int J Health Geogr. 2011;10:44.PubMed CentralView ArticlePubMedGoogle Scholar
- Ehlkes L, Krefis AC, Kreuels B, Krumkamp R, Adjei O, Ayim-Akonor M, et al. Geographically weighted regression of land cover determinants of Plasmodium falciparum transmission in the Ashanti Region of Ghana. Int J Health Geogr. 2014;13:35.PubMed CentralView ArticlePubMedGoogle Scholar
- Haque U, Scott LM, Hashizume M, Fisher E, Haque R, Yamamoto T, Glass GE. Modelling malaria treatment practices in Bangladesh using spatial statistics. Malar J. 2012;11:63.PubMed CentralView ArticlePubMedGoogle Scholar
- Grillet ME, Barrera R, Martinez JE, Berti J, Fortin MJ. Disentangling the effect of local and global spatial variation on a mosquito-borne infection in a neotropical heterogeneous environment. Am J Trop Med Hyg. 2010;82:194–201.PubMed CentralView ArticlePubMedGoogle Scholar
- Giardina F, Kasasa S, Sie A, Utzinger J, Tanner M, Vounatsou P. Effects of vector-control interventions on changes in risk of malaria parasitaemia in sub-Saharan Africa: a spatial and temporal analysis. Lancet Global Health. 2014;2:e601–15.View ArticlePubMedGoogle Scholar
- Hiscox AF, Maire N, Kiche I, Silkey M, Homan T, Oria P, Mweresa C, Otieno B, Ayugi M, Bousema T, Sawa P, Alaii J, Smith TA, Leeuwis C, Mukabana WR, Takken W. The SolarMal Project: innovative mosquito trapping technology for malaria control. Malar J. 2012;11(Suppl 1):O45.PubMed CentralView ArticleGoogle Scholar
- Menger DJ, Van Loon JJA, Takken W. Assessing the efficacy of candidate mosquito repellents against the background of an attractive source that mimics a human host. Med Vet Entomol. 2014;28(4):407–13.View ArticlePubMedGoogle Scholar
- Bayoh MN, Mathias DK, Odiere MR, Mutuku FM, Kamau L, Gimnig JE, Vulule JM, Hawley WA, Hamel MJ, Walker ED. Anopheles gambiae: historical population decline associated with regional distribution of insecticide-treated bed nets in western Nyanza Province, Kenya. Malar J. 2010;9:62.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhou GF, Afrane YA, Vardo-Zalik AM, Atieli H, Zhong DB, Wamae P, et al. Changing patterns of malaria epidemiology between 2002 and 2010 in Western Kenya: the fall and rise of malaria. PLoS One. 2011;6:e20318.PubMed CentralView ArticlePubMedGoogle Scholar
- Olanga EA, Okombo L, Irungu LW, Mukabana WR. Parasites and vectors of malaria on Rusinga Island. Western Kenya. Parasit Vectors. 2015;8:250.View ArticlePubMedGoogle Scholar
- Garfinkel R. Motivation and modeling. In: Lawler EL, Lenstra JK, Kan AR, Shmoys DB, editors. The traveling salesman problem. A guided tour of combinatorial optimization. Wiley; 1985.Google Scholar
- Vyas S, Kumaranayake L. Constructing socio-economic status indices: how to use principal components analysis. Health Policy Plann. 2006;21:459–68.View ArticleGoogle Scholar
- KNBS: Kenya Malaria Indicator Survey. Nairobi, Kenya: Division of Malaria Control, Ministry of Public Health and Sanitation, Kenya National Bureau of Statistics, and ICF Macro; 2012.Google Scholar
- Mweresa CK, Omusula P, Otieno B, van Loon JJA, Takken W, Mukabana WR. Molasses as a source of carbon dioxide for attracting the malaria mosquitoes Anopheles gambiae and Anopheles funestus. Malar J. 2014;13:160.PubMed CentralView ArticlePubMedGoogle Scholar
- Hawley K, Mollering H. A comparative analysis of areal interpolation methods. Cartogr Geogr Inform Sci. 2005;32:411–23.View ArticleGoogle Scholar
- Jung I, Kulldorff M, Klassen AC. A spatial scan statistic for ordinal data. Stat Med. 2007;26:1594–607.View ArticlePubMedGoogle Scholar
- Kulldorff M, Nagarwalla N. Spatial disease clusters: detection and inference. Stat Med. 1995;14:799–810.View ArticlePubMedGoogle Scholar
- Lopez AD, Mathers CD, Ezzati M, Jamison DT, Murray CJL. Global and regional burden of disease and risk factors, 2001: systematic analysis of population health data. Lancet. 2006;367:1747–57.View ArticlePubMedGoogle Scholar
- Anselin L. Local Indicators of Spatial Association—Lisa. Geogr Anal. 1995;27:93–115.View ArticleGoogle Scholar
- Brunsdon C, Fotheringham AS, Charlton ME. Geographically weighted regression: a method for exploring spatial nonstationarity. Geogr Anal. 1996;28:281–98.View ArticleGoogle Scholar
- Anselin L. Under the hood—issues in the specification and interpretation of spatial regression models. Agr Econ. 2002;27:247–67.View ArticleGoogle Scholar
- Fotheringham AS, Charlton ME, Brunsdon C. Geographically weighted regression: a natural evolution of the expansion method for spatial data analysis. Environ Plann A. 1998;30:1905–27.View ArticleGoogle Scholar
- Wheeler DC. Diagnostic tools and a remedial method for collinearity in geographically weighted regression. Environ Plann A. 2007;39:2464–81.View ArticleGoogle Scholar
- Noor AM, Kinyoki DK, Mundia CW, Kabaria CW, Mutua JW, Alegana VA, et al. The changing risk of Plasmodium falciparum malaria infection in Africa: 2000-10: a spatial and temporal analysis of transmission intensity. Lancet. 2014;383:1739–47.PubMed CentralView ArticlePubMedGoogle Scholar
- Owens S. Malaria and the millennium development goals. Arch Dis Child. 2015;100(Suppl 1):S53–6.View ArticlePubMedGoogle Scholar
- Killeen GF. Characterizing, controlling and eliminating residual malaria transmission. Malar J. 2014;13:330.PubMed CentralView ArticlePubMedGoogle Scholar
- Pullan RL, Sturrock HJ, Soares Magalhaes RJ, Clements AC, Brooker SJ. Spatial parasite ecology and epidemiology: a review of methods and applications. Parasitology. 2012;139:1870–87.PubMed CentralView ArticlePubMedGoogle Scholar
- Monroe A, Asamoah O, Lam Y, Koenker H, Psychas P, Lynch M, et al. Outdoor-sleeping and other night-time activities in northern Ghana: implications for residual transmission and malaria prevention. Malar J. 2015;14:35.PubMed CentralView ArticlePubMedGoogle Scholar
- Dunn CE, Le Mare A, Makungu C. Malaria risk behaviours, socio-cultural practices and rural livelihoods in southern Tanzania: implications for bednet usage. Soc Sci Med. 2011;72:408–17.View ArticlePubMedGoogle Scholar
- Govella NJ, Ferguson H. Why use of interventions targeting outdoor biting mosquitoes will be necessary to achieve malaria elimination. Front Physiol. 2012;3:199.PubMed CentralView ArticlePubMedGoogle Scholar
- Prosper O, Ruktanonchai N, Martcheva M. Assessing the role of spatial heterogeneity and human movement in malaria dynamics and control. J Theor Biol. 2012;303:1–14.View ArticlePubMedGoogle Scholar
- Tusting LS, Willey B, Lucas H, Thompson J, Kafy HT, Smith R, et al. Socioeconomic development as an intervention against malaria: a systematic review and meta-analysis. Lancet. 2013;382:963–72.View ArticlePubMedGoogle Scholar
- Hay SI, Guerra CA, Tatem AJ, Atkinson PM, Snow RW. Urbanization, malaria transmission and disease burden in Africa. Nat Rev Microbiol. 2005;3:81–90.PubMed CentralView ArticlePubMedGoogle Scholar
- Carneiro I, Roca-Feltrer A, Griffin JT, Smith L, Tanner M, Schellenberg JA, et al. Age-patterns of malaria vary with severity, transmission intensity and seasonality in sub-Saharan Africa: a systematic review and pooled analysis. PLoS One. 2010;5:e8988.PubMed CentralView ArticlePubMedGoogle Scholar
- Fotheringham AS, Brunsdon C, Charlton M. Scale issues and geographically weighted regression. In: Tate NJ, Atkinson PM, editors. Modelling scale in geographical information science. New York: Wiley; 2001. p. 124–40.Google Scholar
- Oria PA, Hiscox A, Alaii J, Ayugi M, Mukabana WR, Takken W, et al. Tracking the mutual shaping of the technical and social dimensions of solar-powered mosquito trapping systems (SMoTS) for malaria control on Rusinga Island, western Kenya. Parasit Vectors. 2014;7:523.PubMed CentralView ArticlePubMedGoogle Scholar
- Wheeler D. Geographically weighted regression. In: Handbook of regional science. Heidelberg: Springer-Verlag; 2014.Google Scholar
- Guo L, Ma ZH, Zhang LJ. Comparison of bandwidth selection in application of geographically weighted regression: a case study. Can J Forest Res. 2008;38:2526–34.View ArticleGoogle Scholar
- Paez A, Farber S, Wheeler D. A simulation-based study of geographically weighted regression as a method for investigating spatially varying relationships. Environ Plann A. 2011;43:2992–3010.View ArticleGoogle Scholar
- Wheeler D, Tiefelsdorf M. Multicollinearity and correlation among local regression coefficients in geographically weighted regression. J Geogr Systems. 2005;7:161–87.View ArticleGoogle Scholar