Comparing prioritization strategies for delivering indoor residual spray (IRS) implementation, using a network approach

Background Indoor residual spraying (IRS) is an effective method to control malaria-transmitting Anopheles mosquitoes and often complements insecticide-treated mosquito nets, the predominant malaria vector control intervention. With insufficient funds to cover every household, malaria control programs must balance the malaria risk to a particular human community against the financial cost of spraying that community. This study creates a framework for modelling the distance to households for targeting IRS implementation, and applies it to potential risk prioritization strategies in four provinces (Luapula, Muchinga, Eastern, and Northern) in Zambia. Methods Optimal network models were used to assess the travel distance of routes between operations bases and human communities identified through remote sensing. Network travel distances were compared to Euclidean distances, to demonstrate the importance of accounting for road routes. The distance to reaching communities for different risk prioritization strategies were then compared assuming sufficient funds to spray 50% of households, using four underlying malarial risk maps: (a) predicted Plasmodium falciparum parasite rate in 2–10 years olds (PfPR), or (b) predicted probability of the presence of each of three main malaria transmitting anopheline vectors (Anopheles arabiensis, Anopheles funestus, Anopheles gambiae). Results The estimated one-way network route distance to reach communities to deliver IRS ranged from 0.05 to 115.69 km. Euclidean distance over and under-estimated these routes by − 101.21 to 41.79 km per trip, as compared to the network route method. There was little overlap between risk map prioritization strategies, both at a district-by-district scale, and across all four provinces. At both scales, agreement for inclusion or exclusion from IRS across all four prioritization strategies occurred in less than 10% of houses. The distances to reaching prioritized communities were either lower, or not statistically different from non-prioritized communities, at both scales of strategy. Conclusion Variation in distance to targeted communities differed depending on risk prioritization strategy used, and higher risk prioritization did not necessarily translate into greater distances in reaching a human community. These findings from Zambia suggest that areas with higher malaria burden may not necessarily be more remote than areas with lower malaria burden.

In contrast to ITNs, which in 2011 cost an estimated $2.20 per year of protection delivered, IRS was much more expensive [5]. Using the chemicals DDT, pyrethroids, deltamethrin, and lambdacyhalothrin, the cost of IRS was $6.70 per year of protection per household, with the cost of insecticide ranging from 29 to 81% of the total cost and minimal economies of scale [5]. With second generation ITNs and the drive for lower insecticides for IRS these prices have changed somewhat over the past 10 years. Pyrethroids have decreased in cost substantially over the past 25 years [6], but widespread insecticide resistance threatens the long-term viability of using pyrethroids for malaria control [7][8][9]. Currently no alternative insecticides for ITNs are available for use at scale, although novel chemicals and dual-chemical ITNs are in various stages of development. Alternative insecticides for IRS are available but come at a greater cost than those of DDT, pyrethroids, deltamethrin, and lambdacyhalothrin. Indeed, IRS programmes funded by the organizations such as the US President's Malaria Initiative (PMI) have seen reductions in coverage due to the increasing cost of insecticide [10].
Often countries are faced with the challenge of insufficient funds to cover every household in malaria endemic areas, and as such are forced to determine which houses receive the intervention. Zambia's National Malaria Elimination Centre (NMEC) has encountered this challenge; in at least some areas health facility malaria incidence is used to prioritize areas to receive IRS [11]. Recent work suggests that using vector probability maps has greater impact on malaria transmission than human case data [12]. Other approaches have also been used to prioritize which areas receive IRS, but whichever methodology is used, malaria programmes must balance the malaria risk of a particular human community with the financial cost of spraying a particular human community.
The use of network modelling for optimizing the distribution of goods along road networks is commonly used for market analyses [13,14], and has been applied in the health arena to least-cost routing for hospital access [15][16][17] and delivery of vaccines [18,19]. Several authors have noted that a primary limitation to application of network modelling in the developing world is the availability of accurately mapped road networks [20,21]. This article creates a framework for modelling the cost of IRS implementation and applies that framework to potential intervention prioritization strategies in Zambia.

Study area
Zambia lies in southern Africa and has a range of malaria transmission intensity, from pre-elimination status in Southern and Lusaka provinces, to intense malaria transmission in Luapula Province. The modern history of indoor residual spray (IRS) in Zambia began in 2003 when the Government of the Republic of Zambia (GRZ) began spraying to complement the private sector's IRS campaigns. Zambia's approach is to support as many districts as possible with IRS, and resources often do not allow for complete coverage of districts. This study focuses on four provinces in the eastern part of the country: Luapula, Northern, Muchinga, and Eastern provinces ( Fig. 1). Malaria indicator surveys estimate Plasmodium falciparum parasite prevalence rate to be > 25% and household ownership of at least one ITN > 50% in these areas at the time of the study [22].

Community data
Technicians in Lusaka, Zambia digitized structures in Eastern, Luapula, Muchinga, and Northern province visible in publicly available satellite imagery as part of the planning process of IRS campaigns in 2015 and 2016, as has been described elsewhere [23]. Digitized structures were spatially aggregated into communities based upon distance between structures (< 50 m), and communities with fewer than 25 houses were deemed too small for IRS [11], and not included in the modelling presented here. In this study, IRS operation bases were assumed to be located at city/village centres, taken as point locations from Google Map Maker (see next section for details), totaling 236 city centre locations across the four provinces in this analysis.

Road network data
Road network data were made available to this analysis via Google Map Maker. While Google Map Maker was an openly collaborative resource during the course of this study, it was retired in March, 2017 [24,25], and approved contributions merged with Google Maps. Google Map Maker data were chosen for this study due to more complete road network mapping than Open-StreetMap (OSM) [26], or any other digital road map resource available at the time. For the four provinces, the mapmaker (MM) road file used in this study contained a total of 13,762,691.86 m of roads, in 21,082 polyline segments (average segment length 653 m). While MM road attributes describe 'primary' and 'local' roads separately, the absence of descriptors beyond identification meant that all roads in the network were treated equally.

Network analysis
For each province, the road network was converted to a network database in ArcGIS 10.1's Network Analysis toolbox, and impedance was set in metres. The Closest Facility analysis in ArcGIS is based on Dijkstra's algorithm [27] to find the shortest path between two specified nodes on a network, thus providing optimal routing. This algorithm calculates the distance between all nodes on a network, along all possible routes, and then for each node pair, reports the minimum of those distances. In this analysis, the shortest route between each community centroid and its nearest city centre was returned.
The centroid for each digitized community served as a destination node on the network. As many of these were not located directly on the road network, a tolerance of inclusion in the network of 5000 m was set, both for the city centre locations and for the community centroid locations. Since the communities were not all within 5000 m of the available road network, the network analysis was conducted on a subset of actual communities and city centres, as described in Additional file 1: Table S1.
In addition to the network modelled route distance, because a tolerance of 5,000 m was set, the Euclidean distance from each community centroid to the nearest point on the nearest road in the network was added, and the distance along the optimal route plus distance to the nearest road was calculated, to estimate the total distance to the community centroid.
To examine the impact of modelling distribution routes as optimal network routes, rather than simple Euclidean distance routes between communities and their nearest city centres, the simple Euclidean distances were calculated, and compared to the network distances.

Establishing spatial malarial risk prioritization strategies
The distance framework was applied to two separate prioritization strategies based on underlying risk maps, namely: Plasmodium falciparum prevalence rate among kids aged 2-10 years old (PfPR 2-10 ) from the Malaria Atlas Project (MAP), estimated for 2010 [28]; and MAP vector suitability for Anopheles gambiae, Anopheles funestus, and Anopheles arabiensis [29]. MAP estimates of both PfPR 2-10 and vector suitability estimates are available from the malariaAtlas package in R [30]. Mean estimates were extracted from raster files to community polygons, and aggregated to average values, using the Raster package [31,32] in R version 3.3.2 [33].

Analysis
Two separate modelling strategies were employed, the "within district prioritization" and the "across district prioritization". The 50% prioritization approach was based on an assumption of budget limitations, not a coverage target for a programme. For the first strategy, half of all households within each district of the four provinces were targeted for IRS, with prioritization of communities based on one of the underlying risk maps. Second, half of all households across the four provinces were targeted for IRS with the same risk map prioritization. Since IRS is administered by community, but this study aimed to cover 50% of all households, prioritization was conducted by ranking communities in order of risk (high to low), and summing household numbers (counts within communities) from the top-ranked down, until 50% was exhausted. t-tests were used to determine differences by prioritization strategy and Kappa test scores were used to estimate the agreement between prioritization strategies. To generate a measure of overall agreement between the different prioritization strategies the arithmetic mean between pair statistics was taken [34]. Stata version 13.1 was used for these analyses.

Mapping prioritized targets to a road network
The road network data used in this study represents only around a third of visible roads and tracks on the ground (Ryan, unpublished) and, therefore, should be considered a primary road network, rather than a full road network. Among the 18,448 communities created in the enumeration and digitization process, 11,146 (60.4%) were captured to the incomplete road network. Of these communities, 3198 (28.7%) had at least 25 houses, which would put them in consideration for IRS, and were included in further analysis. There was minimal difference in risk estimates between communities included and excluded from this study, based on PfPR, An. arabiensis, and An. funestus, and included communities had somewhat higher estimates of risk based on An. gambiae (Table 1).

Road network versus Euclidean routing
The differences between Euclidean ("straight-line") and optimal road network route distances for each province are shown as histograms in Fig. 2a-d. These show that both over and underestimates arise under a simple Euclidean distance modelled view, generating an overall range of a maximum of 41.79 km (overestimate) and a minimum of − 101.21 (underestimate) of the distance of a one-way route to deliver spray to a community.

Within district prioritization
In the first strategy examined, district-by-district community ranked prioritization, allocated IRS to the top 50% risk households based upon use of the MAP risk approaches. As seen in Table 2, few communities were excluded from any prioritization strategy (6.9%) and few were included within all prioritization strategies (7.4%), indicating low levels of agreement among MAP risk strategies. In pairwise strategy comparisons, agreement was statistically better than chance between PfPR and An. funestus risk, but no agreement between other prioritization strategies was observed (Additional file 1: Table S2). The overall Kappa statistic of agreement between the different prioritization strategies was − 0.039.
The distance to communities from a city centre to deliver IRS ranged from 0 to 115 km, with 75% of communities located within 25 km of a city centre. Differences in distances to reach communities were higher in prioritized communities compared to non-prioritized communities for PfPR and An. funestus risk, and distances were lower in prioritized communities when based on An. gambiae risk (Table 3).

Across-province prioritization
When considering deploying IRS across the entire fourprovince study area, using the same MAP risk prioritization approach, large variations are seen in the number of communities included by strategy, within province (Table 4), leading to high district-level variation in the allocation of IRS by strategy (Fig. 3).
Similarly to the district-by-district approach, there was large disagreement in communities covered between the different risk prioritization strategies. As seen in Additional file 1: Table S3, very few communities were excluded from any prioritization strategy (3.1%) and even fewer were included in all the prioritization strategies (2.6%). Comparing pairwise strategy sets, agreement was statistically better than chance between PfPR and An. funestus, PfPR and An. gambiae, and An. arabiensis and An. funestus. The overall Kappa statistic of agreement between the different prioritization strategies was − 0.103 (Additional file 1: Table S4).
The optimal route distance to communities ranged from 53 m to 116 km, with 75% of communities 25 km or less from a city centre. Distance to communities were no different between prioritized and non-prioritized communities for An. gambiae and PfPR prioritization, while prioritized communities were statistically nearer to city  . 2 The range of differences in distance, for the four provinces (a-d) between Euclidean distance and optimal network route distance from city centres to nearest target community  centres based on An. arabiensis and An. funestus risk ( Table 5).

Discussion
In this paper, models of allocating IRS were examined, using a combination of optimal network distributions based on road network routing, and spatial prioritization using risk maps estimating under 5 PfPR, and suitability for the three major Anopheline vectors implicated in malaria transmission in Zambia. Differences were observed in the cost to prioritize communities with higher estimates of PfPR and Anopheles species' vector capacity, whether strategizing IRS application district-by-district, or across the four provinces in the study. In many cases, reaching communities prioritized by risk strategies did not differ significantly in cost from reaching non-prioritized communities, and were in some cases cheaper. Prioritizing communities for intervention at the provincial level rather than equally allocating coverage across districts led to variation in the proportion of houses receiving IRS at the district level, as shown in Fig. 3. The findings of this study suggest areas with higher malaria burdens may not necessarily be more remote than with lower malaria burdens. Additionally, a complete lack of agreement in IRS allocation between PfPR and vector prioritization strategies presents a challenge to malaria programmes, requiring programmes to pick which measure of risk is most appropriate given their context. Risk maps are not often used when planning malaria interventions [35], and there is little literature to suggest which prioritization strategy has the most impact on reducing malaria. Further, these analyses utilized global risk maps of malarial risk indicators rather than local risk maps. The use of global risk maps can be considered both a strength and a limitation, with data availability being one of the primary strengths. Two of the most recent malaria risk maps in Zambia were subnational, and even sub-provincial, and so would not be useful for national IRS campaigns [36,37]. These risk maps need to be validated on a larger scale before they can be useful for malaria control programs. It remains to be seen whether localized, more specific risk maps have better agreement in prioritizing communities to receive limited IRS resources.
It has been noted by several authors that availability of accurately mapped road networks greatly limits the application of optimizing network routing models in the developing world [20,21]. This study demonstrated the differences in estimating route distances between using simple Euclidian distance mapping, and network routing, suggesting this is an important gap to fill for effective planning for distribution programs of many kinds. In this comparison between these two distance estimation methods, it was shown that while the overestimates given by Euclidian distance range a substantial amount, in this particular case, overestimation errs in a conservative direction for operational planning. However, the underestimates, if part of multiple round trips, could rapidly amount to large unanticipated costs for spray operations.
In this study, the best available data were used, but results must be interpreted with the data limitations in mind. Forty percent of the communities enumerated were not captured by the network analysis because they were not within 5000 m of the road network and were, therefore, removed from further analyses. It is likely that the road network data is incomplete, rather than that these communities are indeed more remote than their counterparts nearer to the available road data. Indeed, there was no difference in malaria risk between captured and non-captured communities by any of the four measures used herein. Investment into geospatial data such as road networks would improve predictive modelling and precision public health delivery [38] of interventions such as IRS.
Further, while this study developed a robust model, there is always notable variability in IRS operations; for example, team size and number and location of operational bases may differ across a country context, and even within a province or district. It may be that the degree of this variability differs in harder to reach areas. While this model does not fully capture that variability, it offers a basis of comparison across the prioritization strategies and creates a framework for adding such complexity in the future. A caveat to note is that these are findings from one country, Zambia, and that while the methods may prove practical to assess IRS distribution in other countries, the results may differ due to differing population density and distribution, as well as transmission patterns.