Spatial analysis of malaria in Anhui province, China
- Wenyi Zhang†1,
- Liping Wang†2,
- Liqun Fang†1,
- Jiaqi Ma2,
- Youfu Xu1,
- Jiafu Jiang1,
- Fengming Hui1,
- Jianjun Wang3,
- Song Liang4,
- Hong Yang1 and
- Wuchun Cao1Email author
© Zhang et al; licensee BioMed Central Ltd. 2008
Received: 22 July 2008
Accepted: 10 October 2008
Published: 10 October 2008
Malaria has re-emerged in Anhui Province, China, and this province was the most seriously affected by malaria during 2005–2006. It is necessary to understand the spatial distribution of malaria cases and to identify highly endemic areas for future public health planning and resource allocation in Anhui Province.
The annual average incidence at the county level was calculated using malaria cases reported between 2000 and 2006 in Anhui Province. GIS-based spatial analyses were conducted to detect spatial distribution and clustering of malaria incidence at the county level.
The spatial distribution of malaria cases in Anhui Province from 2000 to 2006 was mapped at the county level to show crude incidence, excess hazard and spatial smoothed incidence. Spatial cluster analysis suggested 10 and 24 counties were at increased risk for malaria (P < 0.001) with the maximum spatial cluster sizes at < 50% and < 25% of the total population, respectively.
The application of GIS, together with spatial statistical techniques, provide a means to quantify explicit malaria risks and to further identify environmental factors responsible for the re-emerged malaria risks. Future public health planning and resource allocation in Anhui Province should be focused on the maximum spatial cluster region.
Malaria is one of the leading causes of morbidity and mortality in the world. Indeed, more than 2.4 billion people are exposed to the risk of malaria . Malaria kills between 1.1 and 2.7 million people each year [1, 2]. Malaria is one of major parasitic diseases with a wide distribution in China. The prevalence gradually decreases from south to north. Southern parts of 25°NL (Nanling Mountains) used to be the hyper- or meso-endemic regions, where falciparum malaria was widely present. Meso- and hypo-endemic areas were between 25–33°NL (from Nanling to Qinling Mountains and the Huai River), where vivax malaria was predominant, though falciparum malaria also existed and focal outbreaks often occurred. In the region north of 33°NL (north of Qinling Mountain and the Huai River), malaria was of low endemicity and Plasmodium vivax was the only species present; temporary epidemics were occasionally caused by imported falciparum malaria .
The provinces of Yunnan and Hainan are the areas where malaria has been the most endemic with high transmission of Plasmodium falciparum. Since 2000, a malaria resurgence has occurred in China. In addition to the southern mountainous area of Hainan province and the border area of Yunnan province, most re-emerged malaria occurred in central China, along the Huai River. Anhui Province is the most seriously affected area in China, with the highest number of malaria cases in 2006. The incidence of malaria shows high variability at the county level. A better understanding of the spatial distribution patterns of malaria would help to identify areas and population at high risk and may better prevent and control malaria in this province.
The use of GIS with spatial statistics, including spatial smoothing and cluster analysis, has been applied to other diseases, in which it is often used to analyse and more clearly characterize the spatial patterns [4–8]. Spatial smoothing is used to reduce random variation associated with small populations and enables observation of gradients or holes in disease incidence that may not be apparent from direct observation of the raw data. Spatial cluster analysis is conducted to identify whether cases of disease are geographically clustered [9–11].
In this study, GIS-based spatial analyses involving spatial smoothing and spatial clustering analysis were conducted to characterize geographic distribution patterns of malaria cases. Spatial analysis was used to identify the distribution pattern of malaria and population at high risk at the county level. The technique corrects for multiple comparisons, adjusts for the heterogeneous population densities among the different areas, detects foci without prior specification of suspected location or size, and thereby overcomes pre-selection bias and allows for adjustment of confounders [11, 12].
Materials and methods
Data collection and management
The study site is Anhui Province (114.85° ~119.69°E, 29.38° ~34.74°N), located in the area between the Changjiang River and Huai River, the third largest river in China. The maximum distance from east to west is about 430 km and from north to south is about 586 km. The area has a population of 58,358,232 (the fifth national census in 2000) and encompasses 139,600 square kilometers.
Records on malaria cases between 2000 and 2006 were obtained from the National Notifiable Disease Surveillance System (NNDSS). For conducting a GIS-based analysis on the spatial distribution of malaria, the county-level polygon map at 1:1,000,000 scale was obtained, on which the county-level point layer containing information regarding latitudes and longitudes of central points of each county was created. Demographic information based on the 2000 census was integrated in terms of the administrative code. All malaria cases were geo-coded and matched to the county-level layers of polygon and point by administrative code using the software ArcGIS 9.1 (ESRI Inc., Redlands, CA, USA).
GIS mapping and smoothing
To alleviate variations of incidence in small populations and areas, annualized average incidences of malaria per 100,000 at each administrative region over the seven year period were calculated, and spatial rate smoothing was implemented. Based on annualized average incidence, all counties were grouped into four categories: non-endemic area, low endemic area with annualized average incidence between 0 and 5 per 100,000, medium endemic area with the incidence between 5 and 30 per 100,000, and high endemic area with the incidence over 30 per 100,000. The four types of counties were colour-coded on maps.
To assess the risk of malaria in each county, an excess hazard map was produced. The excess hazard represents the ratio of the observed incidence at each county over the average incidence of all endemic areas, the later was calculated by the number of cases over the total number of people at risk instead of the annualized incidence of a county .
The technique of incorporating data from surrounding areas in an image or map to define a new data value for the area of interest is called spatial filtering. Spatial filtering can involve smoothing or sharpening the data of interest. The spatial smoothing was performed to reduce random noise in the data that comes from the high variance characteristic of small populations or small case numbers . The smoothed incidence was computed from the total number of cases in a spatial 'window' divided by the total number of people at risk within the 'window', which was specified using a spatial weights file including both county and its neighbor counties' locations. Each smoothed incidence was calculated once the 'window' core overlapped with a county center. So the first step in the analysis was to construct a spatial weights file that contained information on 'neighborhood' structure of each county. The k-nearest neighbour criterion ensured each observed object had exactly the same number (k) of neighbours. In the analysis, six neighbours were chosen for each county by k-nearest neighbour criterion. The second step was to load the weight file and carry out smoothing analysis .
Spatial cluster analysis was performed on the confirmed cases of malaria to test whether the cases were distributed randomly over space and, if not, to evaluate any identified spatial disease clusters for statistical significance. 'Spatial scan statistics' was used to test the null hypothesis that the relative risk (RR) of malaria was the same between any county groups, or collection of county groups, and the remaining county groups. Areas with differing sizes were scanned without knowledge on cluster size and location to avoid selection bias. SaTScan software version 6.0 , designed specifically to implement this test, imposed a circular window on the map. This window moved over the area and centered on the centroid of each county group. The area within the circular window varied in size from zero to a maximum radius, never including > 50% of the total population. The SaTScan software tested for possible clusters within the variable window around the centroid of each county group. Cluster analysis was performed with the default maximum spatial cluster size of < 50% of the population and again with a smaller maximum cluster size of < 25% to look for possible sub-clusters. For each window of varying position and size, the software tested the risk of malaria within and outside the window, with the null hypothesis of equal risk. This procedure compensated for the inherent bias in multiple testing .
Spatial distribution of malaria in Anhui Province
Spatial clustering of malaria in Anhui Province
SaTScan statistics for the most likely cluster, Anhui province, China, 2000–2006.
The 1st iteration1
Cluster radius (km)
The 2st iteration2
Cluster radius (km)
The investigation of infectious disease clustering is receiving renewed interest, not least because of advances in geographical information systems (GIS) and spatial statistics, which allow for the quantification of the degree of clustering of infections. Such approaches have been used to investigate the spatial clustering of dengue , sleeping sickness , human granulocytic ehrlichiosis , haemorrhagic fever with renal syndrome , but their application to malaria has been limited [19, 20]. An improved understanding of the spatial clustering of malaria on low-lying lands in China may provide useful insights into local epidemic control and resource allocation.
Using GIS and spatial statistics, the spatial distribution of confirmed malaria cases and increased risk regions in the highly endemic area were identified. The spatial statistics analyses clearly yielded a nonrandom distribution of malaria in the province. Spatial smoothing identified areas of increased risk located in the north of Huai River, the third largest river in China (Figure 3). Spatial cluster analysis identified a statistically significant cluster in the same area, in the north of Huai River, including the counties and cities of Suixi, Guoyang, Xiao, Mengcheng, Guzhen, Lingbi, Huaiyuan, Lixin, Suzhou and Huaibei, where Anopheles sinensis is the principal vector and the people have the habit of sleeping in the open in the summer . These areas were different from the previous foci in the centre part of the province during the past two decades, where Anopheles anthropophagus is the principal vector .
On the basis of this study, prevention strategies are recommended that focus on these high epidemic areas. In an area where malaria disease is highly endemic, targeting prevention strategies at areas of highest risk can potentially increase the programme's effectiveness. People at highest risk should be informed of the high risk and the possibilities for risk reduction. Funds spent on programmes might be better spent on areas where cost-effectiveness can be maximized.
The present study analysed the associations between human population and malaria cases only. Gathering and including vector population data (including species, population density, distribution and infection prevalence rates) and environmental variables in the risk analysis of malaria in these areas provide a more comprehensive view of the disease risk. Future research, to investigate the underlying causes of increased risk in the identified areas, will be analysis of landscape attributes and identification of the environmental variables characteristic of high-risk areas.
This study analysed the spatial distribution of malaria in Anhui Province, China, during 2000 to 2006 using the spatial smoothing and spatial scan statistic method. SaTScan identified a geographic area in northern Anhui Province as the most likely endemic cluster region. This has not been previously reported. The data show practical malaria control measures, as well as methods for future study of malaria and other vector-borne diseases. Further, GIS and GIS-based spatial statistical techniques may provide an opportunity to clarify and quantify the epidemic situation of malaria within re-emerged epidemic areas, and lay a foundation to pursue future investigations into the environmental factors responsible for the increased disease risk. To implement specific and geographically appropriate risk-reduction programmes, the use of such spatial analysis tools should become an integral component in epidemiology research and risk assessment of malaria.
This study was supported by the National Science Fund for Distinguished Young Scholars (30725032), Natural Science Foundation of China (30590374 and 30700682) and Natural Science Foundation of Beijing (7061005).
- World Health Organization Expert Committee on Malaria: 20th Report. World Health Organ Tech Rep. 2000, 735-Google Scholar
- Breman JG, Alilio MS, Mills A: Conquering the intolerable burden of malaria: what's new, what's needed: a summary. Am J Trop Med Hyg. 2004, 71 (2 Suppl): 1-15.PubMedGoogle Scholar
- Tang LH: Progress in malaria control in China. Chin Med J. 2000, 113: 89-92.PubMedGoogle Scholar
- Frank C, Fix AD, Peña CA, Strickland GT: Mapping Lyme disease for diagnostic and preventive decisions, Maryland. Emerg Infect Dis. 2002, 8: 427-429.PubMed CentralView ArticlePubMedGoogle Scholar
- Odoi A, Martin SW, Michel P, Middleton D, Holt J, Wilson J: Investigation of clusters of giardiasis using GIS and a spatial scan statistic. Int J of Health Geog. 2004, 3: 11-21. 10.1186/1476-072X-3-11.View ArticleGoogle Scholar
- Nkhoma ET, Ed Hsu C, Hunt VI, Harris AM: Detecting spatiotemporal clusters of accidental poisoning mortality among Texas counties, U.S., 1980–2001. Int J Health Geog. 2004, 3: 25-37. 10.1186/1476-072X-3-25.View ArticleGoogle Scholar
- Wu J, Wang J, Meng B, Chen G, Pang L, Song X, Zhang K, Zhang T, Zheng X: Exploratory spatial data analysis for the identification of risk factors to birth defects. BMC Public Health. 2004, 4: 23-10.1186/1471-2458-4-23.PubMed CentralView ArticlePubMedGoogle Scholar
- Zeman P: Objective assessment of risk maps of tick-borne encephalitis and Lyme borreliosis based on spatial patterns of located cases. Int J Epidemiol. 1997, 26: 1121-1129. 10.1093/ije/26.5.1121.View ArticlePubMedGoogle Scholar
- Kulldorff M, Feuer EJ, Miller BA, Freedman LS: Breast cancer clusters in the northeast United States: a geographic analysis. Am J Epidemiol. 1997, 146: 161-170.View ArticlePubMedGoogle Scholar
- Kulldorff M, Nagarwalla N: Spatial disease clusters: detection and inference. Stat Med. 1995, 14: 799-810. 10.1002/sim.4780140809.View ArticlePubMedGoogle Scholar
- Kulldorff M: A spatial scan statistic. Communications in Statistics. Theory and Methods. 1997, 26: 1481-1496. 10.1080/03610929708831995.View ArticleGoogle Scholar
- Chaput EK, Meek JI, Heimer R: Spatial analysis of human granulocytic ehrlichiosis near Lyme, Connecticut. Emerg Inf Dis. 2002, 8: 943-948.View ArticleGoogle Scholar
- Anselin L, Syabri I, Kho Y: GeoDa: An Introduction to Spatial Data Analysis. 2005, [http://www.csiss.org]Google Scholar
- Rushton R, Lolonis P: Exploratory spatial analysis of birth defect rates in an urban population. Stat Med. 1996, 15: 717-726. 10.1002/(SICI)1097-0258(19960415)15:7/9<717::AID-SIM243>3.0.CO;2-0.View ArticlePubMedGoogle Scholar
- Kulldorff M, Information Management Services, Inc: SaTScan™ v6.0: Software for the spatial and space-time scan statistics. 2005, [http://www.satscan.org]Google Scholar
- Morrison AC, Getis A, Santiago M, Rigau Perez JG, Reiter P: Exploratory space-time analysis of reported dengue cases during an outbreak in Florida, Puerto Rico 1991–1992. Am J Trop Med Hyg. 1998, 58: 287-298.PubMedGoogle Scholar
- Fèvre EM, Coleman PG, Odiit M, Magona JW, Welburn SC, Woolhouse MEJ: The origins of a new Trypanosoma brucei rhodesiense sleeping sickness outbreak in eastern Uganda. The Lancet. 2001, 358: 625-628. 10.1016/S0140-6736(01)05778-6.View ArticleGoogle Scholar
- Fang L, Yan L, Liang S, de Vlas SJ, Feng D, Han X, Zhao W, Xu B, Bian L, Yang H, Gong P, Richardus JH, Cao W: Spatial analysis of hemorrhagic fever with renal syndrome in China. BMC Infect Dis. 2006, 6: 77-10.1186/1471-2334-6-77.PubMed CentralView ArticlePubMedGoogle Scholar
- Chadee DD, Kitron U: Spatial and temporal patterns of imported malaria cases and local transmission in Trinidad. Am J Trop Med Hyg. 1999, 61: 513-517.PubMedGoogle Scholar
- Schellenberg J, Newell JN, Snow RW, Mung'ala V, Marsh K, Smith PG, Hayes RJ: An analysis of the geographical distribution of severe malaria in children in Kilifi District, Kenya. Int J of Epidemiol. 1998, 27: 323-329. 10.1093/ije/27.2.323.View ArticleGoogle Scholar
- Xu FN, Jia SC, Shen YZ: Malaria situation in Anhui Province in 2001. Anhui Prev Med. 2002, 8: 321-322.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.