- Research
- Open access
- Published:
Parasite genetic diversity reflects continued residual malaria transmission in Vhembe District, a hotspot in the Limpopo Province of South Africa
Malaria Journal volumeĀ 20, ArticleĀ number:Ā 96 (2021)
Abstract
Background
South Africa aims to eliminate malaria transmission by 2023. However, despite sustained vector control efforts and case management interventions, the Vhembe District remains a malaria transmission hotspot. To better understand Plasmodium falciparum transmission dynamics in the area, this study characterized the genetic diversity of parasites circulating within the Vhembe District.
Methods
A total of 1153 falciparum-positive rapid diagnostic tests (RDTs) were randomly collected from seven clinics within the district, over three consecutive years (2016, 2017 and 2018) during the wet and dry malaria transmission seasons. Using 26 neutral microsatellite markers, differences in genetic diversity were described using a multiparameter scale of multiplicity of infection (MOI), inbreeding metric (Fws), number of unique alleles (A), expected heterozygosity (He), multilocus linkage disequilibrium (LD) and genetic differentiation, and were associated with temporal and geospatial variances.
Results
A total of 747 (65%) samples were successfully genotyped. Moderate to high genetic diversity (mean Heā=ā0.74āĀ±ā0.03) was observed in the parasite population. This was ascribed to high allelic richness (mean Aā=ā12.2āĀ±ā1.2). The majority of samples (99%) had unique multi-locus genotypes, indicating high genetic diversity in the sample set. Complex infections were observed in 66% of samples (mean MOIā=ā2.13āĀ±ā0.04), with 33% of infections showing high within-host diversity as described by the Fws metric. Low, but significant LD (standardised index of association, ISAā=ā0.08, Pā<ā0.001) was observed that indicates recombination of distinct clones. Limited impact of temporal (FST range āā0.00005 to 0.0003) and spatial (FST = āā0.028 to 0.023) variation on genetic diversity existed during the sampling timeframe and study sites respectively.
Conclusions
Consistent with the Vhembe Districtās classification as a āhighā transmission setting within South Africa, P. falciparum diversity in the area was moderate to high and complex. This study showed that genetic diversity within the parasite population reflects the continued residual transmission observed in the Vhembe District. This data can be used as a reference point for the assessment of the effectiveness of on-going interventions over time, the identification of imported cases and/or outbreaks, as well as monitoring for the potential spread of anti-malarial drug resistance.
Background
Malaria remains a global health problem, with about 228Ā million cases reported worldwide in 2018, 93% of which occurred in the World Health Organization (WHO) African Region. As the southern Africa region, excluding high-transmission countries like Mozambique, accounted for <ā10% of the 213Ā million cases reported in the WHO African region, several southern African countries have been earmarked for malaria elimination by 2023 guided by the WHO Global Technical Strategy for Malaria [1]. Unfortunately, like a number of other regions in the world, southern Africa experienced a resurgence in malaria cases and deaths during the 2017/2018 season [2]. This resulted in South Africa reporting more than 30 000 cases, a surge in numbers previously only experienced during the 1999/2000 drug and insecticide resistance outbreak [3,4,5].
Plasmodium falciparum is the predominant species which accounts for the majority of cases and fatalities in the South Africa [5], with Anopheles funestus and the Anopheles gambiae complex the main vector species associated with transmission, which mainly occurs in the hot and rainy season between September and May [5]. While South Africa has made significant progress in the reduction in malaria cases since the 1999/2000 outbreak through the implementation of sustained vector control and case management interventions, progress has stalled [3,4,5,6,7,8,9]. The country is characterized with heterogeneous transmission settings in the three remaining endemic provinces: Kwa-Zulu Natal (KZN), Mpumalanga and Limpopo [4, 5]. The Vhembe District in the Limpopo Province with the greatest burden of disease is classified as a moderate transmission area with 3.79 local cases/1000 population at risk in 2018, compared to very few locally transmitted cases (<ā0.1 cases/1000 population at risk) reported for the KwaZulu-Natal (KZN) Province [5]. The Vhembe District is situated in the north-eastern border region of the country bordered by Mozambique to the southeast, Zimbabwe to the north and Botswana to the northwest (Fig.Ā 1). This region experiences sustained, seasonal malaria transmission and accounts for 60% of the countryās burden [4]. While importation of cases, mostly from Mozambique and Zimbabwe, has been implicated in on-going transmission in the Vhembe District, the majority (63%) of the cases collected from 1998 to 2017Ā in the district were classified as from local transmission based on travel histories [4, 10]. Other causative factors for the persistent residual transmission observed in the Vhembe District, despite sustained vector control strategies and public health interventions, include antimalarial drug resistance, insecticide resistance and vector species variance between An. gambiae and An. funestus [7, 11,12,13,14,15].
The addition of malaria parasite population genetic data to the standard surveillance data collected by malaria control programmes has assisted in understanding malaria transmission dynamics and also allowed for spatio-temporal inferences to be made from e.g. local vs. imported malaria cases [16,17,18,19,20,21,22,23,24]. These data reflect that typically, parasite genetic complexity decreases with a decline in malaria transmission, which holds true for low transmission settings such as Southeast Asia and Latin America compared to high transmission settings of sub-Saharan Africa based on incidence data. However, the limited data available from microsatellite genotyping of parasites from low transmission settings in sub-Saharan Africa (from the KZN Province in South Africa, Eswatini, and Namibia [9, 25, 26]), using the same approach as in the current study, suggests the opposite. In these study sites, parasites were highly complex and diverse, and while in some of these settings this relatively high diversity was attributed to frequent importations from neighbouring high transmission settings, in other settings this diversity was characterized by local transmissions suggesting that something different is happening to the parasites in the region.
To better understand the malaria parasite-associated factors that contribute to residual malaria transmission in the Vhembe District, microsatellite genotyping was applied to describe the population of P. falciparum parasites in South Africa at different spatial and temporal scales. This work identifies the contributing factors associated with sustained residual transmission in the Vhembe District in South Africa as the country works towards malaria elimination.
Methods
Study site and study design
Study samples were randomly collected from seven primary health clinics within five known source health districts (Nzhelele/Tshipise, Thohoyandou, Mutale, Elim, and Levubu/Shingwedzi) within the Vhembe District, Limpopo Province, South Africa (Fig.Ā 1). Out of a total of 1892 confirmed cases that occurred during the entire study period and reported only at the 7 study clinics, a total of 1153 (61%) RDT samples from symptomatic patients that tested positive for malaria on P. falciparum-specific RDTs (First ResponseĀ® Malaria Antigen P. falciparum card test HRP2, Premier Medical Corporation, India) were randomly collected from the seven different clinics in the wet and dry seasons from January to December of the years 2016, 2017 and 2018. Samples were transported and stored at room temperature in sealed bags with desiccant. Demographic and travel history information on each of the individual patients was obtained from the LMC database. After linking genotyped data with patient information from the LCM database, samples were stratified based on the source of infection health district (Fig.Ā 1) as it was established that patients that acquired their infections from similar source areas did not necessarily visit the same clinics for testing and treatment. Therefore, it was concluded that source health districts would be more informative for geospatial analysis than clinics. 45% (45%) of the samples however, could not be linked to source health districts and were therefore classified as āunknownā. Source of infection could not be established not because information was not available on the LCM database, but because the genotyped samples could not be linked back to patient identity as the only unique identifiers that could link the patient RDT to their information on the database were either missing or illegible on the RDT. The study was started off with blinded patient identities with only knowledge of year of infection and clinic visited by patients from where samples were then collected.
DNA isolation and quantification
DNA was extracted from the nitrocellulose strip of First Responseā¢ RDTs in accordance with the Worldwide Antimalarial Resistance Network Guidelines [27]. The Saponin-Chelex method [28] was used for all DNA extractions and extracted genomic DNA stored at ā 20 Ā°C. Isolates were screened for P. falciparum using ultra-sensitive varATS [29] and TARE-2 [29, 30] quantitative PCR (qPCR) as described before. Parasite density was quantified on a random subset of 353 high volume samples from all years. It was not possible to quantify all samples due to a low DNA volume for a proportion of samples. The genotyping threshold was set at a parasite concentration of ā„ā10 parasites/ĀµL of blood.
Microsatellite genotyping
A total of 1153 samples were genotyped using a previously described 26 microsatellite marker panel protocol [25, 26, 31]. Briefly, the 26 microsatellite loci were amplified using a semi-nested PCR protocol. The primary PCR was performed in 4 groups of multiplex reactions, and 1 ĀµL of the primary amplified product used as a template for the secondary individual PCR for each marker. To determine repeat length sizes, the labelled PCR products were diluted and sized by denaturing capillary electrophoresis on an ABI 3730XL analyser using GeneScanā¢ 400HD ROXā¢ Size Standard (Thermo Fisher Scientific). MicroSPAT software (https://github.com/Greenhouse-Lab/MicroSPAT/releases/tag/v2.0.3) was then used to automate identification of true alleles and differentiate real peaks from artefacts of the resulting electropherograms using a classifier algorithm based on the location and size of locus-specific patterns relative to a primary peak as done in studies conducted in Eswatini [25], Namibia [26], the KZN Province of South Africa [9] and China [31] that used similar experimental conditions as those used in his study. Multiple alleles per locus were scored if minor peaks were at least a third of the height of the major peak. Collated genotyping data from all samples was processed with similar microSPAT software settings in which a semi-supervised naĆÆve Bayes classifier was used [25] to avoid variability in allele calling. All samples that met the genotyping threshold were genotyped and had to have a genotyping coverageāā„ā60% (alleles detected on at least 15 or more loci) to be included in the downstream population genetics analysis.
Characterisation of within-host genetic diversity
The within-host diversity of infections was determined using multiplicity of infection (MOI, or genetically distinct P. falciparum clones) and the within-host fixation index (FWS). To reduce the probability false positive alleles influencing MOI, the MOI in an individual sample was defined as the second highest number of alleles detected at any of the 26 microsatellite loci genotyped. Since the P. falciparum parasite is haploid in the human host, multiple peaks or alleles correspond to an infection with multiple genotypes or strains (a polygenomic or polyclonal infection).
The other measure of the within-host diversity, the FWS index, is a measure of diversity in an individual infection relative to the population level genetic diversity and was determined as previously described [25, 26, 32]. A low FWS indicates high within-host diversity relative to the population thereby suggesting higher chances of outbreeding. FWS was calculated for each sample using the equation: \(Fws=1-Hw/Hs\), where Hw is the allele frequency of each unique allele found at a particular locus for each individual and Hs is the heterozygosity of the local parasite population. Outbreeding is reported as 1-FWS with a 0 value indicative of a perfect clone and therefore low within-host diversity. Previously described thresholds of Fwsāā„ā0.95 (1-Fwsāā¤ā0.05) were used to identify samples containing a single genotype (or āclonalā infections) in spite of additional genotypes that may be present at relatively low proportions; and Fwsāā¤ā0.70 (1-Fwsāā„ā0.30) to describe samples with highly diverse infections respectively [25, 32,33,34].
Characterization of population level genetic diversity
Population level genetic diversity was estimated using expected heterozygosity (He) which is defined as the probability of randomly drawing a pair of different alleles from the allele pool. Heterozygosity was, therefore, calculated on each locus using the equation: \(He=\left[\left(\frac{n}{n-1}\right)*\right(1-\sum {p}_{{i}^{2}}\left)\right]\), where n is the number of genotyped samples and pi is the frequency of the ith allele in the population [25, 26]. Values for He range from 0Ā indicating no diversity, to 1Ā indicating that all alleles are different and therefore there is maximum diversity. The mean He was calculated by taking the average He across all loci. The number of haplotypes (unique multilocus genotypes) as well as the unique alleles detected per locus (A - allelic richness), were also determined.
Linkage disequilibrium
To assess whether alleles from different loci were associated with each other, the multilocus linkage disequilibrium (LD) was determined as previously described [35] using the Poppr package in R software [36]. LD was determined for the whole dataset which includes monoclonal and polyclonal infections, with LD for monoclonal infections alone determined as a precaution against the bias that may result from presence of any false dominant haplotypes [16]. The Monte Carlo method was used to test the significance of LD in the complete data set of each population stratified by geographic origin of the infection (source health district). In this study, 10 000 permutations were completed. LD values range from 0 (no loci in LD) to 1 (all loci in LD). Pairwise standardized index of association (ISA) over all loci was assessed to determine whether the observed pattern of LD is due to a single or multiple pairs of loci.
Geospatial population substructure and genetic differentiation
To determine the influence of geographic origin of infections on genetic diversity, the ANOVA pairwise t-test was used to compare MOI, 1-FWS and He between the parasite populations stratified by source health district. Population substructure between the geographic areas was investigated by measuring Wrightās F-statistics (FST), using the adegenet package [37] in R. Hendrickās GST and Jostās D, were calculated using the mmod package [38] in R. Genetic differentiation between populations ranges from 0 to 1 representing absence of to complete differentiation, respectively. The Monte Carlo method was used to test the significance of pairwise FST between source health districts. In this study, 999 permutations were completed. Additionally, discriminant analysis of principal components (DAPC) using the adegenet package in R software was used to confirm signatures of population structure [37, 39]. DAPC infers population structure based on whether haplotypes (estimated from multi-locus genotypes generated from all major and minor allele data) clustered into distinct genetic populations. Unlike the traditional principal component analysis (PCA) which identifies linear axes that explain the most variability in all groups together, DAPC seeks to detect the linear axes which explain the most between-group variability in data [37]. K-means clustering was used to detect the number of inferred genetic clusters in the parasite population, and the best number of clusters chosen was that with the lowest associated Bayesian Information Criterion (BIC). To prevent overfitting of clusters, the optimal number of principal components (PC) to be retained was confirmed by cross validation of the DAPC. Data was divided into a training set (90% of data), and a validation set (10% of data), and members of each of the identified clusters were stratified by random sampling to ensure that at least one member of each cluster is represented in both training and validation sets. DAPC was then performed on the training set with variable numbers of PCs retained. The extent to which the analysis was able to accurately predict group memberships of individuals in the validation set was used to identify the optimal number of PCs to be retained. Sampling and DAPC procedures were repeated 1000 times at each level of PC retention, and the optimal number of PCs retained was associated with the lowest root mean square error. The resultant clusters were then plotted in a scatterplot of the first and second linear discriminants of DAPC.
Characterising pairwise genetic relatedness between infections
To determine the genetic connectivity/relatedness of pairs of infections including all alleles detected in both monoclonal and polyclonal infections of successfully genotyped samples, a modified identity by state (IBS) metric was used [26]. Overall, pairwise IBS was calculated as the average of locus specific estimates under the assumption of independent loci. This metric was calculated using the formula: \(IBS= \frac{1}{n}\sum _{i=1}^{n}\frac{Si}{XiYi}\) where n is the number of genotyped loci; Si is the total number of shared alleles at locus i between samples X; and YiXi is the number of alleles in sample X at locus I; and Yi is the number of alleles in sample Y at locus i. A total of 85,078 pairs of infection within the Vhembe District dataset were analysed and highly related infection pairs above the cut-off of IBSāā„ā0.5 [26] identified. Pairwise comparisons of relatedness of parasite pairs were then grouped into two categories depending on whether they occurred between two parasites isolated from individuals who acquired infections in the same (within) source health district or between two parasites from individuals who acquired infections from different (between) source health districts respectively. ANOVA pairwise t-test was used to compare differences in the proportions of highly related infections within and between the groups.
Influence of temporal variation on genetic diversity
To determine the influence of temporal changes in transmission on genetic diversity, the ANOVA pairwise t-test was used to compare MOI, 1-FWS and He between the parasite populations stratified by year of transmission. Finer scale stratification by month of infection was also assessed to determine how MOI changes in the dry and wet transmission seasons and pairwise t-tests were used to compare MOI between the different seasons.
Assessing the impact of control interventions on the complexity of infections
The impact of the main strategy for control, indoor residual spraying (IRS) on the within-host diversity of P. falciparum was also assessed based on the number of unique genotypes (MOI) identified in sprayed vs. unsprayed households. IRS in the Limpopo Province typically takes place at the beginning of each malaria season (wet season between September and May), with the number of households to be sprayed determined by the provincial malaria control programme based on factors such as the number of structures within the malaria endemic area, insecticide availability and resistance data [4].
Results
A total of 1153 P. falciparum RDT positive samples were collected from the Vhembe district, the highest proportion (56%) of which were collected in 2017 and, for those with known source health district data, the majority came from within the Mutale district (Fig.Ā 1). A slightly higher proportion (54 vs. 46%) of females compared to males was sampled during the study with a median age of 23 years, ranging between 19 and 65 years. The median parasite density of a subset of randomly selected quantified samples (nā=ā313) was 660 parasites/ĀµL of blood, which was above the genotyping threshold of ā„ā10 parasites/ĀµL of blood. All 1153 samples were therefore subjected to microsatellite genotyping.
Microsatellite genotyping indicates parasite complexity and diversity
Of the 1153 genotyped samples, 65% of the samples (747/1153) had sufficient coverage at a minimum of 15 of the 26 microsatellite loci evaluated and were included in the final sample set for population genetics analysis. Overall, a high proportion (66%) of polyclonal infections was observed with a mean MOIā=ā2.13Ā in the genotyped samples (Fig. 2a). This indicates a moderate complexity of infection within individual samples and thus moderate to high within-host diversity in the parasite population. Mean MOI did not differ significantly (Pā=ā0.73, ANOVA, nā=ā353 excluding unknowns) between male (mean MOIā=ā2.11) and female participants (mean MOIā=ā2.16). The different age groups also exhibited similar mean MOIs (from 2 to 2.17) which were not significantly different (Pā=ā0.94, ANOVA, nā=ā353 excluding unknowns).
A significant positive relationship (Pearsonās rā=ā0.85 [95% CI 0.83ā0.87], Pā<ā0.001, t-test, nā=ā747) was seen between outbreeding (1-FWS) and MOI, which suggests that both metrics support the presence of within-host diversity. Similarly to the one third of samples appearing monoclonal observed, only 40% of samples had 1-FWS < 0.05 (Fwsāā„ā0.95, indicating effectively clonal infections, Fig. 2b). Mean outbreeding (1-FWS) was low at 0.22, with only 33% of samples with a stringent 1-Fws value of ā„ā0.30 (Fwsāā¤ā0.70), suggesting that these were the most highly diverse infections (Fig.Ā 2b).
On a population level, the majority of the samples (99%, 742/747) had unique haplotypes thus indicating high levels of outcrossing and, therefore, high genotypic diversity in the parasite population. The high number of unique haplotypes implies underlying allelic richness which is indeed reflected in the high mean number of unique alleles (mean A of 12.2) with anywhere from 3 to 26 unique alleles detected across all 26 loci (Fig.Ā 2c). This was supported by a moderate to high mean He of 0.74 (Fig.Ā 2d) that indicates frequent recombination of different parasite clones. This suggests a larger P. falciparum population than that expected in a low transmission setting, but is consistent with the Vhembe District being the highest transmission setting in South Africa. Locus PfPK2 was the most diverse marker (Neiās genetic diversity of 0.91) and contributed to the high genotypic diversity whereas locus Ara2 had the most evenly distributed alleles (0.86) and, therefore, contributed to the most genotypic evenness. Overall, the sample set had a moderate to high genotypic richness, evenness and diversity.
Additionally, low LD (standardized index of association, ISAā=ā0.08) was observed between alleles of the P. falciparum haplotypes (TableĀ 1) and this was not due to a single locus. The observed ISA value fell outside of the re-sampled distribution expected under no linkage when compared to histograms showing results of 10 000 permutations. The Monte Carlo method was used to test the significance of LD in the complete data set and alleles of monoclonal infections (nā=ā253) were linked across loci with Pā=ā0.0001. For all infections including polyclonal infection, LD was also low at 0.138 but significant (Pā=ā0.0001, nā=ā747). This was emphasized by a small proportion (0.26%, 221/85,078) of pairwise infections in the sample set being highly related (IBSāā„ā0.5). The significantly low LD therefore indicates high recombination of distinct parasite clones which supports the moderate to high within-host diversity observed in Vhembe, and is consistent with the presence of some degree of local transmission.
Parasites are fragmented based on their level of within-host diversity
To further evaluate the genetic relatedness between the parasite genotypes, k-means clustering was employed based on individual multi-locus genotype discrimination. Eight genetic clusters were identified in the parasite population with the parasite populations in genetic clusters 1, 2, 4, 7 and 8 observed using DAPC (Fig.Ā 3a) separated by linear discriminant 1 (LD1) from parasites in clusters 5 and 6. Linear discriminant 2 (LD2) further separated clusters 2, 4 and 8 from clusters 1 and 7. The proportion of correct assignment of the haplotypes to each inferred cluster ranged between 0.95 and 1. The DAPC analysis could be performed with missing data in place with missing data which was randomly distributed in the data set basically replaced by the mean allele frequency in the case of multi-locus genotype (MLG) calculations. MLGs were generated from all major and minor allele data. Although all samples with any missing genotypes/data were included, loci PfPK2 and TA1 in which the majority of 83% and 90% of data was missing, respectively, were discarded in the DAPC analysis. Interestingly, clusters 5 and 6 contained the majority (98%, 249/253) of monoclonal (MOIā=ā1) infections (Fig.Ā 3b). This was consistent with the mixture of mostly clonal (1-Fwsā<ā0.05) and less diverse (medium, 1-Fwsā>ā0.05 but <ā0.30) infections in clusters 5 and 6 (Fig.Ā 3c). Clusters 1 and 2, and to a lesser extent cluster 7, appeared fragmented and contained only highly diverse infections (1-Fwsā>ā0.30, Fig.Ā 3c). This indicates that although the majority of the parasite population is indeed structured, the most highly diverse infections result in fragmentation in the population. The moderate genetic diversity and MOI, and relatively pronounced population structure are all indicative of constant transmission at overall relatively moderate levels.
No geospatial correlation exists to explain genetic diversity
Overall, the levels of within-host and population level diversity were not influenced by where infections were reported since the mean MOI (global ANOVA Pā=ā0.36, nā=ā747, Fig.Ā 4a); the level of outbreeding (global ANOVA, Pā=ā0.27, nā=ā747, Fig.Ā 4b) and heterozygosity (global ANOVA Pā=ā0.23, nā=ā747, Fig.Ā 4c) between the different source health districts did not significantly differ. Only differences in the MOI from Elim and Thohoyandou were observed compared to the overall mean MOI (pairwise t-test, nā=ā747, Pāā¤ā0.01 and Pāā¤ā0.05, respectively), which may be due to small sample size from these sites. The different source districts also did not contribute to the genetic clusters observed from the DAPC analysis, as infections from the different areas were represented/distributed in the eight different inferred genetic clusters, implying free gene flow and possible parasite mixing between these sites. This was confirmed by the very low FST values (Fig.Ā 4d), with only the Elim and Levubu/Shingwedzi districts sharing significant FST with two other districts each, implying some degree of differentiation but again associated with small sample size from these areas. A Hendrickās GST at ā 0.0394 and Jostās D at ā 0.0122 supports general parasite mixing for the complete Vhembe district. This lack of separation based on geospatial data agrees with patient demographic and travel history information, with 99% of the infections locally acquired of which 95% are within the Mutale health district. This also correlated to residential status, with 93% of the individuals residing in Mutale. Additionally, 1-IBS analysis showed that infections were genetically connected within and between source areas, with highly related infections significantly higher (Pā<ā0.0001, ANOVA, nā=ā221) within source areas (mean 1-IBSā=ā0.76āĀ±ā0.022) compared to between source areas (mean 1-IBSā=ā0.56āĀ±ā0.007) which confirms some level of local transmission.
Transmission dynamics was not temporally influenced
The level of genetic diversity did not differ between the years of transmission as demonstrated by global ANOVA P values (nā=ā747) for MOI (Fig.Ā 5a), outbreeding (Fig.Ā 5b) and heterozygosity (Fig.Ā 5c) of 0.90, 0.77 and 0.07, respectively, providing no evidence for differing transmission intensity during that period. No genetic differentiation (FST range āā0.00005 to 0.0003) between parasite populations from the different years was observed, which suggests that the malaria outbreak experienced in 2017 was not due to an introduction of a completely distinct/new parasite population to Vhembe.
Stratification of MOI distribution by transmission season (month of infection) (Fig.Ā 5d) showed that infections were persistently complex throughout the year. In spite of an approximately 15-fold reduction in the number of cases and notable decrease in rainfall levels between the wet (high transmission) and dry (low transmission) seasons (Fig.Ā 5d), mean MOI was also not significantly different (Pā=ā0.42, Bonferroni P adjustment method) between the two seasons (wet season mean MOIā=ā2.14āĀ±ā0.056, and mean MOIā=ā1.96āĀ±ā0.178Ā in the dry season). Infections were similarly complex (mean MOIāĀ±āSEā=ā2.15āĀ±ā0.075 and 2.14āĀ±ā0.101; Pā=ā1, Bonferroni P adjustment method) between infections from sprayed or unsprayed households suggesting the maintenance of complex infections in spite of vector control implementation during wet seasons, suggesting possible co-transmission of strains. These data may indicate that the mean MOI may be seasonally stable thus reflecting the impact of continued residual transmission on the complexity of infections in the Vhembe District. Out of the 220 patients who knew which insecticides were sprayed in their households, 53% (116/220), 44% (97/220) and 3% (7/220) reported use of DDT, FendonaĀ® and K-OthrineĀ®, respectively.
Discussion
The impact of malaria transmission intensity on P. falciparum parasite genetic diversity is only now being clarified in South Africa. By using multilocus genotyping, this is the first study that shows that continued residual malaria transmission in a malaria hotspot within South Africa is associated with parasite diversity.
The results generated in this study are a useful addition to the growing resource of P. falciparum genetic data in the southern African E8 region which will facilitate more detailed evaluations of cross-border parasite spread in future studies. The genetic data may also be useful to control programmes with decision making for malaria elimination as it can be used as a reference point for the assessment of the effectiveness of on-going interventions over time, selection and/or targeting of interventions. Furthermore, the genetic data may be used for the identification of imported cases and/or outbreaks, as well as monitoring for the potential spread of anti-malarial drug resistance and potentially revealing infection patterns.
Although South Africa is overall a low transmission setting, the Vhembe District is a transmission hotspot and here it is shown that the parasite population within this hotspot behaves genetically like those typically seen in high transmission settings (based on incidence data) such as Guinea, Mali and The Gambia [16, 40,41,42] - the P. falciparum parasite population in the Vhembe District is complex and diverse. This is typically associated with high levels of gene flow between areas of different transmission intensities that serve to compound allelic richness by introducing new alleles into the population, thereby increasing the level of heterozygosity in the parasite population [9, 25]. Although the Vhembe district is located along the border with Mozambique and Zimbabwe, very limited evidence exists for imported malaria cases and rather, the high level of heterozygosity observed in this study implies localised diversity in a transmission hotspot, where relatively higher transmission occurs compared to other areas in the country. This is supported by a marked level of gene flow and parasite mixing between parasite populations from the different source areas within the district, resulting in a low but significant LD, with frequent and random association between alleles and a panmictic parasite population [43]. Geospatial and temporal variances had little effect on within-host and population diversity in the Vhembe District suggesting that genetic diversity was stable over space and time. The fact that mean MOI remained seasonally stable may be a reflection of the contribution of complex infections to continued residual transmission in the Vhembe District.
A major limitation of this study was that a large number of samples were classified as āunknownā and could not be linked to their district of origin, thereby leaving the geospatial analyses to a certain extent underpowered in most districts. Additionally, the majority of the samples coming from the Mutale district may have been influenced by a biased sampling plan from the onset based on the fact that samples were collected from a few selected clinics in a known hotspot area based on case incidence data. It was, therefore, not possible to get genetic representation of parasites from the other source districts and those seeded through importation from neighbouring high transmission settings reported in the province if any and what their contribution to the local transmission dynamics is.
Hotspot areas in other low transmission settings, such as Malaysia have demonstrated opposite trends where parasite complexity and heterozygosity are low, and LD is high using both microsatellite markers and merozoite surface protein variants for genotyping [21, 44]. In Madagascar, however, high levels of genotypic diversity and a high proportion of polyclonal infections were associated with a transmission hotspot using single nucleotide polymorphism genotyping [45]. These differences may have been due to differences in technology platforms used for genotyping in comparison to this study or due to the challenge of non-standardisation of study designs in molecular epidemiology studies. Additionally, since ālowā in Africa is very different than ālowā in Asia (which is generally much lower based on for example entomological inoculation rates and parasite prevalence rates) it could have also been due to differences in underlying epidemiology of parasite transmission between Malaysia and Madagascar. In other low transmission areas of southern Africa [9, 25, 26], where a similar panel of microsatellite markers was used, the level of genetic diversity was as high as that in the Vhembe District. However, in Eswatini and in the KZN province of South Africa, the high levels of within-host and population diversity and lack of parasite population structure could be explained by high levels of importation from neighbouring high transmission areas. On the other hand, in north-eastern Namibia, while infections were genetically diverse, transmission was mostly as a result of locally acquired infections and fine-scale parasite fragmentation based on the geographic origin of parasites was observed [26]. Detectable genetic clusters of different genotypes mean if studies are strategically designed and patient parasite genetic and demographic data accurately linked in all remaining endemic areas in South Africa and neighbouring endemic countries, particularly at the border areas, parasites from different countries can easily be detected or traced back to their origins. This information can then be used to make better decisions on a national level and as a regional block on what interventions should be put in place, and concentrated in which areas. Therefore, regional comparison of parasite genotypes will be informative to understand parasite mixing in the context of malaria transmission.
Some clones in an infection may exist at lower proportions than others due to either competitive suppression by other genotypes or possibly host specific selection due to immunity or receptor polymorphisms. In Plasmodium chabaudi, minor clones in mixed/multiple infections may produce as many, or more, oocysts than they would have as a single clone infection [46], highlighting that competitive stress may increase transmission of certain clones. Multiple distinct parasite clones are implicated in high gametocyte production and emergence of highly virulent and drug resistant parasite strains due to intense within-host competition [47,48,49,50]. While the relationship between parasite density and MOI is complex, and may not be enough to explain possible within-host competition of parasite clones in individual infections in the Vhembe District, a detailed, longitudinal study of the contribution of specific clones to parasite transmissibility and virulence would therefore be important.
Conclusion
In this study, the impact of continued malaria transmission intensity on P. falciparum genetic diversity in the Vhembe District is demonstrated. The P. falciparum population is moderate to highly diverse and genetically complex, which is key and advantageous to the parasiteās evolution and survival. This data could be informative as a reference point in evaluating the efficacy of strategic control interventions over time, aimed at eliminating residual malaria transmission in malaria transmission āhotspotsā in South Africa. Furthermore, this data can be used to identify imported cases and/or outbreaks, as well as monitor for the potential spread of antimalarial drug resistance. Linking data on P. falciparum genetic diversity from Vhembe/South Africa to that from neighbouring sub-Saharan countries in the E8 regional initiative would need to be investigated as the transmission dynamics in the region is not fully understood.
Availability of data and materials
The datasets supporting the conclusions of this article are available from the corresponding authors on reasonable request.
Abbreviations
- DAPC:
-
Discriminant analysis of principal components
- IRS:
-
Indoor residual spraying
- KZN:
-
Kwa-Zulu Natal
- LD:
-
Linkage disequilibrium
- LMC:
-
Limpopo Malaria Case
- MOI:
-
Multiplicity of infection
- PCA:
-
Principal component analysis
- RDT:
-
Rapid diagnostic tests
References
WHO. World malaria report 2019. Geneva: World Health Organization; 2019.
WHO. Global technical strategy for malaria 2016ā2030. Geneva: World Health Organization; 2015.
Maharaj R, Raman J, Morris N, Moonasar D, Durrheim D, Seocharan I, et al. Epidemiology of malaria in South Africa: From control to elimination. S Afr Med J. 2013;103:779ā83.
Raman J, Morris N, Frean J, Brooke B, Blumberg L, Kruger P, et al. Reviewing South Africaās malaria elimination strategy (2012ā2018): progress, challenges and priorities. Malar J. 2016;15:438.
Malaria Elimination Strategic Plan for South. Africa 2019ā2023 [Internet]. South African Department of Health. 2019.
Moonasar D, Maharaj R, Kunene S, Candrinho B, Saute F, Ntshalintshali N, et al. Towards malaria elimination in the MOSASWA (Mozambique, South Africa and Swaziland) region. Malar J. 2016;15:419.
Moonasar D, Nuthulaganti T, Kruger PS, Mabuza A, Rasiswi ES, Benson FG, et al. Malaria control in South Africa 2000ā2010: beyond MDG6. Malar J. 2012;11:294.
Khosa E, Kuonza LR, Kruger P, Maimela E. Towards the elimination of malaria in South Africa: a review of surveillance data in Mutale Municipality, Limpopo Province, 2005 to 2010. Malar J. 2013;12:7.
Raman J, Gast L, Balawanth R, Tessema S, Brooke B, Maharaj R, et al. High levels of imported asymptomatic malaria but limited local transmission in KwaZulu-Natal, a South African malaria-endemic province nearing malaria elimination. Malar J. 2020;19:152.
Adeola A, Ncongwane K, Abiodun G, Makgoale T, Rautenbach H, Botai J, et al. Rainfall trends and malaria occurrences in Limpopo Province, South Africa. Int J Environ Res Public Health. 2019;16:5156.
Brooke B, Koekemoer I, Kruger P, Urbach J, Misiani E, Coetzee M. Malaria vector control in South Africa. S Afr Med J. 2013;103:784ā8.
Christian R, Dahan-Moss Y, Munhenga G, Lobb L, Erlank E, Dandalo L, et al. Malaria Vector Surveillance Report, South Africa, JanuaryāDecember, 2017. Johannesburg: National Institute for Communicable Diseases-Bulletin; 2016.
Kyalo D, Amratia P, Mundia CW, Mbogo CM, Coetzee M, Snow RW. A geo-coded inventory of anophelines in the Afrotropical Region south of the Sahara: 1898ā2016. Wellcome Open Res. 2017;2:57.
Samuel M, Qwabe B, Dlamini D, Mabaso N, Manyawo Z, Zhikali J, et al. Malaria Vector Surveillance Report, South Africa, JanuaryāDecember 2018. Johannesburg: National Institute for Communicable Diseases-Bulletin; 2018.
Ukpe IS, Moonasar D, Raman J, Barnes K, Baker L, Blumberg L. Case management of malaria: treatment and chemoprophylaxis. S Afr Med J. 2013;103:793ā8.
Anderson TJ, Haubold B, Williams JT, Estrada-Franco Ā§ JG, Richardson L, Mollinedo R, et al. Microsatellite markers reveal a spectrum of population structures in the malaria parasite Plasmodium falciparum. Mol Biol Evol. 2000;17:1467ā82.
Auburn S, Barry AE. Dissecting malaria biology and epidemiology using population genetics and genomics. Int J Parasito. 2017;47:77ā85.
Patel JC, Taylor SM, Juliao PC, Parobek CM, Janko M, Gonzalez LD, et al. Genetic evidence of importation of drug-resistant Plasmodium falciparum to Guatemala from the Democratic Republic of the Congo. Emerg Infect Dis. 2014;20:932.
Escalante AA, Ferreira MU, Vinetz JM, Volkman SK, Cui L, Gamboa D, et al. Malaria molecular epidemiology: lessons from the International Centers of Excellence for Malaria Research Network. Am J Trop Med Hyg. 2015;93:79ā86.
Carter TE, Malloy H, Existe A, Memnon G, Victor YS, Okech BA, et al. Genetic diversity of Plasmodium falciparum in Haiti: insights from microsatellite markers. PLoS One. 2015;10:e0140416.
Razak MRMA, Sastu UR, Norahmad NA, Abdul-Karim A, Muhammad A, Muniandy PK, et al. Genetic diversity of Plasmodium falciparum populations in malaria declining areas of Sabah, East Malaysia. PLoS One. 2016;11:e0152415.
Bei AK, Niang M, Deme AB, Daniels RF, Sarr FD, Sokhna C, et al. Dramatic changes in malaria population genetic complexity in Dielmo and Ndiop, Senegal, revealed using genomic surveillance. J Infect Dis. 2018;217:622ā7.
Amambua-Ngwa A, Jeffries D, Amato R, Worwui A, Karim M, Ceesay S, et al. Consistent signatures of selection from genomic analysis of pairs of temporal and spatial Plasmodium falciparum populations from The Gambia. Sci Rep. 2018;8:9687.
Daniels R, Chang H-H, SĆ©ne PD, Park DC, Neafsey DE, Schaffner SF, et al. Genetic surveillance detects both clonal and epidemic transmission of malaria following enhanced intervention in Senegal. PLoS One. 2013;8:e60780.
Roh M, Tessema S, Murphy M, Nhlabathi N, Mkhonta N, Vilakati S, et al. High genetic diversity of Plasmodium falciparum in the low transmission setting of the Kingdom of Eswatini. J Infect Dis. 2019;220:e1346-54.
Tessema S, Wesolowski A, Chen A, Murphy M, Wilheim J, Mupiri A-R, et al. Using parasite genetic and human mobility data to infer local and cross-border malaria connectivity in Southern Africa. Elife. 2019;8:e43510.
Module M. Preparation of Rapid Diagnostic Tests (RDTs) for DNA extraction v1.1. WWARN Procedure. 2011.
Plowe CV, Djimde A, Bouare M, Doumbo O, Wellems TE. Pyrimethamine and proguanil resistance-conferring mutations in Plasmodium falciparum dihydrofolate reductase: polymerase chain reaction methods for surveillance in Africa. Am J Trop Med Hyg. 1995;52:565ā8.
Hofmann N, Mwingira F, Shekalaghe S, Robinson LJ, Mueller I, Felger I. Ultra-sensitive detection of Plasmodium falciparum by amplification of multi-copy subtelomeric targets. PLoS Med. 2015;12:e1001788.
Awandu SS, Raman J, Bousema T, Birkholtz L-M. Ultralow-density Plasmodium falciparum Infections in African Settings. Clin Infect Dis. 2019;69:1463ā4.
Liu Y, Tessema SK, Murphy M, Xu S, Schwartz A, Wang W, et al. Confirmation of the absence of local transmission and geographic assignment of imported falciparum malaria cases to China using microsatellite panel. Malar J. 2020;19:244.
Auburn S, Campino S, Miotto O, Djimde AA, Zongo I, Manske M, et al. Characterization of within-host Plasmodium falciparum diversity using next-generation sequence data. PLoS ONE. 2012;7:e32891.
Manske M, Miotto O, Campino S, Auburn S, Almagro-Garcia J, Maslen G, et al. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing. Nature. 2012;487:375.
Mobegi VA, Duffy CW, Amambua-Ngwa A, Loua KM, Laman E, Nwakanma DC, et al. Genome-wide analysis of selection on the malaria parasite Plasmodium falciparum in West African populations of differing infection endemicity. Mol Biol Evol. 2014;31:1490ā9.
Agapow PM, Burt A. Indices of multilocus linkage disequilibrium. Mol Ecol Notes. 2001;1:101ā2.
Kamvar ZN, Tabima JF, GrĆ¼nwald NJ. Poppr: an R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ. 2014;2:e281.
Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24:1403ā5.
Winter DJ. MMOD: an R library for the calculation of population differentiation statistics. Mol Ecol Resour. 2012;12:1158ā60.
Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11:94.
Mobegi VA, Loua KM, Ahouidi AD, Satoguina J, Nwakanma DC, Amambua-Ngwa A, et al. Population genetic structure of Plasmodium falciparum across a region of diverse endemicity in West Africa. Malar J. 2012;11:223.
Murray L, Mobegi VA, Duffy CW, Assefa SA, Kwiatkowski DP, Laman E, et al. Microsatellite genotyping and genome-wide single nucleotide polymorphism-based indices of Plasmodium falciparum diversity within clinical infections. Malar J. 2016;15:275.
Nabet C, Doumbo S, Jeddi F, KonatƩ S, Manciulli T, Fofana B, et al. Genetic diversity of Plasmodium falciparum in human malaria cases in Mali. Malar J. 2016;15:353.
Conway DJ, Roper C, Oduola AM, Arnot DE, Kremsner PG, Grobusch MP, et al. High recombination rate in natural populations of Plasmodium falciparum. Proc Natl Acad Sci USA. 1999;96:4506ā11.
Anthony TG, Conway DJ, Cox-Singh J, Matusop A, Ratnam S, Shamsul S, et al. Fragmented population structure of Plasmodium falciparum in a region of declining endemicity. J Infect Dis. 2005;191:1558ā64.
Rice BL, Golden CD, Anjaranirina EJG, Botelho CM, Volkman SK, Hartl DL. Genetic evidence that the Makira region in northeastern Madagascar is a hotspot of malaria transmission. Malar J. 2016;15:596.
Taylor LH, Walliker D, Read AF. Mixedāgenotype infections of malaria parasites: withināhost dynamics and transmission success of competing clones. Proc Biol Sci. 1997;264:927ā35.
Sondo P, Derra K, Lefevre T, Diallo-Nakanabo S, Tarnagda Z, Zampa O, et al. Genetically diverse Plasmodium falciparum infections, within-host competition and symptomatic malaria in humans. Sci Rep. 2019;9:127.
Nassir E, Abdel-Muhsin A-MA, Suliaman S, Kenyon F, Kheir A, Geha H, et al. Impact of genetic complexity on longevity and gametocytogenesis of Plasmodium falciparum during the dry and transmission-free season of eastern Sudan. Int J Parasitol. 2005;35:49ā55.
Bose J, Kloesener MH, Schulte RD. Multiple-genotype infections and their complex effect on virulence. Zoology. 2016;119:339ā49.
Pollitt LC, Mideo N, Drew DR, Schneider P, Colegrave N, Reece SE. Competition and the evolution of reproductive restraint in malaria parasites. Am Nat. 2011;177:358ā67.
Acknowledgements
Gratitude is expressed to the study population in the Vhembe District, Limpopo Province, as well as the Folovhodwe, Madimbo, Manenzhe, Masisi, Mulala, Makuya and Tshipise clinic and laboratory staff. Eric Mabunda and health information officers from the Limpopo Department of Health are acknowledged for facilitating access to patient information through the LMC database. The University of Pretoria Institute for Sustainable Malaria Control acknowledges the South African Medical Research Council as Collaborating Centre for Malaria Research.
Funding
We thank the South African National Research Foundation (NRF) South African Research Chair (SARChI) programme (the DST/NRF South African Research Chairs Initiative Grant (LB UID 84627), and the South African Medical Research Council for financial support. Opinions expressed and conclusions arrived at, are those of the author and are not necessarily attributed to the NRF.
Author information
Authors and Affiliations
Contributions
JR, LB and BG designed the study. HBG, JR and LB coordinated collection of the field samples and patient information. HBG and SKT processed the samples and HBG analysed the data. HBG drafted the manuscript and SKT, JR, BG and LB revised the draft. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Ethical approval for the study was obtained from the University of Pretoria, Faculty of Health Sciences Research Ethics Committee (Ethics Reference No. 406ā2014). Patient information was extracted from the Limpopo Malaria Case database based on the clinics from which RDT samples were collected, after receiving ethical approval from the Limpopo Department of Health (Ref: LP_201906_011).
Consent for publication
Consent to publish the data presented in this paper was obtained from the Limpopo Department of Health and the University of Pretoria.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisherās note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Gwarinda, H.B., Tessema, S.K., Raman, J. et al. Parasite genetic diversity reflects continued residual malaria transmission in Vhembe District, a hotspot in the Limpopo Province of South Africa. Malar J 20, 96 (2021). https://doi.org/10.1186/s12936-021-03635-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12936-021-03635-z