Impact of the household environment risk for maintenance of natural foci of Leishmania infantum transmission to human and animal hosts in endemic areas for visceral leishmaniasis in Sao Paulo State, Brazil

When it comes to visceral leishmaniasis (VL) in Brazil, one of the main targets of public health policies of surveillance is the control of domestic canine reservoirs of Leishmania infantum. This paper aims to evaluate the effect of the household environment risk in the maintenance of natural foci and in the transmission to human and animal hosts in an endemic city for VL, Bauru, in Brazil. We collected 6,578 blood samples of dogs living in 3,916 households from Nov.2019 to Mar.2020 and applied geospatial models to predict the disease risk based on the canine population. We used Kernel density estimation, cluster analysis, geostatistics and Generalized Additive Models (GAM). To validate our models, we used cross-validation and created a ROC graph. We found an overall canine VL (CVL) prevalence of 5.6%. Odds ratios (OR) for CVL increased progressively according to the number of canines for >2 dogs (OR 2.70); households that already had CVL in the past increased the chances for CVL currently (OR 2.73); and the cases of CVL increase the chances for human VL cases (OR 1.16). Our models were statistically significant and demonstrated an association between the canine and human disease, mainly in VL foci that remain endemic. Although the Kernel ratio map had the best performance (AUC=82), all the models showed high risk in the city's northwest area. Canine population dynamics must be considered in public policies and geospatial methods may help target priority areas and planning VL surveillance in low and middle-income countries.


Introduction
Leishmaniasis is a group of infectious diseases caused by a protozoan of the Leishmania genus that affects humans and animals. The transmission occurs by the bite of the dipterous of the subfamily Phlebotominae, the sand flies. It is considered one of the most widely distributed neglected diseases worldwide(1), being a health problem in North and East Africa, West and East Asia, and the Americas(2). More than one billion and a half persons live in risk areas for leishmaniasis around the However, for control programs, an integrated knowledge about the ecological niche of the vector and environmental conditions is fundamental to address effective measures (5). For this reason, geospatial modeling is a valuable instrument to target interventions of control programs (6).
In Brazil, there is a great difficulty for the effective implementation and operation of the VL control programs (7). Overall, the Brazilian Visceral Leishmaniasis Control Program (VLCP) is based on the control of canine reservoirs, which consists in serosurvey and culling of dogs; control of the vector spraying insecticides inside the households; and early diagnosis and treatment of human cases (8).
The first evidence of VL in Sao Paulo state was the presence of Lutzomyia longipalpis in the urban area of Araçatuba municipality in 1997 (9). In 1998, autochthonous VL dogs were reported, and for the first time in the state, the sand fly was suspected as the vector; and in 1999 autochthonous human cases were reported for the first time (10) (11).

Several factors may be responsible for increasing the cases and the number of deaths in
Sao Paulo state, such as difficulty for early diagnosis and specific treatment in human; difficulty in the correct identification and control of domestic reservoirs; and difficulty in controlling the vector population (8). In addition, there is unclear knowledge about other determinants that may influence the design of novel strategies for control and prevention of VL (12).
In Sao Paulo state, Bauru had the first evidence of VL in 2002 when the sandflies were found and the first autochthonous infection in a dog was reported. The first human records occurred in 2003. Since that time, there have been 580 cases and 46 deaths, a lethality rate of 7.9% from 2003 to 2020(13). The municipality of Bauru was chosen to perform this research because of the high number of cases and its endemicity in the region. In Bauru, there is a lack of information about the spatial distribution and a long-term follow-up of CVL, information that could aid in the global understanding of the problem. The spread of the disease (human cases) came from one cluster in the west to east, and the environmental characteristics suggest a causal relationship of deforestation and human occupation is associated with the emergence of new VL cases (14). Mapping the exact occurrence of the human or canine cases may help better understand the disease and plan public policies regarding VL.
This study aimed to calculate the impact of the household environment risk for visceral leishmaniasis using geospatial methods. We hypothesize that: a) the number of dogs in the . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) with Leishmania-specific antibodies present in serum samples. The diagnostic was run in a Multiscan spectrophotometer using a 450 nm filter and cutoff values ("Cutoff" = CO): CO = average negative controls x 2. The diagnoses were performed according to the manufacturer's instructions and the directions of the VLCP.

Definition of cases
A combination of TR DPP® and ELISA reagent was considered a positive result according to the Brazilian VLCP recommendations for canine diagnoses, routinely used by the Centers for zoonoses control in Sao Paulo (19). TR DPP® non-reagent was considered a negative resultsupplementary material. The prevalence was calculated based on the outcome, being a proportion of a dog found positive for CVL divided by the analyzed dog population. The consent for sample collection involving domestic dogs was provided by the dog owners in the areas surveyed. All serosurvey was supervised by the veterinary group of the Adolfo Lutz Institute in conjunction with the veterinarians responsible for the Center for Zoonoses Control in Bauru municipality. Households without dogs, closed or that refused to give the consent were excluded from the analysis.
The human laboratory diagnoses are based mainly on serological methods and microscopic diagnoses (parasitological). When amastigotes are identified, it is considered a certainty diagnostic.
Patients with clinical manifestation and reagent rapid immunochromatographic test rK39 and/or Indirect immunofluorescence with titers equal to or greater than 80 are considered positive for VL (19).
Human cases addresses come from the epidemiological surveillance center (CVE) (13).

Study design
Canine data were grouped into the households. The number of dogs was categorized as binary data to verify each risk group: one dog; two dogs; and more than two dogs. Households that already had a positive dog or a human case were also categorized as binary. We considered 1 for cases and 0 for non-cases of VL.

Mapping
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Points (households) were categorized as negative or positive for VL in each survey. Figure 2 shows the mapped data. To analyze the area of influence of households with infected dogs in the environment, we created buffer zones of 100m ( Figure 2)supplementary material. We then calculated the number of dogs, negative dogs, and positive dogs using spatial analysis tools. Finally, we aggregated features of point data into polygons, using the census tracts database, to stratify the prevalence spatially.

Statistical analysis
For all the performed calculations, we considered a significant value at p≤0.05. We used the geographic information system (GIS) ArcGIS 10.2.2 and R language, with several packages described in the sections below.

Pearson's correlation
Pearson's correlation was calculated to identify a possible association between the number of cases of CVL and: i) the number of investigated samples or ii) the number of households that . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted May 22, 2021. ; https://doi.org/10.1101/2021.05.18.21257380 doi: medRxiv preprint already had an infected dog, or iii) the number of households that already had and currently have an infected dog/dogs.

Binary logistic regression
We tested if the households with an infected dog (outcome = 0 for a household with no infected dog/dogs or outcome = 1 for a household with an infected dog/dogs) or an area of influence of household (outcome = 0 for areas of influence of household with no infected dog/dogs or outcome = 1 for area of influence of household with infected dog/dogs) could possibly increase the chances to have cases of the disease.

K-function
Being aware of spatial dependence of CVL promoting different risks or protection, we evaluate, locally, the spatial interactions in the urban neighborhoods. Ripley's K-function with 999 permutations was applied to identify households' spatial patterns at distances (21). In this function, K(t) is the number of events within a distance of an arbitrary event, divided by the overall density of events. We plotted maximum and minimal envelopes of the simulated values of K(t), giving the statistical significance for clustered or dispersed patternssupplementary material.

Cluster analysis
We used cluster analysis to detect significant concentrations of CVL within Confidence Intervals (CI) of 90%, 95%, and 99% -supplementary material. Clusters were calculated using Getis-Ord Gi statistic, which identifies features with either high or low values cluster spatially. The pattern can be expressed by clustered, dispersed, or random features and represents a measurable spatial aggregation unit (ESRI, Imagem).

Kernel density
Using the K-funcion dependence, we choose the minimal distance of concentration of our data, 0.5 km, to set the bandwidth. We used the quartic kernel function (22), which is given by . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the distance between the point and the observed event in location, and is the radius centered on .
We plotted Kernel density maps for CVL cases and canine samples. A Kernel density ratio map was then performed (CLV: samples), which gives a visualization of the risk for the disease.

Geostatistical approach
According to the number of dogs, a geostatistical approach was performed to predict the higher risk areas for CVL. We used the Ordinary Kriging method and select two datasets: cases of CVL and number of dogs. We adjusted data in a stable model in a semivariogram, in which for a set of experimental values z(x) and Z (x1+h), separated by h distance, is defined by the equation 2: Where,

N(h) is the number of experimental pairs;
h is the regular interval that separates z(xi) e z( +h) where, . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Cross-validation
For further analysis, we validated our data using cross-validation. We created random samples in ArcGIS and then split our database into training (75%, 2,937 points) and testing (25%, 979 points). Spatial models were created using the training dataset to predict the risk for the testing dataset. For each model, the best threshold was chosen, and we calculated specificity, sensitivity, and accuracy for correctly predicting the observed value of a case or non-case at the testing coordinates.
To sum up, we calculated the area under the receiver operating characteristic (ROC) curve (AUC) with 95% confidence interval, which plots the true positive rate versus false positive rate, allowing identifying the performance of the models. We used the 'pROC' and 'ggplot2' packages in RStudio.

Results
The current study investigated 6,578 dogs (Table 1). Anti-Leishmania spp. antibodies were present in 8.1% of TR DPP® (535/6,578) and 5.6% in both TR DPP® and ELISA diagnoses (369/6,578). We found different spatial prevalence in the investigated census tract, ranging from 0 to 50%, but the mean prevalence was 2.67%. Higher prevalence (>7.5%) was regularly distributed in the city in the sampled area ( Figure 2).

Cluster analysis
We identified a clustered pattern of households with CVL with statistical significance from approximately 0.5 to 6.5 km and a clustered pattern of human cases from 0.5 to 4 km ( Figure 3S).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.   Kernel density ratio map ranging from 0 (blue) to 0.7 (red), which gives a visualization of the risk diving the concentration of cases of CVL ( Figure 4S) by the concentration of dog samples ( Figure 5S). The areas of higher risk are in the west and southwest.

Pearson correlation
Pearson's correlation was positive and moderate considering the number of infected dogs and the households investigated (0.508, p-value=0.000); positive low for infected dogs and the households that already had an infected dog/dogs (0.240, p-value=0.000); and positive and low for infected dogs and the households that already had and currently have a dog/dogs with VL (0.129, p-value=0.000). All conditions were statistically significant.

Binary logistic regression for visceral leishmaniasis
According to

Spatial risk
Considering high OR for CVL according to the number of dogs, we created the spatial models using the number of dogs as a predictor. Figure 5 shows that both models (geostatistical and GAM) were considerable commonality in the spatial pattern. Higher risk is in the borders of the city, especially in the northwest and in the southeast. The last one can be a border effect. Moreover, both models are consistent with the Kernel density ratio map (Figure 4).

Cross-validation
Kernel, Geostatistical and GAM models were plotted in the ROC graph ( Figure 6). The  For each model, the AUC was calculated with 95% confidence intervals. The best model in predicting canine risk disease was the Kernel density ratio map.

Discussion
In the current study, we found a CVL TR DPP® sero-reaction rate of 8.1% (535/6578) and Particularly, prevalence can reveal bias once it may not represent the real number of canines. Overall, the serosurveys are directed to human case areas and/or areas of a suspect or identified canine VL case (19)(8). Historically, in Bauru, some neighborhoods have never performed a serosurvey before this study. On the other hand, some neighborhoods were investigated more than once since the first human case appearance, recognized by its recurrence of CVL. Our study planned the serosurvey to collect a large number of dog samples in different neighborhoods, giving a panorama of VL's endemicity and spatial epidemiological profile in a short time. Nevertheless, examined canines comprised less than 7% of the estimated dog population (6,578/99,815 dogs).
In the present study, our scale is the household instead of only the dogs, identifying spatial characteristics regarding the domiciles and canine population. We highlight that on the household scale, the positivity index of domiciles with infected dogs (8.7%) is superior that the global prevalence of CVL (5.6%), which emphasizes the importance of the household in the disease context. Clusters of households that already had CVL can point out the areas that remain a source of infection and are unnoticed. Almost all investigated areas had these clusters. Additionally, asymptomatic dogs can be highly competent (33) and remain a source of infection without being identified. They contribute to the silent endemic areas. It can turn out into highly endemic areas or possibly a human case site. Cluster gave us previous information of critical areas regards CVL.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted May 22, 2021. ; https://doi.org/10.1101/2021.05.18.21257380 doi: medRxiv preprint The recent expansion of VL to new endemic areas has been attributed to the adaptation of L. longipalpis (sandflies) to naïve ecological niches. The risk of expansion of VL increases in areas identified as migratory poles of attraction. Moreover, CVL has been highlighted as the primary cause of outbreaks (34). In these areas, canine enzootic disease precedes the appearance of human cases.
In our study, CVL increased the chances 102% for human cases and 116% for dogs, demonstrating an association between canine and human VL. Other studies found that the risk increased substantially for individuals when the presence of seropositive dogs (35) or previous cases of CVL in the household (36) (37). Furthermore, we identified a clustered pattern for both human and canine cases from approximately 500m.
As we identified, households with CVL and the dog population can increase the chances for VL and maintenance of natural foci of Leishmania infantum transmission to human and animal hosts, which urges specific public policies focused on animal health, especially in areas target as critical. We found the same mean number of dogs per household (1.6), as reported in previous research (18).
Less than 15% of the investigated households have more than 3 dogs, which is the minority, easily to monitor and investigate as a possible infection site. In the neighborhoods where the canine population is large, animal health assistance is required. Therefore, canine population dynamics must be considered in public policies.
Our results revealed that the risk of increasing CVL or human cases oscillated by areas. Of note, kernel maps studies have used the total number of cases or applied a constant (30). Our study used the number of cases and samples, which gives a visualization of the risk. In accordance, (29) used the same methodology and found a similar pattern of critical areas in the city's borders, a pattern that seems to be expected in small and medium-sized cities of similar urbanization process in low and middle-income countries where VL is endemic. The Kernel density ratio map was the best in the ROC graph, showing spatial analysis tools potential. Spatial models predicting disease risk are promising for decision-making regarding the control of VL. Such studies use machine learning for cutaneous leishmaniasis vectors prediction (38) or human cases prediction (39). Bi et al. stress that future research about VL should focus on spatial simulation and agent-based simulation (40). Machine learning is a novel approach that allows the forecast of disease risk. It can anticipate disease transmission dynamics and identify disease control strategies to fight endemic and emerging diseases (40). Our models bring new insights for thinking VL . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted May 22, 2021. ; https://doi.org/10.1101/2021.05.18.21257380 doi: medRxiv preprint through canines from a social perspective, which has been one of the most debatable points of control programs and tends to be of low priority in the context of general public health.
It is a time of changing public policies in relation to VL. The general principles that guided the past control programs are now questionable. Brazilian VLCP, performed by municipal levels, have presented operational difficulties in executing VL control strategies (41). Additionally, we highlight the unavailability of the proven effectiveness of technical alternatives for laboratory diagnosis, identification, and elimination or protection of reservoirs (42).
In Brazilian cities, culling dogs has been recommended as a control measuring to reduce VL Vast territorial areas should be treated by priority order, emphasizing different profiles of VL.
Furthermore, considering the genetic diversity of vectors (57)(58) and the protozoa (59) (60), even at the local levels, seem to be alternatives to rethink new VL approaches. The decision-making should be supported by an integrated approach, considering education, health, and environment, including vectors, causal agent, canines, households, population density, urbanization, presence of buildings, industries, and environmental factors, such as vegetation, water bodies, temperature, and precipitation. Animal health needs to be discussed in public policies without its stigma. Furthermore, VL should be addressed in the context of One Health (42).
To conclude, this paper had several limitations that should be recognized. Firstly, we had to use the census tract information based on the human population to calculate the canine population grid because of the lack of animal information. This could be solved with an updated canine census, hardly achieved in low and medium-income countries. Secondly, the performance of our spatial models had medium and low accuracy, although the critical areas being commonly similar to the Kernel ratio map of higher performance, which emphasizes a high chance that the classifier distinguishes the positive class values from the negative. The better performance of the models could . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted May 22, 2021. ; https://doi.org/10.1101/2021.05.18.21257380 doi: medRxiv preprint be improved with an updated census and adding real-world covariates when data become available.
There are still research gaps concerning VL, and many areas of study remain unexplored. It remains the question of balancing the effectiveness and costs involved in such a VL control plan (40). As future work, the next step of our research is to analyze the canines' role with new insights of controlling VL, for instance, canine cohort studies of insecticide-impregnated collars, vaccination, and treatment in different areas of this endemic site, as an individual and collective measure in the environment.

Conclusions
As a rule of thumb, one can say that the number of canines and the households impact the risk for maintenance of natural foci of Leishmania infantum transmission to human and animal hosts in endemic areas for VL. Overall, this study serves as a case study for regional and global applications. It reveals the importance of canines on the household scale in low and middle-income countries. It is time for changing VL public policies using a targeted plan of priority through spatial analysis. This statement invites further investigations regarding VL characteristics involving socioeconomic and environmental variables and VL in one health context. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 22, 2021. ; https://doi.org/10.1101/2021.05.18.21257380 doi: medRxiv preprint