Journal of The Royal Society Interface
You have accessResearch articles

Early-stage loss of ecological integrity drives the risk of zoonotic disease emergence

Lara Marcolin

Lara Marcolin

Department of Biology and Biotechnologies ‘Charles Darwin’, Sapienza Università di Roma, Rome, Italy

[email protected]

Contribution: Conceptualization, Formal analysis, Methodology, Project administration, Writing – original draft

Google Scholar

Find this author on PubMed

,
Andrea Tonelli

Andrea Tonelli

Department of Biology and Biotechnologies ‘Charles Darwin’, Sapienza Università di Roma, Rome, Italy

Contribution: Writing – original draft, Writing – review and editing

Google Scholar

Find this author on PubMed

and
Moreno Di Marco

Moreno Di Marco

Department of Biology and Biotechnologies ‘Charles Darwin’, Sapienza Università di Roma, Rome, Italy

[email protected]

Contribution: Conceptualization, Methodology, Supervision, Writing – review and editing

Google Scholar

Find this author on PubMed

    Abstract

    Anthropogenic pressures have increasingly disrupted the integrity of ecosystems worldwide, jeopardizing their capacity to provide essential contributions to human well-being. Recently, the role of natural ecosystems in reducing disease emergence risk has gained prominence in decision-making processes, as scientific evidence indicates that human-driven pressure, such as habitat destruction and deforestation, can trigger the emergence of zoonotic infectious diseases. However, the intricate relationship between biodiversity and emerging infectious diseases (EIDs) remains only partially understood. Here, we updated the most comprehensive zoonotic EID event database with the latest reported events to analyse the relationship between EIDs of wildlife origin (zoonoses) and various facets of ecological integrity. We found EID risk was strongly predicted by structural integrity metrics such as human footprint and ecoregion intactness, in addition to environmental variables such as tropical rainforest density and mammal species richness. EID events were more likely to occur in areas with intermediate levels of compositional and structural integrity, underscoring the risk posed by human encroachment into pristine, undisturbed lands. Our study highlights the need to identify novel indicators and targets that can effectively address EID risk alongside other pressing global challenges in sustainable development, ultimately informing strategies for preserving both human and environmental health.

    1. Introduction

    Emerging infectious diseases (EIDs) of zoonotic origin pose a major threat to public health and socio-economic stability [1,2]. Most EIDs in recent decades derive from pathogens originating in wildlife [3], but the link between biodiversity and EID risk remains only partially explored. As zoonotic spillover cannot occur in the absence of reservoir hosts that maintain the pathogen, wildlife can be considered the necessary but not sufficient condition for the occurrence of zoonotic spillover [4]. Global-scale analyses [3,5] identified mammal species richness as a key predictor of zoonotic EIDs. The relationship between mammal species richness and EID risk generally shows idiosyncratic trends, where both low and high values of species richness correspond to high risk [5]. Such trends suggest that the biodiversity–risk relationship is linked to context-dependent dynamics of pathogen, host and human interaction [6], and is likely influenced by anthropogenic drivers that alter these dynamics. A recent study [7] showed that the proportion of zoonotic hosts in wildlife communities increases in human-dominated systems, compared with undisturbed areas. Both richness and abundance of host species in sites under substantial human use (such as agricultural and urban ecosystems) were higher compared with those found in assemblages sited in undisturbed habitats. This finding highlights the role of anthropogenic disturbance in increasing EID risk. Moreover, it emphasizes that efforts to anticipate EID risk need to take into consideration ecosystem-level metrics that capture the effects of human activities and related environmental changes on biodiversity. In this perspective, EID risk is expected to respond positively to high levels of human pressures that cause habitat degradation, fragmentation and biodiversity loss, thereby altering host communities’ composition and abundance in a way that facilitates pathogen circulation and their consequent transmission to human hosts [7,8]. However, the risk of EIDs can be expected to decrease beyond a certain level of habitat modification, because highly modified areas support fewer species of wildlife, leading to a reduced diversity of available pathogens.

    Habitat degradation caused by land-use changes has a direct influence on human exposure to wildlife pathogens, increasing contact rates at the human–wildlife interface and leading to a higher risk of zoonotic spillover [9,10]. Also, habitat degradation causes substantial changes in species assemblages, which may alter disease dynamics within wildlife communities. In multi-host disease systems, higher species diversity may reduce the transmission of a pathogen through a variety of mechanisms collectively referred to as dilution effects [11]. As different species have different levels of host competence for a given pathogen [12] (i.e. the ability to harbour the pathogen and transmit it to new hosts or vectors), the dilution effect hypothesis assumes that in more diverse wildlife communities, the presence of less-competent hosts reduces contact rates between highly competent hosts (known as the ‘encounter reduction’ mechanism), limiting the opportunities for pathogens to spread. Another dilution mechanism is ‘host regulation’, which occurs when predators or competitors of competent hosts are abundant. Alterations that decrease species diversity may therefore increase pathogen transmission in wildlife communities, resulting in a higher risk of zoonotic spillover once dilution effects are suppressed. On the other hand, the decline in biodiversity itself does not necessarily result in increasing disease risk. Indeed, high levels of biodiversity loss reduce disease risk if competent host species disappear from highly modified environments [11].

    Following the same logic, high levels of environmental and biodiversity integrity are supposed to reduce EID risk [13,14]. While anthropogenic pressures alter species diversity and abundance, intact lands support natural ecological and evolutionary processes operating with minimal human disturbance. Intact forests provide essential functions in climate mitigation, conservation of biodiversity and, importantly in this context, human health [14]. When compared with degraded ones, intact forests host a higher richness of forest-dependent species, a higher functional and intraspecific genetic diversity, as well as a higher connectivity important for gene flow and genetic adaptation. All these features are supposed to give intact forests the potential to reduce infectious disease risk. However, empirical evidence of how various components of ecological integrity contribute to reducing EID risk remains unknown.

    The recently approved Kunming–Montreal Global Biodiversity Framework [15] formalizes the importance of conserving ecological integrity, in Target 1: ‘…bring the loss of areas of high biodiversity importance, including ecosystems of high ecological integrity, close to zero by 2030…’. The Framework also recognizes the value of nature in reducing disease risk, in Target 11: ‘Restore, maintain and enhance nature’s contributions to people, including ecosystem functions and services, such as […] reduction of disease risk…’. We argue that these two target elements have the potential to generate important synergies in the implementation of the Framework [16] and its relationship with the broader 2030 Sustainable Development Agenda [17], but the relationship between ecological integrity and disease risk reduction needs to be tested empirically. A comprehensive large-scale test of these effects has not been performed yet, as large-scale modelling exercises have mostly focused on species richness and ignored ecological integrity and other aspects of biodiversity [3,5]. In this work, we explore how global zoonotic EID risk responds to ecological integrity, analysing the relationship between zoonotic EID events of wildlife origin and several biodiversity metrics that reflect anthropogenic alteration.

    2. Methods

    2.1. Mapping zoonotic emerging infectious disease events

    We followed the zoonotic EID event definition given by Jones et al. [3], considering an EID event as ‘the first temporal emergence of a pathogen in a human population which was related to the increase in distribution, increase in incidence or increase in virulence or other factor which led to that pathogen being classed as an emerging disease’. The zoonotic EID event database used in our work is an updated version of the database used by Jones et al. [3] and modified by Allen et al. [5], where events range in time from 1940 to 2008 (n = 224). Based on the Emerging Infectious Disease Repository (EIDR) database [18], we filtered and updated 7 out of 33 reported events until 2013 (n = 231) (electronic supplementary material, table S1) following the same criteria used by Allen et al. [5], so as to include zoonotic diseases of wildlife origin reported in the peer-reviewed literature, keeping only the first emergence of a new disease-causing agent. Single case reports and events lacking evidence for their geographic and temporal origin were excluded from the analysis. For each new event, we created a spatial polygon corresponding to the most precise municipal region where the event occurred. Since there are multiple temporal layers among covariates, such as land-use and population variables, events were selected keeping only those which occurred since 1970 (n = 154) in order to temporally match them with predictors.

    2.2. Predictors of zoonotic emerging infectious disease risk

    Following Allen et al. [5], we first collected spatial data layers for 19 environmental and anthropogenic drivers already tested for their contribution to zoonotic EID risk prediction. Then, we collected six biodiversity metrics representing different aspects of ecological integrity, to test our hypothesis about the role of integrity in mitigating EID risk. These data were downscaled to the lowest common spatial resolution of 1° (WGS84, approximately 110 km at the equator). A full list of predictor layers and details of sources, original resolutions and rescaling are shown in table 1.

    Table 1. List of predictors included in the models.

    variable type source data set processing temporal resolution and extent
    human population human activity GPCGv1 [19] rescaled decadal (1970–2000)
    population change human activity GPCGv1 [19] (calculated) calculated from rescaled layers inter-decadal (1970–2000)
    cropland human activity LUH2 [20] rescaled decadal (1970–2000)
    cropland change human activity LUH2 [20](calculated) calculated from rescaled layers inter-decadal (1970–2000)
    pasture human activity LUH2 [20] rescaled decadal (1970–2000)
    pasture change human activity LUH2 [20](calculated) calculated from rescaled layers inter-decadal (1970–2000)
    urban land human activity EarthEnv [21] rescaled N/A
    managed/cultivated vegetation human activity EarthEnv [21] rescaled N/A
    global environmental stratification environment GEnS [22] rescaled N/A
    evergreen/deciduous needleleaf trees environment EarthEnv [21] rescaled N/A
    evergreen broadleaf trees environment EarthEnv [21] rescaled N/A
    deciduous broadleaf trees environment EarthEnv [21] rescaled N/A
    mixed/other trees environment EarthEnv [21] rescaled N/A
    shrubs environment EarthEnv [21] rescaled N/A
    herbaceous vegetation environment EarthEnv [21] rescaled N/A
    regularly flooded vegetation environment EarthEnv [21] rescaled N/A
    mammal species richness animal/host IUCN [23] (see electronic supplementary material) static (2022)
    livestock mammal headcount animal/host GLW [24] rescaled static (2016)
    poultry animal/host GLW [24] rescaled static (2016)
    biodiversity habitat index biodiversity [25] reprojected and rescaled decadal (2000-2020)
    biodiversity intactness index biodiversity [26] reprojected and rescaled static (2016)
    contextual intactness biodiversity [27] reprojected and rescaled static (2013)
    ecoregion intactness biodiversity [28] reprojected and rescaled decadal (1993–2009)
    human footprint biodiversity [29] reprojected and rescaled decadal (1993–2009)
    wilderness biodiversity [30] (see electronic supplementary material) decadal (1993–2009)
    2.2.1. Environmental and anthropogenic variables

    We represented environmental conditions using eight climatic and land cover features, respectively, from the Global Environmental Stratification dataset [22] and the EarthEnv database [21]. Such environmental variables are known to greatly influence the distribution of terrestrial mammals and the pathogens they host [3133].

    We then included data on human population density, population change and land-use changes to control for the human-driven pressure that increases exposure to EIDs. Population density has a strong correlation with the distribution of EIDs, affecting disease transmission dynamics and increasing outbreak detection probability in densely populated areas [3,5,33]. Population change serves as an indicator of changing demands on ecosystems, leading to environmental disruptions and disease emergence [13]. These variables were derived from the Global Population Count Grid Time Series Estimates, v1 database [19]. Land-use changes, such as agriculture and deforestation, impact terrestrial species richness and abundance, altering host and vector communities and facilitating human–wildlife contact [13,3436]. Land-use data were obtained from the Land Use Harmonization 2 database (LUH2) [20]. Domestic animal density was also included owing to the role of livestock as intermediate or amplifying hosts in disease outbreaks in humans [20,33]. Data on livestock density were retrieved from the Gridded Livestock of the World dataset [24].

    2.2.2. Biodiversity and ecological integrity variables

    We included mammal species richness, a well-known correlate of EID risk [5], as a proxy of pathogen species richness. Here, for simplicity, we assumed that the pool of pathogens with zoonotic potential increases with an increased number of mammal species richness [37], although we acknowledge the question of whether zoonotic pathogen richness is homogeneously distributed across mammalian taxonomy is still debated. Terrestrial mammals’ distribution data were obtained from the IUCN Red List [23], which includes spatial maps for 5624 terrestrial mammal species’ known range. We filtered data excluding ranges where species were declared extinct prior to 1970 and we then assessed the mammal species richness index on a global scale by overlaying each species’ spatial polygon which overlaps with each 1° resolution grid cell and counting the number of species present in each grid cell.

    To explore the relationship between ecological integrity and EID risk, we selected six biodiversity metrics, each capturing unique dimensions of ecological integrity. As defined by the Convention on Biological Diversity, ecological integrity is ‘an ecosystem’s capacity to maintain its composition, structure and functioning within a natural range of variability over time’. Structure refers to the three-dimensional component of ecosystems, encompassing the biotic and abiotic elements that shape the heterogeneous matrix which supports the composition and functioning of the ecosystem; composition pertains to the diversity and range of organisms existing within the ecosystem and function relates to the ecological processes and ecosystem services provided by the ecosystem [38]. Changes in the structure of ecosystems owing to human activities can have significant consequences for EID risk. Alterations of ecosystem structure can affect the spatial distribution of host species, vectors and potential intermediate hosts involved in the transmission of diseases. Anthropogenic pressures, such as deforestation and habitat conversion, can lead to changes in ecosystem structure, resulting in increased contact between humans, livestock and pathogens. This can facilitate the transmission of EIDs [9] by increasing pathogen sharing from wildlife to humans and creating novel opportunities for pathogen evolution and adaptation to new hosts [38,39]. Changes in ecosystems’ composition owing to human activities, such as deforestation, habitat destruction and land-use change, can cause shifts in pathogens’ ecology by altering zoonotic host communities’ composition through loss, turnover and homogenization of biodiversity [7,8]. Biodiversity loss may increase disease transmission and incidence when the lost species are less-competent hosts, by intensifying encounter rates between pathogens and competent hosts [34,40]. More diverse host communities instead may inhibit the spread of pathogens according to the dilution effect hypothesis [41].

    We chose four ecosystem-level metrics to account for the overall impact of anthropogenic pressures on the ecosystem’s structure.

    1. The human footprint index (HFP) [29] is an indicator of cumulative human pressures on natural ecosystems. HFP is a global index with a resolution of 1 km2 based on eight key layers of human pressures such as infrastructures, land cover and human access to natural areas, which are a suite of anthropogenic stressors that are known to impact ecological systems.

    2. Wilderness areas [30,42] are defined as ‘ecologically intact areas free of industrial-scale activities and other human pressures which result in significant biophysical disturbance’ [43]. Hosting original species assemblages in the absence of large-scale anthropogenic disturbances, fully functional ecosystems may mitigate pathogen spillover risk [14]. We derived the wilderness global index using wilderness area maps from Allan et al. [30], representing ‘pressure-free’ lands with a contiguous area >10 000 km2. It is important to notice that this definition of wilderness does not preclude human presence (e.g. indigenous populations) but rather industrial-scale activities. The proportion covered by wilderness areas for each 1° cell was binarized using a threshold of 25% (i.e. cells with >25% wilderness coverage were considered to include wilderness).

    3. Ecoregion intactness [28] quantifies the extent to which natural ecosystems within an ecoregion remain undisturbed and retain their original ecological characteristics. It is a measure of human alteration of terrestrial ecosystems which captures habitat loss, quality and fragmentation effects arising from anthropogenic disturbance at an ecoregional scale.

    4. Contextual intactness [27] is obtained by combining local habitat conditions with estimates of spatial turnover in species composition, via generalized dissimilarity modelling [44,45] with a resolution of 30 arcsec. The metric identifies valuable local habitat conditions, in the context of other locations where similar species assemblages are found. It measures the proportion of all locations expected to have once supported a similar assemblage of species to the focal grid cell, but which have suffered higher impact from human activities than the focal cell. Contextual intactness values range between 0 and 1, where higher values mean the focal cell has a higher level of intactness within the context of biologically similar cells.

    We also accounted for intactness in biological communities, as determined by levels of anthropogenic alterations on species richness and abundance. We chose two metrics:

    1. The biodiversity habitat index [25] is a global index with a resolution of 30 arcsec designed to quantify the level of species diversity expected to be retained within a given geographic area, compared with its original (pristine) condition. This estimate is based on factors such as the unit’s size, and the connectivity and integrity of natural ecosystems across it.

    2. The biodiversity intactness index [26] expresses the average abundance of the native terrestrial species, relative to their abundance in an undisturbed location. It is a global spatial raster with a resolution of 30 arcsec which integrates a measure of abundance-based compositional similarity with pressure variables of land-use, land-use intensity, human population density and proximity to the nearest road.

    2.3. Predicting zoonotic emerging infectious disease risk

    We explored the relationship between EID risk and biodiversity variables with a random forest classifier algorithm [46] in the R package ‘ranger’ [47]. We ran six distinct models, one for each metric of ecological integrity. Every model included the same set of 19 environmental and anthropic predictors used in the original Allen et al. [5] paper, including mammal species richness, plus a new variable referring to ecological integrity (i.e. human footprint, biodiversity habitat index, biodiversity intactness index, ecoregion intactness, contextual intactness and wilderness). We decided not to run a full model with all integrity metrics together in order to avoid the effects of collinearity between these predictors in our analysis (electronic supplementary material, table S2). We selected random forest models over the boosted regression trees models used by Allen et al. [5] after conducting a preliminary random tenfold cross-validation, wherein random forest models exhibited a higher predictive performance, as measured by the true skill statistics metric (TSS) [48] (electronic supplementary material, table S3).

    The fitted models were used to generate relative influence box plots and partial dependence plots with empirical 90% confidence intervals. The importance of each variable was measured as the mean decrease accuracy (MDA) which measures the decrement of the accuracy of model predictions caused by the variable’s permutation, averaged across all trees. The MDA scores for each model were then normalized by their largest value so that the most influential variable in every model took a value of 1 and every other variable was rescaled accordingly, in order to compare variables’ importance among models.

    We also evaluated how EID risk responds to the interaction between each integrity metric and levels of forest cover, a well-known predictor of EID risk, by implementing bivariate partial dependence plots. We used the R package ‘pdp’ [49] to create both partial dependence plots and bivariate partial dependence plots.

    2.4. Dealing with uncertainty in emerging infectious disease events

    To account for spatial and sampling uncertainty in EID events, we used a bootstrap resampling regime: each model was fitted on 100 bootstrap samples consisting of 154 zoonotic EID ‘events’, and the resultant predictions from the multiple models were then combined to create a single score and to generate empirical confidence intervals. For each event, we selected a presence point, which is one of the grid cells intersecting the spatial polygon where that event is known to have occurred (a municipality, a region, a country). Each presence point was associated with a background point, that is, a grid cell chosen within all other cells which do not intersect that polygon. Both presence and background points were sampled randomly, but the probability of selecting them was weighted based on event coverage and reporting effort. Event coverage is the proportion of a grid cell that overlaps the polygon within which the event has occurred. We assigned higher weights to grid cells with larger overlap (i.e. a higher probability of being selected as presences). Reporting effort is the extent to which disease events are reported to become part of the scientific literature, which is a proxy for the power of disease detection and reporting (for details, see Allen et al. [5]). Cells with a high value of reporting effort have a higher probability to be chosen both as presence and background points.

    2.5. Model fitting and validation

    For each model, we selected the optimal number of variables randomly sampled as candidates for each split (mtry) and node size using a fivefold cross-validation. We defined a grid of hyperparameter values (n = 200) and repeated the cross-validation process for each combination. We then calculated the average performance of the model across all folds for each hyperparameter set and chose the combination with the highest average performance, quantified by the true skill statistic metric.

    To estimate models’ predictive performance, we ran two validations: a classical random cross-validation and a stricter spatial block cross-validation. In the random cross-validation, data were randomly partitioned into ten folds: data from nine folds were used to fit the model, whereas data from the remaining fold were used to test predictive ability. We fitted the models iteratively on all folds but one and validated on the left-out fold. In the spatial block cross-validation, we created six different spatial blocks, one for each continent. We then fitted the models iteratively on all blocks but one and validated them on the left-out block. This latter strategy allowed us to separate training and testing sets by using spatial blocks [50] which take into account the spatial autocorrelation of ecological data. Models’ predictive performance was estimated with the true skill statistic.

    3. Results

    3.1. Model accuracy

    For each integrity model, the ability to recognize both presence and background points was always better than random with specificity and sensitivity both higher than 0.5 (electronic supplementary material, table S4). The random tenfold cross-validation showed a high predictive power across all models per sample combinations (averaged TSS = 0.46, range: 0.33–0.60), whereas the spatial cross-validation showed a lower performance (averaged TSS = 0.27, range: 0.12–0.52) when extrapolating into novel geographical areas [50].

    3.2. Variable’s importance in the random forest models

    Evergreen broadleaf trees variable was the strongest predictor of EID events across all models (median MDA = 0.76, range: 0.74–0.80; figure 1), followed by global environmental stratification (median MDA = 0.60, range: 0.54–0.61), and evergreen/deciduous needleleaf trees (median MDA = 0.54, range: 0.45–0.57). Additionally, several variables related to human activity showed a high level of relative importance, including human population (median MDA = 0.44, range: 0.41–0.48), cropland coverage (median MDA = 0.37, range: 0.34–0.38) and cropland change (median MDA = 0.35, range: 0.34–0.37). Notably, while the TSS scores were similar across all models, and also similar to a baseline model without any integrity metric, there was high variation in the relative predictive importance of integrity metrics. Specifically, human footprint showed the largest relative importance (median MDA = 0.55) among the biodiversity and ecological integrity metrics, immediately followed by mammal species richness (median MDA = 0.53, range: 0.49–0.57). Ecoregion intactness (median MDA = 0.44) and biodiversity habitat index (median MDA = 0.38) also showed good predictive power, while wilderness had the lowest normalized MDA (median MDA = 0).

    Variable’s importance plots for each separate model run with the same set of environmental and anthropogenic predictors plus a single integrity metric (in green).

    Figure 1. Variable’s importance plots for each separate model run with the same set of environmental and anthropogenic predictors plus a single integrity metric (in green): (a) human footprint; (b) ecoregion intactness; (c) biodiversity habitat index; (d) biodiversity intactness index; (e) contextual intactness; (f) wilderness. The importance of each variable is quantified as the MDA of models’ predictions caused by variable permutation. Accuracy measures the proportion of observations correctly classified out of the total observations in the dataset. Variables’ MDA scores within each model were normalized, so that the most influential variable took a value of 1, and the others were rescaled accordingly. Here, the boxplots show the minimum, first quartile, median, third quartile and maximum MDA scores across 100 replicate models.

    We employed partial dependence plots to display the relationship between EID risk and explanatory variables (figure 2; electronic supplementary material, figure S1). The partial dependence plot for human footprint showed the highest probability of EID events for values which correspond to rural areas subject to intermediate modification (HFP ~ 5), while the risk is low both for values indicating highly modified areas and for values corresponding to intact and near-intact lands (HFP <3). EID risk correlated with ecoregion intactness and biodiversity habitat index in a similar way, with the highest risk for intermediate values of integrity. For the remaining integrity metrics, the predicted probability showed little variation, indicating a weak influence on EID risk.

    Partial dependence plots showing the effect of integrity metrics on EID risk. Each plot represents how the risk (on the y-axis) changes given different values of each variable (on the x-axis).

    Figure 2. Partial dependence plots showing the effect of integrity metrics on EID risk. Each plot represents how the risk (on the y-axis) changes given different values of each variable (on the x-axis): (a) human footprint; (b) ecoregion intactness; (c) biodiversity habitat index; (d) biodiversity intactness index; (e) contextual intactness; (f) wilderness. Black lines show the median and coloured areas show the 90% confidence intervals, computed using a bootstrap resampling regime incorporating uncertainty in EID event locations. Below the plots, we report the density distribution of each biodiversity metric upon which the model was built.

    We implemented bivariate partial dependence plots to display how the interaction between integrity metrics and forest cover affects EID risk (figure 3). For all integrity metrics, the highest risk of EID was observed at intermediate values of forest cover and integrity values, namely: human footprint values of ∼5, ecoregion intactness around 0.6, and biodiversity habitat index around 0.9. While the predicted probability for wilderness areas exhibited minimal variation between wilderness (indicated as 1) and non-wilderness areas (indicated as 0), it is noteworthy that EID risk consistently remained lower in wilderness areas.

    Bivariate partial dependence plots showing the effect on EID risk of the interaction between tropical rainforest cover and integrity metrics.

    Figure 3. Bivariate partial dependence plots showing the effect on EID risk of the interaction between tropical rainforest cover and integrity metrics: (a) human footprint; (b) ecoregion intactness; (c) biodiversity habitat index; (d) biodiversity intactness index; (e) contextual intactness; (f) wilderness. EID probability has been rescaled by its minimum and maximum to allow comparison across integrity metrics.

    4. Discussion

    We developed spatial models to examine how the risk of zoonotic EID changes in response to different metrics of ecological integrity. Our approach allowed us to evaluate the predictive power of these metrics in assessing disease risk and to investigate the intensity and direction of their relationship with EID event probability. This research explores various ecosystem-level metrics, extending beyond the traditional focus on mammal species richness alone. We showed that certain aspects of ecological integrity are important for predicting EID risk, both individually and in interaction with levels of forest cover.

    Models consistently performed well in recognizing presence and background points, with high specificity and sensitivity, especially in random 10-fold cross-validation, but showed lower performance in spatial cross-validation. The differences in model accuracy resulting from the two validation methods are in line with those expected [50] because random cross-validation often suffers from the dependence structure of the ecological data, since when validation data are randomly selected from the entire spatial domain, training and validation data from nearby locations will be dependent, and this spatial autocorrelation results in an overoptimistic assessment of models’ performance.

    Environmental variables were the features that better characterized EID risk, with evergreen broadleaf trees as the strongest predictor of the distribution of EID events, followed by global environmental stratification and mammal species richness. This pattern was already identified in the original Allen et al. model [5], and we confirmed it here with models which include integrity metrics and more recent disease events. Given the high biodiversity levels of tropical forests [51], it is possible that these variables represent a similar process. This trend is consistent with existing theories which suggest that greater host biodiversity expands the pool of available pathogens, thus increasing the probability for novel zoonotic pathogens to emerge [13].

    Among integrity metrics, human footprint and ecoregion intactness—respectively representing human pressure and its effects on ecosystems’ structure—were especially important in shaping EID risk. The biodiversity habitat index, which measures the effects of anthropogenic disturbances on ecosystems’ biological composition, also plays an important role in predicting risk. EID risk was high for values of ecological integrity that indicate moderate levels of anthropogenic alteration. These patterns remain consistent regardless of the extent of forest cover, suggesting that human activities altering ecologically intact and nearly intact areas pose a more significant risk of zoonotic disease emergence compared with intensification of activities within areas that have already undergone human modification [13].

    The human footprint index was the most important metric in predicting zoonotic EID risk. A low risk of EID was associated both with very high values (>30), and low values of HFP (<3) which correspond to heavily impacted areas and intact or near-intact lands, respectively [29]. A HFP threshold of 3 has already proved to be an important predictor of transitions in extinction risk for terrestrial mammals [52,53] suggesting that the same drivers of extinction risk, such as habitat loss, degradation and fragmentation, are also associated with increased zoonotic risk. In fact, the probability of EID events is lower in wilderness areas compared with non-wilderness areas, and this pattern occurs for each level of forest cover. At the same time, highly modified areas are less susceptible to EID risk from wildlife, as very few species are able to live in these areas, leading to a much lower diversity of pathogens. The highest risk was associated with HFP values around 5. Values of HFP between 3 and 5 indicate moderately modified rural areas subject to intermediate levels of human pressures and land conversion. HFP greater than 3 corresponds to a level of human pressure comparable to that found in pasture lands, while an HFP of 7 or greater is considered equivalent to intensive agriculture [29]. The relationship between human footprint and EID risk emerging from this analysis is consistent with previous findings about the role of human presence and anthropogenic land-use changes in increasing EID risk. Importantly, human footprint was identified as a key predictor of epidemic cases of major vector-borne diseases such as dengue, chikungunya and Zika, owing to human-driven ecological changes that affect vector species distribution and disease incidence [54]. Rural areas are associated with agricultural expansion and intensification that promote EID risk in close proximity to natural highly diverse habitats [36,55]. Anthropogenic pressures have been shown to alter ecological systems and continue to expand into new areas, with land conversion mostly occurring in tropical rainforests [56]. Changes in land-use increase contact rates between humans and wildlife, disrupting natural disease dynamics [34,36]. These changes often alter species diversity and abundance within ecological communities, affecting pathogens’ transmission rates and exposing novel hosts to infection [13,35]. Moreover, the expansion and intensification of livestock production bring domestic animals in close proximity to wildlife habitats, creating a pathway for the transmission of zoonotic pathogens from wildlife species to farming communities. High livestock population density, coupled with poor genetic diversity and health conditions, can increase the probability of zoonotic spillover to humans by facilitating pathogen shedding and transmission within livestock, leading to an ‘amplification effect’ [57]. Land conversion is also associated with predictable changes in the local diversity and taxonomic composition of known wildlife hosts of zoonotic pathogens, probably mediated by covariance between traits that influence both host status and tolerance to humans [7]. This could be explained by some life-history features such as small body size, short lifespans and fast reproduction [58]. These features enable some wildlife species to cope with human disturbance, but at the same time, make them more prone to acquire and transmit pathogens owing to low energetic investment in adaptive immunity [59,60]. Furthermore, when compared with natural habitats, assemblages in areas under land-use change have more widespread species on average, especially in tropical latitudes [8]. In fact, land-use changes can increase the abundance of selected species that have habitat and dietary niches which overlap with humans enabling direct and indirect contact with similarly adapted sympatric species, domesticated species and humans [61]. Johnson et al. [62] suggest that spillover risk is higher from animal species adapted to human-dominated landscapes that have increased in abundance and expanded their range following land-use change. Additionally, their research found that less common wildlife species, categorized with increasingly threatened status by the IUCN Red List, host significantly fewer viruses shared with humans when compared with widespread and abundant wild mammalian species. These trends of biological homogenization occurring in ecosystems under anthropogenic transformation may amplify disease risk in human-modified lands when widespread, generalist and synanthropic species are also more competent hosts for pathogens [7].

    Along with human footprint, other integrity metrics resulted to be important in predicting EID risk: ecoregion intactness, which accounts for the extent of habitat degradation, fragmentation and the loss of habitat quality caused by human pressures, and biodiversity habitat index, which quantifies the degree of biodiversity intactness in terms of persistence of species diversity. Once again, values representing intermediate levels of human pressures—approximately 0.67 for ecoregion intactness and approximately 0.9 for biodiversity habitat index, according to the original sources [25,28]—are associated with an increase in risk. A consequence of the processes of land conversion is the transformation of contiguous natural habitats into smaller, discrete remnant patches embedded in a matrix of human-modified land. The resulting changes in edge density may have cascading ecological effects that influence resource availability, population carrying capacities, species persistence and community composition, as well as increase interspecies contact rates influencing how pathogens are transmitted within and between species. As demonstrated by Faust et al. [55], intermediate levels of habitat loss correspond to the maximum edge density, while at high levels of habitat conversion edge density declines, reducing spillover risk. Intermediate values of the biodiversity habitat index indicate an environment where moderate habitat changes can lead to altered population dynamics among wildlife species, creating ecological conditions that can facilitate pathogen transmission among species. The loss of diversity and changes in species composition can increase the risk of EIDs, especially when it involves animal species that can play a key role in pathogen transmission [63]. Biodiversity loss owing to habitat loss, fragmentation and degradation is size-selective, especially in tropical forests, and the species most likely to disappear are large-bodied species [64], while smaller-bodied species tend to increase in abundance [63]. Smaller-bodied species, such as rodents, are more likely to be competent hosts for many pathogens [59] and several disease systems, such as Lyme disease [40], showed a correlation between large wildlife loss, fluctuations in the abundance of susceptible hosts and increased disease risk [65]. The positive but weak correlation between wilderness and EID risk could derive from an oversimplification of habitat conditions, as both near-intact and impact areas are classified as ‘non-wilderness’ in this case. Contextual intactness also revealed a weak influence on zoonotic risk. This could be associated with the complexity of that metric, which accounts for both habitat conditions and patterns of compositional dissimilarity [27]. In fact, locations in very different environmental conditions might exhibit similar contextual intactness values, if both are found at the edge of more modified areas while determining different EID risk levels.

    This study acknowledges significant limitations. First, data on disease emergence events are constrained because the primary source, the EIDR database [18], was last updated in 2013, omitting events since then. Additionally, many spillover events, particularly those associated with mild or non-specific symptoms and no human-to-human transmission, often go undetected or unreported and the specificity of geographic information for these events varies widely, ranging from precise coordinates to broader regions. Furthermore, we acknowledge the presence of representation bias in our analysis, as reports of emerging diseases are mostly from developed countries [3], while surveillance and spillover detection are lacking in rural and low-income regions. To address this bias, we incorporated a reporting effort index derived from Allen et al. [5] which accounts for the varying likelihood of disease events being reported in the scientific literature. The response variable includes all zoonotic EID events originating from wildlife reported since 1970, regardless of pathogen type or transmission route (electronic supplementary material, figure S2). This introduces a potential limitation as the effects associated with biodiversity metrics may indirectly reflect variations in pathogen biology and ecology, assuming that not all diseases respond to biodiversity in the same way [54]. In fact, the degree to which biodiversity affects disease risk may depend on several factors related to pathogens: host-specialist diseases that are directly transmitted, without free-living stages, intermediate hosts or vectors are less likely to respond to changes in biodiversity, compared with vector-borne and multi-host pathogens with complex life cycles or free-living infectious stages. Additionally, the influence of biodiversity on disease risk is contingent on various pathogen-related factors, such as transmission type, with density-dependent pathogens responding differently from frequency-dependent ones [55,66]. Pathogens may have multiple transmission routes, and their primary pathway can vary depending on geographic region and host population, increasing uncertainty in large-scale global analyses [67]. It is also possible that at larger spatial scales, models might be unable to capture the mechanisms underlying the relationship between biodiversity metrics and EID risk. Johnson et al. [68] suggested that this relationship varies with scale, being most pronounced locally where species interactions influence it, and weaker at larger scales where climate and abiotic factors take over as dominant drivers. The spatial mismatch between the data of zoonotic spillover events and the explanatory variables also presents a challenge. For instance, the biodiversity intactness index and the contextual intactness variables have a fine resolution reflecting the spatial compositional turnover at the local ecological scale of 1 km. However, when aggregated to a coarse 1° resolution, these variables might lose valuable information, potentially reducing their importance within risk models. Hence, interpreting these results in absolute terms requires caution, as the predictive power and effect of these variables may vary when assessed on a different, finer scale.

    5. Conclusion

    We showed how metrics that represent different dimensions of ecological integrity affect EID risk, providing evidence that goes beyond simple biodiversity metrics (such as species richness) typically used before. By considering these factors, the study offers a broader picture of how anthropogenic pressures influence ecosystem structure and composition, subsequently affecting zoonotic risk. Our findings reveal that the highest EID risk occurs at the initial stages of ecosystem degradation, as demonstrated by human footprint, ecoregion integrity and biodiversity habitat index. This suggests that altering areas with high structural and compositional integrity may result in an immediate increase in EID risk. Another important finding emerging from this study is the complex relationship between tropical forest cover and disease risk. We found the assumption that high levels of forest density directly result in high risk is an oversimplification. Our results clearly illustrate this complexity, showing that risk levels increase when, under conditions of consistent forest density, intermediate levels of compositional or structural integrity are present. This study underscores the need to expand our perspective on biodiversity metrics beyond species richness alone. While species diversity is important, it is not the only factor influencing EID risk. Among the integrity variables we examined, the human footprint emerged as a highly predictive factor indicating the importance of the cumulative environmental impact of anthropogenic pressures on zoonotic risk predictions [54].

    Empirical findings from our study underscore the importance of preserving ecosystems with high ecological integrity. These areas not only play a crucial role in conserving biodiversity but also contribute to the reduction of zoonotic disease risk, aligning with the targets outlined in the Kunming–Montreal Global Biodiversity Framework [14]. The relationship found between human footprint and EID risk supports the Framework’s objectives and emphasizes the need for integrated efforts to safeguard both ecosystem integrity and public health in the face of global environmental changes. Future research should continue to explore the complex connections between biodiversity conservation, ecological integrity and disease risk, providing a robust empirical foundation for the effective implementation of the Framework’s goals [17].

    Ethics

    This work did not require ethical approval from a human subject or animal welfare committee.

    Data accessibility

    The datasets generated and/or analysed during the current study along with the essential code employed for statistical modelling, including the final modelled dataset, are available from the Zenodo Repository [69].

    Data is available in the electronic supplementary material [70].

    Declaration of AI use

    We have not used AI-assisted technologies in creating this article.

    Authors’ contributions

    L.M.: conceptualization, formal analysis, methodology, project administration, writing—original draft; A.T.: writing—original draft, writing—review and editing; M.D.M.: conceptualization, methodology, supervision, writing—review and editing.

    All authors gave final approval for publication and agreed to be held accountable for the work performed therein.

    Conflict of interest declaration

    We declare we have no competing interests.

    Funding

    This research was supported by EU funding within the NextGeneration EU-MUR PNRR Extended Partnership initiative on emerging infectious diseases (project no. PE00000007, INF-ACT Spoke4).

    Footnotes

    Electronic supplementary material is available online at https://doi.org/10.6084/m9.figshare.c.7214492.

    Published by the Royal Society. All rights reserved.