Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Modeling and Mapping the Probability of Occurrence of Invasive Wild Pigs across the Contiguous United States

  • Meredith L. McClure , (MLM); (RSM)

    Affiliation Conservation Science Partners, Truckee, California, United States of America

  • Christopher L. Burdett,

    Affiliation Department of Biology, Colorado State University, Fort Collins, Colorado, United States of America

  • Matthew L. Farnsworth,

    Affiliation Conservation Science Partners, Truckee, California, United States of America

  • Mark W. Lutman,

    Affiliation National Wildlife Disease Program, Animal and Plant Health Inspection Service, United States Department of Agriculture, Fort Collins, Colorado, United States of America

  • David M. Theobald,

    Affiliation Conservation Science Partners, Truckee, California, United States of America

  • Philip D. Riggs,

    Affiliation Center for Epidemiology and Animal Health, Animal and Plant Health Inspection Service, United States Department of Agriculture, Fort Collins, Colorado, United States of America

  • Daniel A. Grear,

    Affiliation Center for Epidemiology and Animal Health, Animal and Plant Health Inspection Service, United States Department of Agriculture, Fort Collins, Colorado, United States of America

  • Ryan S. Miller (MLM); (RSM)

    Affiliation Center for Epidemiology and Animal Health, Animal and Plant Health Inspection Service, United States Department of Agriculture, Fort Collins, Colorado, United States of America

Modeling and Mapping the Probability of Occurrence of Invasive Wild Pigs across the Contiguous United States

  • Meredith L. McClure, 
  • Christopher L. Burdett, 
  • Matthew L. Farnsworth, 
  • Mark W. Lutman, 
  • David M. Theobald, 
  • Philip D. Riggs, 
  • Daniel A. Grear, 
  • Ryan S. Miller


Wild pigs (Sus scrofa), also known as wild swine, feral pigs, or feral hogs, are one of the most widespread and successful invasive species around the world. Wild pigs have been linked to extensive and costly agricultural damage and present a serious threat to plant and animal communities due to their rooting behavior and omnivorous diet. We modeled the current distribution of wild pigs in the United States to better understand the physiological and ecological factors that may determine their invasive potential and to guide future study and eradication efforts. Using national-scale wild pig occurrence data reported between 1982 and 2012 by wildlife management professionals, we estimated the probability of wild pig occurrence across the United States using a logistic discrimination function and environmental covariates hypothesized to influence the distribution of the species. Our results suggest the distribution of wild pigs in the U.S. was most strongly limited by cold temperatures and availability of water, and that they were most likely to occur where potential home ranges had higher habitat heterogeneity, providing access to multiple key resources including water, forage, and cover. High probability of occurrence was also associated with frequent high temperatures, up to a high threshold. However, this pattern is driven by pigs’ historic distribution in warm climates of the southern U.S. Further study of pigs’ ability to persist in cold northern climates is needed to better understand whether low temperatures actually limit their distribution. Our model highlights areas at risk of invasion as those with habitat conditions similar to those found in pigs’ current range that are also near current populations. This study provides a macro-scale approach to generalist species distribution modeling that is applicable to other generalist and invasive species.


Globally, invasive species are inflicting increasing amounts of economic and environmental damage on agricultural and ecological systems [1,2]. In the United States (U.S.), the economic cost of invasive species has been estimated to be $120 billion per year [3]. Much of this cost represents losses incurred by agricultural industry. Conflicts between both native and non-native wildlife species and agriculture are increasingly challenging the ability of institutions to mitigate their negative consequences [4,5]. Invasive species are also one of the most serious threats to biodiversity conservation [6] and have been identified as the primary factor threatening approximately 42% of all species of conservation concern in the U.S. [3]. The threats that invasive species pose to both agricultural and ecological systems may continue to increase in future decades along with the increased globalization of commerce [7,8].

Vertebrates are particularly successful invaders. Jeschke and Strayer [9] found that approximately 50% of introduced vertebrate species exchanged between North America and Europe successfully establish, and 50% of those successfully spread. In the U.S., at least 30 species of exotic free-ranging mammals have become established since European colonization [10,11]. These species often become serious pests that negatively impact native species and their environments [11,12]. Large mammals, such as ungulates, are particularly successful invaders due to their intelligence (i.e., large brain sizes), irruptive population dynamics, and declining abundance of predators [1317].

Wild pigs (Sus scrofa), the ungulate species that includes feral and domestic pigs (S. s. domesticus), several subspecies of the wild boar (S. s. spp.) [18], and hybrids, are one of the world’s most widely distributed mammals [19]. The native range of S. scrofa is Eurasia and Northern Africa [20], thus wild pigs are an invasive species throughout much of their current geographic range. In North America, wild pigs were first introduced in the 14th and 15th centuries by Spanish explorers in the southern U.S. Wild pigs have recently expanded their North American range to include at least 38 states and three provinces in Canada [21,22]. Wild pig populations continue to increase due to their reproductive capacity, adaptability to novel environments, and intentional or accidental introduction by humans [21,23,24].

The range expansion of wild pigs has resulted in substantial impacts to agricultural production, human food safety, ecosystems, and threatened species in the U.S. There is a well-established association of wild pigs with agricultural and environmental damage, though precise estimates of the economic cost of wild pig damage are limited [21,25]. Wild pigs also present considerable risks to human health through environmental contamination of water and agricultural crops [26] or through direct human exposure to bacterial, viral, or parasitic pathogens [27,28]. The rooting behavior and omnivorous diet of pigs can have dramatic ecosystem-level effects on soil properties as well as plant and animal communities [20,29].

Despite their considerable impacts on agricultural and ecological systems, little is known about environmental factors that most strongly influence the broad-scale distribution and spread of the species in the U.S. In this context, the principal objectives of this study was to evaluate physiological and environmental factors associated with the current distribution of wild swine in the contiguous U.S., and to use these factors to predict where pigs may be most likely to occur (and potentially establish and spread) beyond their current documented U.S. range.


Swine distribution data

The National Feral Swine Mapping System [30], collected and maintained by Southeastern Cooperative Wildlife Disease Study (SCWDS), describes the distribution of wild pigs across the lower forty-eight United States. This spatially explicit dataset, compiled at variable intervals from 1982 to 2004 and annually since 2008, consists of polygons describing the known geographic extent of established wild pig populations that have been present for two or more years and have evidence of reproduction. Data are reported nationally from wildlife professionals in state wildlife resources agencies and the United States Department of Agriculture via manual drawing of polygons using topographical maps to reference areas where pigs have been observed.

We aggregated the SCWDS data to watersheds described by the United States Geological Survey's (USGS) Hydrologic Unit Codes (HUC) database (HUC10; mean area of 512 ± 255 km2) [31]. Aggregation of the original polygon data to discrete sampling units was necessary because drawn polygons varied greatly in size and detail (e.g., virtually all of Texas and most of California were each encompassed by single large polygons), and did not represent consistent, comparable sampling units. We chose watersheds as our sampling unit because they are ecologically relevant landscape-level sampling units for large scale studies [3234], and have been used to model the occurrence of other species [35,36]. Furthermore, watersheds are expected to represent a more discrete set of biotic and abiotic factors and thus serve as a more ecologically relevant unit for aggregating covariates than an arbitrary rectangular grid. We chose a watershed size (HUC10) that was much larger than the mean home range size estimated for wild pigs in the U.S. (4.92±6.37 km2) and thus was expected to be capable of encompassing an entire population of pigs. (S1 Table). To aggregate the SCWDS data to the watershed level (Fig 1), we used two criteria to decide whether or not to assign pig presence to a watershed: (a) the area of a SCWDS pig population (polygon) had to be greater than three times the mean home range size of wild pigs in the U.S. (13 km2; 5 mi2) or approximately three times the national mean home range size, and (b) the proportion of each watershed occupied by a given population had to be greater than 2.5%. The first criterion ensures that the occupied portion of a watershed is large enough to support multiple wild pig home ranges, and the second criterion ensures that the occupied portion of large watersheds is large enough relative to total size for covariate values to be meaningfully linked to wild pig presence.

Fig 1. Spread of wild pigs in the contiguous United States.

This map illustrates cumulative documented occurrence of wild pigs from 1982 to 2012 based on Southeastern Cooperative Wildlife Disease Study (SCWDS) records aggregated to watersheds (Hydrologic Unit Code 10). Areas occupied by wild pigs in a given year continue to be occupied in later years, with rare exception.

In addition to spreading locally through population growth and natural dispersal, wild pigs are occasionally introduced to novel locations by humans for hunting. Because patterns generated by anthropogenic spread are not driven by all of the same ecological factors influencing natural spread, we excluded likely introductions from the distribution data. Based on published estimates of annual dispersal capabilities and the distribution of observed distances between newly occupied watersheds and watersheds occupied in the previous year, we excluded new pig populations that were highly likely to have been introduced by humans from further analysis (4.4% of all records). Note that if a watershed that was deemed to harbor an introduced population continued to be occupied in subsequent years or if adjacent watersheds were later reported as occupied, these occurrences were included in analyses.

Model Covariates

We identified physiological constraints and ecological requirements that we hypothesized may influence the observed and potential occurrence of feral swine across the contiguous U.S. We then identified covariates that best represented these factors, which include physiological limits imposed by temperature, access to water, and thermal cover, and ecological requirements for forage and protective cover (Table 1). We used a geographic information system (GIS) to derive spatial data layers for all covariates at the local watershed (HUC10) level across the contiguous U.S. from publicly available national-scale datasets (Fig 2), then standardized all covariates prior to model fitting (data are available: doi:10.5061/dryad.vt46n).

Fig 2. Occurrence model covariates.

Mapped covariate layers used to model wild pig occurrence probability across the contiguous United States. All covariate values are depicted using a quantile classification.

Pigs are known to have physiological characteristics making them sensitive to both high and low temperatures [3739]. Porter and Gates [40] (1969) report that pig mortality results from exposure to full sun when ambient temperatures exceed 23°C and exposure to partial sun when ambient temperatures exceed 35°C. The Swine Care Handbook [41] recommends that cooling be provided when temperatures exceed 35°C for domestic pigs of most growth stages and that supplemental heat should be provided to juvenile pigs when temperatures fall below -4°C. We derived the cumulative number of days above 35°C and below -4°C for each watershed in a given year from National Oceanic and Atmospheric Administration (NOAA) weather station data. We identified weather stations within 250 km of each watershed centroid (up to 10 closest stations), then calculated the number of days each station had an observed maximum temperature above 35°C and the number of days with an observed minimum temperature below -4°C. We then adjusted for the difference in elevation between each weather station and the watershed centroid using the average adiabatic lapse rate temperature correction formula [42]: where ΔT represents a change in temperature of 6.49°C for every 1000 meters of elevation gained or lost between the weather station location and the watershed centroid. We averaged across the selected weather stations and over 30 years of observations, or all years in a 30 year time period for which data were available.

Wild pig survival and reproductive success at low temperatures in natural environments is expected to be influenced by snow presence and depth [39,43,44]. Mean snow depth was estimated from the Snow Data Assimilation System (SNODAS), which integrates snow data from satellite platforms, airborne platforms, ground stations, and models to estimate snow cover and depth [45]. Using estimates from April 1, which is assumed by most resource managers to be the date closest to maximum snow accumulation in temperate latitudes of the northern hemisphere [45], we calculated the average maximum snow depth over 10 years, then averaged within each watershed.

Wild pigs thermo-regulate by accessing shade and water resources [46,47], and restricted access to water is known to cause increased piglet mortality [48]. Mean distance to water was derived from the National Hydrography Dataset Plus (NHDPlus) [49]. First, streams with very low average annual flow (< 3 cubic feet per second) were removed to exclude ephemeral water sources. Distances from remaining stream features and water body perimeters were then measured and summarized by watershed to yield the average distance to the nearest water source from any grid cell within each watershed. Forest canopies also offer important thermal as well as protective cover for wild pigs [44,46,50]. Availability of forest cover was derived from the 2006 National Land Cover Dataset by calculating the percent area of each watershed classified as deciduous forest, coniferous forest, mixed forest, or woody wetlands cover.

Wild pigs are known to be a highly adaptable generalist species in terms of their dietary range [51]. We identified two major forage classes typically available to wild swine at the national scale, crops [37,39] and hard mast (i.e., acorns and other nuts) [37,44,52]. Crop cover was derived from the USDA National Agricultural Statistics Service Cropland Data Layer (2012). All crop types were included as potential forage resources given that wild pigs are known to consume a diverse array of crops, as well as insects and other food sources associated with crops [51]. Mast-producing cover was derived from the USGS Gap Analysis Program (GAP) National Land Cover dataset v2 (2011). We screened cover class names and descriptions to identify those dominated by or containing a significant presence of hard mast-producing tree or shrub species; including oak (Quercus spp.), hickory (Carya spp.), chestnut (Castanea spp.), walnut (Juglans spp.), beech (Fagus spp.), birch (Betula spp.), maple (Acer spp.), elm (Ulmus spp.), and ash (Fraxinus spp.). We calculated the proportion of area within each watershed classified as crop or mast-producing cover as an index of forage availability.

Finally, we derived an index of habitat heterogeneity representing the availability of all three of the resources that meet wild pigs’ key ecological and physiological requirements–water, cover, and forage–within the average group home range area as estimated in previous studies (9 km2 based on 95–100% utilization distributions for mixed groups or sounder groups; S1 Table). By applying a moving window approach to raster maps of each habitat component, we calculated the number of components present within the area of an average sounder range centered at each focal cell. We then averaged these counts across each watershed to generate a continuous index (0–3) of habitat heterogeneity.

Model fitting

Within an information-theoretic framework [53,54], we used logistic regression and multi-model inference [5456] to estimate a logistic discrimination function [57,58] representing the relative probability of occurrence of wild pigs. The function discriminates between watersheds where a species is present and random ‘background’ watersheds based on the distributions of covariates associated with each. This approach is similar to fitting a resource selection function (RSF) [59], but differs in that presence and background watersheds are sampled independently, allowing watersheds occurring in the presence sample to also occur in the background sample. The logistic discrimination function avoids the problematic assumption that background watersheds represent absences or ‘pseudo-absences’, but rather reflects the probability of species occurrence given the distribution of habitat covariates at presence watersheds, relative to background watersheds. While other more recent methods of estimating occurrence probability from presence-only data were considered (e.g., MaxEnt [60], MaxLike [61], scaled binomial loss (SBL) [62], presence-background learning algorithm (PBL) [63]), each has been shown to fail to estimate a quantity proportional to absolute probability of occurrence as estimated by more complete presence-absence data, either in general (MaxEnt [61,6466]) or when required parametric assumptions regarding species prevalence (MaxLike [62,67]) or empirical estimates of prevalence (SBL, PBL) were not accurate. We instead chose a simpler approach with transparent interpretation that we expected to be more robust for this application.

The presence sample included all watersheds in which wild pigs were reported throughout the sampling period (1982–2012), except those identified as likely recent human introductions (N = 4459). We sampled background locations from all watersheds in the contiguous United States, including those with recorded presences. We selected a background sample size that was twice that of the presence sample and approximately half of all contiguous U.S. watersheds. Although selection of the background sample size was arbitrary and a larger sample could have been selected, this approach avoided both inflating the degrees of freedom in our model and excessive overlap of presence and background samples, though the logistic discrimination model is robust to sample overlap [57].

Our global model included the entire suite of covariate (j) linear terms, along with a quadratic term for number of days above 35°C because we hypothesized increased probability of occurrence in warmer climates up to a threshold beyond which additional hot days would be detrimental to pigs. We tested for collinearity by calculating pairwise Pearson correlations and variance inflation factors, but no terms exceeded cutoff values of 0.7 or 10.0, respectively [68,69], and thus no exclusion of terms from the model was necessary.

We used all-subsets model averaging and multi-model inference to arrive at a final predictive logistic discrimination function. Rather than base inferences and prediction on a single, selected ‘best’ model from an a priori set of models, more robust inference can be based on the entire set of models considered [54,55]. Model averaging across all model subsets produces parameter and error estimates that are not conditional on any one model but are instead informed by the entire model set [54,55,70]. This is particularly advantageous when several models have similar weights of evidence, or probability of being the ‘best’ model [56]. Averaging over all possible subsets of a global model is recommended over selection of candidate model sets when the aim is to produce a model averaged predictive model, provided there is strong support for inclusion of each covariate in the global model to avoid a ‘fishing expedition’ [70]. The superiority of model-averaged inferences compared to a traditional ‘best’ model selection strategy has been demonstrated repeatedly (e.g., [55,71,72]).

We used the ‘dredge’ and ‘model.avg’ functions in the MuMIn package [73] for R [74] to fit all additive subsets of the global model and compute model-averaged regression coefficients, unconditional standard errors (SEs), cumulative AIC weights of evidence as a measure of variable importance [5456], and 95% confidence intervals [54,55]. We used a shrinkage estimation approach to produce unconditional model averaged parameter estimates, in which covariates that did not appear in a particular model subset were assigned coefficients of zero to avoid biasing coefficient estimates away from zero [54,75]. Our interpretation of the explanatory power of the regression coefficients in our model was guided by three measures: 1) the weights of evidence, ranging from 0 to 1.0, where higher weights indicated greater relative importance; 2) the 95% confidence interval for each regression coefficient; and 3) effect sizes indicated by each regression coefficient.

Model validation

Standard model validation metrics test discrimination between presence and absence locations and are thus not appropriate for testing the predictive performance of a model designed to discriminate between presence and background locations [76]. We instead used the “RSF plot index”, a variation of k-fold cross-validation designed for presence-only data, to assess proportionality of the relative probability of occurrence predicted by the model and the observed frequency of occurrence [76]. Based on Huberty’s rule [77], we first randomly divided the wild pig data among four cross-validation folds. We used each possible set of three folds to fit a predictive model, again employing multi-model averaging, which we then used to predict the fourth withheld fold. Results of 100 iterations of this process, each with a new random allocation of data across four cross-validation folds, were averaged to avoid dependency of validation results on a single random allocation of data across folds.

We binned predicted values from our cross validation results, then calculated a Pearson correlation between those values and the proportion of watersheds within each bin for which the species was recorded as present. Because validation results can be sensitive to binning method [76], we applied and compared both equal interval and quantile binning methods. Lastly, we assessed the performance of our final model using the Pearson correlation rather than the Spearman rank correlation as in Boyce et al. [76] because the former provides a more rigorous measure of the linear agreement between predicted probability of occurrence and observed frequency of occurrence.


Based on the final inferential model (Table 2), the distribution of wild pigs in the contiguous U.S. was most strongly limited by frequent cold temperatures and the availability of water and is most strongly associated with frequent high temperatures and high habitat heterogeneity within a home range. Covariates representing each of these four factors had AICc weights of evidence of 1.0, indicating high importance.

Covariates representing frequency of cold temperatures and distance from water had significant negative model coefficients, indicating a decrease in the relative probability of wild pig occurrence with increasing average number of days below -4°C (standardized 95% CI: -2.9563 –-2.5413) and with increasing distance from water (standardized 95% CI: -0.6329 –-0.4259). In terms of effect sizes, for each additional day with observed minimum temperature below -4°C per year within a watershed, there was an estimated 15% decrease in the odds of wild pig occurrence, and for each 1 km increase in the average distance to water from a given location in a watershed, there was an estimated 27% decrease in the odds of wild pig occurrence.

Habitat heterogeneity had a significant positive model coefficient, indicating an increase in the relative probability of wild pig occurrence with increasing habitat heterogeneity (95% CI: 0.1045–0.2887). The availability of an additional heterogeneity component (i.e., water, forage, cover) within an average home range area increased the odds of wild pig occurrence by 32%.

Our final model contained a significant positive linear coefficient (95% CI: 0.5523–0.7229) and a significant negative quadratic coefficient (95% CI: -0.1431 –-0.0966) for average number of days above 35°C. Together, these coefficients indicated that the odds of wild pig occurrence increased with the number of days with maximum temperature above 35°C up to an asymptote of approximately 59 days per year, beyond which additional days with maximum temperature above 35°C reduced the odds of occurrence. This threshold occurs well above the observed mean of 14 days with maximum temperature above 35°C across the contiguous U.S. (SD = 22 days).

The linear terms of forest cover, forage availability, and snow depth had AICc weights of evidence of 0.6, 0.36, and 0.31, respectively, and did not have significant effects on the relative probability of wild pig occurrence based on 95% confidence intervals on estimated coefficients.

The top-ranked model had an AICc weight of 27% (given the candidate set) and was 4054 AICc units better (i.e., lower) than the null model, suggesting that the selected suite of covariates approximated the data well.

Cross-validation based on the RSF plot index indicated that the final model had strong predictive capacity (Fig 3). The quantile binning method (equal numbers of watersheds in each bin) produced a Pearson correlation of 0.989 between midpoints of predicted probability of occurrence values and observed proportions of occupied watersheds in each bin. Similarly, the equal interval binning method produced a Pearson correlation of 0.988, indicating low sensitivity of the cross-validation results to binning method.

Fig 3. Cross validation results.

Estimation of the predictive capacity of the wild pig occurrence model based on RSF plots using a) quantile and b) equal interval binning methods.

We used the exponential form of our final model to predict the relative probability of wild pig occurrence across the contiguous United States (Fig 4). As expected, high occurrence probabilities were predicted in the South, generally aligning well with known wild pig occurrence, while low probability of occurrence was predicted in cold regions (e.g., Rocky Mountains, Northern Great Plains) and in arid regions (e.g., desert regions of Nevada and inland southern California).

Fig 4. Predicted wild pig occurrence.

Predictive map of relative wild pig occurrence probability based on a logistic discrimination function relating Southeastern Cooperative Wildlife Disease Study (SCWDS) records collected from 1982 to 2012 with covariates representing ecological and physiological requirements, with actual reported distribution overlaid.


Validation results indicated that our model performed well in predicting high occurrence of wild pigs over much of the species’ current U.S. range, while also identifying additional areas that may be capable of supporting populations of wild pigs that are, apparently, unoccupied, which is perhaps of greatest interest and utility (Fig 4). These areas, such as the Pacific Northwest, mid-Atlantic region, and Southwest, have similar climatic and habitat conditions as areas within the species’ current U.S. range and are also near known populations. The mechanisms for wild pig dispersal in the United States are largely unknown, but human-mediated dispersal may be a key cause of population spread in recent decades [11]. Recent analyses indicated that genetic sources of wild pigs throughout their United States range include genes from historic wild pig populations in the Southeastern United States, as well as introduction of new sources [78]. The full range of genetic sources is not yet characterized but could include escape or intentional release of domestic pigs or farmed wild boar. Although our model does not account for dispersal, the high-risk areas we identified represent key geographic areas warranting surveillance to identify newly established wild pig populations, whether through translocations from existing populations or propagules from agricultural operations.

Temperature is a key limiting factor for many species, especially at higher latitudes [79], and is often used to predict species distributions [80]. There is a well-established morbidity response of pigs to both high and low temperatures in captive conditions. A review of 54 wild boar population density estimates from Eurasia found that low winter temperatures were associated with smaller populations [43]. Our results suggest that this physiological limitation may also be present for wild pigs in the contiguous U.S. However, the current data available for the known distribution of wild pigs is confounded with higher temperature regions where wild pigs have been present longest in the U.S., which was influenced by the original introductions in the Southeastern states. Hence, we suggest there is a need to collect more data in Northern regions of the U.S. and in Canada where wild pigs may be underreported due to historic absence and/or lower density.

Wild pigs are in the same taxonomic order (Cetartiodactyla) as other large mammalian herbivores and, despite their omnivorous food habits, vegetation still comprises most of their diet [51]. Fine-scale telemetry studies have found wild boar and feral pigs using a range of natural and anthropogenic habitats to access either food or cover [39,81,82], and we suspected pigs might show a similar positive relationship between presence and habitat heterogeneity as other large mammalian herbivores [8386]. The response by ungulates to heterogeneity often differs based on the spatial scale of at which heterogeneity is measured [87]. Most previous ungulate-heterogeneity studies evaluated responses from local (i.e., patch or home-range) to landscape scales and, interestingly, often found ungulates showed the strongest positive response to heterogeneity at the broader landscape scale [83,87]. By measuring heterogeneity at the scale of a home range and then summarizing these measurements across watersheds, our study helps extend ungulate-heterogeneity studies beyond landscape scales to evaluate whether heterogeneity influences their distribution at a near-continental scale.

The individual components of our habitat heterogeneity index had different influences on the occurrence of wild pigs (Table 2). For example, the probability of pig occurrence decreased with increasing average distance to water, as predicted based on pigs’ physiological dependence on behavioral thermoregulation. In contrast, forest cover and forage availability showed little association with the current distribution of wild pigs in the U.S. These results were somewhat surprising because food and cover are usually influential covariates in species distribution models [80]. We suspect their minimal influence here results from the generalist food and habitat affinities of wild pigs. However, the collective presence of food, cover, and water summarized as a metric of habitat heterogeneity at a landscape scale had a strong positive relationship with the distribution of wild pigs in the U.S. While this result may be strongly influenced by water availability, it still provides evidence that access to three critical resources of food, cover, and water is a critical aspect of landscapes currently supporting pig populations in the U.S.

Although wild pigs present a growing problem as an invasive species in North America, they also represent many of the characteristics that make generalist species successful invaders. Notably, their highly plastic diet [51] and adaptability to novel habitats [82] present challenges to identifying the drivers of distribution and spread of this generalist species. By taking a macro-scale (continental) view of the wild pig distribution in the U.S., our modeling approach confirmed that wild pigs are extreme generalists. We not only quantified a wide range of habitats in which pigs had a high relative probability of occurrence, but also identified regions that are within the environmental conditions known to support pig populations in the U.S. but are apparently unoccupied. Such insights would not be revealed with local-scale data. We recommend that, whenever feasible, similar macro-scale approaches to generalist species distribution modeling are pursued to capture the extent of conditions that support a species population and generate hypotheses about species limitations or invasion potential that can be tested in combination with finer-scale research.

Our model identified some areas with long established populations that had only moderate predicted occurrence probabilities (e.g., portions of Florida). This is largely due to the strong influence of temperature on the model and the overall positive relationship between high temperatures and occurrence driven by pigs’ historical distribution in the U.S. As a result, coastal areas with mild temperatures relative to nearby inland areas are predicted to be less likely to harbor pigs despite otherwise favorable conditions and historic pig presence. Furthermore, occurrence models are not designed to identify a threshold probability below which pigs are not expected to occur. Most areas with moderate occurrence probabilities are expected to be highly suitable for pigs given their generalist nature and capacity for adaptation to a variety of conditions. Even areas with very low occurrence probabilities (e.g., Northern states) may be capable of supporting pig populations, though perhaps with lower establishment success or at lower densities than in more favorable areas [22].


In this paper, we identified the areas currently occupied by wild pigs in the contiguous United States and, more importantly, predicted those areas that would most likely support pigs if colonized in the future. Areas predicted to be highly suitable for pigs that are not currently occupied but that are near wild pigs’ current range may be particularly at risk of invasion. We suggest that this information, particularly when coupled with spatial patterns of agricultural production, biodiversity indices, or the distribution of species and habitats that may be sensitive to the impacts of pigs, can help guide prioritization of wild pig management practices so as to minimize the impacts of spreading pig populations on agricultural and ecological systems.

Although large portions of the contiguous U.S. are predicted to have very low probability of wild pig occurrence, recent studies have shown that wild pigs can occur in environments previously thought to be inhospitable, for example in Saskatchewan, Canada [22]. Thus, future work is needed to improve our understanding of the drivers of wild pig occurrence throughout North America, particularly the extent to which cold temperatures actually limit wild pig colonization and establishment. A critical first step in this process would be to obtain new and improved occurrence data for wild pigs in the northern U.S. and Canada. Wild pigs are expected to occur at lower densities in these regions, which may make detection of populations difficult. However, development of cost-effective and widely distributable surveys to state wildlife managers is one approach that should be considered. In addition, more field data is needed along a latitudinal gradient to understand how the determinants of occurrence, space-use, and vital rates vary at this continental scale. This may result in a set of regional occurrence models with potentially different driving covariates. Future work must also seek to estimate wild pig population size, and how it varies, across the contiguous U.S., which will result in improved updates to our national-level species distribution model and help managers identify locations requiring active management of wild pig populations.

Finally, one of the largest challenges limiting the understanding and management of recent wild pig range expansion concerns the mechanisms of spread. Our work, in combination with distribution modeling in Canada [22], suggests that much of North America has suitable conditions for wild pigs, including portions that are currently unoccupied. We suspect that the absence of wild pigs in suitable, yet apparently unoccupied habitat results from lack of introduction (i.e., the propagule pressure hypothesis) [88], as opposed to limiting environmental or habitat factors that our distribution model failed to capture. Although dispersal capacity and long-term movement data for wild pigs is lacking in North America, we also suspect that social factors such as value of wild pigs as a recreational hunting resource or as farmed species are at least as important as natural dispersal in driving the current distribution of wild pigs. As such, we recommend that future research investigating the distribution and invasiveness of wild pigs should include social factors that may drive value and motivation for human translocation of wild pigs (what drives wild pig propagule pressure?), in addition to biological factors (what biotic and abiotic factors limit wild pigs populations?) to address competing hypotheses and generate effective management solutions.

Supporting Information

S1 Table. Review of wild pig home range size estimates.

This supporting table compiles wild pig, feral hog, and wild boar home range size estimates available in published literature. Area estimates are given along with information regarding location, number of individuals, estimation method used, and other supporting details.



Disclaimer: The views and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of USDA-APHIS-Veterinary Services.

We thank Meaghan Pryde for assistance with preparing data used in the manuscript, Joe Corn at the University of Georgia for guidance and use of National Feral Swine Mapping System data, and USDA Wildlife Services biologists for providing additional location data and validation. We are grateful to Dan Walsh, Ryan Brook, Nicole Michel, and Brett Dickson for reviewing earlier drafts of this manuscript. We also thank David Oryang for facilitating early discussions and conceptualization of the project.

Author Contributions

Conceived and designed the experiments: MLM CLB MLF MWL DAG RSM. Performed the experiments: MLM. Analyzed the data: MLM CLB MLF DMT PDR DAG RSM. Contributed reagents/materials/analysis tools: MWL DMT PDR. Wrote the paper: MLM CLB MLF DMT DAG RSM.


  1. 1. Butchart SHM, Walpole M, Collen B, van Strien A, Scharlemann JPW, Almond REA, et al. Global biodiversity: indicators of recent declines. Science. 2010;328: 1164–1168. pmid:20430971
  2. 2. Ziska LH, Blumenthal DM, Runion GB, Hunt ER, Diaz-Soltero H. Invasive species and climate change: an agronomic perspective. Clim Change. 2010;105: 13–42.
  3. 3. Pimentel D, Zuniga R, Morrison D. Update on the environmental and economic costs associated with alien-invasive species in the United States. Ecological Economics. 2005. pp. 273–288.
  4. 4. Miller RS, Farnsworth ML, Malmberg JL. Diseases at the livestock-wildlife interface: Status, challenges, and opportunities in the United States. Prev Vet Med. Elsevier B.V.; 2013;110: 119–132. pmid:23254245
  5. 5. Miller RS, Sweeney SJ. Mycobacterium bovis (bovine tuberculosis) infection in North American wildlife: current status and opportunities for mitigation of risks of further infection in wildlife populations. Epidemiol Infect. 2013;141: 1357–70. pmid:23657134
  6. 6. Millennium Ecosystem Assessment. Ecosystems and Human Well-being: Synthesis. Ecosystems. Washington D.C.: Island Press; 2005.
  7. 7. Meyerson LA, Mooney HA. Invasive Alien Species in an Era of Globalization. Front Ecol Environ. 2007;5: 199–208.
  8. 8. Hulme PE. Trade, transport and trouble: managing invasive species pathways in an era of globalization. J Appl Ecol. 2009;46: 10–18.
  9. 9. Jeschke JM, Strayer DL. Invasion success of vertebrates in Europe and North America. Proc Natl Acad Sci U S A. 2005;102: 7198–7202. pmid:15849267
  10. 10. McKnight TL. Feral livestock in Anglo-America. Berkeley, CA: University of California Press; 1964.
  11. 11. Mayer JJ, Brisbin IL. Wild pigs in the United States: their life history, morphology and current status. 2008th ed. Athens, GA: University of Georgia Press; 1991.
  12. 12. Long J. Introduced mammals of the world: their history, distribution and influence. Collingwood, Victoria, Australia: CSIRO Publishing; 2003.
  13. 13. Forsyth DM, Caley P. Testing the irruptive paradigm of large-herbivore dynamics. Ecology. 2006;87: 297–303. pmid:16637354
  14. 14. Sol D, Vila M, Kuhn I. The comparative analysis of historical alien introductions. Biol Invasions. 2008;10: 1119–1129.
  15. 15. Ripple WJ, Estes J a, Beschta RL, Wilmers CC, Ritchie EG, Hebblewhite M, et al. Status and ecological effects of the world’s largest carnivores. Science. 2014;343: 1241484. pmid:24408439
  16. 16. Estes J a, Terborgh J, Brashares JS, Power ME, Berger J, Bond WJ, et al. Trophic downgrading of planet Earth. Science. 2011;333: 301–306. pmid:21764740
  17. 17. Maselli V, Polese G, Larson G, Raia P, Forte N, Rippa D, et al. A dysfunctional sense of smell: the irreversibility of olfactory evolution in free-living pigs. Evol Biol. 2014;41: 229–239.
  18. 18. Wilson DE, Reeder DM. Mammal species of the world: a taxonomic and geographic reference. JHU Press; 2005.
  19. 19. Jones KE, Bielby J, Cardillo M, Fritz S a., O’Dell J, Orme CDL, et al. PanTHERIA: a species-level database of life history, ecology, and geography of extant and recently extinct mammals. Ecology. 2009;90: 2648–2648.
  20. 20. Barrios-Garcia MN, Ballari SA. Impact of wild boar (Sus scrofa) in its introduced and native range: A review. Biol Invasions. 2012;14: 2283–2300.
  21. 21. Bevins SN, Pedersen K, Lutman MW, Gidlewski T, Deliberto TJ. Consequences Associated with the Recent Range Expansion of Nonnative Feral Swine. Bioscience. 2014;64: 291–299. Available:
  22. 22. Brook RK, van Beest FM. Feral wild boar distribution and perceptions of risk on the central Canadian prairies. Wildl Soc Bull. 2014;38: 486–494.
  23. 23. Gipson PS, Hlavachick B, Berger T. Range expansion by wild hogs across the central United States. Wildl Soc Bull. 1998;26: 279–286.
  24. 24. Seward N, VerCauteren K, Witmer G, Engeman R. Feral swine impacts on agriculture and the environment. Sheep Goat Res J. 2004;19: 34–40. Available:
  25. 25. Pimental D. Environmental and Economic Costs of Vertebrate Species Invasions Into the United States. In: Witmer GW, Pitt WC, Fagerstone KA, editors. Managing Vertebrate Invasive Species. 2007. pp. 1–8.
  26. 26. Jay MT, Cooley M, Carychao D, Wiscomb GW, Sweitzer RA, Crawford-Miksza L, et al. Escherichia coli O157: H7 in feral swine near spinach fields and cattle, central California coast. Emerg Infect Dis. 2007;13: 1908. pmid:18258044
  27. 27. Giurgiutiu D, Banis C, Hunt E, Mincer J, Nicolardi C, Weltman A, et al. Brucella suis infection associated with feral swine hunting—three states, 2007–2008. Morb Mortal Wkly Rep. 2009;58: 618–621.
  28. 28. Meng XJ, Lindsay DS, Sriranganathan N. Wild boars as sources for infectious diseases in livestock and humans. Philos Trans R Soc Lond B Biol Sci. 2009;364: 2697–2707. pmid:19687039
  29. 29. Boughton EH, Boughton RK. Modification by an invasive ecosystem engineer shifts a wet prairie to a monotypic stand. Biol Invasions. 2014;16: 2105–2114.
  30. 30. Southeastern Cooperative Wildlife Disease Study (SCWDS). National Feral Swine Mapping System. 2013. Available:
  31. 31. United States Geological Survey Gap Analysis Program (USGS GAP). National GAP land cover data v2. 2011. Available:
  32. 32. Odum EP, Odum HT, Andrews J. Fundamentals of Ecology. Vol 3. Philadelphia, Pennsylvania: Saunders; 1971.
  33. 33. Wu J. Landscape Ecology. In: Leemans R, editor. Ecological Systems: Selected Entries from the Encyclopedia of Sustainability Science and Technology. eBook: Springer; 2013. pp. 179–200.
  34. 34. Montgomery DR, Grant GE, Sullivan K. Watershed analysis as a framework for implementing ecosystem management. J Am Water Resour Assoc. 1995;31: 369–386.
  35. 35. Collins SL, Glenn SM. A hierarchical analysis of species’ abundance patterns in grassland vegetation. Am Nat. 1990;135: 633–648.
  36. 36. Peterson A, Townsend MP, Kluza DA. Predicting the potential invasive distributions of four alien plant species in North America. Weed Sci. 2009;51: 863–868.
  37. 37. Geisser H, Reyer H-U. The influence of food and temperature on population density of wild boar Sus scrofa in the Thurgau (Switzerland). J Zool. 2005;267: 89–96.
  38. 38. Acevedo P, Escudero M a, Muñoz R, Gortázar C. Factors affect ing wild boar abun dance across an environmental gradient in Spain. Acta Theriol (Warsz). 2006;51: 327–336.
  39. 39. Danilov PI, Panchenko DV. Expansion and some ecological features of the wild boar beyond the northern boundary of its historical range in European Russia. Russ J Ecol. 2012;43: 45–51.
  40. 40. Porter WP, Gates DM. Thermodynamic equilibria of animals with environment. Ecol Monogr. 1969; 227–244.
  41. 41. National Pork Board. Swine Care Handbook. Des Moines, IA; 2003.
  42. 42. Brunt D. The Adiabatic lapse-rate for dry and saturated air. Q J R Meteorol Soc. 1933;59: 351–360.
  43. 43. Melis C, Szafranska PA, Jedrzejewska B, Barton K. Biogeographical variation in the population density of wild boar (Sus scrofa) in western Eurasia. J Biogeogr. 2006;33: 803–811.
  44. 44. Honda T. Environmental Factors Affecting the Distribution of the Wild Boar, Sika Deer, Asiatic Black Bear and Japanese Macaque in Central Japan, with Implications for Human-Wildlife Conflict. Mammal Study. 2009. pp. 107–116.
  45. 45. Barrett AP. National Operational Hydrologic Remote Sensing Center Snow Data Assimilation System (SNODAS) Products at NSIDC. NSIDC Spec Rep 11. 2003; 19.
  46. 46. Choquenot D, Ruscoe WA. Landscape complementation and food limitation of large herbivores: Habitat-related constraints on the foraging efficiency of wild pigs. J Anim Ecol. 2003;72: 14–26.
  47. 47. Choquenot D, Dexter N. Spatial variation in food limitation: the effects of foraging constraints on the distribution and abundance of feral pigs in the rangelands. In: Floyd R, Sheppard A, DeBarro P, editors. Frontiers of Population Ecology. Melbourne, Australia: CSIRO Publishing; 1996. pp. 531–546.
  48. 48. Fraser D, Phillips PA. Lethargy and low water intake by sows during early lactation: a cause of low piglet weight gains and survival? Appl Anim Behav Sci. 1989;24: 13–22.
  49. 49. United States Environmental Protection Agency, United States Geological Survey. National Hydrography Dataset Plus—NHDPlus v2.10. 2012. Available:
  50. 50. Fernández-Llario P. Environmental correlates of nest site selection by wild boar Sus scrofa. Acta Theriol (Warsz). 2004;49: 383–392.
  51. 51. Ballari SA, Barrios-García MN. A review of wild boar Sus scrofa diet and factors affecting food selection in native and introduced ranges. Mamm Rev. 2014;44: 124–134.
  52. 52. Schauss ME, Coletto HJ, Kutilek MJ. Population characteristics of wild pigs, Sus scrofa, in eastern Santa Clara County, California. Calif fish game. 1990;76: 68–77.
  53. 53. Akaike H. Information theory as an extension of the maximum likelihood principle. In: Petrov BN, Csaki F, editors. Second international symposium on information theory. Akademiai Kiado, Budapest; 1973. pp. 267–281.
  54. 54. Burnham KP, Anderson DR. Model selection and multimodel inference: a practical information-theoretic approach. 2nd ed. Springer; 2002.
  55. 55. Burnham KP, Anderson DR. Multimodel Inference: Understanding AIC and BIC in Model Selection. Sociol Methods Res. 2004;33: 261–304.
  56. 56. Burnham K, Anderson D, Huyvaert K. AIC model selection and multimodel inference in behavioral ecology: some background, obtainedservations, and comparisons. Behav Ecol Sociobiol. 2011;65: 23–35. Available:
  57. 57. Keating KA, Cherry S. Use and interpretation of logistic regression in habitat selection studies. J Wildl Manage. 2004;68: 774–789.
  58. 58. Pearce JL, Boyce MS. Modelling distribution and abundance with presence-only data. J Appl Ecol. 2006. pp. 405–412.
  59. 59. Manly BFJ, McDonald LL, Thomas DL, McDonald TL, Erickson WP. Resource Selection by Animals: Statistical Design and Analysis for Field Studies, 2nd Edition. Boston, MA: Kluwer Academic Publishers; 2002.
  60. 60. Phillips SJ, Anderson RP, Schapire RE. Maximum entropy modeling of species geographic distributions. Ecol Modell. 2006;190: 231–259.
  61. 61. Royle JA, Chandler RB, Yackulic C, Nichols JD. Likelihood analysis of species occurrence probability from presence-only data for modelling species distributions. Methods Ecol Evol. 2012;3: 545–554.
  62. 62. Phillips SJ, Elith J. Logistic Methods for Resource Selection Functions and Presence-Only Species Distribution Models. Proc Twenty-Fifth AAAI Conf Artif Intell Logist. 2011; 1384–1389.
  63. 63. Li W, Guo Q, Elkan C. Can we model the probability of presence of species without absence data? Ecography. 2011;34: 1096–1105.
  64. 64. Fitzpatrick M, Gotelli N, Ellison A. MaxEnt vs. MaxLike: Empirical comparisons with ant species distributions. Ecosphere. 2013;4: 1–15. Available:
  65. 65. Li W, Guo Q. How to assess the prediction accuracy of species presence-absence models without absence data? Ecography. 2013;36: 788–799.
  66. 66. Manly BFJ, Merrill A. Comments on statistical aspects of the U.S. Fish and Wildlife Service’s modeling framework for the proposed revision of critical habitat for the Northern spotted owl. Laramie and Cheyenne, Wyoming; 2012.
  67. 67. Phillips SJ, Elith J. On estimating probability of presence from use-availability or presence-background data. Ecology. 2013;94: 1409–1419. pmid:23923504
  68. 68. Booth GD, Niccolucci MJ, Schuster EG. Identifying proxy sets in multiple linear regression: an aid to better coefficient interpretation. 1994.
  69. 69. Belsley DA. Conditioning diagnostics. John Wiley & Sons, Inc; 1991.
  70. 70. Symonds MRE, Moussalli A. A brief guide to model selection, multimodel inference and model averaging in behavioural ecology using Akaike’s information criterion. Behav Ecol Sociobiol. 2011;65: 13–21.
  71. 71. Wasserman L. Bayesian Model Selection and Model Averaging. J Math Psychol. 2000;44: 92–107. pmid:10733859
  72. 72. Hansen MH, Kooperberg C. Spline adaptation in extended linear models (with comments and a rejoinder by the authors. Stat Sci. 2002;17: 2–51.
  73. 73. Bartón K. MuMIn: Multi-Model inference R package version 1.10.0. 2014. Available:
  74. 74. R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2013. Available:
  75. 75. Lukacs PM, Burnham KP, Anderson DR. Model selection bias and Freedman’s paradox. Ann Inst Stat Math. 2010;62: 117–125.
  76. 76. Boyce MS, Vernier PR, Nielsen SE, Schmiegelow FK a. Evaluating resource selection functions. Ecol Modell. 2002;157: 281–300.
  77. 77. Huberty CJ. Applied discriminant analysis. John Wiley & Sons, Inc; 1994.
  78. 78. McCann BE, Malek MJ, Newman RA, Schmit BS, Swafford SR, Sweitzer RA, et al. Mitochondrial diversity supports multiple origins for invasive pigs. J Wildl Manage. 2014;78: 202–213.
  79. 79. Hawkins BA, Field R, Cornell H V., Currie DJ, Guegan J-F, Kaufman DM, et al. Energy, water, and broad-scale geographic patterns of species richness. Ecology. 2003;84: 3105–3117.
  80. 80. Franklin J. Mapping species distributions: spatial inference and prediction [Internet]. New York, NY: Cambridge University Press; 2009.
  81. 81. Dexter N. The influence of pasture distribution and temperature on habitat selection by feral pigs in a semi-arid environment. Wildlife Research. 1998. p. 547.
  82. 82. Morelle K, Lejeune P. Seasonal variations of wild boar Sus scrofa distribution in agricultural landscapes: a species distribution modelling approach. Eur J Wildl Res. 2014;61: 45–56.
  83. 83. Kie JG, Terry Bowyer R, Nicholson MC, Boroski BB, Loft ER. Landscape heterogeneity at differing scales: Effects on spatial distribution of mule deer. Ecology. 2002;83: 530–544.
  84. 84. Anderson DP, Turner MG, Forester JD, Zhu J, Boyce MS, Beyer H, et al. Scale-dependent summer resource selection by reintroduced elk in Wisconsin, USA. J Wildl Manage. 2005;69: 298–310.
  85. 85. Saïd S, Servanty S. The influence of landscape structure on female roe deer home-range size. Landsc Ecol. 2005;20: 1003–1012.
  86. 86. Morellet N, Van Moorter B, Cargnelutti B, Angibault J, Lourtet B, Merlet J, et al. Landscape composition influences roe deer habitat selection at both home range and landscape scales. Landsc Ecol. 2011;26: 999–1010.
  87. 87. Turner MG, Pearson SM, Romme WH, Wallace LL. Landscape heterogeneity and ungulate dynamics: what spatial scales are important? Wildlife and Landscape Ecology. New York, NY: Springer; 1997. pp. 331–348.
  88. 88. Simberloff D. The Role of Propagule Pressure in Biological Invasions. Annual Review of Ecology, Evolution, and Systematics. 2009. pp. 81–102.