Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A Tale of Four “Carp”: Invasion Potential and Ecological Niche Modeling

  • Shannon C. DeVaney ,

    Current address: Natural History Museum of Los Angeles County, Los Angeles, California, United States of America

    Affiliation The University of Kansas, Natural History Museum and Biodiversity Research Center, Lawrence, Kansas, United States of America

  • Kristina M. McNyset,

    Current address: U.S. Environmental Protection Agency - ORD/WED, Corvallis, Oregon, United States of America

    Affiliation The University of Kansas, Natural History Museum and Biodiversity Research Center, Lawrence, Kansas, United States of America

  • Justin B. Williams,

    Current address: The University of Nebraska, Lincoln, Nebraska, United States of America

    Affiliation The University of Kansas, Natural History Museum and Biodiversity Research Center, Lawrence, Kansas, United States of America

  • A. Townsend Peterson,

    Affiliation The University of Kansas, Natural History Museum and Biodiversity Research Center, Lawrence, Kansas, United States of America

  • Edward O. Wiley

    Affiliation The University of Kansas, Natural History Museum and Biodiversity Research Center, Lawrence, Kansas, United States of America



Invasive species are a serious problem in ecosystems, but are difficult to eradicate once established. Predictive methods can be key in determining which areas are of concern regarding invasion by such species to prevent establishment [1]. We assessed the geographic potential of four Eurasian cyprinid fishes (common carp, tench, grass carp, black carp) as invaders in North America via ecological niche modeling (ENM). These “carp” represent four stages of invasion of the continent (a long-established invader with a wide distribution, a long-established invader with a limited distribution, a spreading invader whose distribution is expanding, and a newly introduced potential invader that is not yet established), and as such illustrate the progressive reduction of distributional disequilibrium over the history of species' invasions.

Methodology/Principal Findings

We used ENM to estimate the potential distributional area for each species in North America using models based on native range distribution data. Environmental data layers for native and introduced ranges were imported from state, national, and international climate and environmental databases. Models were evaluated using independent validation data on native and invaded areas. We calculated omission error for the independent validation data for each species: all native range tests were highly successful (all omission values <7%); invaded-range predictions were predictive for common and grass carp (omission values 8.8 and 19.8%, respectively). Model omission was high for introduced tench populations (54.7%), but the model correctly identified some areas where the species has been successful; distributional predictions for black carp show that large portions of eastern North America are at risk.


ENMs predicted potential ranges of carp species accurately even in regions where the species have not been present until recently. ENM can forecast species' potential geographic ranges with reasonable precision and within the short screening time required by proposed U.S. invasive species legislation.


Invasive species [2] pose both ecological [3], [4] and economic [5] risks to native ecosystems. Unfortunately, once established, invasives are generally difficult or impossible to eradicate (consider, for example, the case of northern snakeheads Channa argus in Maryland; [6]). Robust methods for anticipating the geographic potential of possible invaders on a continental scale would allow decision-makers and managers to make informed decisions and take effective actions towards excluding harmful species before they are established. Previous investigators have applied ecological niche modeling (ENM) to this problem in both terrestrial [7], [8], [9], [10], [11] and aquatic [12], [13], [14], [15] ecosystems.

More than 4500 non-native and invasive species live in natural ecosystems across the United States [5]. Non-indigenous species may harm native ecosystems through competition, predation, habitat modification, and hybridization with native species [3], [4], [5]. Invasive species are implicated as factors in listing ≥160 native species as threatened or endangered in the United States [5]. More generally, the overall environmental and economic impact of invasive species totals ∼US$137 billion annually in the United States, negatively affecting not just natural systems but also agriculture, aquaculture, forestry, transportation, utilities, recreation, and human health [5], [16].

The process of a species' invasion of a new area generally occurs in several steps: introduction, establishment, and spread [17]. Establishment often involves a “lag” phase, lasting even many years, in which the species is present in relatively low numbers in a restricted geographic area [18]. During the final phase, the species expands rapidly until it reaches its maximum distribution potential in the new landscape. The “rule of tens,” introduced by Williamson [17], holds that of species introduced into a new area, only ∼10% become established, and of species that become established, only ∼10% spread successfully. However, a recent study of species introduced from Europe to North America and vice versa found that ∼50 percent became established and ∼50% of those were able to spread [19]. Recent niche modeling applications suggest that these ‘numbers’ rules are constrained closely by the suitability of the landscape being invaded for the species [20], although the absolute nature of these constraints has been debated [21], [22], [23].

Common carp (Cyprinus carpio), tench (Tinca tinca), grass carp (Ctenopharyngodon idella), and black carp (Mylopharyngodon piceus) represent respectively a long-introduced and well-established invasive species (common carp), a long-introduced but less-successful non-native species (tench), a relatively recent but successful invasive species (grass carp), and a species being used in U.S. aquaculture that has not as yet become established in natural ecosystems (black carp). Our objective was to develop niche models for each species based on its native range and then project the niche model rule sets onto North America to identify areas at risk for establishment. This would allow us to test both distributional predictions quantitatively with independent occurrence data.

We define the “ecological niche” of species as the set of abiotic parameters within which a species is able to maintain populations without immigrational subsidy [24]—this “scenopoetic niche” can be distinguished from niches more closely related to interactions among species (the “Eltonian niche;” [25]) principally by the spatial scale at which it is manifested. We used landscape-scale environmental variables based on climate and topography, and developed tests of stability of ecological niches across the time span of the invasion events [26]. Our ENM application takes advantage of the fact that native populations have had more time and opportunity to “explore” environmental space via dispersal and colonization, and thus are likely to be most informative regarding niche dimensions [27], although combinations of models based on native and introduced distributional areas may be optimal for prediction [28].

Because the geographic extent of a species' potential distributional area is independent of its success to date as an invader, depending rather on time and opportunity for access to the region being invaded [29], we hypothesized that common carp and tench occurrences would already have filled a substantial part of their potential ranges, given multiple introductions for over 150 yr in North America. Grass carp populations are expanding still [30], however, and have probably not yet reached their full geographic potential after almost 40 yr since introduction, whereas black carp are only recently imported into North America, and are not as-yet established in natural waters. We wished to determine whether niche models could detect and demonstrate this pattern where species expand into new environments, out to the bounds imposed by their scenopoetic niches [29], [31], [32]. Finally, to the extent that potential ranges can be reconstructed predictively and robustly we wished to determine what geographic “behavior” we might expect from the newest arrival, the black carp?

Reactive vs. proactive approaches

Invasive species remediation efforts have too-often been highly reactive in nature [1], which means that species tend to establish populations before their threat is recognized. ENM allows assessment of potential geographic areas at risk for invasion before the introduction even takes place [33], [34], [35]. One method of generating niche models, the evolutionary computing algorithm GARP, uses a variety of rule-building methods in an iterative machine-learning process to generate rule sets [36]. Environmental data sets in the form of raster grids are entered into GARP, along with georeferenced occurrence points from the species' native range. By comparing environmental parameter combinations of known occurrence points against those of points randomly sampled from areas from which the species is not known to occur, GARP “learns” a pattern of relationships in the form of a rule set. Rules are iteratively combined, altered, and refined to maximize accuracy. The final result of the algorithm is the niche model, which describes a species' niche as a multi-dimensional hypervolume in ecological space [37], [38].

Although niche models are generated using environmental data and occurrence data from the species' native range, the rules contained within models are defined in environmental terms only, and are independent of geographic area. Hence, rule sets can be applied in any geographic area to identify areas of potential occurrence. We selected GARP as a modeling implement because distributional predictions derived from GARP models have proven accurate under a variety of circumstances [39], and because GARP is robust to small occurrence data sets [40]. Machine-learning techniques, such as GARP, are powerful because the modeling of non-linear functions with large numbers of variables becomes feasible as compared to traditional statistical approaches [41]. Although GARP was not ranked particularly highly in some broad intermodel comparisons [42], these comparisons have been demonstrated to be largely artifactual in more recent analyses [43].

The fishes

Common carp, native to Eurasia, were first introduced in North America in 1831 [44] and were intentionally released throughout most of the United States by the U.S. Fish Commission in 1877–1898 [45], [46]. Now considered an invasive nuisance, common carp are established in every U.S. state except Alaska [47], and seem to have established almost completely across their potential distributional range [15]. The impact of common carp on native aquatic species has been primarily through habitat modification, as it stirs up substrates, uprooting plants and muddying the water [46], [48]. Common carp occupy many microenvironments, are highly fecund, and spawn in shallow, slow-flowing water [49].

The U.S. Fish Commission originally imported tench into the United States in 1877 [45], and eventually provided tench stocks to 36 U.S. states [50]. However, many of the introductions seem to have been unsuccessful, possibly due to biotic interactions [47], [50]. Tench populations are probably established in California, Colorado, Connecticut, Idaho, Washington, Oregon, New Mexico, Maryland, New York (U.S.), and British Columbia (Canada), but their current status in many of these areas is uncertain [47], [51]. Tench usually spawn in weedy shallow areas, and their diet consists chiefly of insects and mollusks [52].

Grass carp were imported into the United States from Malaysia and Taiwan for aquaculture in 1963, and were released into natural waters shortly thereafter in Arkansas [53]. By 1993, grass carp were established in Arkansas, Kentucky, Illinois, Louisiana, Missouri, Mississippi, Tennessee, and Texas [30]; evidence of reproduction has been recorded from the Mississippi River drainage of all of these states except Texas [47]. Grass carp have already adversely affected U.S. ecosystems in several areas [54]. In China, grass carp generally prefer large rivers and lakes, requiring long rivers (50–180 km) with sufficient discharge (>400 m3/sec) and velocity (>0.8 m/sec) for successful reproduction [55], and eat mainly submerged vegetation [53].

Black carp have been used in U.S. fish farms as biological control against snails, which are secondary hosts for parasites that infect commercial fish [56], [57]. Although concerns about black carp escape, establishment, and potential impacts on North American ecosystems have been raised [58], the aquaculture industry maintains that use of the species is necessary [59]. Use of black carp has been focused in southeastern U.S. fish farms that raise channel catfish (Ictalurus punctatus) and striped/hybrid bass (Morone sp.) [60]. Although no black carp breeding populations are as yet known in the U.S., ∼30 black carp escaped into the Osage River from a fish farm in Missouri in 1994 [47]. Recently, black carp have been captured in Illinois [61] and Louisiana [51]; it is not known whether self-sustaining populations are established. In their native range, black carp inhabit major rivers, large tributaries, and lakes; they are river breeders that require a swift current for successful development, and spawning habitat has been described as similar to that of grass carp [60].


We constructed niche models using 58 native range occurrence points for common carp, 292 for tench, 41 for black carp, and 38 for grass carp. We compiled these data from specimen records in museum collections via online databases [62], [63], verified records from Chinese museum collections, and species accounts in scientific literature [64], [65], [66], [67]. We inspected fish collections from the Chinese museums and identified a random subsample of approximately 10 specimens per species to assure correct species assignments. In all cases, we excluded all points outside the known native range from the model-building (training) data pool. We are interested in the potential for ENM as a tool in assessing invasion threat before introduction; therefore, points from the invaded range were deliberately excluded from the model-building process for all species. For each species, we used all available unique, verified native range occurrence points; consequently, sample size varied among species. GARP predictions have been shown to reach 90% of maximum accuracy using only 10 training points, with smaller incremental changes in accuracy when more data are included, such that most of the change in accuracy occurs below 20 data points [68]. As all our data sets exceed 20 points, we expect the effect of variable sample size on model accuracy to be negligible.

For North American occurrences, we obtained 1303 points for common carp, 30 for tench, and 47 for grass carp through the above online databases and the USGS Nonindigenous Aquatic Species database [51] from records of fish not directly stocked (black carp are not as yet known from natural waters). For records not already georeferenced, we assigned latitude and longitude coordinates with the U.S. National Geospatial-Intelligence Agency GEOnet Names Server [69], the USGS Geographic Names Information System [70], and detailed printed map resources [71].

The native range of black carp includes the major Pacific drainages of eastern Asia from about 22°N to 51°N latitude, including areas in China and Russia [72]. Grass carp have a similar native range: the Pacific slope of Asia from the Amur River Basin to the West River, including areas in China, Russia, and Northern Indochina [53]. The native range for tench includes Europe and parts of western Asia [52], [64]. Because native range limits for common carp are poorly understood, as the species occurs throughout Eurasia, having been spread and released by humans there for centuries [47], we treated occurrences throughout Eurasia as native range for this species (after previous treatment of this species [15]). Any questionable occurrence points (e.g. market-collected specimens) were removed from the native range data pool. Duplicate occurrence points were also removed, leaving only verified, unique occurrence points.

Environmental data consisted of 15 layers (“coverages”) summarizing aspects of the ecological landscape of both the native and introduced ranges (Table 1). We were limited to only those environmental variables for which data is available. While we do not consider that the available coverages represent all possible variables affecting species distributions, we included all coverages in model-building that might either directly affect species occurrence, or act as proxies for unavailable data. Data layers varied in spatial resolution (the size of individual cells or pixels in a given grid layer), consequently, layers were resampled to 0.01×0.01° prior to analysis. We resampled layers for the common carp native range only to 0.05×0.05° resolution because of the vast area of analysis and associated computational restraints. Although higher spatial resolution is desirable, range predictions using GARP and similar ENM tools are only modestly affected by changes in resolution [73], [74].

Table 1. Environmental data layers used in the development of the models presented herein.

Prior to generating niche models, we evaluated the environmental coverages with a jackknife process, an analysis designed to maximize predictive accuracy while culling coverages prone to spurious overfitting [14], [75], [76]. This procedure allowed us to optimize environmental layer inputs for each niche model in terms of minimizing model omission error (i.e., exclusion of independent test data points from model prediction) [77]. As such, for each species, we built models based on all combinations of n−1 data layers, where n is the total number of layers. Occurrence data were randomly divided into training and test sets (50% each); 20 models for each n−1 layer subset were built and tested using these training and testing subsets. We calculated correlations between inclusion of each and omission error in the test data set, and removed layers showing positive correlations with omission error (r>0.1) from subsequent analyses. We repeated the jackknife procedure with the reduced set of data layers; when no strongly positive correlations with omission remained, the remaining subset of data layers was designated for use in the final model building process.

For generating final models, occurrence points were divided randomly into training and validation subsets. For all models (except tench; see below), 20 native-range occurrence points were excluded from model building and reserved for independent model-set validation. For tench models, because of the larger data set available for tench native range, we used 20% of the data (58 points) for independent validation. The GARP program further divides training data into intrinsic training and extrinsic testing subsets (80% and 20% respectively herein) prior to each model building process [78]. All experiments were performed using the desktop version of GARP [79].

Models were generated until 20 models with 0% extrinsic omission were compiled (i.e., all of the extrinsic testing subset completely predicted), and remaining models were discarded. We calculated the median of the commission index (calculated as the proportional area predicted present [80]) across all zero-omission models, and selected the 10 models with the lowest deviation from the median as the best model-set used in the distributional prediction [78]. We used the pixel-by-pixel sum of these 10 models (projected across both native ranges and North America), as inclusion of more models has not increased model accuracy or interpretability in limited experimentation.

Validation points set aside prior to model building were then overlaid as an independent test of model-set accuracy. We calculated percent omission (“%O”) as 1 minus the weighted proportion of validation points predicted by 0–10 of the best model set. For example, if 17 of 20 validation points were correctly predicted by all 10 of the best models, 2 by 8 and 1 by 6, then %O = 1−((17+1.6+.6)/20) = 4%.

We evaluated model-set sensitivity and specificity using the Area Under the Curve [AUC] in a Receiver Operating Characteristic [ROC] analysis [81], [82], [83]. In ROC, each pixel in the landscape receives scores from the diagnostic test being evaluated, in this case based on the predictions for the independent testing data. These data are then graphed on a sensitivity vs. 1-specificity plot (sensitivity and specificity determined using a standard 2×2 confusion matrix, with absence information based on all sites from which the species has not been detected previously), and the area under the curve is calculated. This AUC is compared to a “line of no information” with a slope of 1 (AUC = 0.5). No difference between the two AUCs indicates that the best model set is predicting presence no better than at random [82]. The AUC can be interpreted as the probability of a model set correctly predicting presence in a randomly selected grid cell. Because we used a conservative estimate of commission error, even a perfectly accurate model cannot achieve a perfect AUC of 1.0, and AUC scores will be lower for equally accurate model sets as the percent of the total study area predicted by the model set increases [84]. We are aware of the problems inherent in AUCs and their interpretation [43], [85], and interpret our results cautiously as a result.

We calculated the AUC for each model set and tested it against AUC = 0.5 using a z-test to determine significance [81]. We also calculated the maximum AUC (AUCmax) possible for each validation data set, given the distribution of validation datapoints available. We take the difference between AUC and AUCmax as a more informative measure of model set accuracy than the AUC alone [84]. To account for differences in AUC scores, we also calculated percent of the total study area predicted by all 10 best-subsets models.

After analysis on the native range, the best models for each taxon were projected onto central North America (∼25–54°N). We overlaid data points documenting non-native populations on these model predictions. We analyzed predictions with ROC analysis, as above, with the non-native occurrences as independent validation data. We also created maps of weighted proportion of area predicted present within hydrologic units for the lower 48 states by intersecting niche model grids with USGS 6-digit HUC polygons [86].


The best suite of coverages, as determined by the jackknife procedure, for each species is shown in Table 1. All AUCs for all taxa in both native and invasive model tests were significantly better than random (P = 0.01). Given this validation, we visualized both the native and non-native potential distributional areas for each taxon (Figure 1, Table 2), with details as follows. The common carp model predicted native presence across most of Eurasia; all 20 validation data points were predicted correctly by all of the best-subsets models (%O = 0, AUC = 0.80). For the non-native range, all 10 best models predicted a broad potential distribution covering much of the surface of the continental United States. Of 1303 occurrences available for common carp in North America, 991 were predicted present by all best-subsets models, with another 118 predicted present by 9 of the 10 best models; at the other end of the spectrum, 28 points were not predicted by any of the 10 models (%O = 8.8, AUC = 0.62). The native-range tench model performed well, with 10 models correctly predicting 57 of 58 validation points (%O = 0, AUC = 0.80); projection of this model to North America predicted presence across most of the upper Midwest and the eastern United States, and in parts of the northwest. Ten of the 30 North American tench validation points were predicted by all 10 models; 16 were predicted by 1–9 models; and five remained unpredicted (%O = 54.7, AUC = 0.62).

Figure 1. Niche models in native ranges and in the United States.

Ecological niche models for common carp (A, B), tench (C, D), grass carp (E, F), and black carp (G, H) on native and U.S. landscapes. Shading indicates the predicted suitability predicted (brick red = 7–10 models, canary red = 4–6 models, pink = 1–3 models). Occurrence points for each species are shown as training data (yellow circles) in the species' native range (A, C, E, G) or independent validation data (green triangles) in the native or introduced ranges (B, D, F, H).

Table 2. Statistics describing the results of ecological niche model validations.

The grass carp model encompassed its known native range in eastern Asia. All 20 validation occurrence points were predicted present by all of the best models (%O = 0, AUC = 0.86). The projection to North America was more limited than for common carp (Table 2), but covered extensive portions of the eastern, central, and northwestern United States (Figure 1d). Of 47 grass carp testing points available from North America, 36 were predicted present by all models, and 2 were not predicted by any models (%O = 19.8, AUC = 0.71).

The area predicted for black carp is smaller, both on its native range and in North America (Table 2). Seventeen of the 20 native-range validation points were predicted by all models, and all points were predicted by ≥2 models (%O = 6.5, AUC = 0.87). Most of the eastern United States is predicted as suitable for black carp by these models (Figure 1f). Because black carp are not known to be established in North America, and the few records of this species [51], [61] are recent escapees, no independent validation points were available for black carp in North America.


The ecological niche models performed well in predicting independent testing data across native-range landscapes, including areas outside the actual native range. For example, black carp are predicted to find parts of northern Indochina and Japan suitable (Fig. 1): in the literature, we see that black carp have already become established in the Tone River system in Japan [72], [87] and probably in parts of Vietnam [88]. Although grass carp and black carp have similar native range limits, the distribution of occurrences within the range is dissimilar: as a consequence, grass carp predictions covered broader areas on both native and non-native landscapes.

Niche models predicted the known dimensions of the introduced range for common carp and grass carp successfully, in the former case in close agreement with the results of a previous study [15]. This result is significant, given that no information about the biology of these fishes or their non-native occurrences was included in the modeling effort. The models did fail to anticipate some of the introduced occurrence points and omission error was higher for the introduced-range tests, although only 2% of U.S. common carp points were not anticipated by the model predictions. Failures of ENM to fully predict an invaded range could occur when the invasive species is limited in its native range by biotic interactions and is released from those pressures in the invaded range, or when an island species has environmental tolerances that exceed the limits it encounters in its small native range [18], [34], [89]. Given the large native ranges of the carp species investigated here, and the common carp in particular, we do not consider either hypothesis a likely explanation in this case. Most of the unpredicted points for common carp occur in the southwestern U.S. in three major drainages (Colorado, Rio Grande, and Great Basin). All three of these systems have highly altered hydrology, which have been indicated as a primary reason for fish invasions [90], and as such aquatic environmental conditions there may not be representative, and may be dependent on smaller-scale interactions that cannot easily be captured by coarse-resolution environmental data used in this study.

The non-native grass carp model, although it performed statistically significantly better than random expectations, nonetheless had a 19.8% omission rate, owing to several points falling within small areas predicted by few or none of the best-subsets models. Upon closer examination, many grass carp testing points were outside (but close to) predicted areas by <1 km (Figure 1f), which is equal to or less than the native spatial resolution of the environmental data. Imprecision of coordinates for both native and non-native occurrence points may also have played some part in creating this elevated omission error.

The high omission of the introduced range projections of tench merits further discussion. One omitted point in Washington was slightly outside areas of high prediction, as were 3 points in California, so again data precision may come into the picture. Perhaps more importantly, though, tench have not established successfully in many areas predicted by the model where they have been introduced intensively. Baughman [50] found that states in the Great Lakes Region received 13,849 tench from the U.S. Fish Commission in 1886–1896 for introduction into various waters. The models predict potential distributional areas for tench across much of the Great Lakes region, but (other than a limited established population in the Great Chazy River in upstate New York) tench have failed to become established there [47], [51]. Kolar and Lodge [91] also predicted that tench would be successful in the Great Lakes region based on multivariate analysis of life-history characteristics, habitat needs, and invasion history. The reasons for the broad failure of tench introductions are currently not known, although some observational evidence suggests that biotic interactions with sunfishes (Centrarchidae), which are not native to Europe, may be responsible [50]. Another possibility is that environmental factors acting at resolutions finer than the data considered herein may be interacting negatively with the biology of the species. The disparity between our predictions and tench occurrences demonstrates the caution needed when interpreting these predictive models.

That several testing occurrence points fell slightly outside areas predicted at high levels for common carp, tench, and grass carp might suggest that models could be improved with higher-resolution environmental data and occurrence data. However, this improvement would not come easily, as the two data sets must be improved in tandem—the best resolution possible will be limited by the coarser of the two resolutions. In the meantime, it may be more appropriate to quantify risk at coarser resolutions. To this end, we evaluated mean model prediction across hydrologic units (USGS 6-digit HUC units) and compared them qualitatively to the non-native occurrence data (Figure 2). Evaluating predictions at the “pixel” scale may give a false picture of fine-scale accuracy. The grass carp niche model had 19.8% omission when evaluated at the pixel scale, but coincidence with HUCs in which there was high model agreement in prediction of presence was better (Figure 2C): evaluated at this resolution, the tench model identified the areas where tench managed to establish populations (Figure 2B). Moreover, the HUC-level maps avoid problems with spatial autocorrelation and the independence of testing points, and summarize the data at a more appropriate scale and provide a more interpretable map of risks for managers.

Figure 2. Invasive range predictions by USGS hydrologic unit.

Mean ecological niche model predictions at the level of hydrologic unit (USGS 6-digit HUC) in the United States, for common carp (A), tench (B), grass carp (C), and black carp (D). Shade indicates mean suitability from the niche model outputs across all pixels within each HUC polygon (see legend). Known (testing) occurrence points are indicated by dotted circles. Recently captured black carp [23], [33] are shown with stars (D).

Although landscape-scale parameters are important in limiting fish distributions [92], exclusive use of this type of data can complicate potential distribution predictions for aquatic species, as we end up using putative proxies for instream parameters that are probably the truly causal variables. For example, while a comprehensive water temperature coverage is not presently available, air temperature interacts with factors, including dams, tree cover, volume, and groundwater input, to determine stream temperatures [93]. Given the limited data available, we included all layers that might provide useful surrogates, such as percent tree cover and solar radiation coverages as possible surrogates or partial surrogates for water temperature. Hence, ENMs could be improved by integration of datasets for aquatic parameters; in addition, several smaller-scale environmental and biological factors that influence occurrences of fish [94] could also be included. In the case of tench, biotic interactions may be limiting their establishment success: in cases like this one, ENMs can be viewed as the top layer of a multi-scale filtering framework, under which finer-scale models could improve predictions [95], [96].

Common carp have spread so broadly that their current range limits in North America reach (and in some areas exceed) their modeled potential distributional area [15]. In contrast, tench have been unsuccessful in establishing in many areas predicted as suitable by the ENMs; similar constraints could affect black carp, but anticipating their behavior in novel situations is not simple. More practically, resource managers may not be willing to take the risk that unforeseen biotic or environmental interactions may limit black carp establishment in areas predicted as suitable, given the possible negative impacts.

The negative ecological and economic impacts caused by establishment of non-native species in the United States are well known, but can be difficult to anticipate [1]. Black carp are molluscivores, and their presence may put native mollusks at risk, particularly in the southeastern U.S., which is home to 90% of the nation's threatened and endangered mollusk species [97]. Black carp can live for >15 years [88], and are capable of consuming large quantities of mollusks (four-year-old black carp consumed 1.4–1.8 kg of bivalves per day [98]!). Black carp ENMs predicted presence across much of the southeastern U.S., where many of the aquaculture facilities that use black carp are located [60]. Black carp have already escaped captivity in the United States–though not included in the present analyses, this species has been captured recently in Illinois [61] and Louisiana [51], both in hydrological units predicted as high risk (Figure 2D). The niche models predict presence in several watersheds containing major rivers, including almost the entire course of the Mississippi River, which appear to meet the hydrological requirements of black carp reproduction (Fig. 2d).

ENM provides the opportunity to assess invasion potential proactively by using occurrence data from museum collections and the scientific literature. Under Section 1105 of the pending National Aquatic Invasive Species Act of 2007 (NAISA) [99], U.S. federal agencies would have to complete a screening process for planned importations of live aquatic organisms, and make a determination to allow or restrict importation within 180 days of receiving a request for permission to import aquatic organisms [100]. If federal agencies adopted predictive ecological niche models, detailed maps of hydrologic units at risk, combined with brief synopses of the biology of the “new” species, could be distributed quickly to regional or state managers. Scientists with expert knowledge of local ecosystems could then evaluate risk of establishment and invasion by interpreting niche models and the natural history of the species.

Although not all species have similar (broad) invasion potential, our common carp analyses demonstrate that some species will eventually expand their ranges to approximately match the extents that we predicted. We also demonstrated potential range expansion in the more recently introduced grass carp. Tench occurrences, however, did not fill much of the geographic range predicted by the niche model, illustrating the difficulty in predicting the result of introductions given the multitude of factors that can determine the outcome. We believe, like Nico et al. [60], that black carp present a serious invasion threat: for this situation, we have identified regions at highest risk. While it is not possible to anticipate whether introduced species, like black carp, will exhibit the invasive potential of common carp versus the relative invasive ineptness of tench. However, regulatory agencies should give careful consideration to ecological niche models as an integral tool in achieving an effective strategy to limit potential negative impacts by invasive species.


We thank Dr. P. Chen (University of Kansas Natural History Museum) for help in translating and georeferencing Chinese data points, and Drs. C. Zhang (Beijing Institute of Zoology), J. Yang (Kunming Institute of Zoology), and. S. He (Wuhan Institute of Hydrobiology) for access to museum data. We also thank Dr. J. D. Williams, Dr. W. R. Courtenay, and A. Benson (USGS- Florida Integrated Science Center) for helpful discussions about carps and aquatic invasive species. We are especially appreciative of Dr. C.R. Robins (University of Kansas Natural History Museum) for helpful comments and guidance.

Author Contributions

Conceived and designed the experiments: SCD KMM ATP EOW. Performed the experiments: SCD KMM JBW EOW. Analyzed the data: SCD KMM JBW ATP EOW. Contributed reagents/materials/analysis tools: SCD KMM. Wrote the paper: SCD KMM JBW ATP EOW.


  1. 1. NAS (2002) Predicting Invasions of Nonindigenous Plants and Plant Pests. Washington, D.C.: National Academy of Sciences.
  2. 2. (1999) Invasive Species. pp. 6183–6186. Federal Register.
  3. 3. Cox GW (1993) Conservation Ecology. William C. Brown Publishers.
  4. 4. Drake JA, Mooney HA, diCastri F, Groves RH, Kruger FJ, et al. (1989) Biological invasions: a global perspective. New York: John Wiley & Sons.
  5. 5. (1993) Harmful non-indigenous species in the United States.
  6. 6. Courtenay WR, Williams JD (2004) Snakeheads (Pisces, Channidae): a biological synopsis and risk assessment: U.S. Geological Survey.
  7. 7. Benedict MQ, Levine RS, Hawley WA, Lounibos LP (2007) Spread of the tiger: Global risk of invasion by the mosquito Aedes albopictus. Vector-Borne and Zoonotic Diseases 7: 76–85.
  8. 8. Mohamed KI, Papeş M, Williams R, Benz BW, Peterson AT (2006) Global invasive potential of 10 parasitic witchweeds and related Orobanchaceae. AMBIO: A Journal of the Human Environment 35: 281–288.
  9. 9. Nyari A, Ryall C, Peterson AT (2006) Global invasive potential of the House Crow (Corvus splendens) based on ecological niche modelling. Journal of Avian Biology 37: 306–311.
  10. 10. Thuiller W, Richardson DM, Pysek P, Midgely GF, Hughes GO, et al. (2005) Global risk assessment for plant invasions–The role of climatic suitability and propagule pressure. Global Change Biology 11: 2234–2259.
  11. 11. Peterson AT, Stewart A, Mohamed KI, Araújo MB (2008) Shifting global invasive potential of European plants with climate change. PLoS ONE.
  12. 12. Wiley EO, McNyset KM, Peterson AT, Robins CR, Stewart AM (2003) Niche modeling and geographic range predictions in the marine environment using a machine-learning algorithm. Oceanography 16: 120–127.
  13. 13. Iguchi K, Matsuura K, McNyset K, Peterson AT, Scachetti-Pereira R, et al. (2004) Predicting invasions of North American basses in Japan using native range data and a genetic algorithm. Transactions of the American Fisheries Society 133: 845–854.
  14. 14. McNyset KM (2005) Use of ecological niche modelling to predict distributions of freshwater fish species in Kansas. Ecology of Freshwater Fish 14: 243–255.
  15. 15. Zambrano L, Martínez-Meyer E, Mendes N, Peterson AT (2006) Invasive potential of exotic aquaculture fish in American freshwater systems. Canadian Journal of Fisheries & Aquatic Sciences 63: 1903–1910.
  16. 16. Pimentel D, Lach L, Zuniga R, Morrison D (2000) Environmental and economic costs of nonindigenous species in the United States. BioScience 50: 53–65.
  17. 17. Williamson M (1996) Biological invasions. London: Chapman & Hall.
  18. 18. Mack RN, Simberloff D, Lonsdale WM, Evans H, Clout M, et al. (2000) Biotic invasions: Causes, epidemiology, global consequences, and control. Ecological Applications 10: 689–710.
  19. 19. Jeschke JM, Strayer DL (2005) Invasion success of vertebrates in Europe and North America. Proceedings of the National Academy of Sciences 102: 7198–7202.
  20. 20. Peterson AT (2003) Predicting the geography of species' invasions via ecological niche modeling. Quarterly Review of Biology 78: 419–433.
  21. 21. Fitzpatrick MC, Weltzin JF, Sanders NJ, Dunn RR (2007) The biogeography of prediction error: Why does the introduced range of the fire ant over-predict its native range? Global Ecology and Biogeography 16: 24–33.
  22. 22. Broennimann O, Treier UA, Müller-Schärer H, Thuiller W, Peterson AT, et al. (2007) Evidence of climatic niche shift during biological invasion. Ecology Letters.
  23. 23. Peterson AT, Nakazawa Y (2008) Environmental data sets matter in ecological niche modeling: An example with Solenopsis invicta and Solenopsis richteri. Global Ecology and Biogeography 17: 135–144.
  24. 24. Grinnell J (1917) The niche-relationships of the California thrasher. Auk 34: 427–433.
  25. 25. Soberón J (2007) Grinnellian and Eltonian niches and geographic distributions of species. Ecology Letters 10: 1115–1123.
  26. 26. Peterson AT, Soberón J, Sanchez-Cordero V (1999) Conservatism of ecological niches in evolutionary time. Science 285: 1265–1267.
  27. 27. Peterson AT (2005) Predicting potential geographic distributions of invading species. Current Science 89: 9.
  28. 28. Broennimann O, Guisan A (2008) Predicting current and future biological invasions: Both native and invaded ranges matter. Biology Letters 4: 585–589.
  29. 29. Soberón J, Peterson AT (2005) Interpretation of models of fundamental ecological niches and species' distributional areas. Biodiversity Informatics 2: 1–10.
  30. 30. Courtenay WR (1993) Biological pollution through fish introductions. In: McKnight BN, editor. Indiana University / Purdue University. Indianapolis, IN: Academy of Science.
  31. 31. Lodge DM (1993) Biological invasions - Lessons for ecology. Trends in Ecology and Evolution 8: 133–137.
  32. 32. Williams JD, Meffe GK (1998) Nonindigenous species. In: Mac MJ, Opler PA, Haecker CEP, Doran PD, editors. Status & Trends of the Nation's Biological Resources. Reston, VA: United States Geological Survey.
  33. 33. Panetta FD, Dodd J (1987) Bioclimatic prediction of the potential distribution of skeleton weed (Chondrilla-Juncea L) in Western Australia. Journal of the Australian Institute of Agricultural Science 53: 11–16.
  34. 34. Peterson AT, Vieglais DA (2001) Predicting species invasions using ecological niche modeling: New approaches from bioinformatics attack a pressing problem. Bioscience 51: 363–371.
  35. 35. VanderZanden MJ, Olden JD, Thome JH, Mandrak NE (2004) Predicting occurrences and impacts of smallmouth bass introductions in north temperate lakes. Ecological Applications 14: 132–148.
  36. 36. Stockwell DRB, Noble IR (1992) Induction of sets of rules from animal distribution data — a robust & informative method of data-analysis. Mathematics and Computers in Simulation 33: 385–390.
  37. 37. Hutchinson GE (1957) Population Studies - Animal Ecology & Demography - Concluding Remarks. Cold Spring Harbor Symposia on Quantitative Biology 22: 415–427.
  38. 38. Stockwell D, Peters D (1999) The GARP modelling system: Problems and solutions to automated spatial prediction. International Journal of Geographical Information Science 13: 143–158.
  39. 39. Peterson AT (2003) Predicting the geography of species' invasions via ecological niche modeling. Quarterly Review of Biology 78: 419–433.
  40. 40. Wisz MS, Hijmans R, Li J, Peterson AT, Graham CH, et al. (2008) Effects of sample size on the performance of species distribution models. Diversity and Distributions 14: 763–773.
  41. 41. Olden JD, Jackson DA (2002) A comparison of statistical approaches for modelling fish species distributions. Freshwater Biology 47: 1976–1995.
  42. 42. Elith J, Graham CH, Anderson RP, Dudik M, Ferrier S, et al. (2006) Novel methods improve prediction of species' distributions from occurrence data. Ecography 29: 129–151.
  43. 43. Peterson AT, Papeş M, Soberón J (2008) Rethinking receiver operating characteristic analysis applications in ecological niche modelling. Ecological Modelling 213: 63–72.
  44. 44. Lachner EA, Robins CR, Courtenay WR (1970) Exotic fishes and other aquatic organisms introduced into North America. Smithsonian Contributions to Zoology 59: 1–29.
  45. 45. Baird SF (1879) Report of the United States Fish Commission for 1876–1877. In: Commission USF, editor. Washington, D.C.: U.S. Government Printing Office.
  46. 46. Cole LJ (1905) The German carp in the United States. pp. 523–641.
  47. 47. Fuller PL, Nico L, Williams J (1999) Nonindigenous fishes introduced into inland waters of the United States. Bethesda, MD: American Fisheries Society.
  48. 48. Bellrichard SJ (1996) Effects of common carp (Cyprinus carpio) on submerged macrophytes and water quality in a backwater lake on the upper Mississippi River. La Crosse, WI: University of Wisconsin - La Crosse.
  49. 49. Pflieger WL (1975) The fishes of Missouri. Jefferson City, MO: Missouri Department of Conservation.
  50. 50. Baughman JL (1947) The tench in America. Journal of Wildlife Management 11: 197–204.
  51. 51. (2008) Nonindigenous Aquatic Species. United States Geological Survey.
  52. 52. Scott WB, Crossman EJ (1998) Freshwater Fishes of Canada. Ottawa: Fisheries Research Board of Canada.
  53. 53. Guillory V, Gasaway RD (1978) Zoogeography of grass carp in the United States. Transactions of the American Fisheries Society 107: 105–112.
  54. 54. Bain MB (1993) Assessing impacts of introduced aquatic species — Grass carp in large systems. Environmental Management 107: 221–224.
  55. 55. Stanley JG, Miley WW, Sutton DL (1978) Reproductive requirements and likelihoodfor naturalization of escaped grass carp in the United States. Transactions of the American Fisheries Society 107: 119–128.
  56. 56. Venable DL, Gaude AP, Klerks PL (2000) Control of the trematode Bolbophorus confusus in channel catfish (Ictalurus punctatus) ponds using salinity manipulation and polyculture with black carp (Mylopharyngodon piceus). Journal of the World Aquaculture Society 31: 158–166.
  57. 57. Ferber D (2001) Will black carp be the next zebra mussel? Science 292: 203.
  58. 58. (2000) Injurious wildlife: Review of information concerning black carp (Mylopharyngodon piceus). pp. 35314–35315.
  59. 59. Lambrecht B (2000) Invasion of Asian carp worries officials. St. Louis, MO: St Louis Post-Dispatch.
  60. 60. Nico LG, Williams JD, Herod JJ (2001) Carp (Mylopharyngodon piceus): A Biological Synopsis and Updated Risk Assessment. Gainesville, FL: Biological Resources Division, U.S. Geological Survey.
  61. 61. Chick JH, Maher RJ, Burr BM, Thomas MR (2003) First black carp captured in U.S. Science 300: 1876–1877.
  62. 62. (2005) The Species Analyst. Lawrence, KS: University of Kansas.
  63. 63. Froese R, Pauly D (2008) FishBase. World Wide Web electronic publication. pp.
  64. 64. Berg LS (1949) Freshwater Fishes of the U.S.S.R. and adjacent countries (Translated from Russian). Jerusalem, Israel: Smithsonian Institution and the National Science Foundation.
  65. 65. Nichols JT (1943) The freshwater fishes of China. New York: The American Museum of Natural History.
  66. 66. Banarescu P (1964) Pisces - Osteichthyes. Bucharest.
  67. 67. Keith P, Allardi J (2001) Atlas des poissons d'eau douce de France. Paris.
  68. 68. Stockwell DRB, Peterson AT (2002) Effects of sample size on accuracy of species distribution models. Ecological Modelling 148: 1–13.
  69. 69. (2008) GEOnet Names Server. Bethesda, MD: United States National Geospatial-Intelligence Agency.
  70. 70. (2008) Geographic Names Information System. United States Geological Survey (USGS).
  71. 71. (1979) China. Central Intelligence Agency.
  72. 72. Nico LG, Williams JD (1996) Risk Assessment on black carp (Pisces: Cyprinidae). Final report to the risk assessment and management committee of the aquatic nuisance species task force. Gainesville, FL: Biological Resources Division, U.S. Geological Survey.
  73. 73. Guisan A, Graham CH, Elith J, Huettmann F, Dudik M, et al. (2007) Sensitivity of predictive species distribution models to change in grain size. Diversity and Distributions 13: 332–340.
  74. 74. Guisan A, Zimmermann NE, Elith J, Graham CH, Phillips S, et al. (2007) What matters for predicting the occurrences of trees: techniques, data, or species' characteristics? Ecological Monographs 77: 615–630.
  75. 75. Peterson AT, Cohoon KP (1999) Sensitivity of distributional prediction algorithms to geographic data completeness. Ecological Modelling 117: 159–164.
  76. 76. Guisan A, Zimmermann NE (2000) Predictive habitat distribution models in ecology. Ecological Modelling 135:
  77. 77. Bowden GJ, Maier HR, Dandy GC (2003) An evaluation of methods for the selection of inputs for an artificial neural network based river model. In: Recknagel F, editor. Ecological Informatics: Understanding Ecology by Biologically-Inspired Computation. New York: Springer Verlag. pp. 215–232.
  78. 78. Anderson RP, Lew D, Peterson AT (2003) Evaluating predictive models of species' distributions: criteria for selecting optimal models. Ecological Modelling 162: 211–232.
  79. 79. Scachetti-Pereira R (2002) Desktop GARP version 1.1.3. Lawrence, KS: University of Kansas Natural History Museum.
  80. 80. Anderson RP, Lew D, Peterson AT (2003) Evaluating predictive models of species' distributions: Criteria for selecting optimal models. Ecological Modelling 162: 211–232.
  81. 81. Hanley JA, McNeil BJ (1983) A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 148: 839–843.
  82. 82. Centor RM (1991) Signal Detectability - the use of ROC curves and their analyses. Medical Decision Making 11: 102–106.
  83. 83. Fielding AH, Bell JF (1997) A review of methods for the assessment of prediction errors in conservation presence/absence models. Environmental Conservation 24: 38–49.
  84. 84. Wiley EO, McNyset KM, Peterson AT, Robins CR, Stewart AM (2003) Niche modeling and geographic range predictions in the marine environment using a machine-learning algorithm. Oceanography 16: 120–127.
  85. 85. Lobo JM, Jimenez-Valverde A, Real R (2007) AUC: A misleading measure of the performance of predictive distribution models. Global Ecology and Biogeography.
  86. 86. Seaber PR, Kapinos RP, Knapp GL (1987) Hydrologic unit maps. In: Survey USG, editor.
  87. 87. Masuda H, Amaoka C, Araga T, Uyeno T, Yoshino T (1984) The fishes of the Japanese archipelago. Tokyo: Tokai University Press.
  88. 88. Biro P (1999) Mylopharyngodon piceus. In: Banarescu P, editor. The freshwater fishes of Europe. Wiebelsheim, Germany: AULU-Verlag.
  89. 89. Loo SE, Mac Nally R, Lake PS (2007) Forecasting New Zealand mudsnail invasion range: Model comparisons using native and invaded ranges. Ecological Applications 17: 181–189.
  90. 90. Marchetti MP, Light T, Moyle PB, Viers JH (2004) Fish invasions in California watersheds: Testing hypotheses using landscape patterns. Ecological Applications 14: 1507–1525.
  91. 91. Kolar CS, Lodge DM (2002) Ecological predictions and risk assessment for alien fishes in North America. Science 298: 1233–1236.
  92. 92. Marsh-Matthews E, Matthews WJ (2000) Geographic, terrestrial, and aquatic factors: which most influence the structure of stream fish assemblages in the midwestern United States? Ecology of Freshwater Fish 9: 9–21.
  93. 93. Allan JD (1995) Stream Ecology: Structure and function of running waters. London: Chapman & Hall.
  94. 94. Angermeier PL, Krueger KL, Dolloff CA (2002) Discontinuity in stream-fish distributions: Implications for assessing and predicting species occurrence. In: Scott JM, Heglund PJ, Morrison ML, Haufler JB, Raphael MG, et al., editors. Predicting species occurrences: issues of accuracy and scale. Covelo, CA: Island Press.
  95. 95. Tonn WM (1990) Climate change and fish communities — a conceptual framework. Transactions of the American Fisheries Society 119: 337–352.
  96. 96. Poff NL (1997) Landscape filters and species traits: Towards mechanistic understanding and prediction in stream ecology. Journal of the North American Benthological Society 16: 391–409.
  97. 97. Naylor RL, Williams SL, Strong DR (2001) A gateway for exotic species. Science 294: 1655–1656.
  98. 98. Evtushenko N, Potrokhov A, Zinkovskii O (1994) The black carp as a subject for acclimation. Hydrobiological Journal 30: 1–10.
  99. 99. (2007) Prevention of Aquatic Invasive Species Act of 2007. 110th Congress ed.
  100. 100. (2007) National Aquatic Invasive Species Act of 2007. 110th Congress ed.
  101. 101. (IPCC) IPoCC (2001) Climate data archive.
  102. 102. Hansen CM, DeFries RS, Townsend JRG, Carroll M, Dimiceli C, et al. (2003) Global percent tree cover at a spatial resolution of 500 meters: First Results of the MODIS vegetation continuous fields algorithm.
  103. 103. (2001) HYDRO1k elevation derivative database. United States Geological Survey (USGS).