Figures
Abstract
The spotted lanternfly (Lycorma delicatula) has recently spread from its native range to several other countries and forecasts predict that it may become a global invasive pest. In particular, since its confirmed presence in the United States in 2014 it has established itself as a major invasive pest in the Mid-Atlantic region where it is damaging both naturally occurring and commercially important farmed plants. Quarantine zones have been introduced to contain the infestation, but the spread to new areas continues. At present the pathways and drivers of spread are not well-understood. In particular, several human activity related factors have been proposed to contribute to the spread; however, which features of the current spread can be attributed to these factors remains unclear. Here we collect county level data on infestation status and four specific human activity related factors and use statistical methods to determine whether there is evidence for an association between the factors and infestation. Then we construct a network model based on the factors found to be associated with infestation and use it to simulate local spread. We find that the model reproduces key features of the spread 2014 to 2021. In particular, the growth of the main infestation region and the opening of spread corridors in the westward and southwestern directions is consistent with data and the model accurately forecasts the correct infestation status at the county level in 2021 with 81% accuracy. We then use the model to forecast the spread up to 2025 in a larger region. Given that this model is based on a few human activity related factors that can be targeted, it may prove useful to incorporate it into more elaborate predictive forecasting models and in informing management efforts focused on interstate highway transport and garden centers in the US and potentially for current and future invasions elsewhere globally.
Citation: Strömbom D, Sands A, Graham JM, Crocker A, Cloud C, Tulevech G, et al. (2024) Modeling human activity-related spread of the spotted lanternfly (Lycorma delicatula) in the US. PLoS ONE 19(8): e0307754. https://doi.org/10.1371/journal.pone.0307754
Editor: Sean Michael Prager, University of Saskatchewan College of Agriculture and Bioresources, CANADA
Received: December 6, 2023; Accepted: July 8, 2024; Published: August 14, 2024
Copyright: © 2024 Strömbom et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting information files and all code required to replicate the results in this manuscript can be found at https://github.com/danielstrombom/SLFS.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The spotted lanternfly (Lycorma delicatula) is an insect native to China, India and Vietnam that is currently considered an invasive pest in South Korea, Japan and the United States [1, 2]. In regions where it is invasive, it is a threat to many naturally occurring plants and a number of economically important interests including the grape, orchard and lumber industries [3, 4].
The spotted lanternfly was first confirmed in the US in 2014 and has since rapidly spread and expanded its distribution [5]. Quarantine zones and permitting by respective state Departments of Agriculture have been implemented to restrict the movement of certain products and any living life stage of the spotted lanternfly in an attempt to limit the spread to new regions [6]. In areas where the lanternfly is already present a number of control measures, including insecticides [7], parasite-based biological controls [8, 9], traps [10], egg scraping [11], and removal of the primary host plant [12] have been applied. However, these efforts appear to be insufficient given that the lanternfly is still spreading rapidly to new regions [4, 13]. In addition, earlier published niche models based on climate and other factors forecast that the spotted lanternfly may eventually establish itself in a much larger portion of the US than it currently occupies and elsewhere globally [2, 14]. However, these models only forecast the eventual maximal distribution, they do not provide information about how the lanternfly is likely to spread from its current limited distribution to this maximal distribution [15] nor what the key drivers of the local spread are. To address the issue of forecasting how the spotted lanternfly, and other pests and pathogens, might spread over time a species agnostic dynamic spatio-temporal forecasting system was recently introduced [16]. Unlike previous models for the lanternfly infestation in the US this system can also provide estimates for when the lanternfly is expected to arrive in a particular region, rather than just that it eventually might. For example, it has been used to forecast that there is a high chance that California will have lanternfly infestations in 2033 [17]. This system represents a significant advance in dynamic forecasting and has been designed using a data-driven approach employing standard techniques that are ideal for models whose primary objective is prediction and forecasting [18, 19]. However, due to their high complexity and data-driven nature predictive models are less suitable for investigating which features of the spread specific proposed key drivers may be able to explain, and if so how to best target these to limit the spread. To address such questions minimal explanatory models based on a limited number of factors in isolation is more appropriate [20].
The spotted lanternfly is known to spread through two distinct processes; short distance dispersal and long distance dispersal [9]. Short distance dispersal includes natural flight capabilities of up to 40m to infest new host plants [21] and/or local human assisted transport [22]. Long distance dispersal refers to establishment of satellite populations away from the main area of infestation [9], and several potential pathways and drivers of long distance dispersal in the spotted lanternfly have been proposed in the literature. In particular, a number of human activity related factors including the transportation of packing materials and vehicles [1, 3], transportation of materials related to gardening and plants [23], and human activity/population density [9, 13, 24, 25]. Unlike climate, geography, host plant distribution, temperature, and other non-human activity related factors that are known, or believed, to affect the spread and settlement of the lanternfly, these human activity related factors may be more readily targetable and affectable by management efforts. A minimal model based on such factors capable of replicating key features of the actual spread could be used as a framework for assessing proposed management efforts before they are deployed, and potentially even to help determine what they should be.
Results
The data collected for a 166 county region that contained all infested counties [5] on September 28th 2021 (Fig 1A), including number of garden centers (Fig 1B), primary interstate highways (Fig 1C) and population (Fig 1D) is presented in Fig 1.
(A) County map showing the counties designated as infested by the New York State Integrated Pest Management [5] on September 28th 2021 and the 166 county region defined as the convex hull of those infested counties. (B) The number of garden centers present in each county. (C) The number of primary interstate highways transecting each county. (D) The population of each county on a log scale.
By fitting a generalized additive logistic regression model [26–28] we find that the data provides evidence for the increase in likelihood of infestation in a county with the presence of interstate highways and larger populations (Table 1). Additionally, the analysis also suggests that the data provides evidence for the increase in likelihood of infestation in a county with an increase in the number of garden centers only when there is also the presence of a primary interstate highway in that county.
Results are on the logit (log-odds) scale, that is, the scale of the predictor variables.
See the Methods section for information about the data collection and the statistical methods used.
Model
Based on insights presented in the literature and the results of our statistical analyses above we construct a network model of the spread consisting of two components. General short distance spread on the network of adjacent counties and long distance spread on the primary interstate highway network with garden center and population density dependent infestation probabilities. The components and parameters of the model are described in Table 2.
The spread on each of the two interconnected networks follow the independence model introduced in Leung et al. 2004 (See equation (1) in [29]) modified to include the garden center and population dependent spread on the primary interstate highway network. More specifically, the spread on the adjacency network is given by (1) and the garden center dependent spread on the primary interstate highway network by (2) and the population dependent spread on the primary interstate highway network by (3)
We implemented the combined modes of spread over the interconnected adjacency and primary interstate highway networks in Matlab and analyzed the model via simulations. See the Methods section for details on model construction, parameterization, implementation, simulation and assessment.
The results of simulating the model in the 166 county region starting with Berks county, PA, as the only infested county in 2014 and recording the proportion of simulations that yielded each county infested in each year 2015–2021 is presented in the bottom row of Fig 2. We note that the model constructed based on the included factors reproduces key characteristics of the observed spread both qualitatively and quantitatively. Comparing the Top and Bottom rows in Fig 2 shows that not only is the model forecasted growth of the infested region relatively consistent with the observed over the years, but the model also forecasts the opening up of spread corridors from the main infestation region around Berks County, PA, west toward the Pittsburgh area and south-southwest through VA. In particular, the model starts suggesting already in 2017–18 that Alleghany County, PA, had an increasing risk of infestation before it was finally designated as infested in 2020. Similar observations are made in each main direction of infestation spread. Quantitatively, using the rule that if more than half of the simulations resulted in a county being infested to designate a county as infested according to the model, the model predicts the correct 2021 county infestation status with 81% accuracy. More specifically, it forecasts the infestation correctly in 135 counties and incorrectly in 31 counties. Of the 31 mismatches there are 16 false positives, where the model suggests the county should be infested but the data disagrees, and 15 false negatives, where the model suggests the county should not be infested but the data says it is.
The Top row displays the actual observed infestation data from 2014–21. The Bottom row displays the corresponding model forecasts for 2015–21. The coloring indicates the proportion of simulations in which a county became infested in a specific year according to the model. More specifically, a county is solid green if all 1000 simulations rendered it infested in a particular year and white if none of the 1000 simulations resulted in it being infested.
Fig 3 shows the percentage of simulations in which the model predicted the correct infestation status of each county in each year. We note that as counties around the infestation source county (Berks County, PA) start to become infested in 2015 and onwards, an annular-shaped region of lower model accuracy diffuses out from the source and other low accuracy regions emerge at remote locations, in particular, in western PA. Over time the accuracy of the model increases behind the moving annular-shaped region as well as in the remote locations.
Here we display the percentage of simulations that predicted the correct infestation status for each county in each year. The coloring of each county in each year indicates the percentage of simulations that resulted in a correct prediction of its actual infestation status. That is, the percentage of simulations that resulted in it being infested if it indeed was infested, or the percentage of simulations that resulted in it not being infested if it was not infested. More specifically, a county is colored solid blue if all simulations predicted the correct infestation status in a given year (100%) and white if none of the 1000 simulations predicted the correct infestation status (0%).
Once the model had been assessed in the 166 county region the same parameterized model was simulated in a larger 581 county region. These simulations in the larger region forecasts that the spread along the western corridor through OH and IN, the southwestern corridor on the border of WV and VA, and the southern corridor through VA into NC, will intensify up to the year 2025 according to the model (Fig 4).
We note that the model suggests that the corridors of spread west, southwest, and south will all intensify over the years. In particular, the model suggests that the area around Cleveland, OH, has had an increased and increasing risk of infestation at least since 2021, and it was recently labeled as infested by the New York State Integrated Pest Management Program. The model also forecasts that the infestation in NC will increase substantially over the next few years.
See the Methods section for more detailed descriptions of the model, model assessment, and the simulation protocols.
Discussion
The computational model created based on the human activity related factors; primary interstate highways, garden centers and population reproduce both qualitative and quantitative features of the spread through 2014–21. Using the computational model over a larger region of the US to forecast the spread of infestation beyond 2021 suggests that spread will accelerate over this period if the spread is driven by the factors included in the model.
Despite being based on only three specific statistically supported human activity related factors, our model not only reproduces qualitative features of the actual spread, but also forecasts the correct infestation status for 81% of the counties included in the assessment region in 2021. This, together with the observed infestation pattern itself (Fig 2 Top row) supports the assertion that the primary interstate highway system may be a critical pathway for the long distance dispersal of the spotted lanternfly in the US and that human activity in general, and business dealing in plants and plant related materials in particular, are associated with the dispersal along this pathway. While this is consistent with findings and propositions by others as described previously, we note that our finding that primary interstate highways are associated with infestation, rather than road density in general [13] or even all interstate highways, may be a particularly useful insight. There are relatively few primary interstate highways which may make targeting for management and modeling based on those easier and less complex than if a larger classes of roads and transportation pathways are considered. In addition, our statistical finding that garden centers only appear to be associated with increased risk of infestation when primary interstate state highways are present may be useful for prioritizing monitoring and/or management efforts involving garden centers.
Our model does not take into account climate, geography, host plant distribution, temperature, and other factors that are known, or believed, to affect the spread and settlement of the lanternfly. In particular, it does not take into account any other potential mechanism of long distance dispersal, e.g. transport via rail [16, 17], air or waterways. This may explain why the model forecasts the incorrect infestation status in 19% of included counties. If a factor we do not include is severely disadvantageous to lanternfly infestation that factor could explain some of the 16 false positives, and if the excluded factor is severely advantageous to lanternfly infestation it could explain some of the 15 false negatives. However, other explanations for, in particular, the false positives do exist. Notably that sampling effort by authorities and reporting by the public is not uniform throughout the region under consideration [13] so lanternfly infestations may be present in a county but not reported. Further investigation seeking to explain these discrepancies may contribute to our understanding of the actual spread. In particular, the 2021 results because the region of consideration was chosen based on the actual infestation range at that time, but the model accuracy analysis at the county level over time (Fig 3) may also be useful to guide further investigation to explain discrepancies that may be temporal in nature. For example, non-uniform sampling effort causing delayed classification as infested by authorities [13]. While it may eventually be useful to add more factors and/or integrate the factors of our model into niche models [2, 14] and/or dynamic forecasting systems [16, 17] at present its current minimal form is advantageous. Primarily because it is already reasonably effective and adding further elements would increase its complexity and reduce its ability to elucidate within model causal effects [20] of the specific human activity related factors included in our model on the spread of infestation. However, once the infestation has progressed further, both the niche models and the dynamic forecasting system suggest that the conditions for successful establishment of infestation drops significantly (Fig 5 in [17]). If those forecasts are correct our model will produce false positives in these new regions because it does not take into account climate and other factors that they do. If/when the infestation reaches areas with low risk of infestation it will not only serve as a test of our model but also both the niche models [2, 14] and forecasting system [16, 17]. If the spread significantly slows down or stops our model needs revision, but if it proceeds at the same rate in designated low risk areas or move into areas forecasted as unlikely to be suitable the niche models and forecasting system, and potentially our model too, need revision.
In the larger region our model forecasts that the lanternfly infestation will continue its long range dispersal along the primary interstate highway system in all directions and short distance dispersal will drive the spread outward from the new satellite infestations along these routes (Fig 4). This information may be useful on the ground as a complement to the forecasting system in [17] that includes long distance dispersal via rail networks, rather than highway networks as we do here, to forecast where lanternfly infestation is likely to occur next when planning for the application of control measures. In part, because preventing infestation or managing an infestation early is preferable to dealing with a full blown infestation later on [30], but also because of the findings in [31]. Based on available data on the life-stage survival and fecundity [31] estimated that the average annual growth rate of the US spotted lanternfly population is about 5.47. Given this rapid growth even a control measure that kills 100% of treated lanternfly require at least 35% of all lanternfly present in all stages to be treated to induce even the slightest decline in the annual population and this percentage increases rapidly for less effective controls. Given that resources are limited this suggests that using them in the right locations may be critically important. For example, our model forecasts that over the next few years the infestation will intensify in North Carolina (Fig 4). This state has become one of the largest wine producing states with over 525 individually owned vineyards in 2017 [32] and in previously infested regions the grape industry has been particularly adversely affected with one vineyard in Pennsylvania reporting a 90% loss in yield [6]. Our work, including both model and statistical analysis, in conjunction with the infestation data suggests that spread occurs along the primary interstate highways, so efforts to limit the spread risk southward along primary interstates 77, 85, and 95 may lower the risk for North Carolina’s wine industry. We are aware that the Pennsylvania Department of Agriculture are focusing on transportation corridors such as major highways and tourist destinations [33], but we are unable to find information about which specific highways or areas they are targeting and/or what decisions to target these are based on. Our model, or a model similar to it, could help inform such decisions. Whether it is to assess two or more competing management alternatives with given estimated county-level impacts before implementation, or to minimize spread to a particular region, or even to minimize the overall spread. Comparing alternatives will be particularly straightforward since that would amount to changing county level parameters. For example, say that some action is estimated to decrease the interstate highway mediated long-distance dispersal through a county by 30%, then the weights on the network links from that county would be reduced by 30% and new simulations run to observe the outcome. Minimizing spread to a particular region, or minimizing the overall spread, requires more work, but there is a wealth of analysis tools available for entities that spread on networks (e.g. infections [34, 35]) that could be used to rank the nodes (counties) and the links (transmission pathways) based on their contribution to the overall spread. We believe that using minimal explanatory modeling as a complement to the existing data-driven predictive models [2, 14, 16, 17] and other approaches to assess where the limited resources available to combat the infestation should be deployed for maximum impact with respect to whichever objective is set by professionals on the ground will be necessary to significantly reduce the spread of lanternfly in the US. The low complexity model based on statistically established factors we present here represents one step in this direction.
Methods
Data collection
First, we defined a region for our statistical analyses and model assessment such that it is plausible that the lanternfly would have spread to and infested the counties in this region if the conditions had been favorable by following the convex hull approach used in [13]. This method led to a 166 county region as the convex hull of the counties designated as infested by the New York State Integrated Pest Management [5] on September 28th 2021 (Fig 1A). We also collected data on the number of garden centers (Fig 1B), primary interstate highways (Fig 1C) and population (Fig 1D) for each county in the region. More specifically, for each county in the 166 county region we collected
- The names and FIPS codes of its adjacent counties from the United States Census Bureau [36].
- The year in which it was designated as infested by consolidating information from the New York State Integrated Pest Management [5], the Pennsylvania Department of Agriculture [37–40], State of New Jersey Department of Agriculture [41], and the Virginia Polytechnic Institute and State University [42].
- The number of businesses designated as garden centers (retail or wholesale). This data was collected in October 2021 by manually searching “Garden centers in [county name] county” in Google Maps [43] and verifying that each search result was indeed a garden center that lies within the county boundary. (Fig 1B).
- The 2019 population estimate from the United States Census Bureau [44]. (Fig 1D).
- The primary (two-digit) interstate highways that transect it and their identification numbers from Google Maps [43] (Fig 1E).
In the larger 581 county region we collected the same data using the same sources. The exception being the infestation year because no county outside the 166 county region was infested at that time.
See the Data availability statement for how to access the complete data for the 166 and 581 county regions.
Statistical analysis to determine relevance of factors
For a statistical test of our hypotheses we used R version 4.3.1 and package mgcv version 1.9 to fit a generalized additive logistic regression model [26–28]. Specifically, we model the log-odds of county infestation in 2021 with tensor product smoothing for longitude and latitude to control for spatial-autocorrelation and include linear parametric terms for presence/absence of two-digit interstate highway, number of garden centers, county population, and an interaction term for presence/absence of primary (two-digit) interstate highway and number of garden centers. To deal with issues of convergence and variables of different scale, we log transformed the population and normalized the number of garden centers. We used standard methods for model diagnostics and model selection to determine the best possible combination of predictor variables to come to a parsimonious model such that the necessary distributional assumptions are met [45, 46]. We used the packages DHARMa version 0.4.6 and gratia version 0.8.1.34 for diagnostics to assess model assumptions [47, 48]. Fig 5 shows the estimates on the odds-ratio scale, that is, the ratio of the probability of infestation to the probability of no infestation for the parametric terms in our generalized additive logistic regression, as a complement to Table 1 which displays the values for the same estimated coefficients for the linear parametric terms on the log-odds scale, that is, the scale of the predictor variables.
Results are on the scale of odds-ratio, that is, the ratio of the probability of infestation to the probability of no infestation. The likelihood of infestation for a county in 2021 is predicted to increase with the presence of primary interstate highways and larger populations. Interestingly, the likelihood of infestation for a county in 2021 is predicted to increase with an increase in the number of garden centers only when there is also the presence of a primary interstate highway for that county.
See the Data availability statement for how to access the complete R code needed to reproduce these results.
Model construction, parameterization, implementation, simulation and assessment
The model presented in the results section (Table 2 and Eqs 1–3) was implemented in Matlab and here we describe the main model related elements in more detail. For further information, see the complete code for the model and simulation framework that may be found via the information in the Data availability statement.
Model spaces.
The model space consists of two interconnected networks with the included counties as the nodes. See Fig 6. The adjacency matrices for the adjacent counties networks (A) were constructed using data from [36]. The adjacency matrices for the primary interstate highway networks (H) were created using collected data on which primary interstates transect each county.
The model space for the 166 county and the 581 county models consist of the combination of their adjacency and primary interstate networks. We note that the adjacency networks in both cases are fully connected, but that the primary interstate networks are not.
Model parameterization.
We fit the parameter p that represents the probability that an uninfested county adjacent to an infested county becomes infested from one year to the next year using the infestation data for 2014–2020 and following the approach in [29]. More specifically, we proceed in the following steps.
- Find the ‘time t to time t + 1’ infestation distribution as a function of infested adjacent counties using the 2014–2020 data. For all counties that are not infested at time t count the number of infested adjacent counties Ni they have and calculate the proportion pI of the counties that got infested in the next time step t + 1.
- Use the result from step 1 to fit the parameter a in . The Matlab function fit was used for this and the result was a = 0.2902 with 95% confidence interval (0.2048, 0.3756).
- Use a to calculate the parameter p via p = 1 − e−a which yields p = 0.2519.
Model implementation.
See Table 1 for the symbols and description of the model components. Each simulation starts with Berks county, PA, as the only infested county in 2014. Berks county is county number 76 in our 166 county data so at time 0 (2014) the infestation vector, which is a 166 element column vector, contains only 0’s except for the 76th element which is a 1. Then on each timestep t from 1 to T the infestation vector is updated by applying Eqs 1–3 and adding the result as a new column in a matrix Q via the following Matlab code
Q=[q]; %Initiate the matrix for collecting infested over time.
for t = 1:T
q=(1-(1-p).^(A*q))>rand(N,1)-q; % adjacent spread
q=(1/2)*(1-(1-sg).^(H*q))>rand(N,1)-q; % garden c interstate spread
q=(1/2)*(1-(1-sp).^(H*q))>rand(N,1)-q; % population interstate spread
Q=[Q,q]; %add updated infestation vector to matrix
end
The output Q is a matrix that contains the infestation status of each county (rows) in each year from 2014 to 2014+T (columns) resulting from that simulation. Note that this matrix Q only contain 0’s and 1’s, and that the element in row j and column i is 1 if the simulation resulted in county j being infested in year 2014+i, and 0 if it was not infested in that year. sg is the garden center establishment probability vector ( in Table 2) and sp is the population establishment probability vector ( in Table 2). The reason for including the -q in the right hand sides of the implementation of each of the model equations is to ensure that counties that have once become infested stay infested. This works because if qi = 1 then the corresponding left hand side is always larger than the right hand side and county i will remain infested. This reflects reality, as no county that has become infested has subsequently become uninfested during the current US infestation.
Model simulation and assessment.
To assess the accuracy of the model in the 166 county region we ran 1000 simulations starting with Berks county, PA, as the only infested county in 2014 and calculated the proportion of simulations that yielded each county infested in each year 2015–2021. The calculation of the proportions were carried out by adding the output matrices Q from the 1000 simulations, which results in a new matrix that encodes the number of simulations in which each county was infested at each time, and then multiplying this new matrix by (1/1000) gives the proportion of simulations in which each county was infested at each time. Model assessment was carried out qualitatively by comparing infestation maps with model forecast maps through 2014–21 and quantitatively by defining a county infested if more than half of the simulations resulted in that specific county being infested in 2021. The qualitative comparison focused on comparing the overall annual range of the infestation and on the empirically observed corridors of spread west into OH, southwest along the VA-WV border, and south through VA. The quantitative assessment consists of two parts; one aggregate for 2021 and one for accuracy at the county level over time. The aggregate assessment was carried out by calculating the proportion of counties in the assessment region (Fig 1A) that the model accurately predicted the infestation status of. For example, if the 2021 data says that a county is infested and more than 500 (of 1000) model simulations resulted in it being infested that is a match, as is if both model and data says not infested. Any disagreement between model and data on a particular county was recorded as a mismatch. The county level assessment of model accuracy in prediction over time was performed as follows. Denote the proportion of the 1000 simulations that resulted in a particular county i being infested at time t by pi,t. Then the accuracy of the model to predict the correct infestation status of that county (in %) is 100pi,t if the county was actually infested at that time, and 100(1 − pi,t) if the county was not infested at that time. For example, if county i was infested at time t in 600 of the 1000 simulations then pi,t = 0.6 and if the data says it was infested at this time the model predicts it correctly in 60% of simulations, and if the data says it was not infested at this time the model predicts it correctly in 40% (100(1 − 0.6)) of simulations. The result of these accuracy calculations for each county at each time is displayed in Fig 3 and the exact values, as well as the the formulas used for the calculations, may be found in the ‘166 Region model Accuracy’ tab in supplementary materials S1 Data.
Once the model had been assessed in the 166 county region we extended the region to include 581 counties and collected data for each of the factors included in the original 166 county model. To forecast the future spread in the 581 county region we used the same parameterized model and approach as for the simulation in the 166 county region, but simulated for a longer period of time (2014–25). We again started with Berks County, PA, as the only infested county in 2014 and ran 1000 simulations and collected the proportion of simulations that each county got infested in each year up to and including 2025.
Supporting information
S1 Data. Data and simulation results.
This Excel file contains all the data collected and generated in this study. It contains 5 tabs; 166 Region Data, 581 Region Data, 166 Region Model, 166 Region Model Accuracy, and 581 Region Model. The two Data tabs contain the collected data for each county in the respective regions. Specifically, county name, FIPS, number of garden centers, number of primary interstate highways (and their numbers) and population. The Model tabs contain the simulation outputs used to obtain the results presented in the manuscript with interactive maps. Readers can use these to run their own new simulations in Matlab (with or without changes to the code), export the results to the corresponding tab in this Excel file and see the results in map form. The 166 Region Model and the 166 Region Model Accuracy tabs also contains the calculations for the quantitative results, i.e. the number of model-data matches, false positives and false negatives, and model accuracy at the county level over time.
https://doi.org/10.1371/journal.pone.0307754.s001
(XLSX)
References
- 1. Dara SK, Barringer L, Arthurs SP. Lycorma delicatula (Hemiptera: Fulgoridae): a new invasive pest in the United States. Journal of Integrated Pest Management. 2015;6(1):20.
- 2. Jung JM, Jung S, Byeon Dh, Lee WH. Model-based prediction of potential distribution of the invasive insect pest, spotted lanternfly Lycorma delicatula (Hemiptera: Fulgoridae), by using CLIMEX. Journal of Asia-Pacific Biodiversity. 2017;10(4):532–538.
- 3. Barringer LE, Donovall LR, Spichiger SE, Lynch D, Henry D. The first new world record of Lycorma delicatula (Insecta: Hemiptera: Fulgoridae). Entomological news. 2015;125(1):20–23.
- 4. Urban JM, Leach H. Biology and management of the spotted lanternfly, Lycorma delicatula (Hemiptera: Fulgoridae), in the united states. Annual Review of Entomology. 2023;68:151–167. pmid:36206772
- 5.
New York State Integrated Pest Management. Spotted lanternfly. https://cals.cornell.edu/new-york-state-integrated-pest-management/outreach-education/whats-bugging-you/spotted-lanternfly/spotted-lanternfly-reported-distribution-map; 2023.
- 6. Urban JM. Perspective: shedding light on spotted lanternfly impacts in the USA. Pest management science. 2020;76(1):10–17. pmid:31525270
- 7. Leach H, Biddinger DJ, Krawczyk G, Smyers E, Urban JM. Evaluation of insecticides for control of the spotted lanternfly, Lycorma delicatula,(Hemiptera: Fulgoridae), a new pest of fruit in the Northeastern US. Crop Protection. 2019;124:104833.
- 8. Liu H. Occurrence, seasonal abundance, and superparasitism of Ooencyrtus kuvanae (Hymenoptera: Encyrtidae) as an egg parasitoid of the spotted lanternfly (Lycorma delicatula) in North America. Forests. 2019;10(2):79.
- 9. Lee DH, Park YL, Leskey TC. A review of biology and management of Lycorma delicatula (Hemiptera: Fulgoridae), an emerging global invasive species. Journal of Asia-Pacific Entomology. 2019;22(2):589–596.
- 10. Francese JA, Cooperband MF, Murman KM, Cannon SL, Booth EG, Devine SM, et al. Developing traps for the spotted lanternfly, Lycorma delicatula (Hemiptera: Fulgoridae). Environmental Entomology. 2020;49(2):269–276. pmid:31990325
- 11. Cooperband MF, Mack R, Spichiger SE. Chipping to destroy egg masses of the spotted lanternfly, Lycorma delicatula (Hemiptera: Fulgoridae). Journal of Insect Science. 2018;18(3):7. pmid:29868780
- 12.
Spotted lanternfly control program in the mid-Atlantic region–environmental assessment. USDA APHIS, Washington DC. https://www.aphis.usda.gov/plant_health/ea/downloads/2020/slf-mid-atlantic-region.pdf; 2020.
- 13. Cook RT, Ward SF, Liebhold AM, Fei S. Spatial dynamics of spotted lanternfly, Lycorma delicatula, invasion of the Northeastern United States. NeoBiota. 2021;70:23.
- 14. Wakie TT, Neven LG, Yee WL, Lu Z. The establishment risk of Lycorma delicatula (Hemiptera: Fulgoridae) in the United States and globally. Journal of Economic Entomology. 2020;113(1):306–314. pmid:31579914
- 15. Hao T, Elith J, Guillera-Arroita G, Lahoz-Monfort JJ. A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD. Diversity and Distributions. 2019;25(5):839–852.
- 16. Jones CM, Jones S, Petrasova A, Petras V, Gaydos D, Skrip MM, et al. Iteratively forecasting biological invasions with PoPS and a little help from our friends. Frontiers in Ecology and the Environment. 2021;19(7):411–418. pmid:34588928
- 17. Jones C, Skrip MM, Seliger BJ, Jones S, Wakie T, Takeuchi Y, et al. Spotted lanternfly predicted to establish in California by 2033 without preventative management. Communications Biology. 2022;5(1):1–9. pmid:35676315
- 18.
Kuhn M, Johnson K. Applied predictive modeling. 1st ed. Springer, New York, NY; 2013.
- 19.
Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB. Bayesian data analysis. 3rd ed. Texts in Statistical Science Series. CRC Press, Boca Raton, FL; 2014.
- 20. Baker RE, Pena JM, Jayamohan J, Jérusalem A. Mechanistic models versus machine learning, a fight worth fighting for the biological community? Biology letters. 2018;14(5):20170660. pmid:29769297
- 21. Baker T, Smyers E, Urban J, Meng Z, Damadaram KP, Myrick A, et al. Progression of seasonal activities of adults of the spotted lanternfly, Lycorma delicatula, during the 2017 season of mass flight dispersal behavior in eastern Pennsylvania. Journal of Asia-Pacific Entomology. 2019;22(3):705–713.
- 22. Wolfin MS, Binyameen M, Wang Y, Urban JM, Roberts DC, Baker TC. Flight dispersal capabilities of female spotted lanternflies (Lycorma delicatula) related to size and mating status. Journal of Insect Behavior. 2019;32(3):188–200.
- 23. Hong KJ, Lee JH, Lee GS, Lee S. The status quo of invasive alien insect species and plant quarantine in Korea. Journal of Asia-Pacific Entomology. 2012;15(4):521–532.
- 24. Namgung H, Kim MJ, Baek S, Lee JH, Kim H. Predicting potential current distribution of Lycorma delicatula (Hemiptera: Fulgoridae) using MaxEnt model in South Korea. Journal of Asia-Pacific Entomology. 2020;23(2):291–297.
- 25. Ladin ZS, Eggen DA, Trammell TL, D’Amico V. Human-mediated dispersal drives the spread of the spotted lanternfly (Lycorma delicatula). Scientific Reports. 2023;13(1):1098. pmid:36658159
- 26.
R Core Team. R: A Language and Environment for Statistical Computing; 2023. Available from: https://www.R-project.org/.
- 27. Wood SN. Thin-plate regression splines. Journal of the Royal Statistical Society (B). 2003;65(1):95–114.
- 28.
Wood SN. Generalized additive models: an introduction with R. CRC press; 2017.
- 29. Leung B, Drake JM, Lodge DM. Predicting invasions: propagule pressure and the gravity of Allee effects. Ecology. 2004;85(6):1651–1660.
- 30.
Dent D, Binks RH. Insect pest management. Cabi; 2020.
- 31. Strömbom D, Pandey S. Modeling the life cycle of the spotted lanternfly (Lycorma delicatula) with management implications. Mathematical biosciences. 2021;340:108670. pmid:34302819
- 32.
Winslow W. North Carolina WINE Industry Facts. https://www.ncwine.org/media#:~:text=North%20Carolina%20is%20home%20to,and%20European-style%20vinifera%20grapes; 2017.
- 33.
Agriculture Department To Business Travelers: Don’t Take It With You Get A Spotted Lanternfly Permit. Pennsylvania Pressroom. https://www.nj.gov/agriculture/news/press/2018/approved/press180823.html; 2021.
- 34. Nandi AK, Medal HR. Methods for removing links in a network to minimize the spread of infections. Computers & Operations Research. 2016;69:10–24.
- 35. Enns EA, Mounzer JJ, Brandeau ML. Optimal link removal for epidemic mitigation: A two-way partitioning approach. Mathematical biosciences. 2012;235(2):138–147. pmid:22115862
- 36.
County Adjacency File. United States Census Bureau.; 2020. https://www.census.gov/programs-surveys/geography/library/reference/county-adjacency-file.html.
- 37.
Pennsylvania Department of Agriculture 2015 Entomology Program Summary. Pennsylvania Department of Agriculture, Bureau of Plant Industry, Division of Entomology. https://www.agriculture.pa.gov/Plants_Land_Water/PlantIndustry/Entomology/Documents/PA%202015%20Activities%20Report%20for%20EPB.pdf; 2015.
- 38.
Pennsylvania Department of Agriculture 2016 Entomology Program Summary. Pennsylvania Department of Agriculture, Bureau of Plant Industry, Division of Entomology. https://www.agriculture.pa.gov/Plants_Land_Water/PlantIndustry/Entomology/Documents/PA%202016%20Entomology%20Program%20Highlights.pdf; 2016.
- 39.
Pennsylvania Department of Agriculture 2017 Entomology Program Report. Pennsylvania Department of Agriculture, Bureau of Plant Industry, Division of Entomology. https://www.agriculture.pa.gov/Plants_Land_Water/PlantIndustry/Entomology/Documents/PA%202017%20Activities%20Report%20for%20EPB.pdf; 2017.
- 40.
Pennsylvania Department of Agriculture 2019 Entomology Program Report. Pennsylvania Department of Agriculture, Bureau of Plant Industry, Division of Entomology. https://www.agriculture.pa.gov/Plants_Land_Water/PlantIndustry/Entomology/Documents/2019-Entomology-Program-Highlights.pdf; 2019.
- 41.
Spotted lanternfly sighting confirmed in Hunterdon County. State of New Jersey Department of Agriculture. https://www.nj.gov/agriculture/news/press/2018/approved/press180823.html; 2018.
- 42.
Spotted Lanternfly in Virginia. Virginia Polytechnic Institute and State University. https://www.nj.gov/agriculture/news/press/2018/approved/press180823.html; 2018.
- 43.
Google Maps. Google.; 2021. https://www.google.com/maps.
- 44.
County Population Totals: 2010-2019. United States Census Bureau.; 2019. https://www.census.gov/data/tables/time-series/demo/popest/2010s-counties-total.html.
- 45.
Faraway JJ. Linear models with R. 2nd ed. Texts in Statistical Science Series. CRC Press, Boca Raton, FL; 2015.
- 46.
Faraway JJ. Extending the linear model with R. 2nd ed. Chapman & Hall/CRC Texts in Statistical Science Series. CRC Press, Boca Raton, FL; 2016.
- 47.
Hartig F. DHARMa: Residual Diagnostics for Hierarchical (Multi-Level / Mixed) Regression Models; 2022. Available from: https://CRAN.R-project.org/package=DHARMa.
- 48.
Simpson GL. gratia: Graceful ggplot-Based Graphics and Other Functions for GAMs Fitted using mgcv; 2023. Available from: https://gavinsimpson.github.io/gratia/.