There is a need for fit-for-purpose maps for accurately depicting the types of seabed substrate and habitat and the properties of the seabed for the benefits of research, resource management, conservation and spatial planning. The aim of this study is to determine whether it is possible to predict substrate composition across a large area of seabed using legacy grain-size data and environmental predictors. The study area includes the North Sea up to approximately 58.44°N and the United Kingdom’s parts of the English Channel and the Celtic Seas. The analysis combines outputs from hydrodynamic models as well as optical remote sensing data from satellite platforms and bathymetric variables, which are mainly derived from acoustic remote sensing. We build a statistical regression model to make quantitative predictions of sediment composition (fractions of mud, sand and gravel) using the random forest algorithm. The compositional data is analysed on the additive log-ratio scale. An independent test set indicates that approximately 66% and 71% of the variability of the two log-ratio variables are explained by the predictive models. A EUNIS substrate model, derived from the predicted sediment composition, achieved an overall accuracy of 83% and a kappa coefficient of 0.60. We demonstrate that it is feasible to spatially predict the seabed sediment composition across a large area of continental shelf in a repeatable and validated way. We also highlight the potential for further improvements to the method.
Citation: Stephens D, Diesing M (2015) Towards Quantitative Spatial Models of Seabed Sediment Composition. PLoS ONE 10(11): e0142502. https://doi.org/10.1371/journal.pone.0142502
Editor: Carlo Nike Bianchi, Università di Genova, ITALY
Received: April 24, 2015; Accepted: October 22, 2015; Published: November 23, 2015
Copyright: © 2015 Stephens, Diesing. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was made possible through funding from the project DEVOTES (Development of innovative tools for understanding marine biodiversity and assessing good Environmental Status) funded by the EU FP7 (grant agreement no. 308392), www.devotes-project.eu. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The seabed of the world’s oceans accounts for 71% of the surface of the Earth, harbours significant living and non-living resources, fulfils vital ecosystem services and provides a wide range of habitats for various living organisms. Yet, when it comes to accurately depicting the types of seabed substrate and habitat and the properties of the seabed for the benefits of research, resource management, conservation and spatial planning, it becomes immediately clear that there is a lack of accurate and fit-for-purpose maps and spatial models for most parts of the world’s oceans. The European Marine Observation and Data Network (EMODnet) is an attempt by the European Union to overcome such deficits in European waters. EMODnet assembles marine data, data products and metadata from diverse sources with the purpose to unlock fragmented and hidden marine data resources and to make these available. Among those data types that are being harmonised and made available are bathymetry, seabed substrates and benthic habitats.
Whilst such initiatives to unlock hidden data products and to harmonise these across national boundaries are laudable, it has become increasingly clear that this might be a challenging task. For example, EMODnet-Geology attempts to harmonise seabed substrate maps produced by geological surveys across Europe based on more than thirty differing classification schemes. The resulting classification is hence likely to be the lowest common denominator, with fairly broad classes similar to those proposed by . In any case, maps of seabed substrate will be provided in categorical form (substrate classes) rather than as a quantity. This is somewhat unsatisfactorily, especially if one considers that sediment composition and grain size are of importance for such diverse aspects as benthic community structure , Nephrops norvegicus burrow densities , spatial variation in the abundance of Human Pathogen Indicator Bacteria within estuarine environments , permeability in coastal marine sands  and compressional velocity of shelf sediments .
Alternatively, if data on surface sediment composition (e.g. percentages of mud, sand and gravel) are available, then it should be possible to spatially predict the sediment composition of the seabed across the area of interest. Spatial prediction is the estimation of unknown quantities, based on sample data and assumptions regarding the form of the trend and its variance and spatial correlation . Spatial prediction of environmental properties can be achieved in various ways: Deterministic models such as inverse distance weighted, natural neighbour and nearest neighbour interpolation use observations of a target variable to calculate values at unsampled locations with mathematical functions. Such models use arbitrary or empirical model parameters and don’t provide estimates of model error . Stochastic models incorporate the concept of randomness and provide both estimations of a target variable and an associated error . This class of models includes kriging and regression models among others. In recent years, data-driven machine learning algorithms have become a popular choice. Such algorithms have the advantage that the input data does not need to satisfy strict statistical assumptions as is the case for stochastic models. Machine learning algorithms are flexible statistical prediction techniques that ‘learn’ patterns in data to predict an associated value. Machine learning is defined as “programming computers to optimise a performance criterion using example data or past experience” . As a discipline it falls between computing and statistics and is also referred to as statistical learning. Terrestrial remote sensing has successfully employed these techniques for optical data for several years and they are being used more regularly in seabed mapping using acoustic data.
There is a vast array of statistical learning techniques, including Maximum Likelihood Estimation [11–13], k-Nearest Neighbour [14,15], various decision tree methods [12,13,15–20], Artificial Neural Networks [15,20–22], Support Vector Machines [12,15,20] and Bayesian Decision Rules [15,23], and it would be challenging to incorporate and compare them all in a single study. However, with the exception of a few studies [24,25], such methods have been applied to classification (e.g. prediction of substrate or habitat class) rather than regression (e.g. prediction of sediment composition) tasks. Here, we make quantitative predictions of sediment composition (fractions of mud, sand and gravel) with the Random Forest (RF) algorithm , which has become one of the most widely used and successful statistical learning models for classification and regression, showing good performance in a large number of domains [12,14,15,20,25,27–34]. RF is a non-parametric technique, i.e. no assumptions regarding the shape of distributions of the response or predictor variables are made . It can handle complex, non-linear relationships between predictor and response variables.
The aim of this study is to determine whether it is feasible to spatially predict substrate composition across a large area of seabed using legacy grain-size data and environmental predictors. The study combines outputs from hydrodynamic models as well as optical remote sensing data from satellite platforms and bathymetric variables, which are mainly derived from acoustic remote sensing. These variables are tested to determine their contribution to predicting substrate composition. As well as producing mapped layers of best estimate of substrate composition across the study region, prediction intervals are also produced indicating the variability of the conditional distribution.
Materials and Methods
The study area includes the North Sea up to approximately 58.44°N and the United Kingdom’s (UK) parts of the English Channel and the Celtic Seas (Fig 1). The area covered measures approximately 670,600 km2. The water depth extends from 0 m to 2030 m; however the majority of the study area is much shallower continental shelf sea with a median depth of 58 m and 95% of the area being less than 155 m deep.
Substrate observations were collated from a number of sources[35–37] (S1 Table for more details). The minimum requirement for inclusion in this study was that the percentages of the sediment fractions mud (grain size d < 63 μm), sand (63 μm ≤ d < 2 mm) and gravel (d ≥ 2 mm) were reported. Only those observations that intersected with all the predictor variables (see below) were used; this resulted in a total of 57,590 substrate observations (see S1 Fig for the geographic distribution of samples by data source).
Pre-treatment of response data
The mud, sand and gravel percentages are compositional data i.e. each fraction is part in a total and is constrained between 0 and 1 and the three fractions must sum to 1 (or 100%). Because of this, each component should not be considered in isolation from the others. Here we followed recommendations of Aitchison  and transform the data onto the additive log-ratio scale where they can be analysed as two continuous, unconstrained response variables which can assume any value. The outputs of analysis are permutation invariant regarding the denominator , here we have chosen to use the gravel fraction.
The data were then split randomly into training and test datasets. The training data contained 66% of the observations (38,009 observations) and the test set contained 33% (19,581 observations). The test set was used to analyse the prediction performance of the fitted models.
Predictor features were initially selected based on their expected importance for explaining the distribution of shelf sediments and their availability. The predictor features comprise digital elevation models (DEM), earth observation data and hydrodynamic model outputs.
A bathymetry DEM for the study area was created from a 6” DEM of the UK continental shelf area  and the EMODnet-Bathymetry ¼° (equal to 15”) DEM. Both datasets were merged, whereby the higher-resolution 6” DEM was given priority due to higher resolution, which effectively means that data west of 4°E is derived from the 6” DEM and east of 4°E is derived from the EMODnet-Bathymetry model. The resulting DEM has a cell size of 500 m and is projected in ETRS 1989 LAEA projection. All other predictor features (see below) were resampled onto the same grid. Water depth is considered an important predictor variable, as it is expected to indirectly influence sediment composition, e.g. sediments on shoals tend to be coarser grained than in basins.
Several secondary features were derived from the DEM. These include slope, roughness, rugosity, plan curvature, profile curvature and maximum curvature, all calculated for local neighbourhoods (3x3 kernel). Bathymetric position indices  were calculated for neighbourhood sizes of 3, 5, 10, 25, 50, 100, 150, 200, 300, 400 and 500 pixels. The vector ruggedness measure  was calculated for neighbourhood sizes of 3, 5, 15, 21, 27, 31 and 33 pixels.
Remote-sensed suspended particulate matter (SPM) were obtained from the MyOcean data portal at a spatial resolution of 1 km. This is algorithmically estimated mineral suspended matter from the MODIS optical sensor . Mean seasonal values for summer (June-July-August) and winter (December-January-February) were calculated between 2002 and 2010. SPM was included in the analysis because certain topographic features (such as the North Norfolk Sandbanks and Dogger Bank) were at least partially visible in the imagery as areas of higher turbidity. Higher SPM concentrations are also found around headlands and areas typically associated with higher current velocities.
Modelled hydrodynamics comprised average current speed and peak orbital velocity of waves at the seabed. These data were calculated for a single year and assumed to be representative. Depth-averaged tidal and wind-driven currents were calculated using the POLCOMS model  forced with 15 tidal constituents and hourly wind and pressure at 12 km resolution. Using the same meteorological forcing, the WAM spectral wave model was used to provide the wave orbital velocity at the seabed for the most extreme conditions during the modelling period (denoted as ‘peak orbital velocity’) . Waves and currents cause erosion, transport and deposition of sediments depending on the shear stress they cause at the seabed in relation to grain-size dependent critical shear stresses. They are hence expected to be important direct predictor features for sediment composition.
The Euclidean distance to the coastline was calculated in ESRI ArcGIS v10 and was expected to be an indicator for distance to sediment source (coastal erosion). Table 1 summarises predictor features that were retained subsequent to the feature selection procedure (described below). See S2 Fig for spatial plots of these predictor features.
The RF prediction algorithm was chosen as the modelling tool for the analysis because it has shown high predictive accuracy in a number of domains, it is versatile and can be used without extensive parameter tuning, it can handle a large number of predictor features and is insensitive to the inclusion some noisy/irrelevant features. The RF is an ensemble technique developed by Breiman . The algorithm ‘grows’ a large number of regression trees. It is called a random forest because two elements of randomness are introduced. Firstly, each tree is constructed from a bootstrapped sample of the training data. Secondly, only a random subset of the predictor variables is used at each split in the tree building process. This has the effect of making each every tree in the forest unique. The underlying principal of the technique is that although each tree in the forest may individually be a poor predictor and that any two trees could give very different answers, by aggregating the predictions over a large number of uncorrelated trees, prediction variance is reduced and accuracy improved  (P.316). Observations not included in each tree construction (the ‘out-of-bag’ samples) are then used to create a form of cross-validated prediction error. RF also provides a relative estimate of predictor feature importance. This is a measure of the average reduction in prediction error associated with each variable. The randomForest package  in R  was used for the implementation of the model. Each forest had 1000 trees, model parameters such as nodesize (the maximum size of terminal nodes) and mtry (number of features tested at each split) were kept as default of the package, while some small gains in predictive accuracy can be gained by fine tuning these parameters, typically the improvements are not large.
The Boruta feature selection wrapper algorithm  was used to establish predictor features that were significantly important for predicting substrate composition. Wrapper functions identify relevant features by performing multiple runs of predictive models, testing the performance of different subsets . The RF algorithm is a good candidate as a ‘black-box’ for this process because as described previously, it is relatively insensitive to tuning parameters (providing enough trees are grown) and it implicitly produces an estimate of feature importance. The algorithm tests the performance of each predictor feature against introduced random noise features to test whether an increased importance score of that feature is likely to be a ‘real’ or a result of random chance.
The standard implementation of RF aggregates the conditional mean from each tree in the forest to make its ensemble predictions. Rather than just returning the conditional mean, quantile regression infers the whole conditional distribution of the response variable . This is useful when attempting to determine the underlying variability of an estimate and the reliability of a prediction. Substrate composition in some situations will be estimated much more accurately than others. Quantile regression returns prediction intervals, meaning the range in which future observations would be expected to fall, given a certain probability. This should not be confused with confidence intervals, which would typically imply the range in which a population mean is expected to fall). When mapped spatially these intervals could help guide future sampling programs to locations that are predicted with low accuracy. Prediction intervals give an idea of the prediction variability and how confident the model is in its own predictions. The quantregForest package  in R was used to calculate 95% prediction intervals. The default parameters of the package were used (nodesize = 10, ntree = 1000).
The RF implicitly carries out a form of cross-validation (CV) using the OOB observations. This usually gives a reliable measure for real model performance assuming enough trees are grown . In addition to this performance indicator, the models constructed here are tested against the test set of observations. For both the CV and the test set, the performance is assessed by calculating the mean of the squared prediction error:
In order for the predictions to be useful they must be back-transformed from the log-ratios to mud, sand and gravel fractions. This is illustrated for the observed and predicted value of a single test set observation in Fig 2. The 95% prediction intervals appear as a grey window.
Example of transformation of predictions from log-ratio space to M/S/G fractions for a single test set observation. Predictions are made in the log-ratio space and reverse transformed into M/S/G fractions. The observed value is shown in red, predicted value (conditional mean) shown in green. 95% prediction intervals represented by the grey rectangle.
Spearman rank correlation index is reported for the observed vs predicted M/S/G values for the test set observations. This is used in preference to the Pearson product-moment correlation which assumes that the variables are approximately normally distributed variables . Also, to give a more intuitive idea of how well the models are predicting substrate at new locations, the test data were transformed into EUNIS substrate classes . These substrate classes provide insight into which substrates the models are predicting effectively and where it is not performing well. Classification accuracy and the kappa coefficient of agreement  were calculated from the error matrix.
Finally, we compare classified model outputs against existing high-resolution maps derived from multibeam acoustic data. It cannot be assumed that such maps are accurate, just because they were derived from high-resolution data. Therefore, we only selected maps that fulfilled the following criteria: 1) Maps were derived from multibeam acoustic data and seabed samples reporting sediment composition or class. 2) The mapping process is repeatable, i.e. expert interpretation was kept to a minimum. 3) The maps were validated against an external test set. 4) The maps were published, preferably in a peer-review journal. 5) The maps overlapped with our study site and had a size of at least 1,000 km2 to allow for a meaningful comparison. 6) The maps depicted classes that could be derived from the predicted sediment composition (e.g. [1,54]).
The comparison was carried out with the Map Comparison Kit , using a ‘per-pixel’ approach, i.e. comparing the class allocations of two maps for every pixel. Therefore, the high-resolution maps were resampled at the resolution of our sediment model. The results of the per-pixel comparisons are reported in a contingency table from which several statistics are calculated. These include the overall agreement between two maps and the kappa coefficient. All data used in the training and validation of the models are included in S1 File.
Exploration of results
Partial dependence plots  (P.369) were generated for the two most important features indicated in model training. They were generated using a random subset of training data. Partial dependency plots indicate the response of the dependant variable across the range of individual features of interest, while averaging out the effects of all other features. They are useful for understanding the nature of the relationship between predictor and response variable, the shape of the underlying function and identifying thresholds in predictor which may be particularly important.
The feature selection process indicated that all features contributed significantly to the model performance. This would justify training the final model using all the predictor features. However, manual thinning of the features indicated that the CV error rate did not increase when less important features were discarded. The same eight features were identified as most important for both alrm and alrs. Using these features yielded a CV error almost identical to the model that used all features and so it was decided to use these eight features in the final model (see Table 1). Fig 3 shows the relative importance of the eight features to prediction accuracy.
The model validation statistics (Table 2) indicate that approximately 66% and 71% of the variability of alrm and alrs are explained by the predictive models. The close agreement between cross-validated and test set statistics indicates that the models are not ‘over-fitted’ to the training data and that they are generalising the true functions. Fig 4 shows the predictions for 1,000 random observations from the test set. The left panels show observed versus predicted values for alrm and alrs along with 95% prediction intervals (black vertical lines). The observed versus predicted scatter plots show a considerable spread in the residuals. The right panels show the observed values relative to 95% prediction intervals. The plots show that the intervals have a high probability of capturing the observed values, 94.3% of alrm and 95.3% of alrs observed values are inside the prediction intervals, showing that they are good indicators of prediction reliability.
The left panels show predicted vs observed values for alrm and alrs along with prediction intervals. The diagonal line indicates y = x. The right panels show the observed values relative to the centre of prediction interval. Points are coloured according to whether the observed value is within the 95% prediction interval.
Spearman rank correlation for the back-transformed sediment fractions of the test data are as follows: Mud 0.71; Sand 0.75; Gravel 0.78 (p-values for all tests were <0.001).
Observed and predicted sediment fractions were classified into EUNIS substrate types (Table 3), which encompass four broad substrate categories. The confusion matrix shows observed (rows) and predicted (columns) substrate class. The results of the classified test set show a classification agreement of 83% and a kappa coefficient of 0.60. This suggests that the model is performing reasonably well at predicting broad substrate types. However, the error rates for each class show large variability. The lowest misclassification is achieved for ‘sand and muddy sand’ with an error rate of 0.05; however this increases to 0.4 for ‘coarse sediment’, 0.47 for ‘mud and sandy mud’ and 0.81 for ‘mixed sediment’. This indicates that the model is biased towards sandy substrates and is consistently over-predicting the sand fraction. It also highlights the imbalance in substrate types represented in the sampling with the majority of samples falling in the ‘sand and muddy sand’ class.
Classified model outputs were compared with two high-resolution maps previously produced by the authors: a five model ensemble of substrate type according to a simplified Folk classification  and a RF prediction of EUNIS substrates . In the first case (Table 4), overall agreement between the two maps was 83% (kappa = 0.21). However, the relatively high overall agreement is due to a high agreement in the ‘sand’ class, which is dominating the site. Agreement is lower for ‘muddy sand’ and especially ‘gravelly sand’. ‘Sandy gravel’, the least frequent class in the high-resolution map, wasn’t even predicted by our model. These results would suggest that the gravel content is under-predicted by the model. In the second case (Table 5), the overall agreement was lower at 75%, but kappa was higher at 0.34. Contributions to the overall agreement are mainly made by ‘sand and muddy sand’ and, to a lesser extent by ‘coarse sediment’. Agreement between the two maps is very low for the infrequently occurring classes ‘mud and sandy mud’ and ‘mixed sediment’.
Exploration of results
For both alrm and alrs, average current speed is the most important contribution to prediction accuracy (Fig 3). For alrm the increase in node purity is considerably larger than for the following features. Bathymetry is the second most important feature for both response variables. Bivariate dependence plots were generated for mean current speed and water depth (Fig 5). The highest mud concentrations (>10%) are located in deeper areas (< -50 m) with low current speeds (<0.25 m/s). Gravel content is highest in moderately shallow areas (around -50 m), where current speeds are highest (>0.75 m/s). Sandy substrates are associated with intermediate conditions (0 to -50 m depth and 0.25–0.5 m/s current speed) and shallow depths, regardless of the current speed. These relationships are mirrored in the fourth panel, which shows simplified Folk classes. Gravelly sands are related to high current speeds (>0.6 m/s) in water depths below -25 m. Muddy sand is most likely to occur below -50 m water depth, where current speeds are low(<0.125 m/s). Sand dominates in the remainder of the diagram.
Sediment composition and types
Results of the spatial distribution of sediment fractions (mud, sand and gravel) are shown in Fig 6. It is apparent that the sand fraction dominates over large parts of the study area. This is especially the case in the North Sea. The highest amounts of gravel fraction are found in the central English Channel, the Bristol Channel and north of the Irish north coast. Not surprisingly, the locations of maxima in gravel content show a marked correspondence with the locations of tidal-induced bottom shear stress maxima . The highest mud contents are located west of the Scottish west coast beyond the shelf break, on Fladen Ground, in the Norwegian Trough, in the north-western Irish Sea and around Arran Island (-5.2,55.4). The estimates of sediment composition can be easily classified according to EUNIS substrates  or simplified Folk textural groups . This is shown in Fig 7.
The 95% prediction intervals for alrm and alrs show considerable spatial variation (Fig 8). Although both appear to co-vary across the study site, it is apparent that prediction intervals are generally larger for alrm. Prediction intervals for both alrm and alrs are generally low in parts of the German Bight, the English Channel and the Celtic Sea. High prediction intervals are apparent close to the coast, especially around the UK.
Our aim was to predict sediment composition based on legacy grain-size data across parts of the NW European continental shelf. The decision to use legacy data was made out of necessity as it would be unrealistic to collect, process and analyse a sufficient amount of samples over such a large area of seabed. However, this meant that samples had to be obtained from various sources and consequently the data set contains samples that were collected at different times with various sampling gear and analysed according to differing protocols. The BGS dataset for example includes samples that have been collected with 15 different types of equipment (albeit the vast majority with a Shipek grab) between the years 1967 and 1994. It is also known that different grain-size analysis methods produce differing results . Lack of standardisation might lead to increased uncertainty in predictions. Sources of uncertainty might include acquisition method, vintage and timing of sampling, representativeness of subsampling, pre-treatment methods, systematic errors due to particle properties and imperfect conversion models . Harmonisation of data sets would be desirable; however due to incomplete or missing metadata this was not possible.
As indicated above, sampling of the seabed was conducted over several decades. Additionally, the data of the predictor features capture different time intervals, e.g. the hydrodynamics were modelled for the year 2008, whilst SPM estimates were derived for the years 2002 to 2010. An implicit assumption of our models therefore is that the response and predictor variables are constant through time. Principally, such an assumption is unlikely to hold. For example, a decrease in water clarity in the southern and central North Sea over the last 25 years is most likely driven by increased SPM concentrations . Also, waves and currents cause disturbance of the seabed in the study area, with the highest levels occurring in areas of high tidal stress and in shallow regions exposed to waves with large fetch . Such disturbance events might result in net changes in sediment distribution and seabed morphology through time, if net transport of the mobilised sediment is occurring. The influence of waves on the seabed is usually assumed to be limited to water depths less than half the wave length. The wave base, which is the seaward limit of wave influence, typically occurs in depths of 50–70 m around the UK . However, fast (event to inter-annual time scales) morphodynamic adaptations to hydrodynamic forcing conditions are restricted to the upper shoreface , which has a depth limit significantly lower than the wave base. It is also known that sediment distribution patterns with high grain-size contrast frequently found in continental shelf settings, might remain stable over years to decades [64–67] despite frequent remobilisation of sediment. Time and space scales of sediment characteristics and forcing functions on continental shelves are inter-linked e.g. . Given the relatively low resolution of our models (500 m pixels), it is unlikely that significant changes in sediment distribution and seabed morphology have occurred within a few decades (equal to the temporal spread of the data).
Predictor features are resolved on various length scales, 500 m for the DEM, 1000 m for the SPM data and 12 km for the hydrodynamic model outputs. These are dictated by various considerations, e.g. limitations to computing power for hydrodynamic modelling and resolution of sourced data sets. It is realistic to expect considerable variability of sediment type within an area of 500 m by 500 m of seabed, but the sediment observations were not sampled at the same scale. The various sampling devices obtain a sample from typically less than 1 m2 of seabed; this is the support size of the data and it is different from the support size of the estimates we are making. Averaging quantities over a large area has the effect of reducing the variance and making the distributions more normal . The implications are that it is unrealistic to expect to account for all the variability in our observations when the predictor features have a coarser resolution (larger support size) than the sampled data.
It is likely that there is bias in the model towards the sand fraction. This is a result of the fact that the study is generally dominated by sandier sediments so the sand fraction is overrepresented in the training samples. This means an over-estimate of the sand content at the expense of the mud and gravel contents, as was indicated by the error assessment for the EUNIS substrate map and the comparison with high-resolution maps. It would be worthwhile to investigate methods of compensating for this bias. Possible approaches to explore for mitigating this effect would be to stratify the sampling when building the RF model. This could be done either by substrate class strata so that each tree is built with an equal number of samples from each substrate type. Alternatively, stratifying the sampling spatially could also be an option, this would insure that each tree was built with a sample that was approximately evenly distributed across the study area.
Our model predicts the sediment composition, but it cannot account for rock outcrops at the seabed. Several studies have demonstrated the presence of bedrock at the seabed in the English Channel [70,71], Bristol Channel , off Northern Ireland [73,74], the North Sea  and elsewhere. However, no estimates on the total area of rocky seabed exist to date. Principally, it would be feasible to predict the presence and absence of bedrock outcrops at the seabed, provided there is a sufficient amount of observations and suitable predictor features.
Significance of results
We have described a quantitative spatial model of seabed sediment composition for parts of the NW European continental shelf that has been produced with a repeatable method and validated with an independent set of test data. Model validation statistics indicate that our model is performing reasonably well and is not over-fitted to the training data. A EUNIS substrate model, derived from the predicted sediment composition, achieved an overall accuracy of 83% and a kappa coefficient of 0.60. A quantitative comparison with existing high-resolution maps of seabed sediment types demonstrates that the model is reproducing known distribution patterns reasonably well, with overall agreement of 75% and 83%, respectively. To our knowledge, this is the first such model that has been published for this sea area and it is hoped that the results will be utilised by other researchers. The model outputs are downloadable from http://doi.pangaea.de/10.1594/PANGAEA.845468 and might be useful for a wide variety of purposes including, but not limited to, broad-scale habitat mapping, species distribution modelling, hydrodynamic modelling, ecosystem modelling, environmental survey planning, selection of monitoring stations, estimation of other sediment parameters (e.g. porosity, bulk density, total organic carbon) and stock assessments (e.g. Nephrops norvegicus). The model outputs are versatile in that they can easily be classified according to different sediment classification schemes, such as EUNIS substrate types  or Folk textural groups . This is preferable over classified maps, as different classification schemes might be used for different tasks. However, a Folk map cannot easily be re-classified into a EUNIS substrate map, as the boundary between sandy and muddy substrates is defined as a sand-to-mud ratio of 4:1 in  and such a boundary does not exist in the Folk classification.
The presented model belongs to the group of empirical models, which sacrifice generality for precision and reality [75,76]. Because of that, it is considered site-specific and cannot be applied to other areas of the seabed. A dedicated spatial prediction model will have to be built for those areas; however the principal methodology is transferrable. The success of such a model will depend on the availability of suitable target and predictor data sets.
Large parts of the study area have been described as tidally-dominated continental shelf seas, augmented by varying degrees of sub-ordinate storm impact in the North Sea, Celtic Sea and English Channel and fair-weather wave impact in the German Bight . In the southern half of the North Sea, tides dominate the sand transport in most of the Southern Bight and along the east coast of the UK, and storms (stirring by waves and transport by wind-driven currents) dominate in the north-eastern part of the Southern Bight, to the north of the Southern Bight, along the Friesian Islands, in the German Bight and on the Dogger Bank . This is borne out by our model results, in that the feature importance scores (Fig 3) show average current speed as the most important predictor for both alrm and alrs. The relative importance of storms is also reflected in the variable importance plots, with peak orbital velocity featuring among the eight selected predictor features. With such a strong forcing by hydrodynamic processes in the study area, one might argue that a process-based model, which describes the cause-effect relationships of hydrodynamic forcing and resultant sediment distribution patterns, would be more appropriate to predict the sediment composition of the NW European continental shelf seabed. However, process-based models tend to be realistic and general, but not precise , whilst our goal was to make accurate quantitative predictions of sediment composition. Sediment composition of the seabed is a product of multiple processes acting on a wide spectrum of temporal scales from orbital water movements caused by a passing wave (seconds) to glacial-interglacial cycles (100,000 years). In analogy to ecological systems  or morphodynamics of coastal systems , continental shelf sedimentation and the resulting sediment composition of the seabed are governed by constraints (i.e. laws, such as those describing the mobilisation of sediment by ambient currents) and contingencies. Process-based models would struggle to capture historical contingencies, such as extreme and rare events and geological inheritance. In the particular case, sea-level changes that occurred since the last sea-level lowstand ca. 21,000 years BP  have submitted the study area to different sedimentary processes through time and space. Relicts of the past, such as glacial morphological features are still present on the shelf (e.g. moraines, drumlins, flutes and eskers in the northern North Sea and Irish Sea [82–84]) and associated sediments might not be explainable by ambient hydrodynamic forcing alone.
We employed an empirical static model, which assumes equilibrium between the environmental conditions and sediment composition. Arguably, relict features, as mentioned above, and their associated sediments might not be in equilibrium with the ambient hydrodynamic forcing due to the relatively short time span since the submergence of the NW European continental shelf area and stabilisation of the sea level at the current height since approximately 6,000 years BP. However, indirect predictor features such as bathymetry and BPIs might to some extent compensate for this deficit, in that they capture the morphological features that relate to non-equilibrium sediment compositions.
The results indicated by the partial dependency plots (Fig 5) were generally in agreement with expectations, i.e. elevated mud contents associated with deeper sheltered areas, coarser substrates associated with high current speed, and sandy substrates dominating the intermediate conditions. There appears to be certain thresholds of current speed which are associated with boundaries of substrate type, for example 0.125 m/s is approximately the boundary between ‘muddy sand’ and ‘sand’ and 0.5 m/s approximates the boundary from ‘sand’ to ‘gravelly sand’. This relationship is not applicable to shallower areas where ‘sand’ dominates consistently. However, a map of sediment classes derived by applying the above classification rules would only partially resemble the map presented in Fig 7 and misses important features such as Fladen Ground. This indicates that predictor features other than average current speed and bathymetry are also of importance for deriving an accurate representation of the seabed sediment composition.
There appears to be an association between the large prediction intervals and shallower coastal areas, especially around the UK coast. These areas might represent particular conditions which are under-sampled so cannot be predicted accurately. Alternatively, strong environmental gradients and increased temporal variability, which are known to occur in coastal areas, might not be sufficiently resolved in the predictor features. The spatial pattern of the highest prediction interval widths suggests that it is associated with areas of increased hydrodynamic disturbance and this could mean that these areas are inherently more temporally or spatially variable. Large parts of the west coast of Scotland have high PI widths also, these are areas which are highly exposed to Atlantic swell and large parts of this coastline are known to be rocky.
Potential for improvements
We have demonstrated that it is feasible to spatially predict the seabed sediment composition across a large area of continental shelf in a repeatable and validated way. However, this research has also highlighted the potential for further improvements. These might include:
- The spatial prediction of rock outcrops across the study area based on rock observations (response variable) and suitable predictor features, such as seabed rugosity and hydrodynamics [85,86].
- Addressing the imbalance of the sample data set, which is skewed towards sandy substrates.
- Inclusion of higher-resolution predictor features, especially peak wave orbital velocities, which are likely to poorly resolve processes close to the coast. Also, a higher resolution EMODnet bathymetry data set has recently become available.
- Extension of the spatial coverage to include the northern North Sea and the Irish and French parts of the NW European continental shelf.
S1 Fig. Spatial distribution of substrate observations by data source.
The number of samples is shown.
Conceived and designed the experiments: DS MD. Performed the experiments: DS MD. Analyzed the data: DS MD. Contributed reagents/materials/analysis tools: DS MD. Wrote the paper: DS MD.
- 1. Long D. BGS detailed explanation of seabed sediment modified Folk classification. 2006.
- 2. Coggan R, Barrio Frojan CRS, Diesing M, Aldridge J. Spatial patterns in gravel habitats and communities in the central and eastern English Channel. Estuar Coast Shelf Sci. 2012;111: 118–128.
- 3. Campbell N, Allan L, Weetman A, Dobby H. Investigating the link between Nephrops norvegicus burrow density and sediment composition in Scottish waters. ICES J Mar Sci J du Cons. 2009;66: 2052–2059.
- 4. Perkins TL, Clements K, Baas JH, Jago CF, Jones DL, Malham SK, et al. Sediment Composition Influences Spatial Variation in the Abundance of Human Pathogen Indicator Bacteria within an Estuarine Environment. PLoS One. Public Library of Science; 2014;9: e112951. pmid:25397595
- 5. Wilson AM, Huettel M, Klein S. Grain size and depositional environment as predictors of permeability in coastal marine sands. Estuar Coast Shelf Sci. 2008;80: 193–199.
- 6. Goff JA, Kraft BJ, Mayer LA, Schock SG, Sommerfield CK, Olson HC, et al. Seabed characterization on the New Jersey middle and outer shelf: correlatability and spatial variability of seafloor sediment properties. Mar Geol. 2004;209: 147–172.
- 7. Bivand RS, Pebesma EJ, Gómez-Rubio V. Applied Spatial Data Analysis with R (Use R). Use R. 2008.
- 8. Hengl T. A practical guide to geostatistical mapping [Internet]. Office. 2009. Available: http://book.spatial-analyst.net/system/files/cover_geostat_2009.pdf
- 9. Li J, Heap AD. Spatial interpolation methods applied in the environmental sciences: A review. Environ Model Softw. 2014;53: 173–189.
- 10. Alpaydin E. Introduction to Machine Learning. Second Edi. Cambridge, MA: The MIT Press; 2010.
- 11. Buhl-Mortensen P, Dolan M, Buhl-Mortensen L. Prediction of benthic biotopes on a Norwegian offshore bank using a combination of multivariate analysis and GIS classification. ICES J Mar Sci. 2009;66: 2026–2032.
- 12. Che Hasan R, Ierodiaconou D, Monk J. Evaluation of Four Supervised Learning Methods for Benthic Habitat Mapping Using Backscatter from Multi-Beam Sonar. Remote Sens. 2012;4: 3427–3443.
- 13. Ierodiaconou D, Monk J, Rattray a., Laurenson L, Versace VL. Comparison of automated classification techniques for predicting benthic biological communities using hydroacoustics and video observations. Cont Shelf Res. 2011;31: S28–S38.
- 14. Lucieer V, Hill NA, Barrett NS, Nichol S. Do marine substrates “look” and “sound” the same? Supervised classification of multibeam acoustic data using autonomous underwater vehicle images. Estuar Coast Shelf Sci. Elsevier Ltd; 2012; 1–13.
- 15. Stephens D, Diesing M. A comparison of supervised classification methods for the prediction of substrate type using multibeam acoustic and legacy grain-size data. PLoS One. 2014;9: e93950. pmid:24699553
- 16. Dartnell P, Gardner J V. Predicting Seafloor Facies from Multibeam Bathymetry and Backscatter Data. Photogramm Eng Remote Sensing. 2004;70: 1081–1091.
- 17. Rattray A, Ierodiaconou D, Laurenson L, Burq S, Reston M. Hydro-acoustic remote sensing of benthic biological communities on the shallow South East Australian continental shelf. Estuar Coast Shelf Sci. Elsevier Ltd; 2009;84: 237–245.
- 18. Rooper CN, Zimmermann M. A bottom-up methodology for integrating underwater video and acoustic mapping for seafloor substrate classification. Cont Shelf Res. 2007;27: 947–957.
- 19. Che Hasan R, Ierodiaconou D, Laurenson L. Combining angular response classification and backscatter imagery segmentation for benthic biological habitat mapping. Estuar Coast Shelf Sci. Elsevier Ltd; 2012;97: 1–9.
- 20. Huang Z, Nichol S, Daniell J, Siwabessy J, Brooke B. Predictive Modelling of Seabed Sediment Parameters Using Multibeam Acoustic Data: A Case Study on the Carnarvon Shelf, Western Australia. 2008; 1–6.
- 21. Ojeda GY, Gayes PT, Van Dolah RF, Schwab WC. Spatially quantitative seafloor habitat mapping: example from the northern South Carolina inner continental shelf. Estuar Coast Shelf Sci. 2004;59: 399–416.
- 22. Marsh I, Brown C. Neural network classification of multibeam backscatter and bathymetry data from Stanton Bank (Area IV). Appl Acoust. Elsevier Ltd; 2009;70: 1269–1276.
- 23. Simons DG, Snellen M. A Bayesian approach to seafloor classification using multi-beam echo-sounder backscatter data. Appl Acoust. Elsevier Ltd; 2009;70: 1258–1268.
- 24. Li J, Heap AD, Potter A, Huang Z, Daniell JJ. Can we improve the spatial predictions of seabed sediments? A case study of spatial interpolation of mud content across the southwest Australian margin. Cont Shelf Res. Elsevier; 2011;31: 1365–1376.
- 25. Zhi H, Siwabessy J, Nichol SL, Brooke BP. Predictive mapping of seabed substrata using high-resolution multibeam sonar data: A case study from a shelf with complex geomorphology. Mar Geol. 2014;357: 37–52.
- 26. Breiman L. Random Forests. Mach Learn. 2001;45: 5–32.
- 27. Che Hasan R, Ierodiaconou D, Laurenson L, Schimel A. Integrating Multibeam Backscatter Angular Response, Mosaic and Bathymetry Data for Benthic Habitat Mapping. PLoS One. Public Library of Science; 2014;9: e97339. pmid:24824155
- 28. Diesing M, Green SL, Stephens D, Lark RM, Stewart HA, Dove D. Mapping seabed sediments: Comparison of manual, geostatistical, object-based image analysis and machine learning approaches. Cont Shelf Res. Elsevier; 2014;84: 107–119.
- 29. Cutler DR, Edwards TC, Beard KH, Cutler A, Hess KT, Gibson J, et al. Random forests for classification in ecology. Ecology. 2007;88: 2783–92. Available: http://www.ncbi.nlm.nih.gov/pubmed/18051647 pmid:18051647
- 30. Prasad AM, Iverson LR, Liaw A. Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction. Ecosystems. 2006;9: 181–199.
- 31. Pal M. Random forest classifier for remote sensing classification. International Journal of Remote Sensing. 2005. pp. 217–222.
- 32. Chapman DS, Bonn A, Kunin WE, Cornell SJ. Random Forest characterization of upland vegetation and management burning from aerial imagery. J Biogeogr. 2010;37: 37–46.
- 33. Chan JCW, Paelinckx D. Evaluation of Random Forest and Adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery. Remote Sens Environ. 2008;112: 2999–3011.
- 34. Oliveira S, Oehler F, San-Miguel-Ayanz J, Camia A, Pereira JMC. Modeling spatial patterns of fire occurrence in Mediterranean Europe using Multiple Regression and Random Forest. For Ecol Manage. Elsevier B.V.; 2012;275: 117–129.
- 35. Valerius, J, Van Lancker V, Van Heteren S, Leth JO, Zeiler M. Trans-national database of North Sea sediment data. Data compilation by Federal Maritime and Hydrographic Agency (Germany); Royal Belgian Institute of Natural Sciences (Belgium); TNO (Netherlands) and Geological Survey of Denmark and Greenland (Denmark). 2014.
- 36. Jenkins J. Seabed substrates data integrated with the dbSEABED system. INSTAAR University of Colorado; 2014.
- 37. British Geological Survey. BGS Legacy Particle Size Analysis uncontrolled data export. 2014; Available: www.bgs.ac.uk
- 38. Aitchison J. The Statistical Analysis of Compositional Data. London: Chapman and Hall; 1986.
- 39. Weltje G. Tenernary sandstone composition and provenance: an evaluation of the “Dickson model.” In: Buccianti A, Mateu-Figueras G, Pawlowsky-Glahn V, editors. Compositional Data Analysis in the Geosciences. 264th ed. London: The Geological Society; 2006. pp. 79–99.
- 40. Astrium OceanWise. Creation of a high resolution digital elevation model (DEM) of the British Isles continental shelf. 2011.
- 41. Lundblad ER, Wright DJ, Miller J, Larkin EM, Rinehart R, Naar DF, et al. A Benthic Terrain Classification Scheme for American Samoa. Mar Geod. 2006;29: 89–111.
- 42. Sappington JM, Longshore KM, Thomson D. Quantifying Landscape Ruggedness for Animal Habitat Analysis: A Case Study Using Bighorn Sheep in the Mojave Desert. J Wildl Manage. 2007;71: 1419–1426.
- 43. Gohin F., Loyer S., Lunven M., Labry C., Froidefond J.M., Delmas D., Huret M. HA. Satellite-derived parameters for biological modelling in coastal waters: Illustration over the eastern continental shelf of the Bay of Biscay. Remote Sens Environ. 2005;95.
- 44. Holt JT, James ID. An s coordinate density evolving model of the northwest European continental shelf: 1. Model description and density structure. J Geophys Res Ocean. 2001;106: 14015–14034.
- 45. Aldridge JN, Parker ER, Bricheno L, Green SL, van der Molen J. Assessment of the physical disturbance of the northern European Continental shelf seabed by waves and currents. Cont Shelf Res. 2015;108: 121–140.
- 46. James G, Witten D, Hastie T, Tibshirani R. An Introduction to Statistical Learning. Springer; 2013.
- 47. Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2: 18–22.
- 48. R Development Core Team. R: A Language and Environment for Statistical Computing [Internet]. Team RDC, editor. R Foundation for Statistical Computing. R Foundation for Statistical Computing; 2014. https://doi.org/10.1007/978-3-540-74686-7
- 49. Kursa M, Rudnicki W. Feature selection with the Boruta Package. J Stat Softw. 2010;36. Available: http://www.jstatsoft.org/v36/i11/paper/
- 50. Guyon I, Elisseeff A. An Introduction to Variable and Feature Selection. J ofMachine Learn Res. 2003;3: 1157–1182.
- 51. Meinshausen N. Quantile Regression Forests. J Mach Learn Res. 2006;7: 983–999.
- 52. Dalgaard P. Introductory Statistics with R. New York: Springer; 2002.
- 53. Cohen J. A Coefficient of Agreement for Nominal Scales. Educ Psychol Meas. 1960;20: 37–46.
- 54. Folk RL. The distinction between grain size and mineral composition in sedimentary-rock nomenclature. J Geol. 1954;62: 344–359.
- 55. Visser H, De Nijs T. The map comparison kit. Environ Model Softw. 2006;21: 346–358.
- 56. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Second Edi. Springer-Verlag New York Inc; 2008.
- 57. Diesing M, Stephens D. A multi-model ensemble approach to seabed mapping. J Sea Res. 2015;100: 62–69.
- 58. Pingree RD, Griffiths DK. Sand transport paths around the British Isles resulting from M2 and M4 tidal interactions. J Mar Biol Assoc United Kingdom. 1979;59: 497–513.
- 59. Konert M, Vandenberghe J. Comparison of laser grain size analysis with pipette and sieve analysis: a solution for the underestimation of the clay fraction. Sedimentology. 1997;44: 523–535.
- 60. Van Heteren S, Van Lancker V. Collaborative Seabed-Habitat Mapping: Uncertainty in Sediment Data as an Obstacle in Harmonization. In: Diviacco P, Fox P, Pshenichy C, Leadbetter A, editors. Collaborative Knowledge in Scientific Research Networks. Hershey PA, USA: Information Science Reference; 2015. pp. 154–176.
- 61. Capuzzo E, Stephens D, Silva T, Barry J, Forster RM. Decrease in water clarity of the southern and central North Sea during the 20th century. Glob Chang Biol. 2015; n/a–n/a.
- 62. Connor DW, Gilliland PM, Golding N, Robinson P, Todd D, Verling E. UKSeaMap: the mapping of seabed and water column features of UK seas. Peterborough: Joint Nature Conservation Committee; 2006.
- 63. Stive MJF, de Vriend HJ. Modelling shoreface profile evolution. Mar Geol. 1995;126: 235–248.
- 64. Diesing M, Kubicki A, Winter C, Schwarzer K. Decadal scale stability of sorted bedforms, German Bight, southeastern North Sea. Cont Shelf Res. 2006;26: 902–916.
- 65. Goff JA, Mayer LA, Traykovski P, Buynevich I, Wilkens R, Raymond R, et al. Detailed investigation of sorted bedforms, or “rippled scour depressions”, within the Martha’s Vineyard Coastal Observatory, Massachusetts. Cont Shelf Res. 2005;25: 461–484.
- 66. Schwarzer K, Diesing M, Larson M, Niedermeyer R-O, Schumacher W, Furmanczyk K. Coastline evolution at different time scales—examples from the Pomeranian Bight, southern Baltic Sea. Mar Geol. 2003;194: 79–101.
- 67. Anthony D, Leth JO. Large-scale bedforms, sediment distribution and sand mobility in the eastern North Sea off the Danish west coast. Mar Geol. 2002;182: 247–263.
- 68. Sternberg RW, Nowell ARM. Continental shelf sedimentology: scales of investigation define future research opportunities. J Sea Res. 1999;41: 55–71.
- 69. Isaaks E., and Srivastava R. An Introduction to Applied Geostatisitcs. Oxford University Press; 1989.
- 70. Diesing M, Coggan R, Vanstaen K. Widespread rocky reef occurrence in the central English Channel and the implications for predictive habitat mapping. Estuar Coast Shelf Sci. Elsevier Ltd; 2009;83: 647–658.
- 71. Coggan RA, Diesing M. 33—Rock Ridges in the Central English Channel. In: Harris PT, Baker EK, editors. Seafloor Geomorphology as Benthic Habitat. Amsterdam: Elsevier; 2012. pp. 471–480.
- 72. Warwick RM, Uncles RJ. Distribution of benthic macrofauna associations in the Bristol Channel in relation to tidal stress. Mar Ecol Prog Ser. 1980;3: 97–103.
- 73. Calvert J, Strong JA, Service M, McGonigle C, Quinn R. An evaluation of supervised and unsupervised classification techniques for marine benthic habitat mapping using multibeam echosounder data. ICES J Mar Sci J du Cons. 2014;
- 74. Asa Strong J, Service M, Plets R, Clements A, Quinn R, Breen J, et al. Marine substratum and biotope maps of the Maidens/Klondyke bedrock outcrops, Northern Ireland. J Maps. Taylor & Francis; 2012;8: 129–135.
- 75. Guisan A, Zimmermann NE. Predictive habitat distribution models in ecology. Ecol Modell. 2000;135: 147–186.
- 76. Levins R. The strategy of model building in population biology. Am Sci. 1966;54: 421–431.
- 77. Johnson HD, Baldwin CT. Shallow clastic seas. In: Reading HG, editor. Sedimentary Environments: Processes, Facies and Stratigraphy. Third edit. Oxford: Blackwell Science Ltd; 1996. pp. 232–280.
- 78. Van der Molen J. The influence of tides, wind and waves on the net sand transport in the North Sea. Cont Shelf Res. 2002;22: 2739–2762.
- 79. Boero F, Kraberg AC, Krause G, Wiltshire KH. Time is an affliction: Why ecology cannot be as predictive as physics and why it needs time series. J Sea Res. 2014;
- 80. Woodroffe CD. Coasts: Form, process and evolution. Cambridge: Cambridge University Press; 2003.
- 81. Lambeck K, Rouby H, Purcell A, Sun Y, Sambridge M. Sea level and global ice volumes from the Last Glacial Maximum to the Holocene. Proc Natl Acad Sci. 2014;111: 15296–15303. pmid:25313072
- 82. Bradwell T, Stoker MS, Golledge NR, Wilson CK, Merritt JW, Long D, et al. The northern sector of the last British Ice Sheet: Maximum extent and demise. Earth-Science Rev. 2008;88: 207–226.
- 83. Clark CD, Hughes ALC, Greenwood SL, Jordan C, Sejrup HP. Pattern and timing of retreat of the last British-Irish Ice Sheet. Quat Sci Rev. 2012;44: 112–146.
- 84. Van Landeghem KJJ, Wheeler AJ, Mitchell NC. Seafloor evidence for palaeo-ice streaming and calving of the grounded Irish Sea Ice Stream: Implications for the interpretation of its final deglaciation phase. Boreas. Blackwell Publishing Ltd; 2009;38: 119–131.
- 85. Bekkby T, Moy FE, Kroglund T, Gitmark JK, Walday M, Rinde E, et al. Identifying Rocky Seabed Using GIS-Modeled Predictor Variables. Mar Geod. Taylor & Francis; 2009;32: 379–390.
- 86. Dunn DC, Halpin PN. Rugosity-based regional modeling of hard-bottom habitat. Mar Ecol Prog Ser. 2009;377: 1–11.