Figures
Abstract
Expanding spatial presentation from two-dimensional profile transects to three-dimensional ocean mapping is key for a better understanding of ocean processes. Phytoplankton distributions can be highly patchy and the accurate identification of these patches with the context, variability, and uncertainty of measurements on relevant scales is difficult to achieve. Traditional sampling methods, such as plankton nets, water samplers and in-situ vertical sensors, provide a snapshot and often miss the fine-scale horizontal and temporal variability. Here, we show how two autonomous underwater vehicles measured, adapted to, and reported real-time chlorophyll a measurements, giving insights into the spatiotemporal distribution of phytoplankton biomass and patchiness. To gain the maximum available information within their sensing scope, the vehicles moved in an adaptive fashion, looking for the regions of the highest predicted chlorophyll a concentration, the greatest uncertainty, and the least possibility of collision with other underwater vehicles and ships. The vehicles collaborated by exchanging data with each other and operators via satellite, using a common segmentation of the area to maximize information exchange over the limited bandwidth of the satellite. Importantly, the use of multiple autonomous underwater vehicles reporting real-time data combined with targeted sampling can provide better match with sampling towards understanding of plankton patchiness and ocean processes.
Citation: Mo-Bjørkelund T, Majaneva S, Fragoso GM, Johnsen G, Ludvigsen M (2024) Multi-vehicle adaptive 3D mapping for targeted ocean sampling. PLoS ONE 19(5): e0302514. https://doi.org/10.1371/journal.pone.0302514
Editor: Emmanuel S. Boss, University of Maine, UNITED STATES
Received: December 13, 2023; Accepted: April 7, 2024; Published: May 8, 2024
Copyright: © 2024 Mo-Bjørkelund et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the article and its Supporting information files.
Funding: The research presented in this paper was funded by the Research Council of Norway through the following: The Nansen Legacy project RCN #276730 (TMB, GJ, ML), https://arvenetternansen.com/nb/arven-etter-nansen/. The Center of Excellence Autonomous Marine Operations and Systems (AMOS), RCN #223254 (TMB and ML) https://www.ntnu.edu/amos. Arctic ABC-D, RCN #245923 (GF, GJ), https://prosjektbanken.forskningsradet.no/en/project/FORISS/245923?Kilde=FORISS&distribution=Ar&chart=bar&calcType=funding&Sprak=no&sortBy=date&sortOrder=desc&resultCount=30&offset=0&Prosjektleder=J%C3%B8rgen+Berge Light as a Cue for Life in Arctic and Northern Seas (LightLife), RCN #315728 (SM). https://prosjektbanken.forskningsradet.no/en/project/FORISS/315728?Kilde=FORISS&distribution=Ar&chart=bar&calcType=funding&Sprak=no&sortBy=date&sortOrder=desc&resultCount=30&offset=60&TemaEmne.1=LTP2%20Hav The sponsors did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Marine phytoplankton are responsible for approximately half of the global oxygen production [1]. However, accurately measuring and mapping phytoplankton blooms poses a significant challenge using conventional sampling techniques. Previous attempts to map the dynamic conditions of these blooms have utilized autonomous underwater vehicles (AUVs) equipped with adaptive sampling algorithms [2–4]. These algorithms allow the vehicle to adjust its path based on real-time measurements gathered during its operation, adapting their path to gather the most useful information. However, prior efforts have been limited to two-dimensional modeling of the environment and have often assumed a static underlying field. In contrast, our proposal aims to advance these methodologies by expanding modeling capabilities into three dimensions and incorporating temporal effects. This expansion enables a more comprehensive understanding of the complex dynamics of phytoplankton blooms. Moreover, increasing the number of deployed autonomous vehicles facilitates higher-resolution data collection. This elevated granularity empowers operators and vehicles alike with contextual information for informed decision-making in the field.
Adaptive robotic sampling, using chlorophyll a estimated from fluorescence (CF) as a proxy for phytoplankton concentrations in “hot spots” [2] has been suggested as a solution to provide information on their patchiness. CF measured in live phytoplankton cells using in-situ sensors can be used as a biomass proxy, but the values must be verified from in-vitro analyses of chlorophyll a from water samples. Therefore, the combination of adaptive robotic sampling and physical sampling of phytoplankton is needed [3, 5]. However, by not sharing the data collected by the AUVs in real time, the ocean sampling is still done deterministically. Adaptive sampling, using AUVs for targeted ocean measurements [2, 4, 6–14], is a commonly used method for capturing dynamic and transient phenomena. In adaptive methods, path planning is often done by deliberation by a utility function evaluated along potential paths. The utility function needs a prediction of the distribution of the environmental variables of interest, such as salinity, temperature, and CF. Common modeling approaches are Gaussian processes (GP) [15] and higher-order simulation models [16]. Estimation of the horizontal distribution of Chlorophyll a using a log-GP (ℓGP) has been explored previously [2, 4]. They also explored multi-vehicle communication using asynchronous surfacing.
While phytoplankton distributions are patchy and unpredictable, they are not without structure or spatial correlation. ℓGP modelling can exploit the spatial correlation in phytoplankton patches and attempt to estimate the CF at unmeasured locations while accounting for the patchy “hot-spot” nature of algal blooms [2].
The exploration problem with multiple sampling platforms was formalized in a sequential decision-theoretic planning under the uncertainty framework termed the multirobot adaptive sampling problem [2]. For efficient multi-vehicle adaptive sampling, data sharing between agents is essential for mission cooridnation and on-board model error reduction [17]. Communications-constrained multi-vehicle sampling was investigated and showed that error reduction of the on-board model is dependent on the quality of the communication between agents [17]. Communication-constrained multi-vehicle operations were carried out off the California coast, using a fleet of AUVs, communicating over the horizon by IRIDIUM® [18, 19], the Autonomous Ocean Sampling Network-II deployed a multi-month mission to map coastal waters [20], in an effort to develop a sustainable, portable, and adaptive ocean sampling system. The Adaptive Sampling And Prediction (ASAP) project [21] set out to monitor and coordinate a set of gliders using over-the-horizon communication.
Phytoplankton, as the trophic base of almost all marine ecosystems, acts as a climate regulator by converting CO2 to organic matter through photosynthesis, which can ultimately be exported to the deep ocean [22, 23]. Their community composition and timing of events can be sensitive to climate change and environmental perturbation, impacting higher trophic levels, which can eventually cause an irreversible shift in the ecosystem [24, 25]. The spatial distributions of phytoplankton are highly heterogeneous, or patchy, [26] and high density patches can occur at a wide range of spatial scales, from mesoscales to microscales [27]. The patchiness of phytoplankton is driven by physical, chemical, and biological processes [27–30]. The horizontal extent of these patches can be observed in surface waters using satellite images [31, 32]. Phytoplankton patchiness can exhibit as subsurface chlorophyll maxima, which is measured by vertical profiling sensors. Taking spurious vertical profiles limits the three-dimensional understanding of patchiness below the surface. Traditional ship-board net or water sampling for phytoplankton at predetermined locations and depths provides integrated, or averaged, data that can miss the high resolution structure of phytoplankton community structure, composition, and abundance. In the Arctic, the common haptophyte Phaeocystis pouchetii can occur as single cells or colonies up to three centimeters in diameter [33], as well as forming high-density blooms [24]. These colonies and blooms are often not fully resolved in space and time due to a lack of sufficient resolution in the sampling.
The goal of this work is to provide a method for informed physical sampling of Chlorophyll a and contextualize the samples. By adapting the vehicle path towards high concentrations of CF, the depth of the highest values of CF within an area can be estimated and sampled. Adaptive informative sampling can have an advantage over traditional “lawnmower” surveys [34]. An adaptive “uncertainty-aware” method was suggested [34], where a simulated vehicle attempted to synoptically map the distribution of phytoplankton off the coast of California. Their main finding was that the adaptive algorithm had an advantage over the classic “lawnmower” survey in reducing the uncertainty of the estimation, reducing biases incurred from human-designed surveys.
As spatiotemporal patchiness plays a fundamental role in the functioning of marine ecosystems [27], understanding the nature of this process is needed to disentangle predator-prey interactions and the flow of carbon through the food web. Here, we show how two AUVs reporting real-time data for targeted discrete water sampling can provide a better match with sampling actions and fine-scale plankton patchiness. Additionally, by having the AUVs adapt their path and share measurements, they are able to effectively map the three-dimensional extent and temporal variability of the patchy phytoplankton bloom, as tested in simulation (Fig 4). By combining adaptive robotic sampling with over the horizon satellite communication and a manned vessel, we created a heterogeneous sampling network for targeted sampling during a spring phytoplankton bloom (May 2022) in Kongsfjorden, Svalbard (Fig 5a and 5b). Adaptive sampling is achieved by modeling CF on board the AUVs using a ℓGP [2], in three dimensions, with the AUVs seeking regions of high CF and high uncertainty, following the sense-plan-act cycle [35]. Data is exchanged through the network over the IRIDIUM® [18, 19] satellite link and displayed to the operators in a central operator hub. The human-crewed vessel acted on the information gained by the autonomous agents to take targeted and contextualized water samples. This three-dimensional reconstruction of phytoplankton CF patchiness evolving in space and time offers a unique opportunity to understand the mechanism underlying their heterogeneous distributions.
Materials and methods
The method was built on the onboard algorithm used for adaptive behavior and data exchange, developed using simulation and field tested. The algorithm on board consists of (a) the predefined pilot survey, (b) the data exchange, and (c) the adaptation phase. The adaptive phase was further subdivided into (i) the on-board modelling and (ii) the path planning. For the AUVs, the mission was initialized by executing a pilot survey, followed by a data exchange. After this initial phase, the adaptive phase ran for a fixed period before a data exchange action was performed and the adaptive phase started again (Fig 1). The sampling network used the data collected by the AUVs for informed sampling from a manned craft. Adaptive path planning was carried out onboard the autonomous vehicles, providing robustness to communication loss and enabling in-situ decision making. No particular permits were needed for the experiments, as they were carried out in Norwegian territorial waters.
On-board algorithm: Pilot survey
A rectangular cuboid was selected as the operational volume shape for exploration, and the operational area is the projection on the horizontal plane of this volume. The size and orientation of the volume is commonly decided by the operator, depending on the nature of the mission, the number of vehicles available, their speed and the desired accuracy of the spatiotemporal model [20]. The pilot survey [4], is usually a pre-planned path, designed to gather data for the initialization of the on-board model. In this case, we set it as a crossing of the operational area by all AUVs in the system, where the path is determined by the number of vehicles, Nv, in the network. All vehicles were set to cross from y = 0 to ymax in parallel paths. The x value of the path was determined by
(1)
for vehicle nv of Nv, where nv is 0-indexed, and Vx was the x-extent of the operational area. The pilot survey path undulated between the surface and the maximal depth of the operational volume. After the pilot survey, the data collected was segmented and sent to the operator hub in a surfacing event, where the vehicle received data available from the other vehicles in a data exchange phase.
On-board algorithm: Data exchange
As data is collected at high rates (>1Hz) on different platforms moving at nominal speeds of 1.0–1.5m/s, it was segmented to fit the predefined message structure; enabling computational feasibility of the on-board model and data sharing over low bandwidths. The segmentation algorithm used the shared knowledge of the agents in the operational area, whereas the data exchange used segmented data for efficient low-bandwidth communication.
Segmentation.
For computational feasibility of the on-board model, the number of samples usually needs to be restrained to below ∼10000 [15]. Measurements were segmented into a nx × ny × nz grid, reducing the maximum sample size to nx ⋅ ny ⋅ nz and creating a 3D grid in the operational volume. The number of grid cells was decided by the operator and could be adjusted according to desired spatial resolution and the available on-board computing power. All agents shared a commonly defined grid. The measurements collected were assigned to their respective grid cell. The data value of that cell was set to the average of the measurements taken within that cell since the last segmentation. Previous measurements were discarded if new data was collected within that cell. We assumed that temporal de-correlation would make previous measurements irrelevant compared to the new measurements. Cells with previous held data values and no new measurements kept their data intact. Data segmentation was performed at each update step, before evaluation of the on-board model and adaption.
Exchange.
The AUVs surfaced at regular intervals, transmitting the latest segmented data, following the flowchart (Fig 2a). The surfacing event served two main purposes, (1) to exchange data with the other vehicles in the system and (2) to obtain a GPS fix to bound the navigational uncertainty of the vehicle. Data, vehicle position, and time were transmitted by IRIDIUM® [18, 19] short burst data (SBD) to a central operator hub, where the operator monitored the operation and data flow. The operator hub then responded to the vehicle with the most recent data and positions received from the other agents in the system. The IRIDIUM® SBD protocol limits the message size to 340 bytes [18, 19], while the vehicle-specific driver needed 20 bytes for overhead. Data for one grid cell took up nd = 3 + nc bytes, where nc is the number of data channels (e.g. fluorescence, dissolved oxygen, temperature, salinity), creating the message structure (Table 1). This enabled the sending of up to data points over IRIDIUM® SBD.
Flowcharts of the (a) on-board algorithm, starting at the pilot survey, followed by the cyclical data exchange and adaptive phases, and (b) adaptive phase, with data ingestion, ℓGP evaluation at potential waypoints, x′, and their evaluation.
Data message structure for nc = 1, cell number is sent as a UInt16, as the number of grid cells should be in the interval 256 to 65536, and where k ∈ [0, …, ld) is the data index.
If the queue of unsent data points was longer than ld, the message was truncated to contain the ld newest data points. The operator hub also sent a message containing the last known positions of the other vehicles in the network to the surfaced vehicle. This message was not as constrained in size as the data message, since the number of vehicles, Nv, is sufficiently small.
Operator hub.
As the communication center in the sampling network, the operator hub distributed the information throughout the sampling network, both to the AUVs and operators. Data was transmitted to the operator hub and presented to the operators both as raw messages, and the ℓGP realization, with the latest vehicle position. From there, the operators could interrogate the model for a predicted profile anywhere in the operational space along with its error bounds. Based on this, operators could choose to act on the information and perform targeted sampling, having a greater understanding of the contextual data field. The act of sending the data near real time through the operator hub enabled a ‘human-in-the-loop’ approach. When the data exchange was completed, the vehicles updated the model and began the adaptive phase.
On-board algorithm: Adaptive phase
In the adaptive phase (Fig 2b), the available data was used as input for a GP [15], in the form of a ℓGP [2, 4]. Potential paths were generated from a set of potential waypoints around the vehicle and deliberated upon using the predictive CF mean value and variance at the waypoint. Furthermore, its proximity to other vehicles in the area was taken into account. When the vehicle reached the desired waypoint, a new waypoint was chosen by the same means until it was time for a data exchange.
On-board model.
The ℓGP [2, 4, 15] was expanded to three physical dimensions and time-varying uncertainty was added to the measurements. We denoted the set of measurements yS at locations X at times τ, with S = [X, τ] and let zS = logeyS − mGP, where was the mean of the input data. When taking the logarithm of the measurements, they became unitless [36]. We assumed that
, such that a GP could be used for predictions of z at unobserved locations and times by the formulation in Eqs (2) to (4) [15].
(2)
(3)
(4)
Where S* = [X*, τnow] was a set of prediction locations at the time of the prediction, τnow,
was the predictive mean at the locations S*, and Ku,v was the kernel function evaluated at the locations u and v, and
was the nugget variance [37]. The kernel function (Eq 5), where M = diag([Mx, My, Mz, Mτ]) is the diagonal matrix of de-correlation lengths for each dimension of s, including time τ, where s is a row of S; one location.
(5)
Time was included in the kernel function to account for the temporal decay of accuracy in the measurements. This lead to the covariance matrix having to be updated at each evaluation step, further leading to a full evaluation of the GP at each iteration, in stead of step-wise ingestion of data [3, 14]. In order to produce a predictive mean for the value of CF, the predictive mean for zS must be transformed [2] as presented in Eqs (6) and (7).
(6)
(7)
From Eqs (6) and (7) we see that the variance, , is dependent on the predictive mean, unlike for the linear GP. This needs to be taken into account when evaluating the potential paths for optimal, adaptive sampling.
Path planning.
After data was ingested into the ℓGP, it was evaluated at potential waypoint locations, in order to find the optimal path. Using the current position of the AUV, xv, as the center, it created 3nθ potential waypoint locations, where nθ is the number of segments uniformly spread around a circle. These locations were generated as xi,j = xv + [r cos θi, r sin θi, Δzj], where Δz = [−zd, 0, zd], xv is the vehicle position at the current time, r is a radius, and zd is the vertical extent of one grid cell (Fig 3a). Each of the potential locations were screened to see if they were inside the operational volume, and the ones who fell outside were discarded. Then, the predictive mean and uncertainty at the potential locations were evaluated along with the distance from the location, xi,j, to the last known location of the other vehicles. The optimal waypoint maximized (Eq 8).
(8)
The constants kμ, kσ, and kl were configuration parameters, enabling the operator to choose the weight of each element of the evaluation function. Tuning of kμ and kσ was done in simulation, where the trade-off between uncertainty reduction and maxima-seeking could be set by the operators. Setting kl is a function of how close the operator wants the vehicles to come to each other, and in deciding its value, the speed of the vehicles and the size of the operational area needs to be taken into account [20]. Notice that only the horizontal distance between the vehicles was taken into account, as the vertical extent of the operational volume usually is far less than the horizontal. After the optimal waypoint was chosen, the AUV travelled towards it, collecting underway measurements. Then, when the waypoint was reached, the measurements were segmented, the ℓGP evaluated, and a new waypoint chosen.
Simulation and field deployment
The method was implemented in the Robot Operating System (ROS) [38] and tested using a simulated CF field along with the vehicle simulator in DUNE (DUNE: Unified Navigation Environment) [39]. Constellations of multiple AUVs were simulated using a virtual machine for each vehicle, using the host computer as the operator hub, and emulating satellite communication by reading and writing to a shared directory on the host computer. A simplified simulation, showing the basic functionality is available [40], along with the ROS packages [41]. A field deployment was carried out during the Arctic spring bloom, on the 28th of May of 2022 in Kongsfjorden, Svalbard, focusing on an area around sampling station KB3 [42]. Operator input values (Table 2) for the above presented method were chosen based on simulation trials and prior assumptions [17, 43]. Values are common to simulation and field deployment, except the operational volume x- and y-extent. De-correlation lengths are softly constrained by the grid cell size and the size of the operational area. Small correlation lengths resulting in a high variance in-homogeneous prediction, whereas high correlation lengths would smooth the predictions, possibly erasing important features. The choice of de-correlation lengths here is based on heuristics. A conceptual overview of the operational setup is presented in Fig 3b.
Vehicles and sensors.
The on-board algorithm was implemented on two Light AUVs (LAUVs) [39], using their backseat driver [44]. The LAUV concept was developed at the Underwater Systems and Technology Laboratory at the University of Porto, and commercially produced by OceanScan Marine Systems and Technology Lda. The vehicles, LAUVs “Harald” and “Roald” (AUR-Lab, NTNU), and are equipped with a scientific payload of a CTD sensor, and a chlorophyll a fluorometer. Specifically, LAUV “Roald” is equipped with a RBR Turner Cyclops Fluorometer and an AML Smart X CTD, while LAUV “Harald” carries a WetLabs EcoPuck Triplet, measuring chlorophyll a fluorescence, colored dissolved organic matter and optical backscatter, and a SBE 49 FastCAT CTD. The on-board software, DUNE, (also used for simulation) controls the navigation, sensors, and low-level control and guidance on the vehicle. A manned work boat, R/V Teisten (Kings Bay AS), was used as the platform for targeted sampling. It was equipped with a SAIV SD204 CTD with a Seapoint chlorophyll a in situ Fluorometer, 300m-capable winch and 10L Niskin bottle water samplers [46].
In vitro sampling.
A subsurface CF maximum could be biased by non-photochemical quenching (NPQ) [45], exhibiting as a subsurface peak, by quenching the signal close to the surface rather than representing the true distribution of chlorophyll a in the water column. We compared CF predictions from in situ AUV data and shipborne CF measurements with the in vitro concentrations of chlorophyll a from water samples. To validate the data retrieved from sensors, discrete water samples for measurement of in vitro chlorophyll a were collected using 10 L Niskin bottles [46] on board the work boat. Sample depths were selected based on layers of relatively high, medium and low chlorophyll a concentrations estimated by the model generated in the operator hub from data from the AUVs. Seawater was filtered (0.5 L) onto Whatman GF/F glass fiber filters [5]. In the laboratory, chlorophyll a was extracted in 100% methanol after 20 hours at −20°C in darkness. The chlorophyll a concentrations were determined using a Turner Designs Trilogy fluorometer (model: 7200–000) following the non-acidification method [47].
Results
Our approach is evaluated in three parts, (1) by simulating up to four AUVs in a common CF field, (2) by a field experiment, deploying the heterogeneous network and comparing the predictions from the AUV data to measurements and samples taken from a crewed vessel, and (3) by challenging the underlying assumptions of the distribution and statistical properties of the CF field. Hence, the results of convergence time in simulation, model performance compared to field measurements, and the statistics of the CF field are presented.
Simulation
For development and testing of the algorithm for multi vehicle adaptive mission planning, simulation trials were used, simulating a spring bloom, based on expected values and distributions. The system enabled each vehicle in the heterogeneous network to make decisions in-situ, mapping the phytoplankton plume finding a path that minimized uncertainty for the operation volume while seeking CF maxima. A single vehicle simulation over a 3000m × 3000m × 50m volume resulted in the on-board model predictive mean converging after 6000 − 7000 seconds, with the root mean squared error (RMSE) between the on-board model and the simulated field as the convergence criterion. When the single vehicle approach was compared with multi vehicle one, a significant gain in model accuracy and convergence time was observed when adding one vehicle. Further, the benefit of adding more vehicles diminished as the number of vehicles in the network increased. (RMSE results for one to four vehicles, Fig 4a). Using four vehicles, the RMSE converged to the nugget standard deviation of the models used in simulation, σn = 0.7.
Simulation results and simulated CF field, (a) RMSE of on board models compared to simulated ground truth, of one to four vehicle missions of the same operational area and underlying simulated field. Simulated (b) chlorophyll-a fluorescence field, plotted on a 3000m × 3000m × 50m operational volume used for simulation. Plotted using log-scale for easier visualization.
Field deployment
The goal of the field experiment was to estimate the three dimensional distribution of CF in the operational volume by using two AUVs, and to verify the adaptive on-board mapping algorithms and sensing network. Using the CF data from the AUVs, the ℓGP was evaluated at each grid cell in the operational volume (Fig 5d and 5e). A horizontal slice plotted for the interval y ∈ [1200, 1500]m (Fig 5c and 5f) to evaluate the area along the Northwestern edge of the operational volume near the point marked KB3. In both cases (Fig 5c–5f), a clear vertical structure in the predicted CF was visible, with a low concentration in the upper 15 − 20m, followed by a layer of higher concentration below 20m. In the horizontal plane, patchiness was also visible with a heterogeneous distribution without a clear trend. There is a sporadic discrepancy between the CF measurements and the underlying field, (a discrepancy feature can be seen in Fig 5c at 45m depth and x ≃ 900m), indicating noisy measurements and/or patchy distributions of CF at that position.
Satellite overview images of (a) Svalbard (courtesy of U.S. Geological Survey) and (b) Kongsfjorden (courtesy of U.S. Geological Survey), with the operational area marked in red, and station KB3 indicated by a white triangle. Predictive mean CF (c,d), and variance (e,f), of the ℓGP, of the operational volume (c,e) and a horizontal slice (d,f), 300m wide, over the interval y ∈ [1200, 1500]m. Plotted in log-scale for visualisation, with circles (c,f) and dots (e,d) indicating grid cells that contain measurements. All field deployment data from AUVs is from 28th of May 2022.
To asses the on-board model, it is re-run in post process, using input from both the AUVs. The averaged model uncertainty for the on-board model decreases for the first 3000 seconds, before leveling off at around 2.5μg2/L2 (Fig 6a). In addition to higher precision, the accuracy also increases with time (Fig 6b), with fewer and lower peaks of RMSE between instances of the ℓGP. The predicted CF profiles at station KB3 are plotted after each data exchange event (Fig 6c) as seen from the operator hub. The synthetic profiles follow the same pattern of low CF values in the upper 20m, with a wide maxima between 30 and 50m, showing small variations for the prediction as new data is gathered.
Averaged (a) prediction uncertainty of each grid cell, (b) prediction error in relation to next prediction for all cells, and synthetic (c) CF profiles at station KB3, after each data exchange event, along with the in-vitro chlorophyll a values from water samples. Names (a and b) “Harald” and “Roald” refer to the two AUVs used in the field deployment.
The ℓGP was evaluated at the sampling location (KB3 [42, 48]), producing a predicted CF profile. To validate this profile, it was compared to the in-situ CF measurements from the fluorometer attached to the shipborne CTD and the in-vitro chlorophyll a values (Fig 7a). These three independent profiles (CF prediction, CF measurement and in-viro measurement from water samples) all showed the same trend for the water column. A CTD cast down to 300m showed low CF values near the surface down to 20m, and a subsurface maxima between 28 and 50m. Additionally, the shipborne measurements were noisy, likely caused by the combination of a small sampling volume for the fluorometer and the bloom being dominated by the colony forming Phaeocystis pouchetii (colony in Fig 7b and individual cells in Fig 7c). Targeted water samples were taken at locations with both high and low CF predictions based on data from the AUVs (lowest 0 and 17m and highest 25, 37, and 50m). Values of in-vitro chlorophyll a from the water samples were approximate to the prediction from the ℓGP (Figs 6c and 7a). In the log-space (Fig 7a), the largest offset between the three profiles is in the upper 15m, where the values are the lowest. The prediction of CF, shipborne CF measurements and in-vitro chlorophyll a measurements follow each other closer below 15m. The uncertainty bounds on the predicted CF profile were wide in relation to the noise level on the ship borne CF measurements. This can be caused by the temporal and spatial offset between the AUV measurements and the shipborne measurements combined with the nugget effect noise. This indicates that the ℓGP was well configured and able to predict the in-situ and in-vitro values and uncertainties.
Phytoplankton bloom conditions, (a) a comparison of predicted profile of CF from the ℓGP and its one standard deviation, ship borne CF measurements on both down- and upcast and in-vitro chlorophyll a values from water samples at station KB3 at 11:47 local time on the 28th of May 2022. Colony (b) of Phaeocystis pouchetii and an (c) example of the phytoplankton community present in the bloom, both (b and c) microscope images. Photos: (b) Ane Cecilie Kvernvik and (c) Anette Wold.
Statistical properties
Due to the “hot-spot” characteristics [2, 4, 17, 49] CF measurements in bloom conditions, we assumed that they would have a log-normal distribution. This assumption was confirmed by a best-fit analysis [50], evaluating log-normal, gamma, beta, Burr [51], and normal distributions (Table 3 and Fig 8a), using the sum of squared errors and the Akaike information criterion [50, 52]. From these analyses, the log-normal distribution had the lowest sum of squared errors, making it the best fit for the data.
Statistical properties, (a) a histogram of raw CF data along with fitted log-normal, Burr, beta, and normal probability density functions for the entirety of the data set from the 28th of May 2022. Semivariogram (b), with data points plotted in blue dots and the a theoretical best fit function as a green line, and histogram (c) over number of samples for each lag bin, with each bar corresponding to a data point in (b). Semivariogram and histogram were generated using a random sample of 25% of the data.
In order to determine if the de-correlation lengths and variances used in the ℓGP were reasonable, a semivariogram analysis of the data was performed (Fig 8b and 8c). In the semivariogram, a best fit spherical theoretical function [53] was plotted. From the semivariogram, two sills are visible, one at 500m, and one at 1600m, whereas the nugget semivariance appears to be 0.8, whereas the spatio-temporal variance lies between 2 and 3. From the best fit theoretical function, the parameters were found to be Mx = My = 1537m, ,
. Future ℓGP variables can be picked from the semivariorgram, either from the fit theoretical function or the data, or an educated combination of the two.
Discussion
We have shown that a 3D adaptive sampling network can provide near real time data updates to the operators, both in simulation and in field deployments. For the duration of the mission, the underwater vehicles in the network were able to adapt to and monitor the CF field within the operational volume. Acting on the data sent from the AUVs, the operators were able to measure and verify the predicted CF profile at the sampling location KB3 [42] and lab analysis, in Kongsfjorden, Svalbard.
The simulation study showed increased model accuracy and decreased model convergence time when using more than one autonomous vehicle within the same operational area. However, this was done using a static underlying field which could skew the results towards higher accuracy and precision than a dynamic underlying field. Despite of this, the findings hold true, the model accuracy increases with the number of vehicles in the same area, with a smaller increase in model accuracy per vehicle for each vehicle added. Further, the simulated CF field contained two large scale patches (Fig 4b), these did not mimic the small scale heterogeneity of the CF measurements observed in the field deployments, yet, the large scale patchiness carry similar characteristics. Using a sufficient number of vehicles, the RMSE converged towards the inherent process and sensor noise, and a lower value would be hard to achieve without changing the model parameters. The convergence of the model uncertainty and reduced prediction variability (Fig 6a and 6b) after 3000s in the field deployment indicate that the model was able to fit the field data. By using a Gaussian method, the model naturally became smoothed and “compressed”, not exhibiting the extremes of the measurements in the predictive mean. This effect is also visible in the field deployment data (Fig 5c and 5d). Further, by using an adaptive mission planning method, rather than a pre-planned one, the AUVs were able to seek the regions of highest variance and predicted CF values from the on board model. In having log-normally distributed data as input, the ℓGPs posterior covariance, , is directly dependent on the predictive mean,
, and thus more exposed to variability in the field value, compared to the Gaussian random field [2, 54]. Adaptive methods have also been shown to be superior to the “lawnmower” maneuver [34] when measuring chlorophyll a.
Comparing the vertical profiles from AUVs, work boat, and in vitro measurements (Fig 7a), the measurements and predictions follow each other well below 20m. In the upper 15m, the work boat CF measurements fall out of the error bars of the prediction. For the discrepancy between model prediction and work boat CF measurements we suggest these possible origins; (1) inaccurate sensor calibration, (2) the model prediction uncertainty, erring on the positive side (Eq (6)), and (3) measurements were not co-located and contemporaneous. In comparing the predicted profiles (Fig 6c), there is little temporal development, and change in the prediction, indicating that even a small amount of data was sufficient to generate a representative prediction. The small surface discrepancy (Fig 7a), ≃ 0.1μg/L, between CF measruements and in vitro chlorophyll a can be explained by NPQ [45], a phenomenon in which the amount of measured CF and amount of intracellular chlorophyll a does not match the calibration of the sensor. Since the discrepancy is largest in the surface, where the light is strongest, NPQ is a likely candidate for explanation. There is also a temporal and vertical distance between work boat CF measurements and water sampling, increasing the probability of the discrepancy to originate from spatio-temporal process noise.
From the observations gathered during the field campaign, it was indicated that the predominant species of the algal bloom at the time of sampling was Phaeocystis pouchetii. Patches or colonies of P. pouchetii would be within one sampling envelope of a chlorophyll a fluorometer (≃ 3ml [55]), but would be integrated in the 10L Niskin bottle sample. This helps to explain the relative difference in noise level of the in situ CF measurements and the in vitro samples of chlorophyll a (Fig 7a). The values from in vitro samples also fits the predicted values from the in situ CF gathered by the AUVs, where the uncertainty reflects the measurement and process noise in the data. In addition, the in vitro samples were pseudoreplicated, with a variance of ≃ 0.01 [μg2/L2], within each sample. This is negligible in relation to the variance of the in situ CF measurements. The noise could also be caused by the low signal-to-noise ratio from fluorescence sensors, irradiance from the sun, different species of algae, photosynthetic rate, and health of the cells [56].
The assumption of a log-normal CF distribution [2, 17] was verified (Fig 8), however, the input to the ℓGP is the cell-averaged CF values, not the raw measurements. When attempting a best-fit analysis on the segmented data, the results were less clear, with no obvious best fit distribution. A criticism to the best-fit analysis, is that the data is collected adaptively, seeking to maximize the CF value, and reduction in uncertainty (dependent on the CF value), thereby skewing the data in favour of higher values, thus skewing the histogram.
From the fit theoretical function in the variogram (Fig 8b), the nugget and variance was found to be within a reasonable range from the initial heuristic values. The nugget effect used was smaller than the one presented in the data, while the variance used was higher than the one used in the model. However, both values were within a reasonable range of the initial estimates. The horizontal de-correlation length predicted from the variogram, 1537m, is substantially longer than the de-correlation length used in the on board model, 600m. However, from the variogram data, there seems to be two predominant de-correlation lengths, the longer at 1537m and a shorter at around 500m. In choosing a shorter horizontal length scale, the model might over-estimate the uncertainty, but is more apt at preserving heterogeneity in the data. In the semivariogram, time was not taken into account, as the total mission time was less than two hours, and thus we can assume it to be fairly static [3, 4, 16].
Conclusion
Our data shows that real-time data reported by multiple collaborating autonomous underwater vehicles combined with targeted water sampling can help increase our understanding of plankton patchiness. A method for adaptive tracking of CF based on a log-Gaussian process as a lightweight ocean model for a sensing network of AUVs and a boat was presented. The method models the chlorophyll a abundance, inferred by in-situ fluorescence measurements from made by the underwater vehicles. The AUVs were able to detect, explore, and measure the CF field inside the operational volume for the duration of the mission. The information relayed to the boat was acted upon and water samples and in-situ CF measurements were taken from the boat. By enabling on-board adaption, coupled with a small-bandwidth satellite communication, the vehicles were able to make decisions in-situ, targeting the desired combination of uncertainty reduction and CF maxima-seeking. By adding temporally increasing noise to the measurements, the on-board ℓGP model was able to increase the significance of the most recent measurements. For the multi vehicle adaptive sampling with data sharing, the vehicles are forced to communicate, with the added benefit of allowing the operator to observe the data and communications in real time. Having an active operator, acting upon the information received, it was showed that targeted sampling of the water-column were enriched by the field estimates. In future work, including covariates, such as CTD, currents, spectral radiance, irradiance, reflectance from in-situ and remote sensing should be considered, providing a more holistic picture of the region of interest. Longer duration deployments in varying bloom conditions should be undertaken to further robustify the method by collecting data on spatial and temporal length-scales in a diverse set of scenarios, leading to a more well-calibrated model and utility function. Further work should also include a more formal configuration of the path-finding parameters in (Eq (8)), optimizing for the desired behavior in a multitude of bloom scenarios.
More information on pelagic biodiversity is urgently needed as the demand on these resources increases from climate change and harvesting. Future integration with automated eDNA samplers, such as LRAUV-ESP [57], or imaging instruments [58], together with adaptive sampling technologies reporting in real-time can be used to contextualize and inform plankton community sampling. This will help to fill important knowledge gaps and observe complex biological processes at fine-scale temporal and spatial resolutions.
Acknowledgments
Thank you to Kings Bay AS for providing helpful service and instrumentation. The authors would also like to thank Emily Venables and Tomasz Piotr Kopec from the University of Tromsø for invaluable help in the field.
References
- 1. McQuatters-Gollop A, Reid PC, Edwards M, Burkill PH, Castellani C, Batten S, et al. Is there a decline in marine phytoplankton? Nature. 2011;472(7342):E6–E7. pmid:21490625
- 2.
Low KH, Dolan J, Schneider J, Elfes A. Multi-robot adaptive exploration and mapping for environmental sensing applications. PhD thesis, CMU. 2009;.
- 3.
Fossum TO, Fragoso GM, Davies EJ, Ullgren JE, Mendes R, Johnsen G, et al. Toward adaptive robotic sampling of phytoplankton in the coastal ocean. Science Robotics. 2019;4(27). https://doi.org/10.1126/scirobotics.aav3041 pmid:33137739
- 4.
Kemna S. Multi-robot strategies for adaptive sampling with autonomous underwater vehicles [thesis]. University of Southern California; 2018.
- 5. Fragoso GM, Davies EJ, Fossum TO, Ullgren JE, Majaneva S, Aberle N, et al. Contrasting phytoplankton-zooplankton distributions observed through autonomous platforms, in-situ optical sensors and discrete sampling. Plos one. 2022;17(9):e0273874. pmid:36067176
- 6.
McGann C, Py F, Rajan K, Thomas H, Henthorn R, McEwen R. T-rex: A model-based architecture for auv control. In: 3rd Workshop on Planning and Plan Execution for Real-World Systems. vol. 2007; 2007.
- 7. Zhang Y, Ryan JP, Bellingham JG, Harvey JB, McEwen RS. Autonomous detection and sampling of water types and fronts in a coastal upwelling system by an autonomous underwater vehicle. Limnology and Oceanography: Methods. 2012;10(11):934–951.
- 8.
Zhang Y, Bellingham JG, Ryan JP, Kieft B, Stanway MJ. Two-dimensional mapping and tracking of a coastal upwelling front by an autonomous underwater vehicle. In: 2013 OCEANS-San Diego. IEEE; 2013. p. 1–4.
- 9. Zhang Y, Bellingham JG, Ryan JP, Kieft B, Stanway MJ. Autonomous four-dimensional mapping and tracking of a coastal upwelling front by an autonomous underwater vehicle. Journal of Field Robotics. 2016;33(1):67–81.
- 10. Zhang Y, Kieft B, Hobson BW, Ryan JP, Barone B, Preston CM, et al. Autonomous tracking and sampling of the deep chlorophyll maximum layer in an open-ocean eddy by a long-range autonomous underwater vehicle. IEEE Journal of Oceanic Engineering. 2019;.
- 11.
Fossum TO. Adaptive Sampling for Marine Robotics [thesis]. Norwegian University of Science and Technology; 2019.
- 12. Fossum TO, Norgren P, Fer I, Nilsen F, Koenig ZC, Ludvigsen M. Adaptive Sampling of Surface Fronts in the Arctic Using an Autonomous Underwater Vehicle. IEEE Journal of Oceanic Engineering. 2021;.
- 13. Fossum TO, Travelletti C, Eidsvik J, Ginsbourger D, Rajan K. Learning excursion sets of vector-valued Gaussian random fields for autonomous ocean sampling. The Annals of Applied Statistics. 2021;15(2):597–618.
- 14. Stankiewicz P, Tan YT, Kobilarov M. Adaptive sampling with an autonomous underwater vehicle in static marine environments. Journal of Field Robotics. 2021;38(4):572–597.
- 15.
Rasmussen CE. Gaussian processes in machine learning. In: Summer School on Machine Learning. Springer; 2003. p. 63–71.
- 16. Berget GE, Fossum TO, Johansen TA, Eidsvik J, Rajan K. Adaptive sampling of ocean processes using an AUV with a Gaussian proxy model. IFAC-PapersOnLine. 2018;51(29):238–243.
- 17.
Kemna S, Caron DA, Sukhatme GS. Adaptive informative sampling with autonomous underwater vehicles: Acoustic versus surface communications. In: OCEANS 2016 MTS/IEEE Monterey. IEEE; 2016. p. 1–8.
- 18.
Maine K, Devieux C, Swan P. Overview of IRIDIUM satellite network. In: Proceedings of WESCON’95. IEEE; 1995. p. 483.
- 19. Pratt SR, Raines RA, Fossa CE, Temple MA. An operational and performance overview of the IRIDIUM low earth orbit satellite system. IEEE Communications Surveys. 1999;2(2):2–10.
- 20. Leonard NE, Paley DA, Lekien F, Sepulchre R, Fratantoni DM, Davis RE. Collective motion, sensor networks, and ocean sampling. Proceedings of the IEEE. 2007;95(1):48–74.
- 21.
Bhatta P, Fiorelli E, Lekien F, Leonard NE, Paley D, Zhang F, et al. Coordination of an underwater glider fleet for adaptive ocean sampling. In: Proc. International Workshop on Underwater Robotics, Int. Advanced Robotics Programmed (IARP), Genoa, Italy. Citeseer; 2005.
- 22. Falkowski PG. The role of phytoplankton photosynthesis in global biogeochemical cycles. Photosynthesis research. 1994;39(3):235–258. pmid:24311124
- 23. Trudnowska E, Lacour L, Ardyna M, Rogge A, Irisson JO, Waite AM, et al. Marine snow morphology illuminates the evolution of phytoplankton blooms and determines their subsequent vertical export. Nature communications. 2021;12(1):1–13. pmid:33990580
- 24.
Sakshaug E, Johnsen GH, Kovacs KM. Ecosystem Barents Sea. Tapir Academic Press; 2009.
- 25.
Falkowski PG, Raven JA. Aquatic photosynthesis. Princeton University Press; 2013.
- 26. Grünbaum D. The logic of ecological patchiness. Interface Focus. 2012;2(2):150–155. pmid:23565330
- 27. Robinson KL, Sponaugle S, Luo JY, Gleiber MR, Cowen RK. Big or small, patchy all: Resolution of marine plankton patch structure at micro-to submesoscales for 36 taxa. Science advances. 2021;7(47):eabk2904. pmid:34797707
- 28. Mitchell JG, Yamazaki H, Seuront L, Wolk F, Li H. Phytoplankton patch patterns: seascape anatomy in a turbulent ocean. Journal of Marine Systems. 2008;69(3-4):247–253.
- 29. Durham WM, Stocker R. Thin phytoplankton layers: characteristics, mechanisms, and consequences. Annual review of marine science. 2012;4:177–207. pmid:22457973
- 30. Van Gennip S, Martin AP, Srokosz MA, Allen JT, Pidcock R, Painter SC, et al. Plankton patchiness investigated using simultaneous nitrate and chlorophyll observations. Journal of Geophysical Research: Oceans. 2016;121(6):4149–4156.
- 31.
Schalles JF. Optical remote sensing techniques to estimate phytoplankton chlorophyll a concentrations in coastal. In: Remote sensing of aquatic coastal ecosystem processes. Springer; 2006. p. 27–79.
- 32. Dierssen HM. Perspectives on empirical approaches for ocean color remote sensing of chlorophyll in a changing climate. Proceedings of the National Academy of Sciences. 2010;107(40):17073–17078.
- 33. Wassmann P, Ratkova T, Reigstad M. The contribution of single and colonial cells of Phaeocystis pouchetii to spring and summer blooms in the north-eastern North Atlantic. Harmful algae. 2005;4(5):823–840.
- 34. Frolov S, Garau B, Bellingham J. Can we do better than the grid survey: Optimal synoptic surveys in presence of variable uncertainty and decorrelation scales. Journal of Geophysical Research: Oceans. 2014;119(8):5071–5090.
- 35. Gat E, Bonnasso RP, Murphy R, et al. On three-layer architectures. Artificial intelligence and mobile robots. 1998;195:210.
- 36. Matta CF, Massa L, Gubskaya AV, Knoll E. Can one take the logarithm or the sine of a dimensioned quantity or a unit? Dimensional analysis involving transcendental functions. Journal of Chemical Education. 2011;88(1):67–70.
- 37.
Cressie N, Wikle CK. Statistics for spatio-temporal data. John Wiley & Sons; 2015.
- 38.
Quigley, Morgan and Conley, Ken and Gerkey, Brian and Faust, Josh and Foote, Tully and Leibs, Jeremy and Wheeler, Rob and Ng, Andrew Y et.al. ROS: an open-source Robot Operating System ICRA workshop on open source software, Kobe, Japan, 2009;3.2:5
- 39. Sousa A, Madureira L, Coelho J, Pinto J, Pereira J, Sousa JB, et al. LAUV: The man-portable autonomous underwater vehicle. IFAC Proceedings Volumes. 2012;45(5):268–274.
- 40.
Mo-Bjørkelund, T. Python 3D Adaptive sampling of log Gaussian field using multiple vehicles. Available from: https://github.com/toremobjo/3dlgp_adaptive_sampling.
- 41.
Mo-Bjørkelund, T. ROS packages used in 3D Adaptive sampling of log Gaussian field using multiple vehicles. Available from: https://github.com/toremobjo/Multivehicle3DAdaptive.
- 42. Renaud PE, Tessmann M, Evenset A, Christensen GN. Benthic food-web structure of an Arctic fjord (Kongsfjorden, Svalbard). Marine Biology Research. 2011;7(1):13–26.
- 43. Fossum TO, Eidsvik J, Ellingsen I, Alver MO, Fragoso GM, Johnsen G, et al. Information-driven robotic sampling in the coastal ocean. Journal of Field Robotics. 2018;35(7):1101–1121.
- 44.
Pinto J, Mendes R, da Silva JC, Dias JM, de Sousa JB. Multiple Autonomous Vehicles Applied to Plume Detection and Tracking. In: 2018 OCEANS-MTS/IEEE Kobe Techno-Oceans (OTO). IEEE; 2018. p. 1–6.
- 45. Muller P, Li XP, Niyogi KK. Non-photochemical quenching. A response to excess light energy. Plant physiology. 2001;125(4):1558–1566. pmid:11299337
- 46.
Niskin SJ; Jan. 13, 1970. Available from: https://patents.google.com/patent/US3489012?oq=niskin+bottle.
- 47. Holm-Hansen O, Riemann B. Chlorophyll a determination: improvements in methodology. Oikos. 1978; p. 438–447.
- 48.
Hop H, Wiencke C. The Ecosystem of Kongsfjorden, Svalbard. In: The Ecosystem of Kongsfjorden, Svalbard. Springer; 2019. p. 1–20.
- 49. Campbell Janet W The lognormal distribution as a model for bio-optical variability in the sea Journal of Geophysical Research: Oceans. 1995;100(C7)13237–13254, Wiley Online Library.
- 50.
Cokelaer T. Python FITTER package; 2022. Available from: https://fitter.readthedocs.io/en/latest/index.html.
- 51. Burr IW. Cumulative frequency functions. The Annals of mathematical statistics. 1942;13(2):215–232.
- 52. Stoica P, Selen Y. Model-order selection: a review of information criterion rules. IEEE Signal Processing Magazine. 2004;21(4):36–47.
- 53.
Mälicke M, Möller E, Schneider HD, Müller S. mmaelicke/scikit-gstat: A scipy flavoured geostatistical variogram analysis toolbox; 2021.
- 54.
Eidsvik J, Mukerji T, Bhattacharjya D. Value of information in the earth sciences: Integrating spatial modeling and decision analysis. Cambridge University Press; 2015.
- 55.
Boss E, Haëntjens N. Primer regarding measurements of chlorophyll fluorescence and the backscattering coefficient with WETLabs FLBB on profiling floats.; 2016.
- 56. Johnsen G, Bricaud A, Nelson N, Prézelin B, Bidigare R. In vivo bio-optical properties of phytoplankton pigments. Phytoplankton pigments: Characterization, chemotaxonomy and applications in oceanography. 2011; p. 497–537.
- 57. Yamahara KM, Preston CM, Birch J, Walz K, Marin III R, Jensen S, et al. In situ autonomous acquisition and preservation of marine environmental DNA using an autonomous underwater vehicle. Frontiers in Marine Science. 2019;6:373.
- 58. Davies EJ, Brandvik PJ, Leirvik F, Nepstad R. The use of wide-band transmittance imaging to size and classify suspended particulate matter in seawater. Marine pollution bulletin. 2017;115(1-2):105–114. pmid:27931867