Assessing the Underwater Acoustics of the World's Largest Vibration Hammer (OCTA-KONG) and Its Potential Effects on the Indo-Pacific Humpbacked Dolphin (Sousa chinensis)

Anthropogenic noise in aquatic environments is a worldwide concern due to its potential adverse effects on the environment and aquatic life. The Hongkong-Zhuhai-Macao Bridge is currently under construction in the Pearl River Estuary, a hot spot for the Indo-Pacific humpbacked dolphin (Sousa chinensis) in China. The OCTA-KONG, the world's largest vibration hammer, is being used during this construction project to drive or extract steel shell piles 22 m in diameter. This activity poses a substantial threat to marine mammals, and an environmental assessment is critically needed. The underwater acoustic properties of the OCTA-KONG were analyzed, and the potential impacts of the underwater acoustic energy on Sousa, including auditory masking and physiological impacts, were assessed. The fundamental frequency of the OCTA-KONG vibration ranged from 15 Hz to 16 Hz, and the noise increments were below 20 kHz, with a dominant frequency and energy below 10 kHz. The resulting sounds are most likely detectable by Sousa over distances of up to 3.5 km from the source. Although Sousa clicks do not appear to be adversely affected, Sousa whistles are susceptible to auditory masking, which may negatively impact this species' social life. Therefore, a safety zone with a radius of 500 m is proposed. Although the zero-to-peak source level (SL) of the OCTA-KONG was lower than the physiological damage level, the maximum root-mean-square SL exceeded the cetacean safety exposure level on several occasions. Moreover, the majority of the unweighted cumulative source sound exposure levels (SSELs) and the cetacean auditory weighted cumulative SSELs exceeded the acoustic threshold levels for the onset of temporary threshold shift, a type of potentially recoverable auditory damage resulting from prolonged sound exposure. These findings may aid in the identification and design of appropriate mitigation methods, such as the use of air bubble curtains, “soft start” and “power down” techniques.


Introduction
Over the past few decades, anthropogenic (human-generated) noise in aquatic environments has generated worldwide concern due to its potential adverse effects on the environment and aquatic life [1][2][3][4][5]. Of particular concern are the intense impulsive sounds from explosive detonations, seismic surveys and pile driving, common activities in the construction of renewable-energy marine wind farms, docks and bridges. The effects on marine mammals have been of particular interest [6][7][8][9]. This concern is partly due to the protected status of marine mammals under state laws and international conventions, such as the Convention on International Trade in Endangered Species of Wild Fauna and Flora, as well as their vulnerability to ambient noise. Cetaceans have a sophisticated acoustic sensory system with wideband hearing sensitivity [10], and they are heavily dependent on the acoustic environment for many life functions. They have evolved sophisticated vocalizations and multiple sound-reception pathways, and they rely on acoustic stimuli for social interaction, navigation and foraging in the marine environment [10].
The Greater Pearl River Delta is one of the most economically developed regions in China [11]. However, land transport between its western (such as the Zhuhai and Macao Special Administrative Regions) and eastern regions (such as the Hongkong Special Administrative Region) is limited by the Pearl River Estuary (PRE). To increase the region's economic competitiveness and to facilitate economic collaboration, e.g., by reducing the costs involved in transporting people and goods between the regions, the Hongkong-Zhuhai-Macao Bridge (HZMB) is being constructed to connect these three cities. The HZMB Island Tunnel Project is a large-scale, cross-boundary sea crossing involving more than 300 supporting bridge piles, an underwater tunnel and two artificial islands (Fig. 1). Construction began on 15 December 2009 and is expected to continue into 2016 [12].
The PRE (22u169S; 113u439E) is a hot spot for the Indo-Pacific humpbacked dolphin (Sousa chinensis, locally called the Chinese white dolphin), which is distributed in shallow coastal waters from South Africa in the west to southern China in the east [13]. This species is currently assessed as Near Threatened; the chinensis-type geographic form (found from the east coast of India to China) is categorized as Vulnerable by the International Union for the Conservation of Nature Red List of Threatened Species [13] and as a Grade One National Key Protected Animal by China's Wild Animal Protection Law, issued in 1988. The population size of humpbacked dolphins in the PRE was estimated to be 2555 and 2517 during the wet and dry seasons, respectively [14], representing the largest known humpbacked dolphin population in China [15,16] and the world [14,17]. To better protect the dolphin population, the Pearl River Estuary Chinese White Dolphin National Nature Reserve (PRECWDNNR) (Fig. 1) was established in the PRE in 1999.
Unfortunately, the HZMB, which uses thousands of piles driven into the bottom of the estuary, is located across the PRECWDNNR. To minimize any adverse effects on protected species, the following strategies were adopted: (1) An underwater tunnel was designated to replace a bridge structure in the core area of the reserve (Fig. 1). (2) A marine mammal safety zone (an exclusion zone of 200 m radius [18]) was established in the vicinity of the bridge construction sites. Qualified marine mammal observers scan for the presence of marine mammals within the exclusion zone. If marine mammals are observed in the safety zone, operations halt until the animals have left the zone. (3) An acoustic deterrent device (Future Oceans 70 kHz Dolphin Pinger; Future Oceans, Queensland, Australia) that emits a 145 dB signal for 300 m every 4 s is used to warn any marine mammals away from the safety zone both before and during construction. (4) A hydraulic vibration hammer is used for pile driving in addition to an impact hammer, which generates substantially louder impulse sounds.
However, due to a limited understanding of the sound produced by the construction activities, the safety zone was not established based on robust experimental or theoretical information. Efforts to protect animals are generally hampered when only limited data are available for establishing criteria for interim protection. Research on the characteristics of the underwater sound field produced during bridge construction is needed. In particular, pile driving, which produces loud underwater sounds, requires study to improve environmental impact analyses and aid in the identification and design of appropriate mitigation methods [19].
Compared with the conventionally used impact hammers, the vibratory hammer is a much more economical tool for construction companies [20]. In addition to its ability to extract piles, other advantages include (1) a lighter weight than conventional hammers, (2) faster operation at a lower noise level than conventional hammers and (3) lack of requirement of a temporary guide frame for driving free-standing piles [20]. Accordingly, it represents an alternative tool, or a complementary tool, to impact hammers from a conservation perspective.
In the waters of western Hong Kong, S. chinensis has been observed to travel at higher speeds during percussive pile driving. Moreover, the animals tend to partially and temporarily abandon the pile driving area [21]. Given that the peak pressure levels produced by a normal vibration hammer during the driving of normal, cast-in-steel-shell piles range from approximately 175 dB to 205 dB [19], the OCTA-KONG (American Piledriving Equipment Inc., Kent, WA, USA), which is the world's largest vibration hammer and is capable of driving and extracting piles, may impose a substantial threat to marine mammals. The use of this hammer further emphasizes the need for an assessment of underwater noise in and around the HZMB.
The present study had two main purposes. The first was to characterize the acoustic properties of the operating sounds of the OCTA-KONG, including pile driving and extraction. The second was to assess the potential impacts of this anthropogenic noise on Sousa with respect to three factors: Sousa sounds (whistles and clicks) recorded in the same district during a previous dolphin acoustic survey by the first author;Sousa audiograms [22,23]; the safety exposure level established by the U.S. National Marine Fisheries Service (NMFS) [24] and the marine mammal noise exposure criteria proposed by a panel of experts from a wide range of disciplines in acoustic research [7] and the National Oceanic and Atmospheric Administration (NOAA) [25].

Vibration piling
The OCTA-KONG is the world's largest hydraulic vibratory driver/extractor. It consists of a Multiple Linked Vibro System with 8 APE 600's connected in a tandem combination (Table 1, Fig. 2), with each APE 600 powered by a Model 1200 power unit (Fig. 2). The OCTA-KONG was used to drive 22 m diameter steel shell piles (SSP 22 ) during the construction of the main wall of the two artificial islands ( Fig. 1 Vibration piling components. The major components of the hammer are as follows: (1) the suppressor housing (biasweight), with a rubber elastomer isolated suppressor; (2) the  vibrator gearbox, incorporating the phased high-amplitude eccentric weights; and (3) the clamping attachment (Fig. 2C).
Principles of vibration piling. During pile driving and extraction, the pile and hammer (except for the suppressor housing) are rigidly connected by the clamping attachment, forming a hammer-pile complex oscillating exciter (Fig. 2). Vibration is caused by the vertical movement produced by the centrifugal force that arises when the pairs of eccentrics are counter-rotated [20].The continuous pulses of energy transferred from the hammer-pile complex to the soil can temporarily change the stress-strain behavior, such as soil displacement (e.g., at the penetrating pile tip), and they can create excess pore water pressures or even complete fluidization of the soil. As a consequence, the frictional (i.e., both the internal friction of the soil and the pile-soil friction) and tip resistances are strongly reduced during vibratory driving, enabling the pile to penetrate under the low vertical thrust produced from the combined action of centrifugal force and the self-weight of the hammer-pile complex [20]. The detailed mechanics are not discussed further here.
Piling procedure. The SSP 22 were 38 m-60 m long and weighed 450 000 kg-600 000 kg (Table 1). During the initial stage of pile installation (i.e., the pre-OCTA-KONG driving session), one of the SSP 22 was rigidly connected to the clamping attachment of the OCTA-KONG, moved to the predesignated location by crane ( Fig. 2) and sunk approximately 20 m by the self-weight of the hammer-pile complex. During the OCTA-KONG driving session, the hammer was used to further drive down the hammerpile complex to the desired depth. The average sink depth during OCTA-KONG piling was 5 m (range 4 m to 6 m), depending on the substrate. During pile extraction, the OCTA-KONG was powered at the outset to reduce the pile-soil friction and to extract the pile using the line pull of the crane. At a certain point, the operation of the OCTA-KONG was stopped, and only the crane was used to extract the pile.

Ethical statement
Permission to conduct the study was granted by the Ministry of Science and Technology of the People's Republic of China. The research permit was issued to the Institute of Hydrobiology of the Chinese Academy of Sciences (Permit number: 2011BAG07B05).

Acoustic data recording system
Two sets of recording systems were adapted for underwater sound recording. The first was a boat-based system (hereafter referred to as BS) consisting of a Reson piezoelectric hydrophone (model TC-4013-1; Reson Inc., Slangerup, Denmark), a 1 MHz bandwidth EC6081 voltage pre-amplifier with a band-pass filter (model VP2000; Reson Inc.), a high-speed, 16 bit, multifunction data acquisition (DAQ) card (model NI USB-6251 BNC; National Instruments (NI), Austin, TX, USA),a laptop computer and LabVIEW 2011 SP1 (NI) software. Underwater signals were detected with a Reson hydrophone (sensitivity: 2211 dB re 1 V/ mPa at 1 m distance; frequency response: 1 Hz to 170 kHz +1/2 7 dB) and conditioned by a VP2000 pre-amplifier. Further highpass filtering at 10 Hz was conducted to reduce system and flow noise, and low-pass filtering at 250 kHz was conducted to prevent aliasing before inflow into the NI USB-6251 BNC DAQ card. The acoustic data were then stored directly on the hard drive of a computer in binary format with a sampling rate of 512 kHz, using LabVIEW software. The second recording system was a Song Meter Marine Recorder (hereafter referred to as SM2M), which included an HTI piezoelectric hydrophone (model HTI-96-MIN; High Tech, Inc., Long Beach, MS, USA) with a sensitivity of 2 165 dB re 1 V/mPa at 1 m distance and a frequency response of 2 Hz-48 kHz +/22 dB. It also included a programmable autonomous signal processing unit, integrated with a band-pass filter and a pre-amplifier, which can log data at a resolution of 16 bits and up to a 96 kHz sample rate, with a storage capacity of 512 GB (46128GB SDXC cards). The signal processing unit was sealed inside a waterproof PVC housing and was submersible to a depth of 150 m. The Reson hydrophone and the SM2M system were calibrated prior to shipment from the factory. The remaining components of the BS system, including the amplifier, filter, DAQ card, LabVIEW software and laptops, were lab-calibrated prior to the field survey by inputting a calibration signal generated by an OKI underwater sound level meter (model SW1020; OKI Electric Industry Co., LTD., Tokyo, Japan). Signal transmission was also simultaneously monitored with an oscilloscope (model TDS1002C; Tektronix Inc., Beaverton, OR, USA).

Data collection
Acoustic recordings were made on 5 days between 21 October, 2013 and 4 January, 2014 at the construction site of the HZMB, China (21u169-21u169S; 113u339-113u559E) ( Fig. 1, Table 2). Surveys were conducted from a 7.5 m recreational power boat with a 102 970 W (140 horsepower) outboard engine. Both stationary and floating recording methods were used during sound recording. For stationary recording, either peripheral static buoys were used to suspend the submersible SM2M or the research vessel was moored with an anchor to form a static platform for the boat-based BS recording system. For floating recordings, the vessel's engine was turned off after approaching a pile, allowing the boat to drift. The recording system was then deployed from the side of the boat. If the boat drifted too far from the pile, recording was stopped, and the boat was repositioned. During sound recording, the vessel's engine remained off. The hydrophones were deployed to 2 m depth using an attached weight to limit movement due to water flow. Furthermore, pile driving was performed primarily during the slack water period, when tidal influence on the water depth and currents were both minimal.
The distance to the construction site was measured using Nikon laser rangefinders (model Ruihao 1200S; Nikon Imaging (China) Sales Co., Ltd., Shanghai, China) with a performance range of 10 m to 1100 m and an accuracy of 61 m. The locations for both stationary recording and floating recording were also logged using a GPS receiver (model GPSMAP 60CSx; Garmin Corporation, Sijhih, Taiwan). The water depth and quality, including temperature, salinity and pH, was measured with a Horiba Multiparameter Water Quality Monitoring System (model W-22XD; Horiba, Ltd., Kyoto, Japan). Ambient noise was recorded before OCTA-KONG piling operations.

Acoustic data analysis
Acoustic signals, including OCTA-KONG vibration sounds and ambient noise, were continuously sliced into a time window segment of 1 s. Segments with obvious interference were deleted. Analysis was conducted with SpectraLAB 4.32.17 software (Sound Technology Inc., Campbell, CA, USA) and MATLAB 7.11.0 (The Mathworks, Natick, MA, USA) routines and custom programs.
Sound pressure levels (SPLs) and sound exposure levels (SELs). The measured parameters included sound pressure levels (SPLs) and sound exposure levels (SELs). SPLs were derived directly from the pressure metrics, including the zero-to-peak sound pressure (i.e., the maximum of the unweighted absolute instantaneous sound pressure in the measurement bandwidth (p max )) and the root-mean-square sound pressure (i.e., the average of the square of the unweighted instantaneous sound pressure (p(t)) in the measurement bandwidth integrated over the analyzed signal duration (T)). The zero-to-peak SPL (SPL zp ) is ten times the logarithm to the base 10 of the ratio of the square of the zero-topeak sound pressure to the square of the reference sound pressure of 1 mPa (p ref1 ). Similarly, the root-mean-square SPL (SPL rms ) is ten times the logarithm to the base 10 of the ratio of the square of the root-mean-square sound pressure to the square of the reference sound pressure of 1 mPa. The single SEL (SEL ss ) is ten times the logarithm to the base 10 of the ratio of the integral of the squared sound exposure of a signal of 1 s time window to the reference sound exposure of 1 mPa 2 s (p ref2 ). Absolute pressure levels were derived by subtracting the sensitivity of the hydrophone and the gain due to the amplifier [10].
Spectrogram, power spectral density and 1/3 octave band frequency spectrum The frequency composition of the signals was determined using spectrograms, which express a signal's amplitude, frequency and time, portraying amplitude as a graph plotted in a dark color on a two-dimensional time-frequency plane [10]. Power spectral density (PSD) level routines (dB re 1 mPa 2 Hz 21 ), i.e., narrowband spectra in 1 Hz bands, which represent the averaged sound power in each 1 Hz band, were applied to investigate detailed tonal signatures [26]. The 1/3 octave band frequency spectrum, i.e., the sum of the squared pressure of all 1 Hz bands within a 1/3 octave, was investigated to assess impacts on mammalian hearing, as 1/3 of an octave approximates the effective filter bandwidth of cetaceans [5]. Both the spectrograms and narrowband spectra were obtained using the fast Fourier transform (FFT) method, combining a Hanning smoothing window function with an overlap of 85% for the averaging. For the BS (sample rate 512 kHz) and SM2M data (sample rate 96 kHz), the FFT size was 262 144 samples and 65 536 samples, respectively, resulting in a frequency grid resolution of 1.95 Hz and 1.46 Hz, respectively, and a temporal grid spacing of 76.80 ms and 102.40 ms, respectively. Narrowband spectra were further normalized to PSD by dividing by the frequency grid resolution.
Cetacean auditory weighted SEL (SELws). As the damage risk criteria for marine mammals exposed to noise should incorporate the exposure frequency [27], cetacean auditory weighting (CA-weighting) [25] functions were used to incorporate the animals' auditory sensitivity to certain frequencies by emphasizing those frequencies where sensitivity to noise is high and de-emphasizing frequencies where sensitivity is low. The CAweighting function (W CA (f)) was merged with a marine mammal weighting function (W M (f), Equation 1) [7] and an equal-loudness weighting function curve (W EQL (f), Equation 2). Function W EQL (f) was derived from bottlenose dolphin (Tursiops truncatus) frequency-specific temporary threshold shift data [27,28] and equal-loudness contours [29]). Equal-loudness contours represent the SPLs of a sound that are perceived as equal in loudness magnitude in a testee as a function of sound frequency. They are considered to reveal the frequency characteristics of the testee's auditory system [30]. The contours are derived from subjective loudness experiments that ask candidates to judge the relative loudness of two tones of different frequencies [29,31]. At each frequency, the amplitude of the W CA (f) is defined using the larger value from the two component curves (Equation 3). The cetacean auditory-weighted SEL (SEL ws ) is ten times the logarithm to the base 10 of the ratio of the integral of the squared sound exposure of an CA-weighted signal of 1 s time window to the reference sound exposure of 1 mPa 2 s. This SEL ws can be simplified (Equation 4), as the integral of the squared sound exposure of an CA-weighted signal is equal to the overall energy of the CAweighted PSD contour (PSD W (f)) multiplying its frequency Table 2. Descriptions of OCTA-KONG vibration sites, sound recording equipment and method. resolution.
where W(f) is the weighting function amplitude (in dB) at frequency f (in Hz), a and b are constants related to the lower and upper hearing limits (the ''roll off'' and ''cut off'' frequencies), respectively, and K is a constant used to normalize the equation at a particular frequency [32]. For Sousa, which belongs to the midfrequency cetacean functional hearing group, K 1 , a 1 and b 1 are 2 16.5, 150 and 160 000, respectively, and K 2, a 2 and b 2 are 1.4, 7829 and 95 520, respectively [25,32]. Source levels and source SELs. OCTA-KONG source levels (SLs), including the zero-to-peak SL (SL zp , dB re 1 mPa), root-mean-square SL (SL rms , dB re 1 mPa), and source SELs (SSELs), including unweighted SSEL (SSEL ss , dB re 1 mPa 2 s) and CA-weighted SSEL (SSEL ws , dB re 1 mPa 2 s), were obtained by combining measures of received level (RL) and transmission loss (TL) (Equation 5). TL was estimated from the distance from the source (r) as a result of the depth-dependent spreading loss plus frequency-dependent absorption (Equation 6) [33].

SL~RLzTL
ð5Þ where r is the range in meters; A is the spreading loss coefficient, which generally varies from 10 (cylindrical spreading) to 20 (spherical spreading); and a is the frequency-dependent absorption coefficient in dB/m. As the dominant frequency of vibration pile driving was below 10 kHz [19], the absorption term does not significantly contribute to transmission loss and can generally be ignored for those recordings with greatest measurement ranges of less than 1 km [34,35]. Therefore, Equation 6 can be simplified to Equation 7 for the estimates of SL zp , SL rms , SSEL ss and SSEL ws . Sound propagation in shallow water environments (,200 m deep) is complex [33]. Attenuation may vary with depth depending on the sediment type, pressure and sediment porosity [36], and the frequency dependence of the acoustic response is sensitive to the details of the geoacoustic structure of the seabed [37]. Previous geophysical studies indicated that the surficial sediments of the bridge construction site were almost flat (TQ Zeng, personal communication Audibility range. Sound audibility is determined by both external conditions, such as the characteristics of received sound and background noise conditions, and internal conditions, such as the hearing capability of the receiving system (also called the hearing audiogram). As the lowest frequency of the available Sousa audiogram was 5.6 kHz [22,23], we were unable to analyze sound audibility for low-frequency sound by referencing the audiogram. Thus, the audible sound range was conservatively estimated as the range from which the sound source is attenuated by absorption and spreading loss with distance, measured at the point where the received sound is equal to the ambient noise level. The sound audible range was estimated using the transmission loss, Equation 9, to incorporate the deviation between the SL of the OCTA-KONG and the ambient noise level (the spreading loss coefficient derived above was adopted here).
Possible impacts on Sousa. The potential effects of anthropogenic noise on marine mammals include, but are not limited to, behavioral responses, auditory masking, and physiological effects [5]. Potential behavioral responses include exposure avoidance, behavioral disturbance or no response [7]. Auditory masking refers to the disruption of the reception of auditory signals by noise in the adjacent frequency bands (the so-called critical band) [7], resulting in partial or complete reduction in the audibility of the signals [7,38,39]. Physiological effects include temporary or (in extreme cases) permanent threshold shifts (TTS, PTS), a type of increase in the threshold of the audibility portion of an individual's hearing range or at a specified frequency above a previously established reference level producing states of temporary and recoverable shifts (TTS) or permanent, irreversible ones (PTS) [7]. As no Sousa were encountered during the recording period, documenting the behavioral responses was beyond the scope of the present study. Previous recordings of Sousa acoustics, including dolphin clicks and whistles recorded from within the same district, and Sousa audiograms [22,23] were used to analyze potential auditory masking. The Sousa whistles and clicks were recorded by following a focal group of dolphins (an aggregation of dolphins that were engaged in the same behavior and separated by less than 100 m) [40]. Using the vocalizations to determine the animals' location was difficult because only one hydrophone system was used. However, we can confirm that the recorded dolphin sound was from the focal group because no other groups of dolphins were present within approximately 1000 m. During the sound recording, the location of the dolphins was determined within a 50 m radius of our boat based on the successive sites at which they were observed to surface and breathe. The OCTA-KONG sound level was compared with both the cetacean safety exposure level and the proposed acoustic threshold levels for the onset of TTS and PTS for the analysis of potential physiological effects. The cetacean safety exposure level established by NMFS is 180 dB (SPL rms ) [24], and the proposed PTS and TTS acoustic threshold levels for Sousa (which are mid-frequency cetaceans) exposed to vibration driving noise (a non-impulsive sound source) are: (1) SL zp of 230 dB and 224 dB re 1 mPa, respectively; (2) SSEL cum of 195 dB and 215 dB re1 mPa 2 s, respectively; and (3) SSEL wcum of 178 dB and 198 dB re 1 mPa 2 s, respectively [25], using whichever level is first exceeded.

Statistical analysis
Statistical analyses were conducted using SPSS 16.0 (SPSS Inc., Chicago, IL, USA). Descriptive statistics of all measured SPLs (SPL zp and SPL rms ) and SELs (SEL ss and SEL ws ) were obtained, including means, standard deviations (SD) and ranges (minimummaximum values). The mean SPLs and SELs were calculated in Pa and converted to dB. A Levene's test and a Kolmogorov-Smirnov test were used to analyze the homogeneity of the variances and data normality, respectively. Nonparametric methods [41] were adopted for parameters that were non-normally distributed (Kolmogorov-Smirnov test: p,0.05). A Mann-Whitney U-test [41] was applied to analyze whether the SPLs and SELs of the OCTA-KONG varied significantly between pile driving and pile extraction (by comparing data recorded at the same distance to the pile and using the same system) and to test for differences in recorded noise level between the two recording systems. A Kruskal-Wallis test [41] was adopted to examine the overall ambient noise differences across different recording days. A Duncan's multiple comparison test [41] was used for post hoc comparisons of differences in ambient noise level among different recording days. Differences in the ambient noise level among different times within the same day were tested using a Mann-Whitney U-test. Differences were considered significant at p, 0.05.

Results
OCTA-KONG pile driving (Fig. 3) was recorded on 5 days at the sites between SSP 22 #32 and #41, and pile extraction was monitored on SSP 22 #26 (Table 2). One floating recording for both piling and extraction was obtained ( Table 2). Water depths at the recording sites were shallow, ranging from 7 to 8 m (Table 2).

SEL ws
The recorded SEL ws ranged from 112.74 dB to 128.86 dB and 111.92 dB to 138.07 dB for pile driving and pile extraction, respectively (Table 3).

SL zp , SL rms , SSEL ss and SSEL ws
The best-fit sound propagation models for SPL zp , SPL rms , SEL ss and SEL ws for pile driving and pile extraction are shown in Figure 7A and 7B. The estimated mean SL zp during pile driving and extraction ranged from 179.79 dB to 189.01 dB and 185.70 dB to187.49 dB, respectively. The estimated mean SL rms and SSEL ss during pile driving and extraction ranged from 168.90 dB to 179.96 dB and 173.00 dB to175.26 dB, respectively. The estimated mean SSEL ws during pile driving and extraction ranged from 142.95 dB to 157.20 dB and 157.00 dB to158.90 dB, respectively (Table 5).

Audibility range
The frequency-dependent sound absorption constant a was estimated at 0.0006 [35] for the specific pH of 8, a salinity of 33% and a water temperature of 20uC (measured at the piling sites during the sound recording period) at a frequency of 10 kHz; the majority of OCTA-KONG vibration noise power is found below this frequency (Figs. 4,5,6). The transmission loss equations with correlation to the distance r for SL zp , SL rms , SSEL ss and SSEL ws were 15.1 log 10 (r)+0.0006r, 15.0 log 10 (r)+0.0006r, 15.0 log 10 (r)+ 0.0006r and 15.4 log 10 (r)+0.0006r, respectively, for pile driving and 19.1 log 10 (r)+0.0006r, 19.4 log 10 (r)+0.0006r, 19.4 log 10 (r)+ 0.0006r and 19.7 log 10 (r)+0.0006r, respectively, for pile extraction. The estimated audible range of SL zp during pile driving and pile extraction ranged from 448 m to 1546 m and from 196 m to 236 m, respectively. The estimated audible range of SL rms and SSEL ss during pile driving and pile extraction ranged from 818 m to 3489 m and from 192 m to 229 m, respectively. The estimated audible range of SSEL ws during pile driving and pile extraction ranged from 483 m to 2954 m and from 557 m to 765 m, respectively (Table 5).

Impact on Sousa
Auditory masking. The Sousa audiogram was revised from the two available audiograms [22,23], with the lowest threshold at each frequency defining the merged audiogram curve. Both the OCTA-KONG vibration sound and the ambient noise level recorded in this study were above the threshold of the Sousa audiogram (Fig. 6). The 1/3 octave band sound pressure level of the Sousa click sound at a distance of less than 50 m with a dominant frequency range of 20 kHz to 200 kHz would not be masked by the OCTA-KONG vibration sound recorded at a distance of 200 m. However, the 1/3 octave band sound pressure level of the Sousa whistle recorded at a distance of less than 50 m with a dominant frequency range from 3 kHz to 6 kHz would be masked by the vibration sound recorded at a distance of 200 m (Fig. 6).
Cetacean safety exposure level. The maximum SL rms of SSP 22 driving (#32 and #36) and extraction (#26) exceeded the established cetacean safety exposure SPL rms level of 180 dB (Table 5). However, the maximum SL rms of SSP 22 pile driving of #38, #39 and #41 were lower than 180 dB (Table 5).
Physiological impact. All the calculated SL zp values of OCTA-KONG pile driving and pile extraction (with maximums of 193.23 dB and 193.15 dB, respectively) ( Table 5) were well below 224 dB, the proposed SL zp threshold for the onset of TTS for mid-frequency cetaceans exposed to non-impulsive sound. However, the calculated SSEL cum values for pile driving of SSP 22 #32, #39 and #36 and pile extraction of SSP 22 #26 were 201.33 dB, 195.05 dB, 199.75 dB and 207.59 dB re 1 mPa 2 s, respectively (Table 6), exceeding the proposed 195 dB threshold for the onset of TTS in mid-frequency cetaceans exposed to nonimpulsive sound. In addition, the calculated SSEL wcum for the pile driving of SSP 22 #39 and pile extraction of SSP 22 #26 was 179.05 dB and 191.41 dB re 1 mPa 2 s, respectively, greater than the proposed threshold of TTS onset at 178 dB. All the calculated SSELs were lower than the threshold of the onset PTS for midfrequency cetaceans exposed to non-impulsive sound.

Discussion
Inshore marine mammals are highly susceptible to habitat loss, fragmentation, and degradation [5]. Marine mammals have a well-developed sense of hearing, and the importance of sound reception to these mammals makes them susceptible to the effects of anthropogenic noise [5].
The impacts of anthropogenic noise on marine life have been widely assessed [43]. The St. Lawrence River beluga (Delphinapterus leticas) may change its vocalization SPLs in direct response to  changes in the noise field (Lombard effect) [44] or shift its frequency bands when exposed to vessel noise [45]. Killer whales (Orcinus orca) may adjust their vocal behavior, showing longer call durations [46], or exhibit a Lombard effect [47] to compensate for masking boat noise. Bowhead whales (Balaena mysticetus) may change migration routes and exhibit avoidance reactions when exposed to air gun noise [48] or travel at increased speeds in the presence of anthropogenic noise [5]. Harbor porpoises (Phocoena phocoena) tend to reduce their acoustic activity when exposed to the construction noise of pile driving [49] or reduce their buzzing activity when exposed to impulse noise from seismic surveys [50]. Bottlenose dolphins will significantly increase their whistle rate at the onset of an approach by a vessel [51], and Indo-Pacific bottlenose dolphins (Tursiops aduncus) tend to produce whistles with less frequency modulation at lower frequencies In habitats with greater ambient noise [52]. Acoustic impact models that estimate the effects of anthropogenic noise on the hearing and communication of fish and marine mammals by comparing noise spectra, audiograms and the vocalizations of the animal of interest have been widely applied, e.g., in research on the effects of ambient and boat noise on Chromis chromis, Sciaena umbra and Gobius cruentatus living in a marine protected area in Italy [53] and on the Lusitanian toadfish (Halobatrachus didactylus) in Portugal [54], on the impact of sounds resulting from construction and pipe-driving at an oil production island in Alaska on ringed seals (Phoca hispida) [34], on the potential effects of pile-driving at an offshore wind farm in the Moray Firth, NE Scotland on marine mammals [6], on the possible sensitivity of bottlenose dolphins to pile-driving noise [55], on the potential effects of underwater noise produced by whalewatching boats on killer whales in southern British Columbia and northwestern Washington State [56] and on the effects of the highspeed hydrofoil ferry in West Hong Kong waters on the Chinese white dolphin [57].
Transmission loss is correlated with bathymetry, substrate type, and sound speed profile along the direction of transmission [36], and the fit obtained for site-specific transmission loss may not apply to transmission in other directions from the source if these conditions are different in those directions [33]. Because the bathymetry and substrate type in the studied construction site are    consistent, the site-specific empirical fit method that we used to determine the transmission loss can be applied to transmission in other locations.

Spectrogram, PSD and 1/3 octave band frequency spectrum
The spectrograms and PSD levels allowed us to evaluate the detailed frequency composition of the signal (Figs. 4, 5); however, they did not consider the critical band theory of the mammalian auditory system. Therefore, they offer little insight into either how these mammals perceive noise or the extent of the masking effect of the noise PSD levels [9]. The 1/3 octave band sound pressure level information provided us with a starting point for evaluating the frequency components of the construction sounds that are audible to the dolphins [34]. Although there is little noise energy above the ambient noise levels between 20 kHz and 120 kHz, the Chinese white dolphin shows the greatest sensitivity to sound in this range, as is normal for toothed whales [5], and the majority of the noise increments above the ambient noise levels of 5.6 kHz to 20 kHz were greater than 15 dB (Fig. 6). Both the OCTA-KONG vibration noise and the ambient noise level were above the threshold of the Sousa audiogram at frequency bands between 5.6 kHz and 128 kHz (Fig. 6), indicating that sound detection in these frequency bands was limited by the ambient noise rather than by the Sousa audiogram.

Impacts on Sousa
Sound masking. The dominant noise level of the OCTA-KONG operation was below 20 kHz, suggesting that Sousa clicks were not adversely affected (Fig. 6). This interpretation is further supported by the finding that the peak frequency of Sousa clicks ranges from 43.5 kHz to 142.1 kHz [23]. By contrast, the Sousa whistle, with a fundamental frequency ranging from 520 Hz to 33 kHz [15], was most susceptible to auditory masking and could be completely masked at a distance of 200 m (Fig. 6). As whistles play a significant role in dolphin communication, such auditory masking may disrupt activities such as feeding and sexual behavior [5]. The adopted safety zone of approximately 200 m radius, as suggested by NOAA [18], should be enlarged to a more conservative region of 500 m radius, as recommended by the Joint Nature Conservation Committee [58]. Beyond this distance, the audibility of certain OCTA-KONG vibrations to Sousa is negligible (Table 5).
Physiological impact. Although the SL zp of the SSP 22 vibration was lower than the proposed physiological damage level, 60% (3 out of 5 piles) of the SSEL cum values during SSP 22 driving, the SSEL wcum values during SSP 22 #39 driving and both the SSEL cum and SSEL wcum values during SSP 22 extraction exceeded the acoustic threshold levels for the onset of TTS (Table 6). In general, the SSEL cum and/or SSEL wcum values could exceed the PTS or TTS threshold in a multitude of ways, depending on the exposure levels and durations [25]. The average SSEL ss values for all six SSP 22 sites were lower than the cetacean safety exposure level (180 dB) ( Table 6); therefore, the surpassed SSEL cum and SSEL wcum levels were due to the prolonged duration of the operation (as a function of 10 log(t)). The average durations of OCTA-KONG vibration during pile driving and pile extraction are 3 min and 30 min, respectively, with a range of 2 min to 6 min and 20 min to 40 min, respectively (YP Wang, personal communication).

Mitigation method
As the SSEL cum and SSEL wcum values were exceeded due to the prolonged sound exposure periods, the PTS could potentially be avoided by alternating the OCTA-KONG vibration with periods of inactivity, e.g., operations on nonconsecutive days to reduce the sound exposure. In addition, as several of the maximum SL rms values for OCTA-KONG vibration exceeded the cetacean safety exposure level, an air bubble curtain could be introduced. Such curtains can substantially reduce underwater noise at frequencies between 400 Hz and 6400 Hz [59]. During the present study, the power unit rotated primarily in the 1300 r/min-1500 r/min range; however, a maximum of 1700 r/min was used during the construction of the two artificial islands (YF Yang, personal communication),which may have introduced more intense operation noise. Moreover, in addition to the use of pings, ''soft start'' and ''power down'' techniques should be adopted [18]. Specifically, at the beginning of each pile installation or extraction, vibratory hammers should be activated at low power for 15 s, followed by a 1-min waiting period (i.e., at a duty cycle of 20%, repeated at least twice) before full power is achieved (i.e., a "soft start"). Additionally, if dolphins are observed within the exclusion zone during the in-situ vibration, operations should either cease or substantially reduce the vibration power (i.e., "power down"). Piledriving operations should occur during periods when threatened or endangered species are less abundant, as suggested by NOAA [19].

Limitations
The present study had two limitations: First, dolphin behavioral responses during pile driving and extraction were not addressed. In view of the limited current knowledge of the noise doseresponse relationship, we are unable to assess whether the noise generated by the OCTA-KONG may cause behavioral disruption. Second, although the adopted audiogram was derived from two Chinese white dolphins of different ages [22,23], there is individual variation in cetacean audiograms [60,61]. Therefore, the two audiograms used here should not be considered representative of the hearing sensitivity of this species. In addition, the Sousa audiogram data were sparse and did not extend below a lower frequency limit of 5.6 kHz, further limiting noise exposure assessment at lower frequencies. Audiograms covering a wider frequency range for Sousa are needed to quantitatively analyze the impact of the noise.

Conclusions and Future Research
The fundamental frequency of the OCTA-KONG vibration ranged from 15 Hz to 16 Hz, with noise increments below 20 kHz and a dominant frequency and energy below 10 kHz. The vibration zone detectable by Sousa extends beyond 3.5 km. Sousa clicks do not appear to be adversely affected, whereas Sousa whistles are susceptible to auditory masking; therefore, a safety zone of 500 m radius is proposed. Although the SL zp value of the OCTA-KONG was lower than the physiological damage level, the maximum SL rms value sometimes exceeded the cetacean safety exposure level, and the majority of SSEL cum and SSEL wcum values exceeded the acoustic threshold levels for the onset of TTS. Moreover, the TTS was due to the prolonged production of the vibration sound. These findings can help improve environmental impact analyses. Future research that evaluates the real-time noise conditions accompanying underwater construction and the associated behavioral responses of nearby dolphins is recommended to address, in a more direct and robust manner, the possible impacts of human-generated noise on these animals. An increased understanding of the dose effects of noise exposure will provide us with valuable information on how to mitigate possible impacts during the underwater project; this information is important for Sousa conservation. In addition, prey are a critical resource for cetaceans [62][63][64], but little is known about the effects of construction noise on fish [1,65,66]. Dolphins can identify and locate their prey through passive listening during the search phase of the foraging process [67,68]. Therefore, further research is needed to identify the potential adverse impacts on fish, including the masking of prey sounds by anthropogenic noise, particularly of those species that are important prey for marine mammals.