Introduction

Over the last 30 years the human impact on the seafloor of the North Sea has strongly increased. This includes the exploration of oil and gas fields, sand mining for construction, the installation of underwater cables and pipelines, the construction of foundations for offshore wind farms, as well as fishing by bottom trawls and dredging activities. In order to mitigate adverse effects on the seafloor, the European Union released the Marine Strategy Framework Directive (MSFD) in 2008, including 11 descriptors of “good environmental status”. One of these is “seafloor integrity”, defined as “a level that ensures that the structure and functions of the ecosystems are safeguarded and benthic ecosystems, in particular, are not adversely affected” (European Union 2008). The European Commission selected six indicators of seafloor integrity (cf. Rice et al. 2012), including the type, abundance, biomass and areal extent of relevant biogenic substrates. To date, regular monitoring is based largely on grab sampling, associated with time-consuming biological analyses. Another major drawback is the point-type nature of such information, the full spatial coverage of habitats being mostly extrapolated.

Over the last decade, developments in hydroacoustic methods have enabled extensive seafloor mapping in a relatively short time period—for sidescan sonar, up to 67 km2/day (Kenny et al. 2003). The backscatter signal reflects several abiotic properties such as surface roughness and grain size (e.g. Collins and Galloway 1998; Bornhold et al. 1999; Preston et al. 2004; Ferrini and Flood 2006; Markert et al. 2013), as well as benthic faunal elements such as mussel beds and shell debris (e.g. Quester Tangent Corporation 2003; Wienberg and Bartholomä 2005; van Overmeeren et al. 2009), coral reefs (e.g. Gleason et al. 2006; Gleason 2009), and macroalgae and seaweed (e.g. Preston 2006; Hass and Bartsch 2008; Mielck et al. 2014). In the Belgian sector of the southern North Sea, for example, Degraer et al. (2008) detected reefs of the polychaete Lanice conchilega by means of high-resolution sidescan sonar. This polychaete forms tubes composed of sand grains and shell fragments, which cause an increased reflectivity appearing as a patchy and grainy acoustic facies.

For the German Bight in the southern North Sea, the Marine Strategy Framework Directive-related project “WIMO” (“Scientific Monitoring Concepts for the German Bight”) aims to develop new concepts for the monitoring of marine habitats (for overviews, see Winter et al. 2014;Winter et al., Introduction article for this special issue). One topic focuses on the interactions of benthic macrofauna and hydroacoustic signals from single-beam echosounders, multi-beam echosounders and sidescan sonar. Based on an older sediment distribution map of Figge (1981) and survey results from earlier projects, the present study explores this aspect in the vicinity of the island of Helgoland, representing characteristic sediment types of the German Bight. Identifying surrogate hydroacoustic signatures would facilitate meaningful spatial extrapolation of point-type biological information, thereby reducing the time and costs of seafloor sampling from aboard ship. Within this context, the aims of this study are (1) to map the distribution of surficial sediments using different hydroacoustic devices in order to find the most appropriate system for this purpose, (2) to use automatic seafloor-classification approaches for seafloor segmentation into distinct classes, (3) to provide ground-truthing for the results of the classification, and (4) to compare different sediment types and backscatter signals with the spatial distribution of macrofauna communities.

Study area

According to the sediment distribution map of Figge (1981), the seafloor in the vicinity of Helgoland is the only location in the German Bight with outcropping bedrock of Palaeozoic to Mesozoic age (Spaeth 1990), interspersed with Quaternary muddy sand to sandy mud. It is typified by the well-known benthic community dominated by the brittle star Amphiura filiformis (e.g. Salzwedel et al. 1985; Rachor and Nehmer 2003; Kröncke et al. 2004, 2011).

The eastern border of the study area is located about 2 km west of the island of Helgoland (Fig. 1), extending approx. 17 km in a north–south direction and approx. 6 km in a west–east direction (approx. 98 km2). Water depth varies between 17 and 54 m (Fig. 2). The easternmost part of the study area is located within the nature preservation area “Helgoland Felssockel”, where human impact is restricted to lobster-pot fishing. The study area was surveyed twice from aboard the RV Heincke in February and November 2014, using a RoxAnn seafloor classification system (Table 1; cf. Hass et al. 2016, this volume). A subarea in the north-eastern sector of the study area (approx. 3 km long and 2 km wide, depth range 19–35 m; Fig. 3a) was selected for more detailed surveying because it includes many different habitats, some of them difficult to evaluate with hydroacoustic gear because of steep slopes and large boulders.

Fig. 1
figure 1

Location of the study area west of Helgoland, with the subarea (hatched box) selected for detailed surveying

Fig. 2
figure 2

Study area. Left panel Bathymetry (RoxAnn) and sample locations (with corresponding mean grain size). Right panel Interpreted and interpolated results of RoxAnn data analysis: colour categories see Fig. 5; A–C benthic communities (modified after Hass et al. 2016, this volume). Rectangle in both panels Subarea selected for detailed surveying

Table 1 Data acquisition between 2011 and 2014
Fig. 3
figure 3

Subarea. Bathymetry (2012) from multi-beam echosounder (a) and backscatter (2011) from sidescan sonar (b) with positions of 2011 grab samples and macrobenthos communities. cf Seafloor classification: c single-beam echosounder, RoxAnn GD-X (HE 364, 2011); d multi-beam echosounder (2012), QTC swathview; e sidescan sonar (2011), QTC swathview; f single-beam echosounder (2012), QTC 5.5/QTC Impact. Lower panels Corresponding areal percentages of seafloor classes and percentages of macrobenthic communities

Materials and methods

Acoustic data

The subarea was surveyed 11 times between 2011 and 2014 by means of seven acoustic devices from aboard the RV Senckenberg, RV Heincke and RV Haitabu (Table 1). Simultaneous data collection was by means of a Furuno FCV 295™ or FCV 1000™ single-beam echosounder (SBS), a Reson SeaBat 8125™ multi-beam echosounder (MBES) and a Benthos 1624™ sidescan sonar (SSS) from aboard the RV Senckenberg, a RoxAnn™ system from aboard the RV Heincke, and an Echoplus system from aboard the RV Haitabu.

Onboard the RV Senckenberg, the FCV 295 single-beam echosounder was operated with a pole-mounted Furuno 200–8B transducer. The FCV 1000 single-beam echosounder was connected to a hull-mounted Furuno 200–8B transducer with a beam angle of 7.4° at 6 dB, an emission power of 1 kW, and a pulse length of 0.5 ms. A recording depth range of 0 to 40 m (FCV 295) or 0 to 50 m (FCV 1000) was selected. Data acquisition was carried out by QTCView 5.5™ from Quester Tangent Corporation (Victoria, BC, Canada).

The 455 kHz multi-beam echosounder Reson SeaBat 8125 was also pole-mounted on the portside of the RV Senckenberg. The system generated 240 beams by electronic beamforming, with a beam size of 1° along track and 0.5° across track. The opening angle of 120° enables a swath coverage of up to 3.5 times the water depth. A selected line spacing of 125 m achieved full areal coverage. The data were acquired by means of QINSY version 7.1 software, the post-processing comprising a digital elevation model (DEM) with a grid size of 0.5 m. Further interpretation is based on the ArcGIS toolbox Bottom Terrain Modeller (Wright et al. 2005). For the subarea, this entailed calculations of bathymetric position indexes (BPIs), bottom slopes and terrain ruggedness from the bathymetric grid (Fig. 3a).

According to Verfaillie et al. (2007), the BPI is a “measure which allows calculating where a certain location with a defined elevation is relative to the overall landscape”. The BPI algorithm compares each cell elevation to the mean elevation of the surrounding cells (Verfaillie et al. 2007). It can be calculated on a broad scale (B BPI) or on a fine scale (F BPI). Negative BPI values correspond to depressions, whereas positive BPI values correspond to crests. BPIs of zero are found for constant slopes and flat areas.

The Benthos SIS 1624 sidescan sonar had a high-frequency chirp signal of 370–390 kHz and a low-frequency chirp signal of 110–130 kHz. The low-frequency beam size was 0.5° (horizontal) and 55° (vertical), whereas the high-frequency beam size was 0.5° (horizontal) and 35° (vertical). The low-frequency data were used for automatic seafloor classification. A range of 100 m (200 m swath coverage) was selected for the 2011 and 2012 surveys, and of 150 m (300 m swath) for the 2013 and 2014 surveys. Sidescan sonar data were collected along all multi-beam echosounder lines (line spacing of 125 m) at a tow speed of 5 knots and a constant tow depth of ca. 7 m below the sea surface. Tow-fish positioning was based on layback measurements. The horizontal range resolution of chirp systems is defined as (pulse length×speed of sound)/2 with pulse length=1/bandwidth. For the Benthos 1624 system (bandwidth 20 kHz, pulse length 0.00005 s) and a speed of sound of 1,500 m/s, a range resolution of 3.075 cm is calculated. The along-track resolution is defined by the vessel speed and ping rate of the sidescan sonar (BSH 2016). At a sidescan range of 100 m, 7–8 pings are emitted each second. This leads to an along-track resolution of ca. 0.35 m at a vessel speed of 5 knots (ca. 2.5 m/s). For the 150 m range setting, the maximum ping rate of five beams per second results in an along-track resolution of 0.5 m at a vessel speed of 5 knots (2.5 m/s). The backscatter data are displayed as a grey scale comprising 256 values, where 0 is assigned to black, corresponding to strong backscattering, and 255 to white, corresponding to low backscattering.

The RoxAnn (SONAVISION) system has a Furuno LS-6100 fish-finder equipped with a 200 kHz FCV 1100 transducer (beam angle: 10°) mounted on a frame in the moon pool of the RV Heincke. The depth range was set to 80 m. All other settings were according to the manufacturer’s recommendations. A calibration cycle was run prior to the measurements. The data were recorded using the RoxAnn acquisition software RoxMap 32 (SONAVISION, Aberdeen, UK) at a line spacing of 220 m.

The EchoPlus system is permanently installed onboard the RV Haitabu, connected to a Fahrentholz Lithugraph XL echosounder. The working frequencies were 30 and 200 kHz. The data were recorded using JEDI software that is customer programmed for the Landesamt für Landwirtschaft, Umwelt und ländliche Räume, Flintbek, Germany. In contrast to the other single-beam echosounder classification systems (RoxAnn and QTC5.5), the classification of EchoPlus data was in the supervised mode with a line spacing of 250 m.

In order to compare the results of seafloor classification systems quantitatively, two approaches are commonly used. As outlined in Schimel et al. (2010a, b), the conventional approach is to produce a case study map for each classification, followed by estimates and comparison of their accuracy based on ground-truthing data (see, for example, Brown et al. 2005; Lucieer 2008; Brown and Collier 2008). The other approach is a comparison of one map with another without referring to ground-truthing data. This approach is very valuable for the determination of similarities between the different classifications, but no information regarding map accuracy is gained. This kind of comparison of similarities is normally calculated using error matrices (Schimel et al. 2010a, b).

The present ground-truthing datasets are based on sidescan sonar data from the subarea (ca. 6 km2), and the classification patterns for the different methods look very similar (cf. above). Therefore, similarity was not calculated by means of error matrices but rather by the Bray-Curtis similarity index of degree of areal coverage (Fig. 3c–f) using the PAST software package (Hammer et al. 2001). The software computes a number of similarities between all pairs of rows entered. The results are given as a symmetric similarity matrix.

Grab sampling

In 2011 surface samples were collected from aboard the RV Senckenberg by means of a 0.1 m2 Van Veen grab at 39 sites along four transects in the subarea selected for more detailed exploration. At each site, three replicates were taken for macrofauna and sedimentological analyses (Fig. 3b). Transect locations were chosen using a preliminary sidescan sonar mosaic in order to make sure that all different backscatter areas were sampled.

The 2013 survey concentrated on sampling sites characterised by fine-grained sediments (e.g. muddy sand), with an additional transect at the southern border of the subarea. Here, three additional samples were retrieved from aboard the RV Heincke (11/2013) using a spade-box corer (50×50×60 cm) at sites where high abundances of A. filiformis were determined in 2011.

During the 2014 survey of the RV Senckenberg, 40 grab samples for sedimentological analyses were retrieved by a Shipek grab. During the 2014 survey with the RV Heincke, samples from 44 sites were taken from the total study area using a HELCOM 0.1 m2 Van Veen grab. Of these, 41 samples were retained for sediment analyses, three locations being characterised by stones.

Macrofauna and sediments

At all locations where samples for macrofauna analyses and sedimentological analyses were retrieved, three grabs were taken at each location. Two of the grabs were used for macrofauna analyses and one grab for sedimentological analyses. The average distance between replicates is estimated to range between 15 and 30 m (Capperucci and Bartholomä 2012).

For sediment analyses, a macroscopic description and photographic documentation of the sediment surface were conducted for each grab. Then, subsamples of surface sediments were taken for grain-size analyses. The upper 2 cm was chosen because this depth interval is close to the penetration of the high-frequency acoustic signal into sandy sediments (Huff 2008; 1.5 cm for 500 kHz). In the laboratory, the samples were desalted in semi-permeable hoses over 24 h. Each desalted sample was washed through a 0.063 mm mesh screen to separate the mud fraction (<0.063 mm). The coarse fraction was then dried and dry-sieved through a 2 mm mesh. Shell debris and gravel were weighed in order to determine the gravel content (>2 mm) as well as the content of shell debris >2 mm. The coarse fraction (>2 mm) was not further analysed. The sand size fraction (0.063 to 2 mm) was weighed, treated with hydrochloric acid, and weighed again to determine the content of sand-sized shell debris. For the total amount of shell debris of the samples (see ESM Table 1, parts a and b, in the electronic supplementary material available online for this article), the shell debris of the gravel size fraction and the sand size fraction were summed up. The sand fraction was split into 1/4 Φ units (Φ = –log2 diameter of particle in mm) from –1 to 4 Φ, using calculations from settling velocities measured by a MacroGranometer™ settling tube (Flemming and Thum 1978; Tucker 1988). The mud fraction was analysed using a Micromeritics SediGraph III 5120™ (Stein 1985).

Samples HE416-38 to 82 were photographed and macroscopically analysed. At every sampling location the underwater video set was deployed to produce high-definition videos of the seafloor at the sampling location. In the laboratory the samples were treated with acetic acid to remove carbonates and with hydrogen peroxide to remove organic compounds. After the chemical treatment the fraction >2,000 μm was separated and the fraction <2,000 μm was measured in 100 size fractions using the CILAS particle sizer. Then the data were processed and re-sampled in 10th phi steps.

For macrofauna analyses, each 0.1 m2 Van Veen grab was sieved onboard ship at 1 mm mesh size and the retained material was fixed with 4% buffered formaldehyde. Samples of coarse-grained sediments were decanted 10 times while sieving. In the laboratory, organisms were stained with Rose Bengal and identified to species level.

Data analyses

RoxAnn- and EchoPlus-based seafloor classification

The RoxAnn and EchoPlus theory of operation is to release an initial sound pulse and to take the energy received in the electronically gated last third of the first echo return (E1) as a measure of seafloor roughness. The rougher the seafloor, the more energy is received back and the longer becomes the tail of the echo. All of the energy received with the second echo return (E2, after reflection at the sea surface and again at the seafloor) is a measure of seafloor hardness. The amount of reflection is controlled by the difference in acoustic impedance between seawater and the seafloor (Chivers et al. 1990; Schlagintweit 1993; Dean et al. 2013). Adverse effects due to changing water depth, such as variations in the backscatter amplitude and the tail length of the return signals, are electronically compensated by a pulse-integration constant. This constant adjusts the total integral value of the E1 and E2 gate areas. RoxAnn uses TVG (time-varied gain) to compensate for spreading loss with increasing water depth (Chandu, SonaVision, pers. comm., 2014).

The E values are given in DC voltage, and measurements were taken every second. Each data line includes the geographic position, date and time, water depth, as well as E1 and E2 values. Whenever either one of the variables did not record or showed implausible data (outliers), the whole data line was removed from the dataset during data processing.

The resulting data were colour coded using 10,000 RGB bins in the E1/E2 space. Each colour band (R, G and B) was then separately interpolated using the “natural neighbour” method. After interpolation, the three colour bands were recombined, resulting in an RGB triplet for every data point on the interpolated grid. Since the colour values depend on the two variables E1 and E2, the legend for the map is also two-dimensional. All data processing and graphics were produced using self-programmed MATLAB-based scripts and Quantum GIS 2.4.0-Chugiak. The JEDI software serving to record the Echoplus E1 and E2 data also allows a supervised classification, based on ground-truthing data entered into the system. However, in the context of this study, only the raw E1 and E2 data were used.

QTC-based acoustic seafloor classification

Based on promising findings of previous work in the southern North Sea (Wienberg and Bartholomä 2005; Bartholomä 2006; Bartholomä et al. 2011), Quester Tangent™ software was used for the acoustic seafloor classification. Full wave forms of single-beam echosounder data were collected and analysed with QTC 5.5™/QTC Impact™. Sidescan sonar and multi-beam echosounder backscatter data were classified using QTC Swathview, with processing steps including data cleaning, and compensation for depth, range and insonification angles. After compensation, the time-series data from the single-beam echosounder were aligned by their bottom picks and summed in stacks of five. The analyses of the echo shape using QTC proprietary algorithms generated 166 so-called full feature vectors (FFVs). With sidescan sonar and multi-beam echosounder images, borders of rectangular patches were distributed over the unmasked regions of backscatter images. In order to shorten the processing time of the backscatter data, a reduced number of only 29 FFVs were extracted, comprising basic features (1–4), grey level co-occurrence matrix (GLCM) data (5–20), circular Fourier transformation data (21–28) and fractal dimension data (29): 1–4 mean, standard deviation, skewness, kurtosis; 5–8 GLCM correlation; 9–12 GLCM contrast; 13–16 GLCM entropy; 17–20 GLCM homogeneity; 21–28 circular Fourier transform; 29 fractal dimension (QTC, pers. comm.).

Further processing of the amplitude data for single-beam echosounder records as well as for image data for sidescan sonar and multi-beam echosounder data was exactly the same for all systems. For all hydroacoustic datasets, PCA (principal component analysis) was used to reduce the FFVs (166 FFVs for single-beam data, 29 FFVs for multi-beam and sidescan sonar data) to three components, i.e. Q1, Q2 and Q3. These were plotted in three-dimensional space (Q space) and used for clustering (Preston et al. 2004).

Segmentation was done in the three-dimensional space of the Q values. Starting with all records in the same class, a variant of the k-means clustering method separated them into clusters, i.e. acoustic classes. The result of the k-means clustering is presented graphically as score against number of classes. The score is a statistical measure of cluster size and tightness (Quester Tangent Corporation 2011). The statistically optimal number of classes is characterised by the minimum score. Each record, originating from a stack of single-beam echoes or from the amplitudes in a rectangle, was assigned to the closest class centre in the Q space in a process that was iterative and stable. According to QTC, each cluster represents an acoustically distinct area. The QTC seafloor classes files were gridded (nearest neighbour method) using a grid size of 1 m in QTC Clams™ software. This software also allows a categorical interpolation to extend point and line coverage to full coverage. The full coverage data was saved in the Geotif format for further analyses.

The final Geotif file was imported into a geographic information system (ArcGIS 10.3™) to generate maps of acoustic classes. In this GIS environment, the macrofauna clusters and sediment data were combined with the acoustic map for validation of the acoustic classes.

Macrofauna

The software PRIMER v6 (Plymouth Marine Laboratory; Clarke and Gorley 2006) was used for statistical analyses of macrobenthos data. Diversity was analysed according to Shannon and Weaver (1949) and the Pielou (1969) evenness index. For multivariate analyses, the data were fourth root transformed and the Bray-Curtis similarity coefficient (Bray and Curtis 1957) was calculated. Cluster analyses were performed to detect similarities between communities. Significant differences between communities were evaluated by means of the ANOSIM randomisation test (Clarke and Green 1988). SIMPER was used to detect the characteristic species per community (Clarke and Warwick 2001).

Results

Acoustic backscattering

In the subarea, the sidescan sonar backscatter mosaic of 2011 (Fig. 3b) shows two main zones of different seafloor backscatter. A zone of generally higher backscatter (darker grey tones) extends from the north-eastern boundary of the subarea and generally dominates the eastern sector, with grey values varying between ca. 47 and 200 (average of 160). The northern, north-western and southern sectors display lower backscattering intensity with grey values of ca. 170–240 (mostly 200–210). Within the zone of weaker backscatter, a slight increase in grey tones is visible from the northeast to the southwest (Fig. 3b). The grey values decrease from ca. 230 near the north-eastern boundary (A in Fig. 3b) to ca. 210 in the centre (B in Fig. 3b) and 170–180 near the south-western boundary (C in Fig. 3b) of the subarea (Table 2). Although the absolute grey values for the years 2012 and 2014 are slightly different because of different environmental settings such as wave height and wave direction, the general increase of backscatter strength from the northeast to the southwest is confirmed for both surveys of 2012 and 2014 (Table 2).

Table 2 Grey values (0=black, 255=white) for sidescan sonar mosaics from 2011 to 2014 in the subarea

Derivates from bathymetry

In the subarea, the broad-scale BPI values (Fig. 4, upper left) vary between –1 and 6. However, the strongest variation of BPIs is found in the north-eastern sector with a stronger backscattering of sidescan sonar data (Fig. 3b). This sector is also characterised by stronger variations in bathymetry (Fig. 3a). The remaining sectors show a BPI of zero, indicating a flat seafloor or a constant slope. The fine-scale BPI values (Fig. 4, upper right) show variations between –1 and 4, with most values being zero. Again, the strongest variations are visible in the north-eastern sector of the subarea.

Fig. 4
figure 4

Subarea. Broad-scale bathymetric position index (BPI, upper left), fine-scale BPI (upper right), bottom slope (lower left) and terrain ruggedness (lower right)

The bottom slopes (Fig. 4, lower left) vary between 0° and 59°. Again, the higher values are restricted to the northeast of the subarea, whereas gentler slopes (0–1.1°) were recorded in the remaining sectors. The calculated terrain ruggedness (Fig. 4, lower right) mirrors the results of the other bathymetry derivates (BPI, bottom slopes), with higher values in the north-eastern sector of the subarea and lower values in the remaining sectors.

Acoustic seafloor classification

QTC Swathview™ sidescan sonar

In the subarea, segmentation of the acoustic backscatter data from sidescan sonar by QTC Swathview™ resulted in six seafloor classes (Fig. 3e). The areal distribution of seafloor class green (coverage: 33.7% of the seafloor) coincides with the areas of light grey tones on the sidescan sonar mosaic (Fig. 3b). Therefore, this class is related to low backscatter values (grey tones: 210–250) of the sidescan sonar signal. Seafloor class red (coverage: 34% of the seafloor, Fig. 3e) correlates well with high backscatter values (grey tones: 168–200) of the sidescan sonar data in the central and south-eastern sectors of the subarea, but occurs also in the southern and south-western sectors where low backscatter values are visible on the sidescan sonar mosaic (Fig. 3b).

In the central and south-eastern sectors of the subarea, seafloor class blue (coverage: 23.3% of the seafloor) is visible near the nadir of the sidescan sonar data (Fig. 3b). This suggests either that class blue is a classification artefact, due to the generally stronger backscatter signal close to the nadir, or that variations in seafloor cover (e.g. different types/amounts of epifauna) may have generated this class. However, in the southern and south-western sectors, the blue class (Fig. 3e) correlates well with a slight increase in backscattering intensity seen in the sidescan sonar mosaics (Fig. 3b). Seafloor class cyan (coverage: 3.3% of the seafloor) is predominantly visible in the south-eastern sector with high backscatter values. The remaining two classes yellow (coverage: 0.5% of the seafloor) and pink (coverage: 5.2% of the seafloor) occur in small patches mostly in and around the areas of high backscatter in the sidescan sonar signal (Fig. 3b).

QTC Swathview™ multi-beam echosounder

In the subarea, backscatter data of the multi-beam echosounder were segmented into four acoustic seafloor classes using the QTC Swathview™ software. Most of the subarea is characterised by only two classes (Fig. 3d): red and green. Class green covers 28.6% of the seafloor, coinciding with areas of low backscatter values in the sidescan sonar mosaic (Fig. 3b). Seafloor class red covers 70.5% of the seafloor, coinciding with areas of strong seafloor backscattering in the central and south-eastern sectors of the subarea but also in the southern and south-western sectors where a gentle increase in backscatter values is visible. The seafloor class cyan (coverage: 0.7% of the seafloor) is constrained within the area of high backscatter values, whereas the remaining class pink (coverage: 0.2%) is disseminated over the whole subarea and might be due to classification of noise.

RoxAnn

The E1/E2 data from the RoxAnn system were processed by means of two classification approaches. In the subarea, fuzzy seafloor classification (Fig. 3c) reveals four classes. Class red covers 34.6% of the seafloor and coincides with the area of strong backscattering in the sidescan sonar data (Fig. 3b). Class green (coverage 33.2%) corresponds to the area of weak backscattering, whereas class blue (coverage 21.2%) is associated with slightly increased backscattering in the south-western sector of the subarea (Fig. 3c). Class yellow covers 11.0% of the subarea, and mainly co-occurs with strong backscatter in the central sector (Fig. 3b, c).

For the second approach of processing (RoxProgs), the RoxAnn data are displayed in the E1/E2 space (“RoxAnn box”) in Fig. 5 for the whole study area. Six acoustic classes can be distinguished. Since there are usually no sharp borders in nature, class borders were inserted neither in the RoxAnn box (Fig. 5) nor in the map (Fig. 2). If sharp borders occur, then they are clearly visible in both figures. Class I occurs exclusively above 30 m water depth. According to the RoxAnn box, the green colours suggest the softest and smoothest values of the entire dataset. Class I is largely congruent with QTC class green (single-beam echosounder as well as multi-beam and sidescan sonar) and with the areas of weakest backscatter of the sidescan sonar in the north-eastern part of the subarea. Class I demarcates a large curved sector in the upper northern part of the study area (Fig. 2).

Fig. 5
figure 5

Study area. RoxAnn box of roughness (E1) vs. hardness (E2). I–V Classes mentioned in the text. Inset E1 vs. E2 plotted as black dots superimposed on the chosen colour scheme (modified after Hass et al. 2016, this volume)

Class II (yellow) suggests a slightly harder and rougher seafloor. It is congruent with QTC class yellow but extends further into the southern part of QTC class cyan. Class III (orange) largely resembles QTC class blue; however, parts of the subarea in the south and southeast covered by QTC class blue belong more to RoxAnn class II (i.e. revealing lower roughness and hardness values than class III). Class III demarcates large sectors in the southern and western parts of the study area (Fig. 2), associated with slightly increased backscatter in the sidescan sonar records. Class IV has hardness values similar to class III but associated with clearly higher roughness values. Class IV shows reddish colours and is congruent with QTC class cyan. It covers a large part of the eastern sector of the study area. Classes V and VI are congruent with QTC class red. They encompass the hardness values of classes I, II and partly III. However, the roughness values are significantly higher and classes V and VI are clearly separated from classes I to III. They cover a distinct sector in the central eastern part of the study area.

QTC View 5.5™/QTC Impact™ single-beam echosounder

In the subarea, the single-beam echosounder data can be subdivided into six statistically significant different acoustic seafloor classes (Fig. 3f). The sector characterised by low backscatter values of the sidescan sonar mosaic is covered by seafloor classes green (coverage: 8.8% of the seafloor) and yellow (coverage: 10.8%). Seafloor classes red (coverage: 36.2%) and cyan (coverage: 11.9%) are predominantly restricted to the central and south-eastern sectors of the subarea where high backscatter values are visible on the sidescan sonar mosaic (Fig. 3b). The southern and south-western sectors, where the sidescan sonar mosaic shows slightly increased backscatter values, have seafloor class blue (coverage: 32.2%). Seafloor class pink covers only 0.4% of the seafloor and is mainly situated within seafloor class red in the central part of the subarea (Fig. 3f).

Sediments

In the subarea, the locations of sediment samples in 2011 were determined on the sidescan sonar backscatter mosaic in order to ensure full coverage of backscatter signatures (Fig. 3b). In assessing the whole study area, sample locations of survey HE416 in 2014 were chosen on the basis of RoxAnn measurements (Fig. 2).

According to the classification of Folk (1954), the seafloor samples range from gravel to sandy mud (ESM Table 1 in electronic supplementary material). In the subarea, most samples are classified as muddy sand (26 of 38 samples). For the whole study area, five samples are classified as gravel (samples 09, 45, 71, 72, 73), three as sandy mud (samples 10, 69, 70), two as muddy gravel (samples 46 and 47), one as gravelly muddy sand (sample 08) and one as muddy sandy gravel (sample 95). Grab samples characterised by high amounts of gravel contained not only granules (diameter 2–4 mm) but also pebbles (4–64 mm) and cobbles (64–256 mm; cf. Wentworth scale; Wentworth 1922; Krumbein and Aberdeen 1937).

Coarser-grained sediments were retrieved from the central and south-eastern sectors of the subarea, and finer-grained sediments from the northern, southern and south-western sectors (Fig. 3b). In the latter, there are only small variations in grain size distributions from the northeast to the southwest (Fig. 6). Using the Folk (1954) classification, all samples (2011_02, 2011_42, 2011_64, 2011_86) are muddy sand. However, using the phi mean of the sand size fraction of these samples (Fig. 6) and the Wentworth (1922) scale, sample 2011_86 (phi mean 2.601) can be classified as muddy fine sand. According to this classification, the remaining samples are muddy very fine sand (phi mean 3.463–3.491). Samples from almost the same locations in the 2013 survey (from northeast to southwest: 2013_20, 2013_17, 2013_14, 2013_02) yielded the same results. The southernmost sample (2013_02) is classified as muddy fine sand, whereas the remaining samples are muddy very fine sand (Fig. 7).

Fig. 6
figure 6

Subarea. Grain size distributions along the transect of 2013 from the northwest (sample 2013_20) to the southwest (sample 2013_02)

Fig. 7
figure 7

Subarea. Grain size distributions along the transect of 2011 from the northwest (sample 2011_02) to the southwest (sample 2011_86)

Regarding the whole study area, there are similar spatial distribution patterns of hardness- and roughness-based classification (ESM Table 1, part c, in electronic supplementary material; Fig. 2). However, this dataset also highlights the patchy character of the area. The sediment becomes generally slightly coarser from north to south. The elevation highs in the NE of the study area reveal pebbles and boulders that inhibited the use of a grab sampler (HE416_56, 57, 60), as well as medium sands (HE416_38–40) and muddy sediments (HE416_47–55). The deeper southern third of the study area is characterised by sandy sediments throughout.

Macrofauna

In the samples from 39 stations taken in 2011 in the subarea, 189 macrofauna taxa were identified, comprising mostly molluscs (49%) followed by annelids (23.7%), echinoderms (18.6%) and crustaceans (7%). The four transects T1 to T4 (Fig. 3b) show a variation of the main macrofauna groups. The abundance of molluscs decreases from T1 to T4, whereas that of echinoderms increases (Fig. 8). Despite the high sedimentary diversity, the cluster analyses provided only four significantly different macrofauna communities (Fig. 9) based on the ANOSIM test (global R value of 0.92, significance level of 0.1%).

Fig. 8
figure 8

Subarea. Percentage of major fauna groups to total taxa along transects T1 to T4 (for locations of T1–T4, see Fig. 3b)

Fig. 9
figure 9

Subarea. Hierarchical clustering of macrofauna samples

The three stations of community A (1–3) are in the northern part of the subarea (Fig. 3b). According to the cluster analyses, this is a Nucula nitidosaAbra alba community associated with polychaetes such as Lagis koreni and Scalimbregma inflatum. This community has a mean taxa number of 40/0.1 m2, a mean Shannon index of 2.18, a high mean evenness (0.59), and the highest mean abundance (11,627 ind./m2). Community A is associated with the acoustic class green (sidescan sonar, multi-beam echosounder, single-beam echosounder QTC) or class I (single-beam echosounder RoxAnn; Fig. 3c–f, ESM Table 2 in the electronic supplementary material).

Stations belonging to community B are in the centre of the subarea (Fig. 3b), earmarked by muddy sand to sandy mud. Dominant species are N. nitidosa, A. filiformis and Hyala vitrea. This community comprises species of both communities A and C, although in lower abundances—mean abundance of 5,160 ind./m2, mean taxa number of 37/0.1 m2, and highest mean Shannon index (2.48) and evenness (0.69; Fig. 3c–f, ESM Table 2 in electronic supplementary material). Community B is associated with the acoustic class green (QTC sidescan sonar, QTC multi-beam echosounder), the class yellow (QTC single-beam) and the RoxAnn class I.

Community C is an A. filiformisKurtiella bidentata community dominated by the brittle star (1,711/m2) and commensal bivalve species. The highest mean taxa number of 42/0.1 m2, mean abundance (5,702/m2), Shannon index (2.41) and evenness (0.64) were found. This community is associated with the acoustic class blue (QTC sidescan sonar), red (QTC multi-beam echosounder), QTC single-beam class blue and RoxAnn class III (Fig. 3c–f, ESM Table 2 in electronic supplementary material).

The fourth community included stations from an area of higher backscattering caused by coarser sediments with boulders and pebbles (Fig. 3b). There, a quantitative sampling with the Van Veen grab was impossible. Qualitative samples revealed typical hard-substrate organisms such as the polychaetes Pomatoceros triqueter and Sabellaria spinulosa (Hartmann-Schröder 1996), a high abundance of anthozoans, but also soft-bottom species such as Lagis koreni, Gattyana cirrhosa and Protodorvillea kefersteini. A similar community was found by Kühne and Rachor (1996) at the “Steingrund” near Helgoland.

To confirm the results of 2011, the A. filiformis data of the 2013 samples were analysed. Again, high abundances of A. filiformis were detected in the southern part of the subarea (see ESM Table 3 in the online electronic supplementary material).

Brittle star abundance vs. side-scan sonar grey values

An increase in the abundance of the brittle star A. filiformis was found from the northern to southern parts of the subarea (Fig. 8). The highest abundances were determined at stations with a water depth of about 30 m. In Fig. 10 (cf. ESM Table 3 in electronic supplementary material), the abundance of A. filiformis is plotted against grey values (0=black, 255=white) from the 2011 sidescan sonar mosaic (Fig. 3b). The Spearman rank order (Spearman 1904) shows a negative correlation between grey scale values and abundance of brittle stars. The variation increases from 2011 to 2013 (Fig. 10).

Fig. 10
figure 10

Subarea. Correlations (Spearman rank order) between grey values (0=black, 255=white) from sidescan sonar mosaic and total abundance of the brittle star A. filiformis in 2013 (upper panel) and 2011 (lower panel)

The 2013 samples again show similar high abundances of A. filiformis in the southern part of the subarea (ESM Table 3 in electronic supplementary material). The correlation between grey values and the abundance of the brittle star using the Spearman rank order reveals a correlation coefficient of –0.547 and P value of 0.00584. This confirms the results for the 2011 correlation. In contrast, the sample transect added near the southern border of the subarea in 2013 (samples WHGUG21–WHGUG31) shows average mean abundances of A. filiformis (ESM Table 3 in electronic supplementary material).

Discussion

Acoustic seafloor classification comparison

The physical configuration of swath-based and vertical down-looking single-beam echosounders implies different types of return signals. Therefore, the classification of swath systems is based on statistical segmentation of images, whereas the classification of single-beam echosounder data is based on the statistical analysis of the wave shape (QTC 5.5) and multiple-echo envelope analysis (RoxAnn). For all systems, a very similar classification pattern was generated (Fig. 3c–f, Table 3). The seafloor classifications obtained by the RoxAnn system and from the sidescan sonar data (QTC Swathview) show the highest similarity of 0.89, whereas there is a similarity of 0.63 for the RoxAnn and multi-beam echosounder data (QTC Swathview). The similarity between the single-beam echosounder classifications from RoxAnn and QTC 5.5 is 0.75.

Table 3 Similarity index of seafloor classifications based on Bray-Curtis

The minimum similarity value was computed for the classifications of the multi-beam echosounder (QTC Swathview) and the QTC 5.5 single-beam data (0.46). The remaining similarities vary between 0.70 (QTC 5.5 and sidescan sonar) and 0.64 (sidescan sonar and multi-beam echosounder).

Generally lower similarities between classifications from single-beam echosounder data and swath systems have been noted earlier (Anderson et al. 2008; Schimel et al. 2010a, b; Bartholomä et al. 2011). Schimel et al. (2010b) pointed out that this can be at least partly explained by the differences of grazing angles between the systems. Single-beam echosounders operate at very high grazing angles only (85–90°), whereas sidescan sonar systems have low to medium grazing angles (1–45°) and most multi-beam echosounders operate from low to very high grazing angles (25–90°; Schimel et al. 2010b). Indeed, Lurton (2002) showed that the effects of surface roughness and volume backscattering vary strongly with grazing angle.

The generally lower similarity between the multi-beam echosounder classification and the classification from RoxAnn, QTC 5.5 and sidescan sonar was not expected. Preston et al. (2003), Bartholomä et al. (2011), Micallef et al. (2012), Markert et al. (2013) and Lurton and Lamarche (2015) showed that classifications obtained from multi-beam echosounder data are of superior quality. This is, among others, due to the fact that for every single beam of the multi-beam system, data on backscatter, depth and grazing angle are available for further processing (e.g. compensation). Classifications from multi-beam echosounders and sidescan sonar systems are normally very similar because they both are based on the backscatter image (Schimel et al. 2010a).

The low similarity between the multi-beam classification (Fig. 3d) and all other classifications (Fig. 3c, e, f) is mainly caused by the areal coverage of classes green (28.6%, Fig. 3d) and red (70.5%, Fig. 3d). At the moment, it is not clear why the multi-beam echosounder classification is so different from the other classifications. The surveys were carried out in summer (July) 2011–2014, a season commonly earmarked by increased suspended particulate matter. Whether this partly impeded the acoustic energy of the high-frequency multi-beam echosounder (455 kHz) from reaching the seafloor is an aspect requiring further research. Mazel (1985) noted that suspended sediment scatters the signal on its way to the seafloor.

Influence of macrobenthos

All classification systems showed differences in acoustic results between the southern and northern parts of the subarea chosen for more detailed surveying (Fig. 3), reflected also by the occurrence of community C (ESM Table 2 in electronic supplementary material). This can be explained neither by changes in sediment composition (e.g. Riegl et al. 2007; Freitas et al. 2011) nor by variations in bathymetry.

Both single-beam systems clearly reveal changing seafloor properties between the areas inhabited by communities B and C (Figs. 2 and 3c, f). Slight differences between the results of the two systems might be due to different years and different months (seasons) in which the measurements were carried out. RoxAnn data suggest that community C also dominates large parts of the total study area (Fig. 2, orange colours). Evidently, changes in backscatter signal strength can be explained by changes in seabed roughness induced by varying abundance of the ophiuroid A. filiformis in the southern part of the study area at about 30 m depth.

The brittle star is the dominant species of soft sediments in the North Sea—e.g. the Oyster Ground (Cadée 1984; Kröncke et al. 2004, 2011) and the German Bight (Salzwedel et al. 1985; Kröncke et al. 2004, 2011). Associated species of A. filiformis communities were Harpinia antennaria and the bivalves K. bidentata and Corbula gibba (e.g. Rachor and Nehmer 2003; Reiss et al. 2006). A. filiformis is found at depths of 5 m to more than 200 m (e.g. Buchanan 1964; Künitzer 1989; Künitzer et al. 1992; Sköld et al. 1994; Rosenberg et al. 1997; Josefson 1998; Southward and Campbell 2006).

The size of the disk of A. filiformis can reach a diameter of 10 mm, and the slender, flexible arms a length of up to 100 mm. The specimens (disks) have been found buried at sediment depths of 3–8 cm (e.g. Rosenberg et al. 1997, 4 cm depth; Josefson 1998, 3–6 cm depth), depending on the age and therefore on the size of the disk and the arm length of the specimen. The three box cores taken by the RV Heincke in November 2013 from the area with high abundances of A. filiformis showed a burial depth of 5 cm. The bioturbation caused by the brittle stars may change the physical properties of the sediments, as reported by Rowden et al. (1998). They found that the shear strength of the upper 5 cm of sediment was reduced by up to 45% due to bioturbation. This also may influence the acoustic properties of sediments, like firmness.

A. filiformis can switch between two different feeding types. For deposit feeding, the arm tips explore the sediment surface for particles. For suspension feeding, the arms are held vertically to 2–4 cm above the sediment surface to trap particulate suspended matter and phytoplankton (Buchanan 1964; Woodley 1975; Loo et al. 1996). The number of active feeding arms seems to vary. Ockelmann and Muus (1978) described 3–4 active arms per individual, whereas Loo et al. (1996) and Rosenberg et al. (1997) observed only 1–2. In the present study, the mean abundance of brittle stars/m2 in community C would result in a minimum of 1,711 arms/m2 (cf. one active arm per individual) and a maximum of 6,844 arms/m2 (cf. four active arms). Since most of the specimens were adults, the mean length of the arms above the sediment surface probably reached 5–6 cm.

The high number of ophiuroid arms inferred for the subarea of the present study is expected to influence the surface roughness of the seafloor, causing a change in backscattering (cf. Lurton 2002). The roughness data (E1) from RoxAnn (Fig. 4) show higher values in the southern part of the subarea. This indicates that the high number of ophiuroid arms increased the roughness of the seafloor. Such changes in surface micro-roughness can influence the backscattering signal and acoustic classification (Bartholomä et al. 2011). Capperucci and Bartholomä (2012) confirmed that, for high-frequency hydroacoustic systems (>100 kHz), such roughness changes may influence the acoustic signals more than actual changes of sediment composition. It is also important to note that the seafloor classes are based on the acoustic response of the footprint of the sonar system from both the seafloor itself and the arms of the ophiuroids. For a single-beam echosounder, the seafloor footprint size is based on water depth. At a water depth of 30 m, the QTC 5.5 system ensonifies an area of 11.8 m2. Thus, the seafloor classification pattern of the southern part of the subarea of the present study would be influenced by changes in seafloor roughness and hardness caused by thousands of feeding arms of the brittle star A. filiformis.

Conclusions

Acoustic seafloor mapping and automatic classification is in general a valuable tool for the study of sediment distribution and also the EU MSFD descriptor “seafloor integrity”. In order to obtain reliable results, it is suggested to run at least two different systems simultaneously during a survey. One of these should be a swath system like a dual-frequency sidescan sonar or multi-beam echosounder. The other should be a single-beam echosounder-based acoustic seafloor classification system working in the 200 kHz range. The different grazing angles (high for single-beam systems and medium to low for sidescan sonar, and medium to high for multi-beam echosounder systems) reduce the effects of volume backscattering and surface roughness. A thorough ground-truthing program is required. Based on the similarities of classifications, the best choice for habitats like the northwest Helgoland area would be a sidescan sonar and the RoxAnn seafloor classification system.

Sidescan sonar data showed a slight increase of backscatter strength from the northeast to the southwest in the subarea chosen for more detailed surveying in the present study. Automatic seafloor classification methods segmented the area of increased backscatter intensity in the southern sector of the subarea as separate class, although the sediment texture was not significantly different from that in the north-western sector (muddy fine sand to muddy very fine sand). The abundance of molluscs decreases from the north to the south of the subarea, whereas that of Echinodermata increases (especially A. filiformis). The correlation between the abundance of A. filiformis feeding arms and backscatter intensity in the southern sector revealed that most likely the feeding arms increased the sediment surface roughness and, thus, the backscatter intensity. It is suggested that the A. filiformis community C covers large parts of the subarea. The number of grab samples for macrofauna studies can be reduced significantly by the use of hydroacoustic data and automatic seafloor classification, saving time and money. The seafloor sediment distribution from acoustic backscatter and automatic seafloor classification shows a much higher resolution than from extrapolation of surface samples alone.