Introduction

Since its introduction in 1951 [1] hand-held breast ultrasound (HHUS) has become an important instrument in complementary breast imaging and is accepted in assessing suspicious lesions of the breast [25]. Concerning breast cancer screening of asymptomatic women, especially in women with dense breast tissue sensitivity of breast imaging may be increased by a HHUS in addition to mammography [611].

The challenge is to obtain reproducible and standardized data with an ultrasound examination. Until today hand-held breast ultrasound (HHUS) is highly dependent on the examiner´s experience. There have been multiple attempts to solve these problems with automated breast ultrasound systems during the last decades [1222].

The new generation of automated breast ultrasound systems combines automation and three-dimensional (3D) scanning [2333]. The SomoV™ (U-Systems, Inc., Sunnyvale, CA, USA) system allows an automated recording of the whole breast 3D volume by a technician. The acquired 3D volume data can be evaluated time- and location-independent by different physicians.

The aim of this study was to evaluate and compare clinical performance of the automated SomoV™ system by analyzing 1,966 cases in comparison to the standard breast imaging methods and histology if available.

Materials and methods

Study design and procedures

Nine hundred and eighty three patients were included in this monocentric, explorative, prospective, institutional review board approved cohort study. The indication for examination had a wide range including routine check-up, follow-up, preoperative staging of breast cancer, evaluation of palpable lumps and work-up of abnormalities found in HHUS or mammography. The examination with SomoV™ was integrated in the routine of the breast clinic. Mammography, HHUS and SomoV™ were interpreted and classified according to the current American college of radiologists breast imaging reporting and data system (ACR BI-RADS®) [34]. The decision for further histological work-up (biopsy) was based on the conventional methods for breast imaging (mammography and/or ultrasound BI-RADS® category 4 or 5), not on the results of SomoV™. Thereby the standard methods (HHUS, mammography and—if indicated—histology) served as gold standard.

SomoV™ examination

Automated ultrasound of the breast was performed by a technician. Both breasts (thoracic wall after mastectomy respectively) were scanned with SomoV™. Nine hundred and eighty three patients and therefore 1,966 cases (=breasts) were included.

The SomoV™ system consists of a scanning unit and the diagnostic workstation. The scanning unit contains a 10 MHz high-frequency linear transducer. This transducer is able to capture a volume of 17 × 14.5 × 5 cm3 in a single scan. In the course of a single scan which takes about 60 s the SomoV™ generates 340 two-dimensional slices. The number of required scans to image the whole breast was determined by the patient´s breast size and ranged from one to five scans per breast (Fig. 1a, b).

Fig. 1
figure 1

SomoV™ examination three (a) and five (b) scans per breast

The acquired volume data were automatically sent from the SomoV™ scanning unit to the diagnostic workstation to process the 3D volume dataset in various multi-planar reconstructions and orientations. For the purpose of this study transverse, coronal and sagittal views were available. Technical details of the SomoV™ method and handling have already been described previously (see Fig. 2, SomoV™ diagnostic workstation) [28, 31, 35].

Fig. 2
figure 2

SomoV™ diagnostic workstation

All scans were interpreted by a physician specialized on breast diagnostics. SomoV™ interpretation took place independent of the patient´s work-up; the reviewer was blinded to the findings on the corresponding mammograms, HHUS and to all clinical information including the medical history of the patient.

Conventional breast imaging

Mammography and HHUS were evaluated in one session by the same breast diagnostics specialist knowing the patients´ medical history and clinical findings. Bilateral HHUS of the whole breast was performed by a physician specialized in breast imaging by using the Acuson Antares ultrasound system (Siemens Medical Solutions, Mountain View, CA, USA) equipped with a linear-array transducer with a bandwidth of 7.5–13.5 MHz.

Digital mammography was conducted by medical technical assistants using the Mammomat Novation DR (Siemens Medical Solutions, Mountain View, CA, USA). Usually the examination consisted of the cranio caudal (CC) and the medio latero oblique (MLO) projections. Whenever necessary based on the decision of a physician, additional positions as the medio lateral (ML) projection, spotfilms or magnification views were performed.

All mammography images were soft copy double-read by experienced physicians in breast diagnostics.

Data evaluation and statistical analysis

As an explorative study, all statistical analyses are descriptive. Reported p-values have no confirmatory character. Statistical analyses were performed with SPSS Statistics software Version 21.0.

At first, the study cohort is described by calculating absolute and relative frequencies for categorical variables and means and standard deviations for metric outcomes.

The main objective of this study was to evaluate the diagnostic potential of SomoV™ to truly differentiate cancer and normal/benign breast tissue without the knowledge of other clinical information. Therefore, we compared the BI-RADS® scores given for the SomoV™ examinations with those for HHUS, mammography and the gold standard. In order to do this, the BI-RADS® score results were dichotomized as follow:

For SomoV™, the BI-RADS® 1 and 2 were summarized as benign, the BI-RADS® scores 0, 4 and 5 were combined and rated as unclear/suspicious. BI-RADS® score 3 was not used to characterize findings with SomoV™ in this study simulating a screening situation. BI-RADS® 0 was given in all cases with unclear findings as for example possible scar tissue or artifacts.

For HHUS and mammography the BI-RADS® 1, 2 and 3 were summarized as benign, the BI-RADS® scores 4 and 5 were combined and rated as unclear/suspicious/malignant.

Kappa statistics were applied to measure agreement between the different diagnostic methods [36]. Moreover the corresponding 95 % confidence intervals are provided.

We also calculated sensitivity, specificity and positive and negative predictive values.

Results

Study population

In total, 983 patients were examined with SomoV™ which leads to 1,966 cases (=breasts/thoracic walls) to be evaluated. The mean age was 55.7 years (range 19–92 years). A number of 348 patients (35 %) came for further evaluation of lesions detected in an outpatient clinic, 283 patients (29 %) came for a routine checkup, 274 patients (28 %) attended their follow-up visit after breast cancer and 78 (8 %) came for re-evaluation of lesions that were under observation (further details see Table 1). Table 2 shows the given BI-RADS® scores for SomoV™, HHUS and mammography.

Table 1 Description of the study cohort
Table 2 BIRADS® assessment

Agreement of SomoV™ interpretation with HHUS and mammography

Table 3 shows the absolute numbers of agreement and disagreement for SomoV™ and HHUS yielding a kappa coefficient of 0.31 (95 % CI [0.26;0.36], p < 0.0001). In 1,638 cases (83 %) both methods came to the same interpretation. Mammography and SomoV™ agreed slightly worse (kappa coefficient 0.25 (95 % CI [0.2;0.3], p < 0.0001) (see Table 4).

Table 3 SomoV™ vs. HHUS
Table 4 SomoV™ vs. Mammography

Performance of SomoV™ compared with the gold standard

As some lesions were explicitly not seen in HHUS e.g. microcalcification (even not in a second-look US after MRI) the lesions were consequently biopsied under mammographical guidance (53 lesions). These cases are not expected to be seen in SomoV™ and therefore have been excluded from the further analysis.

The absolute numbers of agreement which result in a kappa coefficient of 0.3 and a total agreement rate of 84 % are shown in Table 5. In this group 119 breast cancers were detected. Sensitivity, specificity, positive and negative predictive values (PPV and NVP) are given as 74, 85, 24 and 98 %, respectively.

Table 5 SomoV™ vs. Goldstandard

SomoV™ detected breast cancer correctly in 88 cases (74 % sensitivity). Thirty one breast cancer cases were not detected with SomoV™. After reevaluating these cases another 12 were seen when rereading the original SomoV™ scans for evaluation (84 % sensitivity, 85 % specificity, 27 % PPV and 99 % NVP). A further eight cases were seen primarily with MRI or mammography and the US-guided biopsy was done after a second-look HHUS. Within the remaining 11 cases not seen by SomoV™, the cancer was located behind the nipple or very peripheral in five cases so that acquisition technique came to its limits. In two cases an ulcerated carcinoma made the examination difficult and in one case a recurrent focus near the chest wall after mastectomy and reconstruction was difficult to interpret without the clinical information. In a clinical situation when informed about the patients´ history and combining that knowledge with the clinical findings and breast imaging only 11 cancers would not have been found. Respectively 108 of 119 breast cancers would have been detected resulting in a sensitivity of 91 %.

Discussion

SomoV™ was at a disadvantage in comparison to HHUS and mammography due to the blinding since the examiners of HHUS and mammography had knowledge about the clinical situation (scars, lumps, history of the patient, etc.). In addition, the results of mammography were available during HHUS examination and interpretation. In contrast the examiners evaluating SomoV™ data were given no further information about the patients` medical history or results from any other examination. We were aware that this would lead to a diagnostic imbalance between the different methods and a disadvantage for SomoV™, but we aimed to analyze the diagnostic test performance in a situation similar to screening. By including diagnostic cases we intended to increase the number of potential findings in the SomoV™ data.

Nevertheless the study revealed kappa coefficients that indicate fair agreement (k = 0.25–0.31) with HHUS and mammography. Otherwise total agreement rates for dichotomized (benign vs. malignant) were even above 80 %.

The specificity rates are acceptable, which means that a negative result with SomoV™ is very likely to be truly negative. The sensitivity ranged from 74 to 91 % depending on the fact if second-look US and clinical information were included or not.

Challenges of the scanning technique

Some problems concerning the quality of the acquired data arise from the scanning unit´s construction. As soon as a breast contour is modified (for example after surgery) the contact between transducer and tissue can be inadequate and a lack of data and artifacts can be the consequence.

Strength of the study

The strength of this study is the large series of cases. Contrary to other studies, where nearly every examined breast showed a finding, the examiners in our study had a more heterogeneous group and were blinded to all clinical information. This made evaluation more difficult. Wenkel et al., for example, examined only suspicious breasts with SomoV™ so that there was a finding in nearly every data set [31]. Kotsianos-Hermle et al. included only suspicious findings in their study [37]. In the study of Prosch et al. the examiners did not know the results from HHUS and mammography but they were informed about the patients´ clinical history so that they knew about scars, etc. [28].

Outline: implementation of SomoV™ as a screening tool

It is very important for the application of a screening method that nearly every patient with a lesion is detected. From the 144 carcinomas confirmed by biopsy, 92 were detected with SomoV™. From the remaining 52 cases 21 cancers could not be seen with HHUS, neither with SomoV™. Excluding all cases where the failure was definitely not due to the SomoV™ technique (see results), only 19 cancers would not have been detected with SomoV™. Moreover the number of undetected lesions would decrease with the examiners gaining experience and a setting where the examiners are given additional information concerning the patients´ history. Therefore Somo V™ could be a valuable screening tool for either young women in addition to the clinical examination or women between 50 and 70 in combination with mammography. The problem of false positive results is that they cause unnecessary anxiety for patients on the one hand and further diagnostic procedures, as a second-look HHUS and in some cases core cut biopsies on the other hand. This means an emotional drain for the concerned patients. The false positive rate in our study would probably decrease with growing experience and when medical history and the clinical information can be taken into account. The BI-RADS® score 0 was given exceptionally frequently for SomoV™ data. It is likely that being given more information about the patients´ history and clinical findings and having more experience with SomoV™ examination, the examiners could decide more often whether a lesion is suspicious or not and the number of BI-RADS® score 0 would decrease, which would lead to decreasing false positive rates.

Conclusion

The results of this explorative study let us suggest, that automated 3D ultrasound might be a helpful new tool in breast imaging, especially in screening.