Introduction

The number of ultrahigh field (UHF) magnetic resonance (MR) system installations continues to increase, with currently approximately 70 research sites in operation worldwide. As a main advantage, this new technology provides a higher signal-to-noise ratio (SNR) and sensitivity [1], yet the artifact-to-noise ratio [2] can also be increased. This can degrade image quality and may differ at individual UHF sites, where system hardware differences could affect reproducibility. In UHF brain imaging, inhomogeneities due to wavelength effects are less pronounced than in body or cardiac imaging because of smaller object size relative to the radiofrequency (RF) wavelength [3]. Image quality and reproducibility, however, may still be compromised due to such RF effects. With increasing maturity of UHF MR, many studies have showed that brain imaging is one of the most beneficial applications at 7 T and clinical research is starting to exploit UHF advantages [4]. The next step is clinical approval of 7 T MRI for brain imaging and also musculoskeletal imaging, as currently announced by one manufacturer [5].

Inter-site reproducibility is not only a requirement for UHF application in clinical diagnosis but also for larger multicenter trials and quantitative data analysis. Such reproducibility has previously been established at 3 T [6] for multicenter studies of neurological diseases, i.e., Alzheimer’s disease [79], to reliably detect and quantify changes in brain volume, grey and white matter compartments, or cortical and subcortical structures (like hippocampus, cerebellum, etc).

The higher spatial resolution and sensitivity of 7 T may allow extension of previous 3 T group study results to single subject level in longitudinal studies and also open the possibility for 7 T multi-site studies. However, inter-site reproducibility has not yet been investigated for 7 T brain imaging. A recent study [10] showed the potential advantage of 7 T over 3 T for high-resolution Voxel-based morphometry (VBM) mapping, but noted that statistical methods are sensitive to image inhomogeneities. Other quantitative measures, such as cortical thickness, also benefit from the high spatial resolution achievable with 7 T, but reproducibility across sites needs to be demonstrated [11].

To establish the consistency and reproducibility of quantitative measures between 7 T systems, the “traveling heads” experiment was initiated in 2014 to compare human brain imaging at a number of 7 T installations [12]. A 7 T brain imaging protocol with state-of-the-art UHF sequences was applied and image quality compared between eight different 7 T MRI sites of the same vendor. To separate site and system differences from inter-subject variations, the same two “traveling heads” were imaged at all sites.

Materials and methods

Measurement setup

Two male subjects (33 and 37 years of age) were imaged at eight UHF sites, all operating a 7 T whole-body MRI system from the same vendor (Siemens Healthcare GmbH, Germany). Data were acquired at the following 7 T sites: 1: Berlin, Germany; 2: Bonn, Germany; 3: Essen, Germany; 4: Heidelberg, Germany; 5: Leipzig, Germany; 6: Maastricht, Netherlands; 7: Magdeburg, Germany; and 8: Vienna, Austria. The sites, all located in Germany and neighboring countries, cooperate within the framework of German Ultrahigh Field Imaging (GUFI).

These systems have differences in basic imaging components that might influence image quality (Table 1). Two sites (2, 6) have an actively shielded magnet installed, which has a smaller warm-bore size (83 vs. 90 cm) and is shorter (2.49 vs. 3.37 m). There are also two different gradient coil versions installed at the different sites: SC72D with 70 mT/m and AS95 with 38 mT/m. The high-performance version (SC72D) provides reduced gradient linearity and is installed at five sites (2, 5–8). At six of the eight sites, a commercial RF head coil (Nova Medical, Inc., Wilmington, MA, USA) with one transmit (TX) and 32 receive (RX) channels is available. A similar 24 RX channel coil from the same manufacturer is available at the other two sites. All systems had the same software version installed (VB17A, 20120320UHFF14).

Table 1 Comparison of 7 T sites

To facilitate data analysis, the measurement setup was kept as similar as possible between the sites, and the subjects were positioned as standardized as possible with the help of locally available cushions and pillows. In addition, “auto-align” localizer scans were applied to orient the slice position and angle fully automatically based on anatomical landmarks [13]. This method has been shown to work reliably also at 7 T and lead to higher slice reposition accuracy than manual positioning [14].

The imaging protocol consisted of a vendor-provided spin-echo-based B1 mapping sequence (TR 1200 ms, TE 14 ms, 5 × FA 45°–135°, #SL 11, TA 0:27, 8 × 8 × 5 mm3) for transmitter calibration, verified by a 3D DREAM [1517] B1 mapping sequence (TR 5000 ms, TE 0.8/1.5 ms, STE first timing, #SL 36, TA 0:05, 5 × 5 × 5 mm3). A second order B0 shimming routine of the vendor was applied with the system integrated routine in two iterations. The measurement volume was adjusted manually to include the whole brain. Subsequently, the following imaging sequences were performed: MP2RAGE in sagittal orientation (TR 6000 ms, TI 800/2700 ms, TE 3.0 ms, 3D, TA 9:38 min, 0.75 × 0.75 × 0.75 mm3), TSE in AC-PC orientation (TR 9000 ms, TE 16/130 ms, #SL 35, SL gap 1 mm, RF pulse length exc./ref.: 3/6 ms, TA 7:05 min, 0.3 × 0.3 × 2.0 mm3), modified time-of-flight angiography (TOF) [18] (TR 21 ms, TE 4.3 ms, FA 20°, 3D, #SL 112, PAT 4, AC-PC, TA 6:41, 0.2 × 0.2 × 0.4 mm3), and SWI (TR 29 ms, TE 15 ms, FA 15°, 3D, #SL 112, PAT 3, AC-PC, TA 9:05 min, 0.3 × 0.3 × 1.0 mm3). An EPI sequence with sinusoidal readout [19] (TR 2350 ms, TE 21 ms, ES 1.03 ms, #SL 46, PAT 3, AC-PC, TA 4:22 min, 1.3 × 1.3 × 1.5 mm3) was used to acquire T2*-weighted time series data during 4 min of scan time without task. All sequences except TOF and EPI had full brain coverage. For TOF, the imaging slices were centered at the height of the thalamus. The upper border of the EPI slab was set at the level of the most cranial borderline between frontal cortex and subdural space.

Different sequence parameters were chosen at the 24ch coil sites (Sites 1 and 4) and at Site 6 for TSE measurements. The repetition time was increased to 10,000 ms and the number of slices was reduced to 30 (26 at Site 1). At Site 6 an unknown power problem with the RF power amplifier during TSE forced the use of the same reduced duty cycle. For TOF the TR was also slightly increased from 21 to 23 ms at the 24ch coil sites. Due to sequence installation failures at Sites 4 and 6, a multi-shot reference scan was used for phase correction rather than the FLASH reference scan that was conducted otherwise. The FLASH reference scan is less sensitive to subject movements and improves temporal SNR (tSNR) [20]. During a second scan at Site 4, reinstallation of sequence sources allowed both options for the rescans.

The entire protocol was repeated for both subjects on different days at three sites [Sites 2, 3(2×), and 4] to assess intra-system variability. Data acquisition for the study was begun and ended at Site 3, with a time span of 12 months between first and last scan (Table 2).

Table 2 Chronology of the measurements for the traveling heads study at the different sites

Data analysis

All measurements were acquired within 1 h per subject, with all inline filters and subsequent data corrections deactivated as far as accessible through the user interface. Gradient distortion correction was applied to all image data except TSE. Registration and image processing was done with FSL [21, 22] (FMRIB Software Library v5.0, The University of Oxford). Brain extraction was performed based on the MP2RAGE de-noised uniform images [23] and brain segmentation was calculated based on the bias-corrected uniform images [24] (Fig. 1). The resulting VOI masks were applied to MP2RAGE images and T1 maps as well as dual-echo TSE. To measure total brain volume with high accuracy, a multi-stage brain extraction was employed based on the optiBET script [25]. This script was used to achieve a first approximate brain extraction. After bias correction with the FSL_anat script, a second run of optiBET with optimized settings was applied, and the extracted brain was registered to the MNI template. FAST [26] tissue type classification with initialization of a priori MNI tissue maps was used. Typical subcortical structures like hippocampus or thalamus were segmented with the run_first_all script [27]. An inferior brain mask was calculated as the sum of brain stem and cerebellum mask to analyze cerebrum and inferior brain separately. All segmented masks were also used for TSE contrast evaluation and were, therefore, transferred to TSE image orientation and resolution with FLIRT [28]. For qualitative analysis a second registration procedure was applied to the MP2RAGE and TSE images. For this purpose, a template for both subjects was created from all three MP2RAGE scans at Site 3. This template was used to linearly co-register images from the other sites with six degrees of freedom (DOF). TSE were also co-registered using transformation matrices gained from previous calculations. Images from TSE were analyzed for each RF coil in two separate subgroups, as sequence parameters had to be changed for the 24ch coil. The TSE data from Site 6 were excluded from group analysis as unknown power problems and adaption of sequence parameters to those of the 24ch sites resulted in different image contrast. The Michelson contrast definition as the quotient of difference and sum of two different tissue signal intensities was used for all contrast measures between tissues. Mean values, standard deviation between sites, and coefficient of variance (COV) were calculated.

Fig. 1
figure 1

Post-processing of MP2RAGE images. FSL was used for quantitative image analysis. Brain extraction and tissue type classification were done based on the de-noised uniform images. Subcortical structures were segmented in bias-removed images

Angiographic image data were co-registered to a single rescan at Site 3 and vessel-to-background contrast was measured for various equally distributed vessels and subcortical regions. For TOF analysis, images were brain-extracted and bias-corrected with FSL. Six volumes of interest were defined by thresholding all vessels in a 30 mm diameter spherical volume (MRIcron, v.6.6, Chris Rorden). Adjacent background tissues were selected manually. The seed points of VOIs were chosen at the most prominent vessels in volumes representing the frontal, left and right frontal, left and right dorsolateral, and central part of the brain (Fig. 2). TOF images of the other sites were co-registered to the latest rescan of Site 3 in two steps. After in-plane subsampling (2×), a 7-DOF registration was applied. For each vessel, the co-registered images were registered a second time using local weighting of the cost function with dilated vessel masks (12 DOF). Maximum intensity projections (MIP) were calculated (ImageJ, v. 1.50e, National Institutes of Health, USA) based on globally registered images for qualitative analysis. To assure that only overlapping brain regions are shown in MIPs, binary brain masks were created, transferred to reference image space, combined using multiplication (logical and) and applied. The vessel-to-background contrast was calculated and compared quantitatively. Statistical significance between sites was calculated with Sigmaplot (Sigmaplot 11, Systat Software Inc.). Friedman repeated measures analysis of variance on ranks, Mann–Whitney rank sum test, and Wilcoxon signed rank rest for rescans were used.

Fig. 2
figure 2

Selection of vessels and subcortical volumes for angiographic analysis. In (a) the colored overlay in MIP shows vessel selection for the TOF sequence. In (b) the position of vessels and subcortical regions is shown in SWI projections. For TOF, two sections of the pericallosal artery (red and blue), left and right deep branches of the middle cerebral artery (yellow and orange), and left and right cortical branches of the middle cerebral artery (turquoise and green) with surrounding vessels (30 mm) were chosen for analysis. For SWI, subcortical VOIs (blue) of left and right putamen, pallidus and red nucleus as well as the five veins with the largest diameter (marked green, e.g., internal cerebral veins or parietal superior cerebral vein) in different brain regions were chosen for contrast comparisons. For both subjects the same VOIs were selected

The SWI image data were also post-processed with FSL. The latest rescan at Site 3 was defined as the reference dataset, where five equally distributed vessels and surrounding tissue were manually segmented in each subject’s brain. The most prominent vessels with a minimum diameter of 15 mm were chosen, representing the frontal, left, right, and central part of the brain (Fig. 2). Additionally, six subcortical volumes and the adjacent background tissues were selected for analysis. Four of them (left/right putamen and globus pallidus) were derived from registration of the SWI images to the TSE (3 DOF, auto-align angle for scanning used) and MP2RAGE scans. Left and right red nuclei were manually segmented. From the vessel segmentation, a weighting mask was calculated for each vessel and used for optimization of the cost function of the individual FLIRT registration. Analogous to TOF analysis, a 12-DOF registration was applied for each vessel between the sites to minimize registration errors. The segmented vessel masks were transferred to each site’s original image space and vessel-to-background contrast was measured. A second global registration for each brain with 7 DOF and no weighting was also calculated for qualitative analysis. For both angiographic analyses, mean vessel contrast ratio and mean contrast difference of all segmented vessels to reference scan vessels were compared site-wise, and the range of contrast-ratio values was compared vessel-wise.

The EPI scans were co-registered with the help of gradient-echo (GRE) images from the B0 field mapping sequence. The GRE images of a rescan at Site 3 were used as reference, and all other sites were registered to it with FLIRT. Brain extraction was performed, and EPI images and B0 field maps were transferred to reference image space for qualitative analysis. To assure that overlapping brain regions were chosen for quantitative analysis, binary brain masking and a combination of masks was applied similar to TOF. Movement correction with MCFLIRT was applied to all datasets and tSNR was calculated. To analyze system-specific signal drift, tSNR was calculated with and without the application of a high-pass filter (cutoff/FWHM: 235 s). As scaling factors, employed during conversion to integer image data, were not optimally adjusted for the EPI data of Site 8, pixels with high signal level (in regions at the brain surface) were partially saturated at the maximum level. To accommodate this, a filter that excluded the 5 % of voxels with highest signal level was additionally applied to all image data. The brain was extracted with BET on the mean image of all time points, and tSNR was calculated in filtered masks.

B1 field maps were derived from the DREAM sequence. The brain was extracted from the GRE images of the sequence and applied to the flip angle map. Mean and standard deviation of flip angles were calculated in this volume. Similar analysis was done for the B0 maps, which were registered as described above. The B0 field maps have been calculated with FSL to obtain off resonance in Hertz. Standard deviation and quadratic mean have been analyzed.

Figures 1, 2, 3, 4, 5, 6, 7, 8 , 9 show image data of Subject 1. Tables 1, 2, 3, 4, 5, 6 contain detailed data for both subjects 1 and 2.

Fig. 3
figure 3

B0 maps of Subject 1. The same horizontal slice as for the EPI measurement (Fig. 9) is shown. Field homogeneity was slightly different between the measurements

Fig. 4
figure 4

Flip angle maps of Subject 1. Axial and sagittal views are shown for better depiction of differences in inferior brain regions. The 24ch RF coils achieve lower flip angles at the same applied power (Sites 1 and 4)

Fig. 5
figure 5

MP2RAGE uniform image. Horizontal slice through the brain of Subject 1 measured at different 7 T sites. 24ch coil sites (1 and 4) are indicated with a dark grey numeral. Data were co-registered and brain extracted. MP2RAGE images show high agreement in image contrast at different sites

Fig. 6
figure 6

PD-weighted TSE images from the same horizontal slice as for MP2RAGE of Fig. 5. Sites 1 and 4 (24ch coil) had slightly adapted sequence parameters (TR 10,000 vs. 9000 ms, 30 vs. 34 slices). High agreement was obtained for the sites with identical RF coils except for Site 6, where RF power problems led to a different appearance

Fig. 7
figure 7

Maximum intensity projections calculated for each site from TOF angiography of Subject 1. The TOF images were co-registered to the last rescan of Site 3 and masked with the overlapping portion of co-registered data from all sites

Fig. 8
figure 8

SWI horizontal slice of Subject 1. Data were registered to the last scan at Site 3. Nine equally distributed vessels and surrounding tissue were manually selected in the reference dataset. Each vessel mask was transferred back to original image space with individually optimized registration parameters

Fig. 9
figure 9

EPI slice of Subject 1. Local differences due to slightly different artifact levels can be seen. The corresponding slice from the B0 field map shown in Fig. 3 correlates well with the artifacts in the images

Table 3 EPI and fieldmap data for both subjects
Table 4 Excerpt of quantitative analysis of MP2RAGE sequence

Results

The B0 field maps are shown in Fig. 3. For both subjects the lowest absolute field deviation and standard deviation of field map were achieved at Site 7 (Table 3). Repeated measurements at Sites 2, 3 and 4 revealed that absolute field deviation and standard deviation changed by up to 3 Hz between rescans. The B1 maps show differences in the efficiency of the two RF coil types (Fig. 4; Table 3). The 24ch coils reached a mean flip angle of 26°, whereas the 32ch coils achieved 33° with the same transmit power. Differences of up to 4° were found between the sites with identical RF coil types, and rescans at the same site were within 1° difference. The 24ch coil at Site 4 had been replaced between the two scans, but the mean flip angle of the new RF coil did not differ significantly from the one replaced. The 24ch coil that was replaced had a higher relative standard deviation of flip angles (0.33) compared to all other RF coils (0.19–0.26). The flip angle maps of both subjects had similar global values and congruent findings. The calculation of transmit power led to a 14 % higher transmitter reference voltage for the 24ch coil systems compared to the 32ch sites (Table 3).

MP2RAGE images showed very high agreement in contrast and measured T1 values in all regions except the cerebellum and brain stem (Fig. 5; Table 4). Subjects 1 and 2 had a mean brain size of 1494 and 1423 cm3, respectively. Maximum variation of calculated brain size between the sites was 5 % for Subject 1 and 4 % for Subject 2. Rescans at three sites showed higher intra-site than inter-site reproducibility (max. 0.6 % difference). For smaller subcortical volumes, intra-site reproducibility was also higher than inter-site deviations (e.g., hippocampus intra-site vs. inter-site COV: 1.3 vs. 3.5 %). Signal levels at inferior brain regions like the cerebellum or brain stem dropped off very fast, and thus tissue segmentation or T1 measurements were more likely to fail in this region. This was observed for different scans of Subject 2. The volumes of gray matter (GM) and white matter (WM) scaled with the total brain volume, but inter-site differences could be reduced to 3 % for Subject 2, when the inferior brain regions were excluded from the analysis. The contrast ratio between GM and WM was very similar between the subjects (mean 0.46 vs. 0.44) and the variation between the sites was minimal (±0.02). Very high reproducibility was found for T1 maps generated from MP2RAGE images (Table 4). Inter-site variation was up to 3 % for all segmented volumes, except for inferior regions like the brainstem and the cerebellum, where T1 calculation failed at some sites due to low signal. With less than 4 %, inter-subject variation was similar. Differences in intra-site reproducibility between the first and the last scan at Site 3 were not found; these scans were the first and last measurements of the entire study and thus had the longest time span of ~12 months between repeated measurements.

The TSE images (Fig. 6) revealed contrast differences between the two different RF coils used. The 24ch coil demanded more transmit power (after flip angle calibration), and thus the SAR prognosis for the TSE exceeded the regulatory limits, because the scanner software used identical SAR limits for both RF coils; consequently, the sequence parameters had be changed (TR, #slices). The contrast ratio of GM to WM was much higher in T2-weighted images than in PD weighting; contrast deviations between the sites did not change for either echo. The maximum gray to white matter contrast differences between all sites were below 8 %. When excluding inferior regions (cerebellum and brainstem) from analysis and grouping the sites coil-wise, total contrast differences were less than 2 % for Subject 1 and 5 % for Subject 2 (Table 5). However, in central subcortical regions, larger contrast deviations up to 22 % were found between the sites with different RF coils, confirming qualitative image impressions as seen in Fig. 6. The coil-wise comparison of subcortical contrast ratios showed a much better agreement between the sites of less than 10 % difference. For inferior regions like the cerebellum and the brainstem, RF signal drop offs as in MP2RAGE caused high contrast differences between the sites (Table 5).

Table 5 TSE contrast measures

The TOF angiography (Fig. 7) analysis showed high agreement between all sites and subjects. Mean vessel contrast ratio was very similar between the subjects (0.43 vs 0.41), but higher deviations and lower mean vessel contrasts were found for sites with 24ch RF coils (Table 6). Statistically significant inter-site differences were found with Friedman repeated MANOVA on ranks for Sites 1, 4, and 8. A Mann–Whitney rank sum test (p < 0.001) revealed statistical differences between the vessel contrasts of different RF coils. Rescans at three sites showed high reproducibility, and no statistically significant intra-site contrast differences were found. Mean intra-site vessel contrast deviation was lower than mean inter-site vessel contrast deviation (2.3 vs. 3 %). When using the same type of RF coil, the maximum contrast differences between all individual vessels were 11 % for 32ch RF coils and 9 % for 24ch RF coils.

Table 6 Excerpt of angiography analysis

SWI image data (Fig. 8) of the vessels and the subcortical VOIs were analyzed separately in two subgroups. Variation of mean vessel contrasts and mean deviation of contrasts (Table 6) were comparable to TOF analysis, but did not show differences between the RF coils. Inter-subject variation was higher than inter-site variation. In contrast, only minor inter-site and inter-subject differences in mean contrast ratios were measured for subcortical volumes. For both groups, intra-site vessel contrast deviation was lower than mean inter-site vessel contrast deviation (vessels 3.6 vs. 4.1 %, VOIs 1.0 vs 1.6 %). The maximum difference in contrast ratios was 14 % for single vessels and 6 % for subcortical VOIs.

EPI images showed typical artifacts at tissue boundaries and air cavities. These artifacts differed in extent over the sites (Fig. 9). These distortion artifacts correspond well with the B0 maps of Fig. 3. The mean tSNR is 15–25 % lower for the sites with 24ch coils compared to those with 32ch coils (Table 3). Site 6 was excluded from the 32ch-coil means, as the tSNR there was similar to that achieved with the 24ch coils because the sequence parameters had to be adapted at this site. Masking the 5 % of voxels with highest signal also reduced the tSNR by about 7 % for Subject 2 and 2 % for Subject 1, but differences between sites were not affected. Subject 1 had a higher agreement of tSNR between the sites: all values were within 11 % difference for the 32ch coil sites. For Subject 2 the span of measurement values was 18 % due to higher tSNR achieved at Site 7. The high-pass filter increased the mean tSNR for the 32ch sites by 6 % but reduced inter-site variability only for Subject 1 by 4 %.

Discussion

The B0 field maps showed slight differences between the sites, but no correspondence to hardware differences like the magnet or the gradient coil type was found. Intra-site differences were on the same level as inter-site differences. Causes other than the actual hardware seem to have a stronger influence on the results of the shimming routines for whole-brain shimming. Possible sources for differences in the field homogeneity are the positioning of the subject in the coil relative to the magnet, but also differences in the chosen adjustment volume. Nevertheless, all B0 maps showed similar field patterns and corresponded well with distortion artifacts found in the EPI images. In contrast, the B1 maps revealed clear differences between RF coils. The transmitter reference amplitude of the 24ch coils was set about 14 % higher to reach a comparable flip angle in a central cubic (50 mm, isotropic) brain region. This was measured with the vendor provided transmitter calibration method. The DREAM validation revealed higher flip angle differences (26 %) between the two RF coils, while the mean flip angle was calculated for the whole segmented brain. The differences reflect the differently analyzed volumes and indicate that the 32ch coil is more effective in peripheral regions of the brain. Since there is no gold standard for B1 mapping at 7 T, there may be differences due to the used methods as well. At peripheral regions like the inferior brain, inter-site reproducibility of the flip angle maps were reduced. The cerebellum is at the edge of the transmit field of both RF coils, and large differences were observed in the B1 maps and in the actual imaging sequences. Small differences may also result from minor geometric differences in coil shape and resulting head position in the scanner. Differences between the systems were confirmed by rescans at three sites, where the reproducibility difference was small compared to the inter-site difference (1° vs. 4°). Inaccuracies in the power calibration provided by the system vendor or defective hardware may also influence the available RF power and, therefore, limit the maximum achievable flip angle. One system was found where the power calibration was incorrect, leading to reduced possible flip angles and higher SAR prediction than at the other sites. After recalibration of the system by the vendor, the measurement was repeated and B1 mapping results were similar to the other 32ch coil sites.

The MP2RAGE sequence is a fast 3-D gradient-echo sequence with two inversions and is often used as an anatomical T1-weighted scan of the brain to visualize cortical subregions and possibly to align other measurements. This sequence or its derivatives are used at different field strengths in most brain measurement protocols, and a high agreement between the 7 T sites is very important for future multicenter trials. At 7 T, optimized adiabatic inversion pulses (TR-FOCI [29]) are used to overcome limitations in B1 homogeneity. We found that MP2RAGE provided high image quality at all sites and was suitable for automatic analysis with FSL. Volumetric analysis based on MP2RAGE images revealed small inter-site differences of 5 %, but rescans at three sites showed almost 10-times higher intra-site reproducibility. The differences showed no tendency to correlate with the hardware of the systems, e.g., either the type of the gradient or RF coil, nor were differences in image quality or the image processing results found or tendencies in B0 or B1 field maps detected that would indicate systematic differences between the sites. The small discrepancy between inter-site and intra-site reproducibility may be caused by the current calibration of the gradient coils or by other site-specific effects. Nevertheless, the MP2RAGE images showed a very high agreement regarding contrast measures and provided very precise T1 relaxation maps for all reviewed structures of the brain except the inferior parts such as the cerebellum and brain stem, where signal levels dropped consistent with the B1 loss measured there. The 7 T systems we compared were all from the same vendor and shared a lot of common hardware and software, but there were also differences in the hardware components, e.g., the magnet designs or the gradient coils. This leads to lower inter-site than intra-site reproducibility. Of course, a comparison across different vendors of MR systems would have been advantageous to include more differences in gradient coil designs and sequence programming. However, it should be kept in mind that at all currently installed 7 T sites, independent of the MR system vendor (Philips, GE, Siemens), both the magnet and the head RF coil come from the same vendors (Agilent and Nova Medical). This leaves mainly the signal generation and reception as well as possible differences in sequence programming as remaining sources of variation between vendors, which may lead to even lower variability as compared to 3 T. A rough comparison to volumetric measurements at 1.5 and 3 T made by Huppertz et al. [30] can be made. However, they used SPM to calculate and compare the intracranial volume as well as GM, WM, and CSF maps for one subject who was measured three times on six different 1.5 and 3 T devices from two vendors. They reported mean intra-site coefficients of variation (COV) for brain, WM, GM, and CSF volumes of 0.5, 0.7, 0.7, and 4.4 % and respective inter-site COVs of 3.8, 3.7, 4.1, and 8.7 %. Our inter-site results show higher inter-site reproducibility, which may reflect that the systems used were all from the same vendor. The intra-site reproducibility, which can be extracted for Site 3 from two subjects with three repetitions (COV 0.2, 0.9, 1.3, and 1.9 %), shows similar results as described for the other field strengths. Differences may also result from the different segmentation methods used [7, 31, 32]. Other groups [33] also reported high intra-site reproducibility for 1.5 and 3 T, but differences across platforms and across field strengths were found. In contrast to lower field strengths, the performance of the setup employed at 7 T in our study cannot be used to reliably quantify the infratentorial brain region, as signal level is too low there. Further optimization of RF coils is needed to overcome these limitations to exploit the higher signal level achievable with 7 T. However, the supratentorial brain reproducibility is very high, which is a promising finding for the reliability of the compared systems.

The TSE sequences are the classical workhorses of MRI at most field strengths and a widely accepted robust imaging method. At UHF, different adaptations were made to overcome SAR limitations that arise at higher field strength due to the quadratic dependency of RF power on Larmor frequency. The sequence we employed utilizes hyperechos [34], allows regulation of pulse lengths, and uses the VERSE [35] algorithm to reduce peak pulse power and thereby the SAR. The SAR limit is always critical for TSE at 7 T and forced us to change the sequence parameter for the 24ch coils, as the scanner software did not account for the lower transmit efficiency of this older coil version compared to the 32ch coil, which was used to set up the imaging protocol. The different sequence parameters led to slightly different contrasts between the sites with different RF coils, especially in central subcortical VOIs. It cannot be assumed, however, that contrast differences would not have occurred if identical sequence parameters had been used because the B1 maps also differed between the RF coils. Site 6 was excluded from analysis, as unknown power restrictions did not allow similar sequence parameters as for the other 32ch coils. Results of the coil-wise comparisons showed strong inter-site correspondence and high reproducibility.

TOF measurements are also challenging at 7 T as the SAR limits are critical due to the applied saturation pulses; the measurements benefit, however, as contrast-to-noise ratio is much higher due to longer T1 relaxation times. The sequence used allowed the regulation of saturation flip angle independent of excitation flip angle and also had the VERSE pulse modification implemented to reduce SAR. At the 24ch coil sites, TR was increased slightly (23 vs. 21 ms) to meet SAR restrictions. Differences between the sites were found, as vessel contrast was slightly reduced for most vessels at the 24ch coil sites. For the two chosen dorsolateral branches of the middle cerebral artery, differences between the RF coils were similar to inter-site variations. This may be due to the different receive (or transmit) sensitivities of the RF coils, but may also originate from physiological parameters like heart rate or blood pressure. Other reasons like subject positioning can also not be excluded. The TOF images and the calculated MIPs show very high agreement, and the similarity in measured vessel contrasts indicates that vessel resolution is comparable between sites.

SWI benefits from higher susceptibility sensitivity at 7 T. Therefore, very small veins can be depicted with SWI. To reach whole-brain coverage in reasonable measurement time (~10 min), a slice resolution of 1 mm was chosen, and the resulting voxel size was bigger than for TOF [(0.3 × 0.3 × 1) vs. (0.2 × 0.2 × 0.4) mm3], where a smaller central slab was imaged, and more highly anisotropic. This led to higher interpolation errors and partial volume effects when relatively small vessels compared to bigger subcortical structures were chosen for quantitative comparison. Nevertheless, the larger veins we chose for analysis had only minimally larger deviations between sites than the arteries compared with TOF. For all subcortical structures, the differences between the sites were minor and much lower than for the TOF measurements. The differences in TOF may be caused by the different contrast mechanisms of the sequences, as indicated by RF coil differences found with TOF, but also physiological reasons like blood flow variations may play an important role.

EPI is very fast and therefore very useful in applications such as functional imaging, but prone to imaging artifacts due to B0 inhomogenities or susceptibility effects. The sequence we used had a modified reference scan for phase correction to account for the higher sensitivity of 7 T to susceptibility and also a sinusoidal readout to reduce gradient coil resonances and helium boil-off. At Site 6 and for the first scan at Site 4, this type of reference scan was not available due to sequence installation problems, and tSNR values were slightly lower than at comparable sites. The distortion artifacts seen in the EPI images correlated well with measured B0 variations. The 32ch coil sites had a higher mean tSNR than the 24ch coil sites. This also corresponds well with other findings at 3 and 7 T [3638]. It has been shown that tSNR benefits most from higher field strength or highly parallel arrays when thermal noise is dominant, e.g., for small voxel volumes and for higher acceleration factors, as in the applied sequence. Inter-site variation of tSNR was smaller for Subject 1 than for Subject 2, and this difference was even increased by the correction for scanner drift. This may indicate that corrections were disturbed by subject movement, physiological effects, or other side effects that could not be detected.

Conclusion

UHF MR systems are very sensitive measurement devices, but high sensitivity also applies to noise and susceptibility to hardware faults. Even for systems provided from the same vendor, there can be remarkable differences between basic imaging components of the individual MR systems and sites that potentially might influence image quality. Effects of these hardware differences were revealed in this study, where it could be shown that the RF coils had the greatest effect on inter-site reproducibility; the effect of the gradient coils and magnet type was more limited. Nevertheless, our results show that the 7 T systems used can deliver high multi-center reproducibility for the supratentorial brain with variability that is similar to that reported in the literature for 3 T. This is a promising finding for quantitative imaging, and a first step toward future large scale multi-site studies at 7 T.