Main

The proton is the dominant component of visible matter in the Universe. Consequently, determining the proton’s basic properties—such as its root-mean-square charge radius, rp—is of interest in its own right. Accurate knowledge of rp is also important for the precise determination of other fundamental constants, such as the Rydberg constant (R)2. The value of rp is also required for precise calculations of the energy levels and transition energies of the hydrogen atom—for example, the Lamb shift. In muonic hydrogen (μH atoms), in which the electron in the H atom is replaced by a ‘heavier electron’ (a muon), the extended proton charge distribution changes the Lamb shift by as much as 2%1. The first-principles calculation of rp from the accepted theory of the strong interaction (quantum chromodynamics, QCD), is notoriously challenging and currently cannot reach the accuracy demanded by experiments, but lattice QCD calculations are on the cusp of becoming precise enough to be tested experimentally9. Therefore, the precise measurement of rp is not only critical for addressing the proton radius puzzle but also important for determining certain fundamental constants of physics and testing lattice QCD.

Prior to 2010 the two methods used to measure rp were ep → ep elastic scattering measurements, in which the slope of the extracted proton (p) electric (E) form factor, \({G}_{{\rm{E}}}^{{\rm{p}}}\), as the four-momentum transfer squared (Q2) approaches zero, is proportional to \({r}_{{\rm{p}}}^{2}\); and Lamb shift (spectroscopy) measurements of ordinary H atoms, which, along with state-of-the-art calculations, can be used to determine rp. Although the e–p results can be somewhat less precise than the spectroscopy results, until 2010 the values of rp obtained from these two methods2,5 mostly agreed with each other10. Since that year, two new results based on Lamb shift measurements in μH were reported1,7. The Lamb shift in μH is several million times more sensitive to rp because the muon in a μH atom is about 200 times closer to the proton than is the electron in a H atom. To the surprise of both the nuclear and atomic physics communities, the two μH results1,7, displaying unprecedented precision with an estimated uncertainty of <0.1%, combined to be eight standard deviations smaller than the average value obtained from all previous experiments. This became known as the proton radius puzzle11, unleashing intensive experimental and theoretical efforts aimed at resolving the disagreement.

The discrepancy between the values of rp as measured in H and μH atoms remains unresolved. Moreover, the two most recent H spectroscopy measurements disagree with each other3,4, which has added a new dimension to and renewed the urgency of this problem. A fundamental difference between the e–p and μ–p interactions could be the origin of the discrepancy; however, there are abundant experimental constraints on any such ‘new physics’, although models that resolve the puzzle by invoking new force carriers have been proposed11,12. More mundane solutions continue to be explored: for example, it has been rigorously shown that the definition of rp used in all three major experimental approaches was consistent13. The effect of two-photon exchange on μH spectroscopy14,15, and form-factor nonlinearities in e–p scattering16,17,18 have also been examined. None of these studies has adequately explained the puzzle, reinforcing the need for additional high-precision measurements of rp that use new experimental techniques and different systematics.

The PRad collaboration at Jefferson Laboratory has developed and performed an e–p experiment as an independent measurement of rp to address the puzzle. The PRad experiment, in contrast with previous e–p experiments, was designed to use a magnetic-spectrometer-free, calorimeter-based method19. The design of the PRad experiment implemented three major improvements over previous e–p experiments. First, the large angular acceptance (0.7°–7.0°) of the hybrid calorimeter (HyCal) enabled large Q2 coverage, spanning two orders of magnitude (2.1 × 10−4 GeV2/c2 to 6 × 10−2 GeV2/c2, where c is the speed of light in a vacuum) in the low Q2 range. The fixed location of HyCal eliminated the many normalization parameters that plague magnetic-spectrometer-based experiments in which the spectrometer must be physically moved to many different angles to cover the desired range of Q2. In addition, the PRad experiment reached extreme forward-scattering angles of down to 0.7°, achieving a Q2 value of 2.1 × 10−4 GeV2/c2; this is, to our knowledge, the lowest Q2 obtained from e–p experiments and is an order of magnitude lower than that previously achieved5. Reaching a lower range for Q2 is critical because rp is determined from the slope of the electric form factor at Q2 = 0. Second, the extracted e–p cross-sections were normalized to the well-known quantum electrodynamics process ee → ee (Møller scattering from atomic electrons, e–e), which was measured simultaneously alongside e–p scattering, using the same detector acceptance. This led to a substantial reduction in the systematic uncertainties of measuring the e–p cross-sections. Third, the background generated from the target windows, one of the dominant sources of systematic uncertainty in all previous e–p experiments, was highly suppressed in the PRad experiment.

The PRad experimental apparatus consisted of four main elements (Fig. 1). (1) A 4-cm-long windowless cryo-cooled hydrogen gas flow target with an areal density of 2 × 1018 atoms per cm2, which eliminated the beam background from the target windows. (2) The high-resolution, large-acceptance hybrid electromagnetic calorimeter, HyCal20. The complete azimuthal coverage of HyCal for the forward-scattering angles enabled simultaneous detection of the pair of electrons from e–e scattering. (3) A plane made of two high-resolution XY gas electron multiplier (GEM) coordinate detectors located in front of HyCal. (4) A two-section vacuum chamber spanning the 5.5-m distance from the target to the detectors.

Fig. 1: The PRad experimental setup.
figure 1

A schematic layout of the PRad experimental setup in Hall B at Jefferson Laboratory, with the electron beam incident from the left. The key beam-line elements are shown along with the windowless hydrogen gas target, the two-segment vacuum chamber and the two detector systems (see the Methods for a brief overview and the Supplementary Information for a description of the target and individual detectors).

The PRad experiment was performed in Hall B at Jefferson Laboratory in May–June of 2016, using 1.1-GeV and 2.2-GeV electron beams. The standard Hall B beam line, designed for low beam currents (0.1–50 nA), was used in this experiment. The incident electrons that scattered off the target protons and the Møller electron pairs were detected in the GEM detector and HyCal. The energy and position of the detected electron(s) were measured by HyCal, and the transverse (XY) position was measured by the GEM detector, which was used to assign the Q2 for each detected event. The GEM detector, which has a position resolution of 72 μm, improved the measurement accuracy of Q2 compared to detection by HyCal alone. Furthermore, the GEM detector suppressed the contamination from photons generated in the target and other beam-line materials; HyCal is equally sensitive to electrons and photons, whereas the GEM detector is mostly insensitive to neutral particles. The GEM detector also helped to suppress position-dependent irregularities in the response of HyCal. A plot of the reconstructed energy versus the reconstructed angle for e–p and e–e events is shown in Fig. 2 for the 2.2-GeV beam energy.

Fig. 2: Event reconstruction.
figure 2

The reconstructed energy versus angle for e–p and e–e events for an electron beam energy of 2.2 GeV. The red and black lines indicate the event selections for e–p and e–e, respectively. The angles ≤3.5° are covered by the crystal PbWO4 modules of HyCal and the larger angles by the Pb glass modules. The colour bar shows the number of events.

The background was measured periodically with an empty target cell. To mimic the residual gas in the beam line, H2 gas at very low pressure was allowed in the target chamber during the empty target runs. The charge-normalized e–p and Møller scattering yields from the empty target cell were used to subtract the background contributions. The beam current was measured with the Hall B Faraday cup with an uncertainty of <0.1%21. Further details on the background subtraction can be found in the Supplementary Information.

A comprehensive Monte Carlo simulation of the PRad setup was developed using the Geant4 toolkit22. The simulation consists of two separate event generators built for the e–p and e–e processes23,24. Inelastic e–p scattering background events were also included in the simulation using a fit25 to the e–p inelastic world data. The simulation included signal digitization and photon propagation, which were critical for the precise reconstruction of the position and energy of each event in the HyCal. The details are described in the Supplementary Information.

The e–p cross-sections were obtained by comparing the simulated and measured e–p yield relative to the simulated and measured e–e yield (see Supplementary Information for details). The extracted reduced cross-section is shown in Fig. 3a. The e–p elastic cross-section is related to \({G}_{{\rm{E}}}^{{\rm{p}}}\) and the proton magnetic form factor, \({G}_{{\rm{M}}}^{{\rm{p}}}\), by the Rosenbluth formula19. In the very low Q2 region covered by the PRad experiment, the cross-section is dominated by the contribution from \({G}_{{\rm{E}}}^{{\rm{p}}}\). Thus, the uncertainty introduced from \({G}_{{\rm{M}}}^{{\rm{p}}}\) is negligible. In fact, when using a wide variety of parametrizations5,26,27,28 for \({G}_{{\rm{M}}}^{{\rm{p}}}\), the extracted \({G}_{{\rm{E}}}^{{\rm{p}}}\) varies by about 0.2% at Q2 = 0.06 GeV2/c2, the largest Q2 accessed by the PRad experiment, and by <0.01% in the Q2 < 0.01 GeV2/c2 region. The largest variation in rp arising from the choice of \({G}_{{\rm{M}}}^{{\rm{p}}}\) parametrization is 0.001 fm. \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\) as extracted from our data is shown in Fig. 3b, using the Kelly parametrization26 for \({G}_{{\rm{M}}}^{{\rm{p}}}\).

Fig. 3: The measured cross-section and form factor.
figure 3

a, The reduced cross-section, \({\sigma }_{{\rm{r}}{\rm{e}}{\rm{d}}{\rm{u}}{\rm{c}}{\rm{e}}{\rm{d}}}={\left({\textstyle \tfrac{{\rm{d}}\sigma }{{\rm{d}}\varOmega }}\right)}_{e\mbox{-}p}/\,[{\left({\textstyle \tfrac{{\rm{d}}\sigma }{{\rm{d}}\varOmega }}\right)}_{{\rm{p}}{\rm{o}}{\rm{i}}{\rm{n}}{\rm{t}}\mbox{-}{\rm{l}}{\rm{i}}{\rm{k}}{\rm{e}}}((4{M}_{{\rm{p}}}^{2}{E}^{^{\prime} }/E)/(4{M}_{{\rm{p}}}^{2}+{Q}^{2}))]\), (where E is the electron beam energy, E′ is the energy of the scattered electron, Mp is the mass of the proton and Ω is the solid angle subtended by the scattered electron detector), for the PRad e–p data. Dividing out the kinematic factor inside the square brackets, σreduced is a linear combination of the electromagnetic form factors squared. The bands at the bottom of the plot are the size of the systematic uncertainties, for 1.1 GeV (red) and 2.2 GeV (blue). The error bars show statistical uncertainties. b, \({G}_{{\rm{E}}}^{{\rm{p}}}\) as a function of Q2. The data points are normalized by the parameter n in equation (1) for the 1.1-GeV and 2.2-GeV data, labelled as n1 and n2, respectively. The error bars show statistical uncertainties. The bands are the systematic uncertainties as in a. The solid black curve shows \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\) as a fit to the function given by equation (1). Also shown is the fit from a previous e–p experiment5, giving rp = 0.883(8) fm (green dashed line) and another previous calculation30 giving rp = 0.844(7) fm (purple dot-dashed line).

The slope of \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\) as Q2 → 0 is proportional to \({r}_{{\rm{p}}}^{2}\). A common practice is to fit \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\) to a functional form and to obtain rp by extrapolating to Q2 = 0. However, each functional form truncates the higher-order moments of \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\) differently and introduces a model dependence that can bias the determination of rp. It is critical to choose a robust functional form that is most likely to yield an unbiased estimation of rp given the uncertainties in the data, and to test the chosen functional form over a broad range of parametizations29 of \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\). To simultaneously minimize possible bias in the determination of the radius and the total uncertainty, various functional forms were examined for their robustness in reproducing an input rp used to generate a mock dataset with the same statistical uncertainty as the PRad data. The robustness, quantified as the root-mean square error (RMSE), is defined as \({\rm{RMSE}}=\sqrt{{(\delta R)}^{2}+{{\sigma }}^{2}}\), where δR is the bias or the difference between the input and extracted radius and σ is the statistical variation of the fit to the mock data29. Previous studies29 show (see Supplementary Information) that consistent results with the smallest uncertainties can be achieved using a multi-parameter rational function, which we refer to as Rational(1, 1):

$$f({Q}^{2})=n{G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})=n\frac{1+{p}_{1}{Q}^{2}}{1+{p}_{2}{Q}^{2}}$$
(1)

where n is the floating normalization parameter, p1 and p2 are fit parameters and the proton charge radius is given by \({r}_{{\rm{p}}}=\sqrt{6({p}_{2}-{p}_{1})}\). The \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\), extracted from the 1.1-GeV and 2.2-GeV data, was fitted simultaneously using the Rational(1, 1) function. Independent normalization parameters n1 and n2 were assigned for the 1.1-GeV and 2.2-GeV data, respectively, to allow for differences in normalization uncertainties, but the Q2 dependence was identical. The parameters obtained from fits to the Rational(1, 1) function are n1 = 1.0002 ± 0.0002stat ± 0.0020syst; n2 = 0.9983 ± 0.0002stat ± 0.0013syst; and rp = 0.831 ± 0.007stat ± 0.012syst fm. The Rational(1, 1) function describes the data very well, with a reduced χ2 of 1.3 when considering only the statistical uncertainty. The values of rp for a variety of functional forms fitted to the PRad data are shown in Supplementary Fig. 15.

To determine the systematic uncertainty in rp, a Monte Carlo technique was used to randomly smear the cross-section and \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\) data points for each known source of systematic uncertainty. The value of rp was extracted from the smeared data and the process was repeated 100,000 times. The root-mean square of the resulting distribution of rp is recorded as the systematic uncertainty. The dominant systematic uncertainties of rp are those that are Q2-dependent, which primarily affect the lowest Q2 data: the Møller radiative corrections, the background subtraction for the 1.1-GeV data and event selection. The uncertainty in rp arising from the finite Q2 range and the extrapolation to Q2 = 0 was investigated by varying the Q2 range of the mock dataset as part of the robustness study of the Rational(1, 1) function29. This uncertainty was found to be much smaller than the relative statistical uncertainty, 0.8%. The total systematic relative uncertainty on rp was found to be 1.4%, and is detailed in Supplementary Table 1 and described in the Supplementary Information.

The value of rp obtained using the Rational(1, 1) function is shown in Fig. 4, with statistical and systematic uncertainties summed in quadrature. Our result, obtained from Q2 down to an unprecedented 2.1 × 10−4 GeV2/c2, is about three standard deviations smaller than the previous high-precision electron scattering measurement5, which was limited to higher Q2 (>0.004 GeV2/c2). However, our result is consistent with the μH Lamb-shift measurements1,7, and also with the recent 2S–4P transition-frequency measurement using ordinary H atoms3. Given that the lowest Q2 reached in the PRad experiment is an order of magnitude lower than in previous e–p experiments, and owing to the careful control of systematic effects, our result indicates that the proton radius is smaller than its previously accepted value from e–p measurements. Our result does not support any fundamental difference between e–p and μ–p interactions and is consistent with the updated value announced for the Rydberg constant by CODATA8.

Fig. 4: The proton charge radius.
figure 4

rp as extracted from the PRad data in this work, shown alongside other measurements of rp since 2010 and previous CODATA recommended values. Our result is 2.7σ smaller than the CODATA recommended value for e–p experiments6. The orange and blue vertical bands show the uncertainty bounds of the μH and CODATA values for e–p scattering, respectively.

The PRad e–p experiment covers Q2 over two orders of magnitude in one setting. The experiment also exploited the simultaneous detection of e–p and e–e scattering to achieve good control of systematic uncertainties, which were, by design, different from previous e–p experiments. The extraction of rp using functional forms with validated robustness is another strength of this result. Our result demonstrates a large discrepancy with contemporary, high-precision e–p experiments. The result also implies that there is consistency between proton charge radii as obtained from e–p scattering measurements on ordinary hydrogen and spectroscopy of muonic hydrogen1,7. The PRad experiment demonstrates the clear advantages of the calorimeter-based method for determining rp from e–p experiments and points to further possible improvements in the accuracy of this method. It is also consistent with the recently announced shift in the Rydberg constant8, which has profound consequences, given that the Rydberg constant is one of the most precisely known constants of physics.

Methods

The PRad experiment was conducted with 1.1-GeV and 2.2-GeV electron beams from the Continuous Electron Beam Accelerator Facility (CEBAF) accelerator incident on cold hydrogen atoms flowing through a windowless target cell. The scattered electrons, after traversing the vacuum chamber, were detected in the GEM detector and HyCal. They included electrons from elastic e–p scattering and e–e Møller scattering processes. The transverse (XY) positions measured by the GEM detector were used to calculate the Q2 value for each event. The e–p and e–e yields were obtained using appropriate cuts on the energy deposited in HyCal and the reconstructed angle. The e–p and e–e yields were binned as a function of Q2. A comprehensive Monte Carlo simulation of the PRad experiment was used to extract the next-to-leading order e–p cross-section from the experimental yields. The e–p cross-sections were obtained by comparing the simulated and measured e–p yield relative to the simulated and measured Møller scattering yield. The value of \({G}_{{\rm{E}}}^{{\rm{p}}}\) was extracted from the e–p cross-section using the Rosenbluth formula, and using a parametrization of \({G}_{{\rm{M}}}^{{\rm{p}}}\). The proton charge radius, rp, was obtained from the extracted \({G}_{{\rm{E}}}^{{\rm{p}}}({Q}^{2})\) by fitting to the Rational(1, 1) functional form and extrapolating to Q2 = 0. The Rational(1, 1) functional form was shown to be the most robust function for radius extraction from the PRad data, giving consistent results with the smallest uncertainties. See Supplementary Information for further details.