Main

Previous observations of neutrino oscillations have established that the three known neutrino flavour states, νe, νμ and ντ are mixtures of three mass states, ν1, ν2 and ν312,13,14,15. This mixing is described by a unitary matrix called the Pontecorvo–Maki–Nakagawa–Sakata (PMNS) matrix16,17, which can be parameterized by three mixing angles θ12, θ13 and θ23, and complex phases. Of these phases, neutrino oscillations are sensitive to δCP. The probabilities that the neutrinos will oscillate from one flavour state to another as they travel depend on these mixing parameters and the mass squared differences (\(\Delta {m}_{ij}^{2}={m}_{i}^{2}-{m}_{j}^{2}\)) between the neutrino mass states. The PMNS parameters and the mass squared differences are referred to as ‘oscillation parameters’. It is known that ν1 and ν2 lie close to each other in mass, with \(\Delta {m}_{21}^{2}=(7.53\pm 0.18)\times {10}^{-5}\,{{\rm{e}}{\rm{V}}}^{2}/{c}^{4}\), whereas \(|\Delta {m}_{32}^{2}|\) is approximately 30 times larger. However, it is not known whether m3 has a larger or smaller mass than m1 and m2 (ref. 2). The case where the mass of m3 is larger (smaller) is called the normal (inverted) ordering. The CP symmetry-violating effect in neutrino and antineutrino oscillations has a magnitude that depends on the Jarlskog invariant18,19:

$${J}_{{\rm{C}}{\rm{P}},{\rm{l}}}=\frac{1}{8}\cos {\theta }_{13}\sin (2{\theta }_{12})\sin (2{\theta }_{23})\sin (2{\theta }_{13})\sin {\delta }_{{\rm{C}}{\rm{P}}}$$
(1)

According to current measurements, this is approximately 0.033sinδCP (ref. 2). This value has the potential to be three orders of magnitude larger than the measured quark-sector CP violation (JCP,q = 3 × 10−5) (ref. 2). Prior to this work, no experiment has excluded any values of δCP (taking into account both mass orderings) at the 99.73% (3σ) confidence level. T2K is a long-baseline neutrino experiment that uses beams of muon neutrinos and antineutrinos, with energy spectra peaked at 0.6 GeV. We observe interactions of the neutrinos at a near detector facility 280 m from the beam production point that characterizes the beam and the interactions of the neutrinos before oscillations. The beam then propagates 295 km through the Earth to the T2K far detector, Super-Kamiokande (SK). SK measures the oscillated beam, which allows us to determine the oscillation parameters.

For this beam energy and propagation distance, the probability that muon neutrinos(antineutrinos) will oscillate to electron neutrinos (antineutrinos) is given approximately, including the CP-violating term but neglecting effects from propagation through matter, by:

$$\begin{array}{c}P({\nu }_{{\rm{\mu }}}\to {\nu }_{{\rm{e}}})\approx {\sin }^{2}(2{\theta }_{13}){\sin }^{2}{\theta }_{23}{\sin }^{2}\left(\frac{1.27\Delta {m}_{32}^{2}L}{E}\right)\\ \,\mp \frac{1.27\Delta {m}_{21}^{2}L}{E}8{J}_{{\rm{C}}{\rm{P}}}{\sin }^{2}\left(\frac{1.27\Delta {m}_{32}^{2}L}{E}\right)\end{array}$$
(2)

Here, E is the energy of the neutrino in gigaelectronvolts, the mass squared differences are given in units of eV2/c4, where c is the speed of light in vacuum, and L is the propagation baseline in kilometres. The second term in equation (2) has a negative sign for neutrinos and a positive sign for antineutrinos. The baseline and beam energy are optimized so that at our baseline, the probability to oscillate to electron neutrinos reaches a maximum at energies around the beam energy. Although the probability of oscillation to electron neutrinos is small, muon neutrinos also oscillate to tau neutrinos, which are not identifiable at SK because T2K’s beam energy is too low for a charged tau lepton to be produced. Overall, the probability that muon neutrinos and antineutrinos will maintain their initial flavour is:

$$\begin{array}{c}P({\nu }_{{\rm{\mu }}}\to {\nu }_{\mu })\approx 1-4{\cos }^{2}{\theta }_{13}{\sin }^{2}{\theta }_{23}\\ \times (1-{\cos }^{2}{\theta }_{13}{\sin }^{2}{\theta }_{23}){\sin }^{2}\left(\frac{1.27\Delta {m}_{32}^{2}L}{E}\right)\end{array}$$
(3)

Given that the probability of oscillation to tau neutrinos is large at our modal beam energy and baseline, there is a minimum in the muon neutrino energy spectrum. The position of this minimum gives the experiment sensitivity to the magnitude of \(\Delta {m}_{32}^{2}\) and the depth gives sensitivity to sin2(2θ23). The height of the peak in the electron neutrino energy spectrum at the oscillation maximum is, at leading order, determined by sin2θ23 and sin2(2θ13) (see equation (2)). However, it also has a sub-leading-order dependence on δCP and the neutrino mass ordering, giving sensitivity to these parameters. Owing to this interdependence, determining the other PMNS mixing parameters is important in measuring δCP. As can be seen from Fig. 1, changing δCP from +π/2 to −π/2 can lead to changes of the order of 40% in the number of electron neutrinos expected at SK. In our analysis we model the observed kinematic distributions of the final-state particles using the full oscillation probability, including the effect of the neutrinos propagating through matter, which is a perturbation of the order of 10% to the probability discussed in equations (2) and (3)20.

Fig. 1: Observed νe and \({\bar{{\boldsymbol{\nu }}}}_{{\bf{e}}}\) candidate events at SK.
figure 1

a, b, The reconstructed neutrino energy spectra for the SK samples containing electron-like events in neutrino-mode (a) or antineutrino-mode (b) beam running. The uncertainty shown around the data points accounts for statistical uncertainty. The uncertainty range is chosen to include all points for which the measured number of data events is inside the 68% confidence interval of a Poisson distribution centred at that point. The solid stacked chart shows the predicted number of events for the CP-conserving point δCP = 0, separated according to whether the event was from an oscillated neutrino or antineutrino or from a background process. The dashed lines show the total predicted number of events for the two most extreme CP-violating cases. c, The predicted number of events for δCP = −π/2 and the measured number of events in the three electron-like samples at SK. The predicted number of events is broken down into the same categories as in a and b and the systematic uncertainty shown is after the near-detector fit. In both a and b for all predictions, normal ordering is assumed, and sin2θ23 and \(\Delta {m}_{32}^{2}\) are at their best-fit values. sin2θ13, sin2θ12 and \(\Delta {m}_{21}^{2}\) take the values indicated by external world average measurements2. The parameters accounting for systematic uncertainties take their best-fit values after the near-detector fit.

The T2K neutrino beam is generated at the Japan Proton Accelerator Research Complex (J-PARC) by impinging a 30-GeV beam of protons onto a graphite target21. This interaction creates a large number of secondary hadrons, which are focused using magnetic horns. A neutrino (antineutrino)-enhanced beam is selected by focusing positively (negatively) charged particles—mostly pions—by choosing the polarity of the magnetic field produced by the horns, thereby enabling us to study the differences between neutrino and antineutrino oscillations. The beam axis is directed 2.5° away from the SK detector, taking advantage of the kinematics of the two-body pion decay to produce a narrow neutrino spectrum peaked at the expected energy of maximum oscillation probability22. The results reported here are based on SK data collected between 2009 and 2018 in neutrino (antineutrino) mode and include a beam exposure of 1.49 × 1021 (1.64 × 1021) protons hitting the T2K neutrino production target.

Neutrinos are detected by observing the particles they produce when they interact. At neutrino energies of 0.6 GeV the dominant interaction process is charged-current quasi-elastic (CCQE) scattering via the exchange of a W boson with a single neutron or proton bound in the target nucleus. In this process the neutrino (antineutrino) turns into a charged lepton (antilepton) of the same flavour. We are thereby able to identify the incoming neutrino’s flavour.

Our near detector facility consists of two detectors both located 280 m downstream of the beam production target21. The INGRID detector23, located on the beam axis, monitors the direction and stability of the neutrino beam. The ND280 detector24,25,26,27,28 is located at the same angle away from the beam axis as SK, and characterizes the rate of neutrino interactions from the beam before oscillations have occurred, thereby reducing systematic errors. ND280 is magnetized so that charged leptons and antileptons bend in opposite directions as they traverse the detector. This effect is used to measure the fraction of events in each beam mode that are from neutrino and antineutrino interactions. In this analysis, we select samples enriched in CCQE events and also several control samples enriched in interactions from other processes, allowing their rates to be measured separately. Here we use ND280 data that include a neutrino beam exposure of 5.8 × 1020 (3.9 × 1020) protons hitting the T2K neutrino production target in neutrino (antineutrino) mode. The explanation for the smaller dataset in ND280 and its impact on the analysis method is described in the Methods.

SK is a 50-kt water detector instrumented with photo-multiplier tube light sensors29. In SK, Cherenkov light is produced as charged particles above a momentum threshold travel through the water. This light is emitted in ring patterns that are detected by the light sensors. Owing to their lower mass, electrons scatter much more frequently (both elastically and inelastically) than muons, so their Cherenkov rings are blurred. We use this blurring to identify the charged lepton’s flavour, as illustrated in Fig. 2. More information on the event reconstruction technique for SK data and the systematic uncertainty on SK modelling can be found in the Methods Section. SK is not magnetized, so therefore relies on ND280’s measurement of the neutrino and antineutrino composition of the beam in each mode.

Fig. 2: Particle identification in the SK detector.
figure 2

Distribution of the particle identification (PID) parameter used to classify Cherenkov rings as electron-like and muon-like. Events to the left of the blue line are classified as electron-like and those to the right as muon-like. The filled histograms show the expected number of single ring events after neutrino oscillations, with the first and last bins of the distribution containing events with discriminator values below and above the displayed range, respectively. The vertical error bars on the data points represent the standard deviations due to statistical uncertainty. The PID algorithm uses properties of the light distribution such as the blurriness of the Cherenkov ring to classify events. The insets show examples of an electron-like (left) and muon-like (right) Cherenkov ring.

We form five independent samples of SK events. For both neutrino- and antineutrino-beam mode there is a sample of events that contain a single muon-like ring (denoted 1μ), and a sample of events that contain only a single electron-like ring (denoted 1e0de). These single-lepton samples are dominated by CCQE interactions. In neutrino-mode there is a sample containing an electron-like ring as well as the signature of an additional delayed electron from the decay of a charged pion and subsequent muon (denoted 1e1de). We do not use this sample in antineutrino-mode because charged pions from antineutrino interactions are mostly absorbed by a nucleus before they decay. Identifying both muon and electron neutrino interactions in both the neutrino- and antineutrino-mode beams allows us to measure the probabilities for four oscillation channels: νμνμ and \({\bar{\nu }}_{{\rm{\mu }}}\to {\bar{\nu }}_{{\rm{\mu }}}\), νμνe and \({\bar{\nu }}_{{\rm{\mu }}}\to {\bar{\nu }}_{{\rm{e}}}\).

We define a model of the expected number of neutrino events as a function of kinematic variables measured in our detectors with degrees of freedom for each of the oscillation parameters and for each source of systematic uncertainty. Systematic uncertainties arise in the modelling of neutrino-nucleus interactions in the detector, the modelling of the neutrino production, and the modelling of the detector’s response to neutrino interaction products. Where possible, we constrain the model using external data. For example, the solar oscillation parameters, \(\Delta {m}_{21}^{2}\) and sin2θ12, whose values T2K is not sensitive to, are constrained using world average data2. While we are sensitive to sin2θ13, we use the combination of measurements from the Daya Bay, RENO and Double Chooz reactor experiments to constrain this parameter2, as they make a much more precise measurement than using T2K data alone (see Fig. 4a). We measure the oscillation parameters by doing a marginal likelihood fit of this model to our near and far detector data. We perform several analyses using both Bayesian and frequentist statistical paradigms. Exclusive measurements of neutrino or antineutrino candidates in the near detector, one of which is shown in Fig. 3, strongly constrain the neutrino production and interaction models, reducing the uncertainty on the predicted number of events in the four single-lepton SK samples from 13–17% to 4–9%, depending on the sample. The 1e1de sample’s uncertainty is reduced from 22% to 19%.

Fig. 3: Event prediction model tuning to near-detector data.
figure 3

a, b, Reconstructed muon momentum in two of the ND280 CCQE-like event samples for both neutrino (a) and antineutrino (b) beam mode. The prediction with all parameters set to their best-fit value from a fit to the ND280 data are shown by the coloured histograms, split into true neutrino CCQE, antineutrino CCQE, neutral current and all other interactions. The dashed line shows the prediction before a fit to the ND280 data. The vertical error bars on the data represent the standard deviation due to statistical uncertainty. c, The ratio of the observed data to the best-fit Monte Carlo prediction in both neutrino-mode and antineutrino-mode samples.

A neutrino’s oscillation probability depends on its energy, as shown in equations (2) and (3). While the energy distribution of our neutrino beam is well understood, we cannot directly measure the energy of each incoming neutrino. Instead the neutrino’s energy must be inferred from the momentum and direction of the charged lepton that results from the interaction. This inference relies on the correct modelling of the nuclear physics of neutrino-nucleus interactions. Modelling the strong nuclear force in multi-body problems at these energies is not computationally tractable, so approximate theories are used30,31,32,33. The potential biases introduced by approximations in these theories constitute the largest sources of systematic uncertainties in this measurement. For scale, the largest individual source contributes 7.1% of the overall 8.8% systematic uncertainty on the single electron-like ring ν-mode sample. Furthermore, as well as CCQE interactions, there are non-negligible contributions from interactions where additional particles were present in the final state but were not detected by our detectors. To check for bias from incorrect modelling of neutrino-nucleus interactions, we performed fits to simulated data sets generated assuming a range of different models of neutrino interactions31,32. We compared the measurements of the oscillation parameters obtained from these fits with the measurement from a fit to simulated data generated assuming our default model. We observed no large biases in the obtained δCP best-fit values or changes in the interval sizes from any model tested. Biases are seen on \(\Delta {m}_{32}^{2}\), and these have been incorporated in the analysis through an additional error of 3.9 × 10−5 eV2/c4 on the \(\Delta {m}_{32}^{2}\) interval. More details of the systematic uncertainties on neutrino interaction modelling can be found in the Methods.

The observed number of events at SK can be seen in Fig. 1. The probability of observing an excess over prediction in one of our five samples at least as large as that seen in the electron-like charged pion sample is 6.9%, assuming the best-fit value of the oscillation parameters. We find that the data shows a preference for the normal mass ordering with a posterior probability of 89%, giving a Bayes factor of 8. We find \({\sin }^{2}{\theta }_{23}={0.53}_{-0.04}^{+0.03}\) for both mass orderings. Assuming the normal (inverted) mass ordering we find \(\Delta {m}_{32}^{2}=(2.45\pm 0.07)\times {10}^{-3}\,{{\rm{e}}{\rm{V}}}^{2}/{c}^{4}\) \((\Delta {m}_{13}^{2}=(2.43\pm 0.07)\times {10}^{-3}\,{{\rm{e}}{\rm{V}}}^{2}/{c}^{4})\). For δCP our best-fit value and 68% (1σ) uncertainties assuming the normal (inverted) mass ordering are \(-{1.89}_{-0.58}^{+0.70}\,\,(-{1.38}_{-0.54}^{+0.48})\), with statistical uncertainty dominating. Our data show a preference for values of δCP that are near maximal CP violation (see Fig. 4), while both CP conserving points, δCP = 0 and δCP = π, are ruled out at the 95% confidence level. Here we also produce 99.73% (3σ) confidence and credible intervals on δCP. In the favoured normal ordering the confidence interval contains [−3.41, −0.03] (excluding 46% of the parameter space). We have investigated the effect of the excess seen in the 1e1de sample on this interval and find that had the observed number of events in this sample been as expected for the best-fit parameter values the interval would have contained [−3.71, 0.17] (excluding 38% of parameter space). In the inverted ordering the confidence interval contains [−2.54, −0.32] (excluding 65% of the parameter space). The 99.73% credible interval marginalized across both mass orderings contains [−3.48, 0.13] (excluding 42% of the parameter space). The CP-conserving points are not both excluded at the 99.73% level. However, this experiment has reported closed 99.73% (3σ) intervals on the CP-violating phase δCP (taking into account both mass orderings), and a large range of values around +π/2 are excluded.

Fig. 4: Constraints on PMNS oscillation parameters.
figure 4

a, Two-dimensional confidence intervals at the 68.27% confidence level for δCP versus sin2θ13 in the preferred normal ordering. The intervals labelled T2K only indicate the measurement obtained without using the external constraint on sin2θ13, whereas the T2K + reactor intervals do use the external constraint. The star shows the best-fit point of the T2K + reactors fit in the preferred normal mass ordering. b, Two-dimensional confidence intervals at the 68.27% and 99.73% confidence level for δCP versus sin2θ23 from the T2K + reactors fit in the normal ordering, with the colour scale representing the value of negative two times the logarithm of the likelihood for each parameter value. c, One-dimensional confidence intervals on δCP from the T2K + reactors fit in both the normal and inverted orderings. The vertical line in the shaded box shows the best-fit value of δCP, the shaded box itself shows the 68.27% confidence interval, and the error bar shows the 99.73% confidence interval. We note that there are no values in the inverted ordering inside the 68.27% interval.

Methods

The measurement presented in this paper relies on the modelling of experimental apparatus to infer the parameters governing the oscillations of neutrinos. This modelling can be broken into three main categories: the modelling of the neutrino production in the beamline, the modelling of neutrino–nucleus interactions in the detectors, and the modelling of the detectors’ responses to final-state particles and the inference of particle properties from the detector response. The main sources of systematic uncertainty in the data analysis arise in these three areas, and here we provide a description of the models and associated systematic uncertainties.

The inference of neutrino oscillation parameters from the data also relies on the statistical methods applied. In this section, we also provide a detailed description of the statistical methods used to infer the favoured values and allowed regions for neutrino oscillation parameters.

Neutrino production modelling

The predicted neutrino and antineutrino fluxes, including the energies and flavours of neutrinos and antineutrinos, are estimated using a detailed simulation of the T2K beamline. Measurement of the proton beam orbit, transverse width and divergence, and intensity are used as the initial conditions before simulating the interactions of protons in the T2K target to produce the secondary particles that decay to neutrinos. Particle interactions and production inside the target are simulated with FLUKA201134,35, while particle propagation outside the target is simulated with GEANT336. Interaction rates and hadron production in the simulation are tuned with hadron interaction data from external experiments, primarily the NA61/SHINE experiment, which has collected data for T2K at the J-PARC proton beam energy of 30 GeV with the T2K target material of graphite37. Measurements of the currents of the magnetic horns during beam operation and of the magnetic fields of the horns before installation ensure accurate modelling of the charged particle focusing in the flux simulation. The simulated fluxes are used as inputs to simulations of neutrino interactions and particle detection in the ND280 and SK detectors. The spectrum of muon neutrinos or antineutrinos produced from the decays of focused charged pions peaks at an energy of 0.6 GeV for an off-axis angle of 2.5°. Near the peak energy, 97.2% (96.2%) of the neutrino-mode (antineutrino-mode) beam is initially νμ \(({\bar{\nu }}_{{\rm{\mu }}})\). The remaining components are mostly \({\bar{\nu }}_{{\rm{\mu }}}\) (νμ); contaminations of \({\nu }_{{\rm{e}}}+{\bar{\nu }}_{{\rm{e}}}\) are only 0.47% (0.49%).

Since the neutrino flux prediction depends on the measured beam and beamline properties, which may vary with time, different flux predictions are made for each run period, and they are combined with weights proportional to the number of protons-on-target (POT) accumulated during periods of nominal detector operation. The collected ND280 data corresponds to 5.8 × 1020 POT in neutrino mode and 3.9 × 1020 POT in antineutrino mode. This amount is less than the amount of data collected at SK owing to the lower efficiency of nominal data taking at ND280, and longer data processing times for ND280 data, limiting the available dataset. This POT difference between ND280 and SK is accounted for by combining the POT-weighted flux predictions for each run period based on the beam exposures for the data collected at each detector.

The uncertainty on the flux calculation is evaluated by propagating uncertainties on the proton beam measurements, hadronic interactions, material modelling and alignment of beamline elements, and horn current and field measurements. In each case, variations of the source of uncertainty are considered and the effect on the flux simulation is evaluated. The INGRID on-axis neutrino detector is not used to tune the beam direction during operation. Hence, it provides an independent measurement of the beam direction38, which is used to validate the flux simulation. The uncertainty on the INGRID beam direction measurement is propagated in the flux model. The variations are used to calculate covariances for the flux prediction in bins of energy, flavour, neutrino/antineutrino mode and detector (ND280 and SK). These covariances are used to propagate uncertainties on the flux prediction in the oscillation analysis. The dominant source of systematic uncertainty is from the hadron interaction data and models. The uncertainty on the flux normalization in this analysis near the peak energy of 0.6 GeV is 9%. In future analyses we aim to improve this to approximately 5% by using NA61/SHINE particle production data measured from a replica of the T2K target39,40. Uncertainties on the proton beam orbit and alignment of beamline elements correspond to an uncertainty on the off-axis angle at the ND280 and SK detectors, corresponding to uncertainties on the peak energy of the neutrino spectrum at those detectors.

Neutrino interaction modelling

The T2K detectors measure products of neutrinos and antineutrinos interacting on nuclei and free protons with energies ranging from about 0.1 GeV to 30 GeV. These interactions are modelled with the NEUT41 neutrino interaction generator, using version 5.3.2. NEUT uses a range of models to describe the physics of the initial nuclear state, the neutrino–nucleon(s) interaction, and the interactions of final-state particles in the nuclear medium.

The primary signal processes in SK are defined by the presence of a single charged lepton candidate with no other visible particles. The dominant process at the peak energy of 0.6 GeV is CCQE scattering. This process corresponds to the neutrino or antineutrino scattering on a single nucleon bound in the target nucleus. The neutrino–nucleon scattering in NEUT is implemented in the formalism of Llewellyn–Smith42. For the initial nuclear state, NEUT implements a relativistic Fermi gas model of the target nucleus, including long-range correlations evaluated using the random phase approximation43. NEUT includes an alternative initial-state model based on spectral functions describing the initial momentum and removal energy for bound nucleons44.

Additional processes that can produce a signal-like final state are modelled in NEUT. The two-particle, two-hole (2p-2h) model of Nieves et al.45,46 predicts the production of multinucleon excitations, where more than one nucleon and no pions are ejected in the final state. The ejected nucleons are typically below the detection threshold in a water Cherenkov detector, making this process indistinguishable from the CCQE process.

The signal candidate sample with one prompt electron-like ring and the presence of an electron from muon decay consists primarily of interactions where a pion is produced. These single-pion interactions can also populate the samples without an additional electron from muon decay if the pion is absorbed in the target nucleus or on a nucleus in the detector, or if it is not detected. Processes producing a single pion and one nucleon are described by the Rein–Sehgal model47. Processes with multiple pions are simulated with a custom model below 2 GeV of hadronic invariant mass and by PYTHIA48 otherwise. These processes may be selected as events with single Cherenkov rings if the pion is absorbed in the target nucleus or surrounding nuclei, or if it is not detected. The final-state interactions of pions and protons in the target nucleus are modelled with the NEUT intranuclear cascade model where the density dependence of the mean free path for pions in the target nucleus is calculated based on the Δ-hole model of ref. 49 at low momenta and from p–π scattering data from the SAID database at high momenta. The microscopic interaction rates for exclusive pion scattering modes are then tuned to macroscopic π–nucleus scattering data.

We consider two types of systematic uncertainties on neutrino interaction modelling in the oscillation measurement. In the first, parameters in the nominal interaction model are allowed to vary and are constrained by ND280 data. These parameters are then marginalized over when measuring oscillation parameters. They include uncertainties on nucleon form factors, the corrections for long-range correlations, the rates of different neutrino interaction processes, the final-state kinematics of the CCQE, 2p-2h and single-pion production processes, and the rates of pion final-state interactions. Most of these are parameters in the models with physical interpretations, and they modify the overall rate of interactions, the final-state topology, and the kinematics of final-state particles. After the constraint from ND280 data, these parameters are correlated with the systematic parameters in the neutrino production model, and their combined impact on the predicted event distributions in SK is evaluated. The constrained interaction model and neutrino production model parameters contribute a 2.7% uncertainty on the prediction of the relative number of electron neutrino and electron antineutrino candidates, the third-largest source of systematic uncertainty, as shown in Extended Data Table 1.

We also include an uncertainty on the νe and \({\bar{\nu }}_{{\rm{e}}}\) cross-sections relative to the νμ and \({\bar{\nu }}_{{\rm{\mu }}}\) cross-sections. This introduces a direct uncertainty on the relative prediction of νe and \({\bar{\nu }}_{{\rm{e}}}\) candidates, and is motivated by uncertainties in the neutrino–nucleon scattering cross-section arising from the charged lepton masses50. As shown in Extended Data Table 1, this introduces a relative uncertainty of 3.0%, the second-largest single source of systematic uncertainty in the CP asymmetry measurement.

The second type of systematic uncertainty is evaluated by introducing simulated data generated by an alternative model into the analysis and evaluating the impact on measured oscillation parameters. This approach is used to evaluate the effect of changing the nuclear initial-state model including the use of the spectral functions and changes to the removal energy for initial-state nucleons. This approach is also applied to evaluate the impact of changes to the 2p-2h interaction cross-section as a function of energy, using an alternative single-pion production model51,52, and applying alternative multi-pion production tuning53. The largest biases observed in this approach are on the measurement of \(\Delta {m}_{32}^{2}\), whereas the impact on other parameters is typically small compared to the total systematic uncertainty. In the case of \(\Delta {m}_{32}^{2}\), an additional uncertainty of 3.9 × 10−5 eV2/c4 is included by taking a convolution of a Gaussian of width 3.9 × 10−5 eV2/c4 with the likelihood. For the measurement of the other oscillation parameters, we found that the biases introduced by varying the removal energy by up to 18 MeV for initial-state nucleons were not negligible. An additional uncertainty equal to the bias in the predicted event distributions when varying the removal energy by 18 MeV was therefore added to the analysis. As shown in Extended Data Table 1, this introduces a 3.7% uncertainty on the relative prediction of electron neutrino and electron antineutrino candidates, the largest single source of systematic uncertainty in the analysis.

SK event reconstruction

Photosensors installed on the SK inner detector register Cherenkov light, produced as charged particles produced by neutrino interactions travel through the water volume29. Photosensor activity clustered in time, on the order of a microsecond, is termed an event. Events coincident with the T2K-beam timing are selected as candidate beam neutrino interactions.

Neutrino interaction events in SK often have multiple periods of photosensor activity separated in time within an event. The most frequent example is a muon decaying into an electron. A decay electron can be used to tag a muon even when the muon energy is below the Cherenkov threshold, for example, the case that the muon is produced by a charged pion decaying at rest. Such sub-events are searched for with a peak finding algorithm and reconstructed separately in later processes.

The kinematics of the charged particles are reconstructed from the timing and the number of detected photons of each photosensor signal by using a maximum likelihood algorithm54. The likelihood consists of the probability of each photosensor to detect photons or not and the charge and timing probability density functions of the hit photosensors. This new reconstruction algorithm makes use of the timing and charge information obtained by all the photosensors simultaneously, which leads to better kinematic resolutions and particle-classifying performances compared to the previously used reconstruction algorithm.

The five signal samples are formed by using the reconstructed event kinematics. All the selected events are required to have little photosensor activity in an outer veto detector, and the reconstructed neutrino interaction position is required to be inside the inner detector fiducial volume. The reconstruction improvement enabled us to extend the fiducial volumes used in the analysis. We performed a dedicated study to optimize the fiducial volume to maximize T2K sensitivities to oscillation parameters taking into account both the statistical and systematical uncertainties. The position-dependent SK detector systematics are estimated by using SK atmospheric neutrino interaction events. The fiducial volume expansion contributes to the increase of selected electron-like (muon-like) events by 25% (14%)12.

Systematic uncertainties regarding SK detector modelling were addressed by various control samples. Uncertainties on the position reconstruction bias and on the decay electron tagging are estimated by using cosmic-ray muons stopping inside the inner detector. Simulated atmospheric neutrino events are compared to data to evaluate systematic uncertainties on the modelling of signal selection efficiencies and the background contamination of the five analysis samples. Parameters describing possible mis-modelling of Cherenkov ring counting and particle identifications are introduced and constrained by a fit to the control samples. These parameters are varied according to the posterior distribution from the fit to the control samples, and the uncertainties on the T2K samples of interest are evaluated. The uncertainty on the modelling of the efficiency of selecting events with neutral pions, which is one of the dominant backgrounds in the electron neutrino CCQE-like event sample, is estimated by constructing a set of hybrid events that combine one data-derived electron-like Cherenkov ring and one simulated electron-like Cherenkov ring to imitate the decay of a neutral pion. The uncertainties on the numbers of selected total events in SK are 2–4% depending on the signal categories. As shown in Extended Data Table 1, the relative uncertainty on the predicted number of electron neutrino and electron antineutrino candidates for samples with no decay electrons is 1.5%.

Statistical methods

We use a binned likelihood-ratio method comparing the observed and predicted numbers of muon-neutrino and electron-neutrino candidate events in our five samples. In neutrino beam mode these are electron-like, muon-like and electron-like charged pion samples, whereas in antineutrino beam mode these are electron-like and muon-like samples. The samples are binned in reconstructed energy and, for the electron-neutrino-like samples, the angle between the lepton and the beam direction. In particular, best-fits are determined by minimizing the sum of the following likelihood function (marginalized over nuisance parameters) over all of our samples:

$$\begin{array}{c}-2\,{\rm{l}}{\rm{n}}\,\lambda (\bar{{\delta }_{{\rm{C}}{\rm{P}}}};{\bf{a}})=2\mathop{\sum }\limits_{i=1}^{N}\,\left[{n}_{i}^{{\rm{o}}{\rm{b}}{\rm{s}}}\,{\rm{l}}{\rm{n}}(\frac{{n}_{i}^{{\rm{o}}{\rm{b}}{\rm{s}}}}{{n}_{i}^{\exp }})+{n}_{i}^{\exp }-{n}_{i}^{{\rm{o}}{\rm{b}}{\rm{s}}}\right]\\ +{({\bf{a}}-{{\bf{a}}}_{0})}^{{\rm{T}}}{{\bf{C}}}^{-1}({\bf{a}}-{{\bf{a}}}_{0})\end{array}$$
(4)

where \(\overline{{\delta }_{{\rm{CP}}}}\) is the estimated value of δCP, a is the vector of systematic parameter values (including the remaining oscillation parameters), a0 is the vector of default values of the systematic parameters, C is the systematic parameter covariance matrix, N is the number of reconstructed energy and lepton angle bins, \({n}_{i}^{{\rm{obs}}}\) is the number of events observed in bin i and \({n}_{i}^{\exp }={n}_{i}^{\exp }(\bar{{\delta }_{{\rm{C}}{\rm{P}}}};{\bf{a}})\) is the corresponding expected number of events. Systematic parameters are marginalized according to their prior constraints from the fit to ND280 data.

We perform both frequentist and Bayesian analyses of our data. The measurement of δCP from each of the analyses is in agreement, with the presented confidence intervals coming from a frequentist analysis and the Bayes factors and credible intervals coming from a Bayesian analysis. In the frequentist analysis a fit is first performed to the near detector samples binned in the momentum and cosine of the angle between the lepton and the beam direction, with penalty terms for flux, cross-section and detector systematic parameters at the near detector. Systematic parameter constraints are then propagated from the near to the far detector via the covariance matrix, C, in equation (4) and their fitted values. The matrix is the combination of the posterior covariance from the near detector fit with the priors for the oscillation parameters, with some parameters affecting both detectors directly, whereas others that affect only the far detector are constrained through their correlation with near-detector-affecting parameters. Gaussian priors for sin2θ13, sin2θ12, and \(\Delta {m}_{21}^{2}\) are taken from the Particle Data Group’s world combinations2, while sin2θ23 and \(\Delta {m}_{32}^{2}\) (\(\Delta {m}_{13}^{2}\)) have uniform priors in normal (inverted) mass ordering. For the Bayesian analyses the prior for δCP is uniform, with an additional check applying a uniform prior in sinδCP producing the same conclusions. Furthermore, rather than fitting the near detector and propagating to the far detector as a two-step process, the Bayesian analysis directly includes the near detector samples in its expression for the likelihood and therefore performs a simultaneous fit of the near and far detector data.

The neutrino oscillation probability depends nonlinearly on the oscillation parameters, with different possible values of δCP corresponding to a bounded enhancement or suppression of the electron (anti)neutrino appearance probability. If statistical fluctuations in the data exceed these bounds, they are not accommodated by the model, and as a result the critical Δχ2 value for a given confidence level is often different from the asymptotic rule of Wilks55. To address this problem the frequentist analysis constructs Neyman confidence intervals using the approach described by Feldman and Cousins56 and thus critical values of Δχ2 vary as a function of δCP and the mass ordering. The critical values at a given confidence level are determined by fitting at least 20,000 simulated datasets for each given true value of δCP and the mass ordering. The remaining oscillation parameters are varied according to their priors. In particular, for sin2θ13, sin2θ12 and \(\Delta {m}_{21}^{2}\) these priors are taken from the Particle Data Group2, with sin2θ13 determined by the reactor experiments noted in the main text. For sin2θ23, and \(\Delta {m}_{32}^{2}\) (\(\Delta {m}_{13}^{2}\)) the priors take the form of likelihood surfaces produced from fits of simulated datasets. The simulated datasets are generated using oscillation parameter best-fits in normal and inverted mass orderings. The remaining systematic parameters are varied according to their prior constraints from the fit to ND280 data.

The Bayesian analysis uses Markov Chain Monte Carlo (MCMC) to take random samples from the likelihood. The particular MCMC algorithm used is Metropolis–Hastings57. For a sufficiently large number of samples the Markov chain achieves an equilibrium probability distribution. The number of steps in the chain with a particular value of a parameter is proportional to the posterior probability that the parameter will have that value marginalized over all the other parameters. Credible intervals are then formed on the basis of highest posterior density, with bins of equal width in the parameter under study. Given the arbitrary initial state of the Markov chain, a finite number of samples must be obtained to allow the chain to converge to a state in which it is correctly sampling from the distribution. These preliminary ‘burn-in’ samples are discarded.