A.I. for nuclear physics

Bedaque, Paulo; Boehnlein, Amber; Cromaz, Mario; Diefenthaler, Markus; Elouadrhiri, Latifa; Horn, Tanja; Kuchera, Michelle; Lawrence, David; Lee, Dean; Lidia, Steven; McKeown, Robert; Melnitchouk, Wally; Nazarewicz, Witold; Orginos, Kostas; Roblin, Yves; Scott Smith, Michael; Schram, Malachi; Wang, Xin-Nian

doi:10.1140/epja/s10050-020-00290-x

A.I. for nuclear physics

Review
Published: 22 March 2021

Volume 57, article number 100, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

The European Physical Journal A Aims and scope Submit manuscript

A.I. for nuclear physics

Download PDF

Paulo Bedaque¹,
Amber Boehnlein ORCID: orcid.org/0000-0002-6987-5994²,
Mario Cromaz³,
Markus Diefenthaler²,
Latifa Elouadrhiri²,
Tanja Horn⁴,
Michelle Kuchera⁵,
David Lawrence²,
Dean Lee⁶,
Steven Lidia⁶,
Robert McKeown²,
Wally Melnitchouk²,
Witold Nazarewicz⁶,
Kostas Orginos^2,7,
Yves Roblin²,
Michael Scott Smith⁸,
Malachi Schram⁹ &
…
Xin-Nian Wang³

2412 Accesses
32 Citations
1 Altmetric
Explore all metrics

Abstract

This report is an outcome of the workshop AI for Nuclear Physics held at Thomas Jefferson National Accelerator Facility on March 4–6, 2020

Nuclear physics midterm plan in Italy: introduction to the series

Article Open access 14 June 2023

NUSTAR – The teenage years

Article 20 February 2017

Nuclear Forces for Precision Nuclear Physics: A Collection of Perspectives

Article 10 September 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Executive summary

Nuclear science is concerned with the understanding of the nature of matter, its basic constituents and their interaction to form the elements and the properties we observe. This includes the forms of matter we see around us and also exotic forms such as those that existed in the first moments after the Big Bang and that exist today inside neutron stars. The techniques, tools, and expertise needed for nuclear physics (NP) research are therefore diverse in nature. State-of-the art accelerators are being developed to illuminate the dynamical basis of the core of the atom in terms of the fundamental constituents called quarks and gluons and to increase the number of isotopes with known properties. This scientific infrastructure is reaching scales and complexities that require computational methods for tasks such as anomaly detection in operational data. New methodologies are needed to detect anomalies and to optimize operating parameters, predict failures as well as to discover new optimization algorithms.

Artificial Intelligence (AI) is a rapidly developing field focused on computational technologies that can be trained, with data, to augment or automate human skill. Over the last few decades AI has become increasingly prominent in all sectors of everyday life, largely due to the adoption of statistical and probabilistic methods, the availability of large amounts of data, and increased computer processing power.

The US government is initiating a broad-based, multidisciplinary, multi-agency program to build a sustained national AI ecosystem. Based upon two decades of research, development, and planning, the US government recognizes the importance of AI to advances in technology, national security and national infrastructure [1]. The national AI Initiative [2] provides a framework to establish a national strategy for US leadership in AI. Key areas of emphasis include: investments in AI research and development, unleashing AI data and resources, setting Government standards, and building the AI workforce. Several workshops and committees have identified the scientific opportunities for AI, as well as challenges from the intersection of AI with data-intensive science such as NP and high-performance computing. The present report is based on the “AI for Nuclear Physics Workshop” held in March of 2020 and outlines ongoing AI activities, possible contributions the NP community could make to identify and fill possible gaps in current AI technologies and needs to benefit NP research programs. The Workshop brought together the communities directly using AI technologies and provided a venue for identifying the needs and commonalities.

For the purpose of this report we define Artificial Intelligence (AI) to broadly represent the next generation of methods to build models from data and to use these models alone or in conjunction with simulation and scalable computing to advance scientific research. These methods include (but are not limited to) machine learning (ML)^{Footnote 1}, deep learning (DL)^{Footnote 2}, statistical methods, data analytics, and automated control.

AI has tremendous potential within NP Research. It can provide new insights and discoveries from both experimental and computational data produced at user facilities. All top priorities of the 2015 Long-Range Plan on Research Opportunities and Directions [3] can benefit from AI. A common theme is to investigate and apply AI methods with well-understood uncertainty quantification, both systematic and statistical, to accelerator science, NP experimentation, and NP theory. At the same time, a number of activities and technologies in the diverse NP research portfolio has the potential to contribute to the emerging AI programs. For example, NP presents data on short time scales and with many different configurations that expose the limitations of current methods and could contribute to making AI more interpretable for the long term.

A general characteristic of the application of AI in NP is the identification of small changes in patterns (statistical variations) in multi-dimensional and highly-correlated data (parameters, channels). This process includes the evaluation of models where one can use AI methods to identify the most promising computational pathways where AI determined parameterizations can be used to avoid performance-limiting sections. Traditional AI tools have been applied successfully to some of these problems, in particular image classification. However, NP data are very diverse and to address the most interesting challenges more science insight has to be built into current AI technologies and AI tools have to be tuned to optimize performance in each application domain. Furthermore, NP data volume and complexity is increasing at a rapid pace. To take full advantage of AI in NP will thus require investments and changes in methodology for the provisioning of computing and handling of data. This in turn will require adequate computing resources, e.g., access to GPU computing and disk storage at appropriate scales.

AI has the potential to transform NP. However, to fully realize AI contributions to NP, and vice versa, close collaboration among universities, technology companies, national laboratories, and other government agencies will be essential. Such collaboration will be required to bring, for example, state-of-the art AI techniques to the NP community. Workforce development is key to increase the level of AI-literacy in NP. The challenges are similar to those outlined in the NSAC Report on ‘Nuclear Physics and Quantum Information Science’ [4] and include educational activities, creation of a community of AI knowledgeable researchers, and collaboration between NP and AI experts. Cross-disciplinary partnerships can help facilitate these connections. The list of Community identified Needs and Commonalities for AI Research essential for NP Applications as identified at this Workshop are presented below and also appear in more detail in Sect. 2 of the report:

(i)
Need for workforce development: There is a need to develop and sustain an AI capable workforce within NP.
- Need for educational activities in AI: The goal is to retain talented students in AI-related fields and to help them to secure employment in a wide range of careers, thus ensuring that the new techniques and concepts developed in NP laboratories are widely disseminated.
- Need for broader community: It is essential to have a community of researchers knowledgeable in AI technologies.
- Need for collaborations: Long term commitment to partnerships between NP researchers and experts in AI/ML/Data Science is crucial as it takes time–for all parties involved–to learn the language and methods.
(ii)
Need for uncertainty quantification: The evaluation and comparison of uncertainty predictions using different modalities is required for widespread use of AI in NP.
(iii)
Need for appropriate use of industry standard tools: significant effort is required in the careful tuning of ML tools (hyperparameter determination) to optimize performance in each application domain.
(iv)
Need for problem-specific tools: the most interesting challenges that can be approached in NP with AI will require approaches that go beyond industry standard tools.
(v)
Need for comprehensive data management: To maximize the usefulness of the data, it will be important to have standards on the processing of data, the application of theoretical assumptions, and the treatment of systematic uncertainties that will be used as training samples or as part of combined analysis. This meta-data will be encoded in the datasets.
(vi)
Need for adequate computing resources: AI techniques are computationally intensive and success in using these techniques will require access to GPU computing and disk storage at appropriate scales.

2 Priority research directions

One aspect of the workshop was to explore areas where the application of AI could have a profound impact on Nuclear Physics Research. This section summarizes those research directions. Additional detail can be found in the Summary of Workshop Sessions.

2.1 Future prospects

Accelerator design and operations Many areas of accelerator design and operations will benefit from investments in AI and ML technologies.

Optimized design of accelerator systems. Development and validation of virtual diagnostics (e.g. longitudinal phase space monitors or predictors); Design and simulation of novel accelerators, and advanced engineered materials; Optimized diagnostic design and deployment; Improvement to beam sources and injector performance.
Improving facility performance and user experience. Data-driven beam generation, transport, delivery optimization; Automated learning for operator support; Hardware acceleration of ML in distributed control systems; Anomaly detection and mitigation (eg. LLRF, beam diagnostics); System health monitoring (e.g., targets, cryoplant); Data driven system maintenance.

Holistic approach to experimentation As a long-term vision, disparate data sources (such as accelerator parameters, experimental controls, and the detector data) would be intelligently combined and interpreted to improve experiments. Real time analysis and feedback will enable the quick diagnostics and optimization of experimental setups. ML expert systems can increase the scientific output from the beam time allocated to each experiment.
Experiment design not limited by computation Future experimental advances in accelerator-based NP research hinges on increased luminosity, which provides the statistics necessary to observe rare processes. ML methods will reduce computational barriers to reach this goal. Intelligent decisions about data reduction and storage are required to ensure the relevant physics is captured. Depending on the experiment, AI can improve the physics content of the data taken through data compactification, sophisticated triggers (both software and hardware-based), and fast-online analysis.
Improving simulation and analysis Improving simulation and data analysis using ML techniques is proceeding with two general aims: (i) to use these new techniques to improve the sensitivity of current instruments and accuracy of the data, and (ii) to decrease the time simulations and analyses takes allowing for faster turnaround time to produce scientific results. Improving sensitivity allows more information to be extracted from datasets, which decreases uncertainty in results and increases discovery potential. Decreasing simulation and analysis time, saves costs and ultimately allows for a higher volume of scientific output by accelerating the feedback loop between experiment, analysis, and theory.
Game changer in nuclear theory A number of case studies have been identified. They are listed in the following.

Sign problem in LQCD	The application of Monte Carlo techniques to systems at finite density (as in nuclear matter), real-time evolution (transport coefficients) and light-cone evolution (parton distribution functions) are hindered by the fermionic sign-problem. AI methods have begun to be applied, both in supervised and unsupervised learning modes. Potentially radical advances can be expected along this direction once the full power of AI is unleashed in this problem
Extraction of physical observables	To extract quantities of interest from correlation functions computed in LQCD in some cases requires the solution of an ill-defined inverse problem. AI methods now being applied to tackling the relevant inverse problems are showing great promise for achieving important milestones in our understanding of hadron structure from first principles

Propagator inversion in LQCD	The computation of observables in LQCD requires the calculation of quark propagators in the background of a large number of gauge configurations. Mathematically this requires the inversion of a large matrix. ML methods are beginning to be used to take cheaper inversions, done with low precision, and recovering the full precision propagator, with enormous savings in computer resources
Bayesian inference and global QCD analysis	Recent progress in ML with deep learning is enabling the development of new tools to advance the science of femtography, which shows great promise for high-precision determination of hadronic structure combining all available experimental data. Such approaches will be necessary for determining 3D nucleon tomography
Identifying rare events	In the current approach to data taking and analysis, rare events, which can often represent major discoveries, can be easily overlooked when analysing data with preset ideas about what one is looking for. AI/ML can be used to generate events, with known theoretical parameters and models, and then compare the experimental stream readout with the pre-prepared theory expectations, to identify unusual or unexpected events that can be set aside for more focused study later
Microscopic description of nuclear fission	Various ML tools will help by dramatically speeding-up many-body simulations of nuclear fission by means of fast emulators for constrained density functional theory calculations in many-dimensional collective spaces; action minimization in the classically forbidden regions; new tools for dissipative dynamics; and computing of missing fission data
Origin of elements	A quantitative understanding of astrophysical processes responsible for the existence of elements requires knowledge of nuclear properties and reaction rates of thousands of rare isotopes, many of which cannot be reached experimentally. The missing nuclear data for astrophysical network simulations must be provided by massive extrapolations based on nuclear models. For some quantities such as nuclear masses, Bayesian ML has shown promise when aiming at informed predictions including both a reduction of extrapolation errors and quantified bounds

Quantified computations of heavy nuclei using realistic inter-nucleon forces	Predictions for heavy and very heavy nuclei such as Pb-208 using A-body approaches based on realistic two- and three-nucleon interactions with full uncertainty quantification will be enabled by Bayesian calibration using pseudo-data from microscopic calculations with supervised ML
Discovering correlations and emergent phenomena	Unsupervised learning can be used to discover correlations in nuclear wave functions based on microscopic Hamiltonians. There are terabytes of data from calculations with nucleonic degrees of freedom that can be data mined to discover emergent phenomena such as clustering, superfluidity, and nuclear collective modes such as rotations and vibrations
Development of a spectroscopic-quality nuclear energy density functional	Predictive and quantified nuclear energy density functional rooted in many-nucleon theory is needed. This development constitutes a massive inverse problem involving a variety of AI tools. The resulting spectroscopic-quality functional–crucial for understanding of rare isotopes–will properly extrapolate in mass, isospin, and angular momentum to provide predictions in the regions where data are not available
Equation of state of quark-gluon plasma in heavy-ion collisions	To move extracting equation of state (EoS) with ML from simulated data to real experimental data, techniques need to be developed for mapping with discretized data with detector acceptance and efficiencies. Accelerated simulations with finite baryon density are needed for classifying EoS in heavy-ion collisions at the beam energy scan (BES) at RHIC
Neutron star and dense matter equation of state	Data from intermediate-energy heavy-ion collisions and neutron-star merger events can be explored using AI tools to deduce the nuclear matter equation of state. ML classification tools can also be used in conjunction with calculations of infinite nucleonic matter to map out the phase diagram and associated order parameters

2.2 Community identified needs and commonalities

AI has tremendous potential within the context of NP Research. However, the current AI tools and methodologies have limitations that have to be addressed for the long term.

Need for workforce development There is a need to develop and sustain an AI capable workforce within NP. This challenge is similar to the workforce development challenge for Quantum Information Sciences, outlined in the ‘NP and Quantum Information Science’ Report.

Educational activities in AI: To this end, there is an urgent need to develop a range of outreach, recruitment, and educational activities. These activities will serve to raise interest in AI-related fields. The goal is to retain talented students in AI-related fields and to help them to secure employment in a wide range of careers, thus ensuring that the new techniques and concepts developed in NP laboratories are widely disseminated.
- \(\bullet \) University-wide AI courses: There is a need for inter-disciplinary AI courses involving Applied Mathematics, Statistics, and Computer Science experts, as well as domain scientists.
- \(\bullet \) Graduate Fellowships are proven tools that enable the development of a well-educated workforce and could be used to good effect in the area of AI.
Need for broader community: To achieve the goals outlined by the community, it is essential to have a community of researchers knowledgeable in AI technologies.
- \(\bullet \) A centralized community based forum could provide a common foundation to build our technologies, allow for quick dissemination of new techniques, and provide a bridge from available AI resources to NP related applications.
- \(\bullet \) Successful inter-disciplinary research require mechanisms such as the ability to create joint faculty/staff appointments. Given the wide range of use cases, such appointments would be beneficial at many institutions engaged in the NP Research Portfolio.
Need for collaborations: Collaboration with ML/AI/Data Science experts over a long-term is essential to successfully bring state of the art AI techniques to the NP community. Long term commitment to partnerships between NP researchers and experts in AI/ML/Data Science is crucial as it takes time – for all parties involved – to learn the language and methods.

Need for problem-specific tools The current surge in AI has provided great advances in software tools and hardware that can provide the basis of ML systems used in data processing. Readily available off the shelf solutions are well suited for several types of problems, particularly image classification. However, NP applications are unique in that they are often aimed at accelerating calculation, whether in the evaluation of models where one can use AI techniques to identify the most promising calculative pathways to simulation where AI-determined parametrizations can be used to circumvent performance-limiting elements. While traditional ML tools may be applied to these problems, significant effort is required in the careful tuning of ML tools (hyperparameter determination) to optimize performance in each application domain.

Enabling Infrastructure for AI in NP Taking full advantage of AI for NP will require investments and changes in methodology for the provisioning of computing and handling of data. Two particular areas concern data management and provisioning for resources.

Need for standardized frameworks: The development of standardized frameworks such as ExaLearn and CANDLE have been extremely beneficial in other domains, and could provide a model for NP. It may be possible to adapt existing frameworks.

Need for comprehensive data management: AI techniques are reliant on large volumes of data for training and the subsequent evaluation of models. For this reason, applications of AI are dependent on effective data management. Such data could be sourced from theoretical calculation, simulation, or experiment. Providing accessibility of the data to the wider NP community and increasing uniformity in data representation would create a connectivity across experiments that could increase collaboration and accelerate the development of AI techniques and tools. Such AI techniques could also facilitate near real-time calibration and analysis. To maximize the usefulness of the data, it will be important to have standards on the processing of data, the application of theoretical assumptions, and the treatment of systematic uncertainties that will be used as training samples or as part of combined analysis. This meta-data will be encoded in the datasets.

Need for adequate computing resources AI techniques are computationally intensive and success in using these techniques will require access to GPU computing and disk storage at appropriate scales.
Need for uncertainty quantification A common theme is to investigate and apply AI methods with well-understood uncertainty quantification, both systematic and statistical, to accelerator science, NP experimentation, and NP theory. The commonly used ML algorithms do not provide error estimations with model predictions, which are essential to understand outcomes. In addition, an evaluation of metrics for the evaluation and comparison of uncertainty predictions using different modalities is required for widespread use of AI in NP.

3 Workshop overview

The AI for Nuclear Physics Workshop was held at Thomas Jefferson National Accelerator Facility March 4–6, 2020. The intent of the workshop was to make a broad survey of current AI projects in NP and to gather community driven input towards establishing priority research directions, areas of commonality across the NP community (and beyond), and general needs, including workforce development. The agenda focused on plenary sessions in the morning with topical working sessions in the afternoon, with most of the presentations available from the agenda. 184 people attended the workshop. The AI for Nuclear Physics Workshop Agenda focused on summaries of status of the usage of AI in Nuclear Theory, Nuclear Experiment and Accelerator Science and operations. The connection between the scientific goals outlined in the Nuclear Science Advisory Committee long range plan [3] and AI was presented by Tim Hallman, Department of Energy Associate Director for the Office of Nuclear Physics. A second focus was the connection to broader efforts within DOE, including overview talks from the DOE Artificial Intelligence Technology Office, a summary of the AI for Science Townhall process, and a summary of the NeuroData without Borders Project [5] and the Exascale Computing Project applications ExaLearn and CANDLE [6] projects.

An adjunct hackathon event was held on March 3, 2020. 8 teams each with four members participated. The challenge problem was drawn from a common task in NP, measuring the properties of charged particles traversing a detector. The challenge was structured as progressive, with five sub-challenges. To enable evaluation of the success of the teams, an automated scoring system and leader board was developed, with the top two scoring teams being awarded prizes. The computational approaches and tools used by the teams had significant variation, demonstrating that creativity in problem solving remains a feature of research undertaken with AI Events such as this can be useful for furthering skills in AI for participants who already have basic knowledge.

4 Summary of workshop sessions

To serve as a record of the discussions, the conveners of the working group sessions have prepared summaries based on the workshop discussions and presentations. The discussions reflect independent deliberations, and consequently some differences of opinion. A list of the working groups and conveners are listed in the Appendix B. As a note, due to conflicting workshops, some NP communities were not properly represented at this workshop. Where possible, contributions from those communities were solicited and appear at the end of this section.

4.1 Lattice QCD and other quantum field theories

Lattice field theory is a cornerstone of all subfields of NP, from nuclear structure to hadronic physics, heavy-ion collisions, and neutron stars. It is based on the Monte Carlo evaluation, in one guise or another, of the quantum path integral. Despite enormous successes achieved in the last few years, computing power currently prevent us from addressing many of the central questions of NP.

Lattice calculations are divided into the generation of gauge configurations, calculation of the observables of interest and data analysis. Artificial intelligence techniques have begun to be applied to all these stages as well as extending the applicability regime of lattice techniques.

4.1.1 Case studies and future prospects

Sign problem: The application of Monte Carlo techniques to systems at finite density (as in nuclear/neutron matter), real-time evolution (transport coefficients) and light-cone evolution (parton distribution functions) are hindered by the famous sign-problem. It has been realized recently that the sign problem can be solved or ameliorated by evaluating the path integral not over real fields but over a manifold deformed into complex space instead. Up to now, the choice of manifolds has been guided by either impossibly expensive calculations or (human) insight into particular models. AI methods have begun to be applied, both in supervised and unsupervised learning modes [7,8,9,10,11,12]. Potentially radical advances can be expected along this direction once the full power of AI is unleashed in this problem.

Configuration generation: The usually most expensive part of a Monte Carlo calculation is the generation of configurations through the use of a Markov chain where, at each step, a new configuration is proposed and accepted or rejected with a probability depending on the new and old configurations. The practical feasibility of the method relies on being able to propose configurations that are significantly different from the old one while at the same time are likely to be accepted. The method used almost universally in QCD is the hybrid Monte Carlo algorithm (invented by the lattice QCD community and now widely used in all branches of science) becomes extremely expensive as the continuum limit is approached. A significant effort is being put into using different AI techniques to create algorithms to make better proposals, more decorrelated and more “acceptable”, in order to speed up the process [13, 14]. The training of the algorithms is accomplished either by the use of configurations generated by standard algorithms or, more ambitiously, through fully unsupervised learning. The basic ideas of such algorithms are already developed by the AI community and used for various applications in the engineering and software industry.

Propagator inversion: The computation of observables in lattice QCD requires the calculation of quark propagators in the background of a large number of gauge configurations. Mathematically this requires the inversion of a large matrix and, in some applications, like the extraction of nuclear forces, it can be the most expensive part of the calculation. Machine learning methods are beginning to be used to take cheaper inversions [15], done with low precision, and recovering the full precision propagator, with enormous savings in computer resources.

Observables: The extraction of physical observables from correlation functions computed in lattice QCD in some cases requires the solution of an ill-defined inverse problem. Such problems include the computation of parton distribution functions, generalized parton distribution functions, and transverse momentum dependent distribution functions, as well as the extraction of spectral densities and scattering phase shifts. These observables are the prime objective of the JLab 12 GeV program where the 3D structure and spectrum of hadrons are studied, as well as the heavy-ion physics community. AI methods are now being applied to tackling the relevant inverse problems to address these physics goals showing great promise for achieving important milestones in our understanding of hadron structure from first principles.

4.1.2 Enabling discoveries/what is needed

All the work summarized in this section is exploratory. The potential is enormous although at this time, the AI techniques are not yet competitive with the standard in the field numerical investigations of quantum field theories. That said, success in a single one of the approaches has revolutionary potential in the field. The approach for AI studies is based on toy models and small lattices where novel ideas can go through the cycle of implementation/testing/improvement very quickly. This requires a model of support that favors small, flexible groups, fosters informal communication between researchers both within NP and the AI community while keeping the field attractive to young people who may have options to pursue a career in the private sector.

4.2 Low-energy nuclear theory

4.2.1 Current status

ML applications of layered feed-forward networks to modeling nuclear masses and other observables were carried out in the early 1990s [16, 17]. But it is only fairly recently that the AI tools have been more broadly adopted by nuclear theorists and applied to various problems in nuclear structure and reactions. The main areas of modern AI applications are the following: fast emulation for big simulations; revealing the information content of measured observables with respect to current theory; identifying crucial experimental data for better constraining theory; revealing the structure of theoretical models by means of advanced parameter estimation and model reduction; uncertainty quantification of theoretical results; and improving the predictive capability by assessing extrapolations, as theoretical models are often applied to entirely new nuclear systems and conditions that are not accessible to experiment.

A variety of AI/ML tools have been used: various flavors of neural networks, Bayesian calibration, Bayesian model averaging, radial basis function, and support for vector machines. The application areas include interpolation and extrapolation of nuclear masses [19,20,21,22,23,24,25,26,27,28], charge radii [29, 30], excited states [31,32,33], beta decay [34, 35], alpha decay [36, 37], fission yields [38, 39], nucleon–nucleon phase shifts [40], pion-nucleon coupling constants from nucleon-nucleon scattering [41], scattering in the unitary limit [42], three-nucleon scattering [43], neutron-alpha scattering [44], nuclear reaction cross sections [45,46,47,48,49], estimates of effective field theory truncation errors [18, 50, 51], estimates of basis truncation errors [52, 53], model calibration and reduction [54,55,56,57,58,59,60], and variational calculations [61].

The low-energy nuclear theory community has been involved in educational efforts in the area of AI. Examples are summer schools, courses, and conferences, including a series of annual meetings on enhancing the interaction between nuclear experiment and theory through information and statistics (ISNET).

4.2.2 Case studies and future prospects

The following case studies are examples of high-impact science that can be enabled by AI.

Microscopic description of fission. Modern many-body approaches to fission [62], aided by AI, will provide a predictive description of fission that will produce data for heavy-element research, nuclear astrophysics, and stockpile stewardship. Here, AI-tools will help on several levels, including: development of emulators for constrained density functional theory calculations in many-dimensional collective spaces [54, 63], action minimization in the classically forbidden regions, description of dissipative dynamics, and the use of neural networks to compute incomplete fission data [38, 39].

Origin of heavy elements. The astrophysical rapid neutron capture r-process responsible for the existence of many heavy elements is predicted to involve many elements that are close to the neutron drip line; the structure of these very exotic nuclei thus directly impacts how elements are produced in stellar nucleosynthesis [64]. A quantitative understanding of the r-process requires knowledge of nuclear properties and reaction rates of \(\sim \)3,000 very neutron-rich isotopes, many of which cannot be reached experimentally. The missing nuclear data for astrophysical simulations must be provided by massive extrapolations based on nuclear models augmented by the most recent experimental data. Here, ML, with its unified statistical treatment of all uncertainties, can make informed predictions for some of the relevant quantities that reduce extrapolation errors and quantified bounds [21, 23, 24, 27].

Quantified computations of heavy nuclei using realistic inter-nucleon forces. Predictions for heavy and very heavy nuclei such as \(^{208}\)Pb using A-body approaches based on realistic two- and three-nucleon interactions with full uncertainty quantification will be enabled by Bayesian calibration using pseudo-data from microscopic calculations with supervised ML [57, 58].

Development of a spectroscopic-quality nuclear energy density functional: Predictive and quantified nuclear energy density functional rooted in many-nucleon theory [65] will be developed. This task constitutes a massive inverse problem [66] involving a variety of AI tools. The resulting spectroscopic-quality functional—crucial for understanding of rare isotopes—will properly extrapolate in mass, isospin, and angular momentum to provide predictions in the regions where data are not available.

Discovering nucleonic correlations and emergent phenomena. Unsupervised learning can be used to discover correlations in calculations of nuclear wave functions that use a microscopic Hamiltonian. There are terabytes of data from calculations with nucleonic degrees of freedom that can be data mined to discover emergent phenomena such as clustering [67,68,69,70], superfluidity [71], and nuclear rotation [72].

Neutron star and dense matter equation of state Data from intermediate-energy heavy-ion collisions and neutron-star merger events can be explored using AI tools to deduce the nuclear matter equation of state [73,74,75]. ML classification tools can also be used in conjunction with calculations of infinite nucleonic matter to map out the phase diagram and associated order parameters.

4.2.3 Enabling discoveries/what is needed

The low-energy nuclear theory community is eager to embrace the diverse toolbox offered by AI. Progress in the field could be accelerated by deploying additional resources to meet the most important needs.

Need for collaborations. Many barriers can be overcome by establishing collaborations that have long-term perspective. Considering the low level of AI literacy in the community, access to ML/AI/Data science experts is essential. (Semi-)Permanent access to experts in AI/ML/Data Science is crucial as it takes time—for all parties involved—to learn the language and methods. The best solution is to hire a AI/ML/Data Science expert as a joint faculty (or postdoc). Funding mechanisms should be defined to support local and national collaborations in NP and ML/AI/Data science.

Need for inter-disciplinary research. Inter-disciplinary research is popular but making it succeed is difficult. Disciplinary boundaries mitigate against hiring ML/AI/Data Science experts involved in NP research. The silo mentality, especially in academia, is a serious problem and is hurting innovation. Formal mechanisms must address the issues of how scholarship is assessed and how teaching is assigned and evaluated, particularly before tenure. Programs should be established to fund AI/NP bridge positions at universities; this would help to create joint faculty appointments at many institutions.

Need for a comprehensive approach to AI education. There is, at present, only a patchwork of AI educational efforts in the low-energy nuclear-physics community. A coherent approach to AI education, involving multiple university departments, such as Physics, Statistics, and Computer Science, is needed. While online courses can be effective, they cannot replace regular in-person lectures. Establishing graduate fellowships in the area of ML/AI/Data science applied to NP problems would enable the development of a well-educated workforce in this area. Some universities have “dual Ph.D.” programs that allow individual students to work within two different graduate programs. Certificates in AI/ML are a less intensive but still beneficial approach to this problem.

4.3 Accelerator science and operations

We identify three distinct areas where AI/ML could improve the reliability and performance of the NP accelerator facilities while reducing the operational cost. These areas are:

Accelerator and material design optimization
Provenance and prognostication for accelerator sub-systems
Dynamic optimization of real time operation controls

Although these areas can be investigated independently, providing an “optimal automated accelerator” would require all areas.

4.3.1 Accelerator and material design optimization

Computational techniques lay at the center of accelerator design. Modern simulation codes are capable of self-consistent tracking \(10^{9}\) charged particles through complex, nonlinear external field environments, and in modeling interactions with materials. Highly developed and benchmarked engineering codes are employed to design and optimize acceleration structures, high power beam targets, vacuum systems, plasma and solid-state devices for instrumentation.

ML/AI techniques are coming into common use during the design stage to facilitate studies of complex beam dynamics in search of optimum lattices and working point tunes, to study novel schemes for cooling hadron beams, to improve diagnostic schemes for beam measurements, to create performance gains in high intensity and high brightness beam sources, to name but a few [76].

Reinforcement learning and Bayesian optimization are techniques that can be used to explore large design parameter spaces. However, in order for these techniques to provide reliable and optimal solutions they need to be configured and tuned for the specific application. An incorrect kernel selection used in a Gaussian Process technique can lead to disastrous results. Similarly, using a sub-optimal search strategy and/or policy model architecture in reinforcement learning will converge to sub-optimal result. Therefore, it’s critical to build or leverage a framework, such as CANDLE and ExaRL, to improve the chances of an optimal solution.

4.3.2 Provenance and prognostication for accelerator sub-systems

Scientific productivity at accelerator-based NP facilities is directly impacted by unscheduled losses of beam time. The trip rate (see Fig. 3) is attributable to multiple causation factors that vary in frequency and severity. Some of the main causes are due to excessive beam losses detected by the Machine Protection System (MPS) and to loss of RF cavity control (RF). Machine learning tools for anomaly detection have been deployed at CEBAF [77], and other laboratories [78] to monitor trends in system behaviors precursor to faults.

Design of beam loss monitor networks using Correlation and Principal Component Analysis (PCA) [79] is used to determine optimum locations to place beam loss diagnostics to monitor for all known loss mechanisms in specific beamlines. Unsupervised learning techniques are used to detect faulty beam position monitors that determine beam trajectories [80].

Beyond effects that directly influence beam delivery to experiments, ML techniques are being considered to assist in other critical operational aspects. Predictive schemes for equipment maintenance can be used to proactively identify components requiring attention prior to critical need. Cryogenic production and distribution will benefit from online monitoring and predictive capabilities provided by supervised and unsupervised learning by quickly detecting unplanned helium losses and alerting operators.

The current efforts leverage existing ML frameworks and tools. However, a detailed integration for verification, validation, and reproducibility have not been developed. Additionally, there are no current efforts to integrate uncertainty quantification into the machine learning pipeline. Finally, implementing domain aware ML, when appropriate, could provide better forward prediction models for failure and anomaly detection. These components will be critical to provide a a full featured and reliable monitoring and prognostication system.

As more sub-systems are integrated into a comprehensive monitoring/logging framework, managing the data-load will become increasingly important. These large-scale online data sets faces a range of challenges, including multi-modal and multi-frequency high-dimensional, noisy, and uncertain input data.

4.3.3 Dynamic optimization of real time operation controls

Frontier accelerator facilities such as FRIB and EIC will require years of operational experience to fully develop functional capabilities at their design level. AI/ML techniques are in use to improve the control over particle beams, incorporating Reinforcement Learning (RL) techniques within the accelerator control system [Schram FNAL]. Particle Swarm techniques have been tested to optimize the tuning of aperiodic ion transport lines, and are in development for advanced particle separators [81]. Bayesian Gaussian Processes (GP) and Neural Network (NN) methods are in use to train laser-driven photoinjector facilities in one or more degrees of freedom (Fig. 4).

AI/ML activities are being pursued at many NP accelerator facilities and at associated universities. These activities are mainly oriented towards addressing local issues, and are performed by individual scientists or small teams with or without direct support from data science experts. Strategic development and deployment of AI/ML techniques across the DOE complex has high leverage of performance for investment.

Similar to the design optimization effort, techniques such as reinforcement learning can be use to explore the large control parameter space to dynamically optimize for real time system. Leveraging existing frameworks, such as CANDLE and ExaRL, to optimize the learning will be important, however, additional safeguards will be required to ensure that the policy network model doesn’t diverge while in a real time system. The ability to process the data in a timely manner will be critical to the applicability of these techniques. Leverage and making advancement in cutting edge technology will provide the ability to deploy better models in real time systems.

4.3.4 Summary and Final Thoughts

We identify specific areas of accelerator design and operations that would benefit from investments in AI and ML technologies.

Data capture and streaming Developing a comprehensive data capturing and streaming framework will be critical to maximize the utility of the AI/ML tools. Having enough time series data from relevant sensors will be be required to build causal models that properly account for system lags, etc. As we gain confidence and understanding in these AI/ML models, moving them closer to the sensor will allow facilities to automate parts of the operations yielding reduced downtime and operational cost. Development in AI/ML at the edge (FPGAs, etc) and model robustness will be vital. As the NP community expands its use in AI/ML will require access to greater resources to train AI/ML models. Data aggregation and distribution to these compute resources will be an important factor.
Uncertainty quantification and robustness The need to associate uncertainties with the AI/ML predictions is critical for all efforts. However, it’s particularly important when applied to Scientific User Facilities. AI/ML applications for anomaly detection and fault prediction require a quantifiable estimation of uncertainty to determine the proper coarse of action and trade-off (false positives, etc.).
Optimized design of accelerator systems

Development and validation of virtual diagnostics (eg. longitudinal phase space monitors or predictors). Design and simulation of novel accelerators, and advanced engineered materials. Optimized diagnostic design and deployment and improvement to beam sources and injector performance.
Improving facility performance and user experience

Data-driven beam generation, transport, delivery optimization. Automated learning for operator support. Hardware acceleration of ML in distributed control systems. Anomaly detection and classification and mitigation (eg. LLRF, beam diagnostics); System health monitoring (eg. targets, cryoplant); Data driven system maintenance.
Benchmark techniques on standard models; dedicated accelerator studies Dedicated studies on machines and diagnostic support. Identify specific beamlines, injectors and accelerator facilities to facilitate design and implementation of technologies, algorithms, data pipeline structures.
Develop capability in AI/ML for computing at the edge (FPGA, etc.) Moving AI/ML workflows closer to the sensor will allow for computing resources to be leveraged and distributed where necessary, allowing for high density data transfers to be conducted locally with reduced load on facility networks.
AI cookbook of techniques and Data Science workshops/training Development of a community standardized toolkit for training AI/ML scientists and provide answers to commonly encountered issues.

4.4 Experimental methods

4.4.1 Current status

AI applications to experimental applications are being developed across the subfields of NP. In some experiments which like those depending on image analysis, AI techniques have been successfully applied. This includes the time projection chamber experiments and neutrino experiments [82,83,84,85]. Work has also been done to analyze jet substructure [86], and in detector rejection methods [87]. Current efforts expand upon this work, building on existing AI technologies.

Significant AI endeavors in experimental NP have been in tracking in various detector setups, as highlighted above. Two examples are track classification in the Active-Target Time Projection Chamber at the FRIB and track selection in the CLAS12 drift chambers at Jefferson Lab. Figures 5 and 6 demonstrate two benefits that AI leverages over traditional methods. In the first, classification machine learning methods were used to improve data selection over traditional cut methods. In the second, equivalent accuracy was achieved with AI methods, but with significant (6x) speedup over traditional fitting methods.

4.4.2 Case Studies and Future Prospects

Holistic approach to experimentation As a long-term, “moonshot” vision, disparate data sources would be intelligently combined and interpreted to improve experiments. Data sources include accelerator parameters, experimental controls, and the detector data itself. Real time analysis and feedback enables the quick diagnostics and optimization of experimental setups. Accelerator-based, quick-turnaround experiments are a unique challenge in NP. ML expert systems can increase the scientific output from the beamtime allocated to each experiment. Ideally, this holistic approach can be applied to the design of the experiment itself by optimizing machine and detector properties as a single system.

Experiment design not limited by computation Future experimental advances in accelerator-based NP research hinges on increased luminosity, which provides the statistics necessary to observe rare processes. ML methods will reduce computational barriers to this goal. Intelligent decisions about data storage is required to ensure the relevant physics is captured. Depending on the experiment, AI can improve the data taken through data compactification, sophisticated triggers (both software and hardware-based), and fast-online analysis.

An example would be the incorporation of neural networks in the FPGAs which comprise the front-end triggers of complex experiments. The very large channel counts afforded by modern semiconductor detectors combined with high beam luminosity yield data rates that can be prohibitively demanding. Incorporating intelligent triggers with very low latency early in the signal processing chain makes this data challenge more manageable. Furthermore such triggers could act as classifiers allowing for anomaly detection on the data stream prior to the trigger decision flagging interesting events that would normally be silently discarded.

Improving analysis As seen in Sect. 4.4.1, improving data analysis using ML techniques is currently proceeding with two general aims:

to use these new techniques to improve the sensitivity of current instruments and accuracy of the data, and
to decrease the time such analysis takes, allowing for faster turnaround time to produce scientific results.

Improving sensitivity creates more accurate datasets, which decreases uncertainty in results and increases the potential for discovery. Decreasing analysis time saves costs and allows for a higher volume of scientific output.

Uncertainty quantification A near term goal is to apply AI methods with well-understood uncertainty quantification, both systematic and statistical, to experimental methods. The dominant ML algorithms used in experimental HEP and NP do not provide error estimations with model predictions, which are essential to understand experimental results. In addition, an evaluation of metrics for the evaluation and comparison of uncertainty predictions from different models is required for widespread use of AI in experimental NP.

4.4.3 Enabling discoveries/what is needed

Educate and build a broader community To achieve the experimental goals outlined by the community, we must build a community of researchers knowledgeable in AI technologies. This would be greatly facilitated by centrally located, NP-supported and maintained educational resources and tutorials. Centralized resources allows for: a common foundation to build our technologies, quick dissemination of new techniques, and a bridge from available AI resources to NP related applications.

Build an infrastructure for AI/ML scientists in the NP community. This includes laboratory positions, the establishment of university collaborations, and joint positions.

Standardized data formats In order to collaborate and use AI tools effectively it is important to standardize the way we present data to these systems. Most AI tools in current use are created by industry or large open source projects with established communities. Taking on common data formats and workflows allows us to move with these communities (and each other) more quickly and effectively.

4.5 Event generation and simulation

4.5.1 Current status

Simulations of physics processes and detector response are required in NP to design experiments, develop and verify analyses, and compare to theory. They are also used in theory and phenomenology to simulate data and investigate theory advances. High-precision measurements at CEBAF, RHIC, the upcoming EIC and other NP facilities require simulations with high-precision and high accuracy. Achieving the statistical accuracy needed is often computationally intensive with the simulation of the shower evolution in calorimeters being a prime example. As alternative, fast simulations with parameterizations of detector response or other computationally efficient approximations are pursued. However, they still lack the accuracy required for high-precision measurements. Here, AI provides a promising alternative. Fast generative models, e.g., GANs or VAEs, are being utilized to model physics processes and detector responses accurately and accelerate simulations. Beyond that, Bayesian optimization is applied for tuning simulations and detector design, with AI-optimized detector design being emerging for the EIC.

4.5.2 Case studies and future prospects

Accelerate simulations High-Energy Physics has used AI, in particular GAN-based architectures, to successfully accelerate detector simulations. In some cases, in particular in case of calorimeters, the models can be directly applied to fast simulations in NP. In many cases, e.g., for particle identification detectors, new approaches to fast particle identification can be developed as, e.g., shown for Cherenkov detectors [89]. The resulting fast turnaround time for simulations with high-precision and high-accuracy will allow for rapid improvements of the physics reach and detection capabilities of NP experiments.

HPC utilization NP experiments have few payloads appropriate to the Leadership Computing Facilities, in particular for the upcoming exascale supercomputers where accelerator technologies are being applied extensively. AI is the best near-term prospect for using accelerated hardware efficiently. Physics and detector simulations based on AI would be an ideal payload for the Exascale Computing Project.

AI-driven detector design Advanced detector design requires performing computationally intensive simulations as part of the detector-design optimization process. Nowadays there are various AI-based global optimization procedures, e.g., reinforcement learning or evolutionary algorithm. Among these, Bayesian Optimization has gained popularity for its ability of performing global optimization of black-box functions which additionally can be noisy and non-differentiable. For example, an automated, highly-parallelized, and self-consistent framework based on Bayesian Optimization has been recently developed [90], where a PID detector for the future EIC has been considered as a case study. These studies showed an improvement in performance and provided useful hints on the relevance of different features of the detector. The same procedure can be applied to any other detector, or even combination of detectors. Also, costs can be added as parameter in the detector-design optimization process.

AI for event generators Monte Carlo event generators describe collision processes through a combination of theory and phenomenological models. AI approaches can be applied to experimental data and map out the underlying probability distributions governing the spectrum of final-state particles in a given process. This information can be used to construct event generators in a model-independent way, providing unique ways to quantitatively test the validity of theoretical assumptions or models. Such an event generator would store the same information as that contained in the experimental data and can be viewed as compact data storage utility. A prototype event generator is currently being developed with the ETHER (Empricailly Trained Hadronic Event Regenerator) project, as illustrated in Fig. 7 for a comparison of Pythia generated electron-proton scattering events with those produced by a Feature-Augmented and Transformed (FAT) GAN [91].

4.5.3 Enabling Discoveries/What is Needed

AI research is multidisciplinary. An interplay of applied mathematics, computer science, and NP will facilitate the development of AI approaches to the unique questions of NP. This will allow, e.g., to design activation functions particular to NP applications or to build efficient neural networks no more complex than necessary. The multidisciplinary approach will also be helpful to understand the requirements for explainable AI and uncertainty quantification for NP simulations. To cultivate multidisciplinary AI development, access to reference data sets, as well as supplementary information for non-experts on what the NP data entails is essential.

4.6 Bayesian inference for quantum correlation functions

Determining the 3-dimensional “tomographic” structure of the proton and nuclei in terms of the elementary quark and gluon (or parton) degrees of freedom of QCD remains one of the central challenges in modern NP. A fundamental complication in this endeavor is the fact that quarks and gluons always remain confined inside hadrons and never observed directly in experiments. This constitutes a classic “inverse problem”: how to reliably infer the quantum correlation functions (QCFs) that characterize hadron structure and the emergence of hadrons in terms of partons from the experimental data — Fig. 8.

Existing approaches to extract QCFs, such parton distribution functions (PDFs), fragmentation functions (FFs), transverse momentum dependent distributions (TMDs) or generalized parton distributions (GPDs), from data rely on Bayesian likelihood inference, coupled with suitable parametrizations of the distribution functions on the internal parton momenta. The complexity of mapping between the large quantities of high-precision data expected from JLab 12 GeV (as well as from the future EIC) and the multidimensional QCFs, many of which have never been been explored, will require the creation of a new paradigm in order to assess the impact of the data. An important opportunity therefore exists for utilizing AI/ML techniques to develop the next generation of QCD analysis tools that can more efficiently map between observables and QCFs and maximum the science output from future facilities.

4.6.1 Current status

Historically the extraction of 1-dimensional QCFs, such as PDFs or FFs, has relied on the maximum likelihood method, which is adequate for cases involving a small number of distributions, but can introduce significant bias and error when applied to more complicated problems involving multidimensional functions. Current state-of-the-art analyses seek to overcome these problems by employing Monte Carlo sampling (NNPDF [92] and JAM [93] Collaborations) to take into account the multiple solutions, and simultaneously determining various types of QCFs which appear in different observables to account for feedback effects [94].

Other examples of state-of-the-art techniques currently employed for 1-D QCF studies include the use of neural net methodology for proton PDFs [92], and the application of generative adversarial networks (GANs) for mapping PDFs [95]. In the transverse momentum sector, the first global TMD analysis was performed recently [96] using the JAM MC methodology extended to the 3-D sector. Exploratory studies of fitting GPDs with neural networks were made for a limited set of deeply-virtual Compton scattering data [97], and recently the more general approach of parametrizing Compton form factors (integrals of GPDs) with neutral nets has been explored [98]. Finally, as lattice QCD simulations at physical quark parameters are becoming more feasible, synergies between global QCD analysis of experimental data and lattice results are being actively explored [99], including the first attempts to perform simultaneous fits to measured cross sections and lattice matrix elements of nonlocal operators, whose Fourier transforms are related to PDFs.

4.6.2 Case studies and future prospects

The history of applying ML tools to study the hadron substructure is rather brief. A recent example used neural nets to construct a universal Monte Carlo event generator (UMCEG) for electron-proton scattering, that is free of theoretical assumptions about underlying particle dynamics [91]. This project, funded by the Jefferson Lab LDRD program, applied generative adversarial network (GAN) technology to simulate particle production at the event level. A new feature-augmented and transformed GAN (FAT-GAN) was developed to select a set of transformed features from particle momenta (generated directly by the generator), and use these to produce a set of augmented features that improve the sensitivity of the discriminator. The new FAT-GAN was tested on pseudodata generated by the Pythia event generator [100], and was able to faithfully reproduce the distribution of final state electron momenta in inclusive electron scattering. The FAT-GAN strategy can be generalized to GANs for simulating other reactions under different conditions, as well as learning exclusive events, and alternative strategies, for example using convolutional neural networks (CNNs), can also be explored.

Another important recent application of AI has been to the development of inverse mapping methodology using machine learning for Bayesian inference of QCFs — see Fig. 9. Two machine learning prototypes have been explored, based on a mixture density network and a parameter-supervised autoencoder, which have been tested and validated first on a toy model for inclusive DIS, and subsequently on a real global analysis of DIS data. The prototypes were found to be capable of mapping PDFs to within 1-\(\sigma \) CL, consistent with those found in recent global Monte Carlo fits [93]. Extension of the methodology to the 3-D sector remains an important future challenge.

4.6.3 Enabling discoveries/what is needed

To maximize the potential benefit from AI for QCF inference studies, collaboration between QCD physicists and machine learning experts is needed in order to translate the domain knowledge of QCD into generic problem definitions that can be addressed with cutting-edge AI technology. To this end, the creation of joint positions between NP and AI will promote cross-disciplinary fluency in both fields.

The development of an interactive web-based global analysis platform to perform global QCD analysis “on the fly” will allow users to study how different setups (choice of specific data sets or kinematic regions, or improvements on data uncertainties from future facilities, such as the EIC) can affect the inferred QCFs. The vision is to move from the limited paradigm where QCFs are numerically tabulated at interpolation grids, with rigid connections between the data and QCFs, to a more flexible paradigm where QCFs can be generated dynamically from user input.

The creation of such web-based analysis infrastructure would be a valuable tool for the NP community, but will require identifying the most efficient computing platform to host such a service and computing resources for its realization. There is also a critical need for production-level hardware resources to enable the analysis of the large quantities of high-precision data expected from new experimental facilities, in order to understand the deep connections between the data and the QCFs.

4.7 Additional contributions received

4.7.1 Relativistic heavy ions

At extremely high temperature or density, quarks and gluons become deconfined and form a new state of matter – Quark Gluon Plasma (QGP). One can study this matter through high energy nuclear collisions at Relativistic heavy ion collider (RHIC), Large hadron collider (LHC) and other facilities, as well as computer simulations by analyzing the four-momenta and species of final state particles produced in each single collision. The dynamical evolution of the collision systems can be described by hybrid models with relativistic hydrodynamics and hadronic cascades at different stages of the collision. One can infer the initial state of the collision and the intermediate evolution from comparisons the data on final state particles from experiments and simulations.

AI plays an important role in compressing the high dimensional heavy-ion collision data to low dimensions, extracting the model parameters and their uncertainties with Bayesian analysis, classifying the equation state, regressing the initial nuclear structure or in solving partial differential equations of relativistic hydrodynamics using deep neural networks. These AI applications are described in the following.

Compressing data to low dimensions Many experimental observables are designed to compress complex high energy nuclear data to low dimensions using simple projection, statistical mean, variance and correlations along a few directions. Unsupervised learning algorithms such as PCA is widely used in the field of high energy nuclear physics, to automatically extract the most informative features in data. PCA can be used to determine the magnitude of different longitudinal fluctuation modes [101], which helps to constrain the initial state entropy deposition along the beam direction in heavy-ion collisions. Since the initial state fluctuations of entropy density in the transverse plane is converted to final state correlations of particles in momentum space, the collectivity and anisotropy of final state particles along the azimuthal angle direction are quantified by the flow harmonics \(v_n\). The \(v_3\) factorization breaking is well described using 2 initial state fluctuation modes given by PCA and a linear hydrodynamic response [102]. PCA also rediscovers flow harmonics [103] which are originally computed from Fourier decomposition.
Bayesian analysis to extract QGP properties Bayesian analysis uses the likelihood between low dimensional experimental data and model output to constrain model parameters, such as the QCD equation of state [104]. The prior QCD EoS used in hydrodynamics is parameterized to cover the physical equation of state functional space.The posterior distribution of the EoS agree with lattice QCD calculations. To take into account the effect of other entangled parameters, Trento + iEBE-VISHNU + UrQMD model is used to do a global fitting using Bayesian analysis [105,106,107]. The clear peak structure in the posterior distributions of model parameters indicates non-zero shear and bulk viscosity of the QGP. When high energy partons traverse through QGP, they loss energy by elastic scattering and gluon radiations. The Bayesian analysis is also used to constrain the heavy quark diffusion coefficients [108], the light quark \(\hat{q}\) [109] and the jet energy loss distribution [110].
Jet classification in heavy ion collisions The applications of neural network was used in 1996 to determine the impact parameter of heavy-ion collisions [111], with a one-hidden layer neural network. Various architectures of deep neural network are used in jet flavor classification for proton-proton collisions. However, the applications to heavy-ion jet classification is rare. The classification performance worsens due to soft gluon radiations affecting soft jet substructure [112]. Recently a point-cloud-like network called particle/energy flow network is employed in jet flavor classification [113] and is used to design new physical observables for heavy-ion jets [86].
Classification for nuclear phase transition Beam energy scan (BES) project aims to locate the QCD critical point that separates the first order phase transition and smooth crossover in the QCD phase diagram by colliding heavy ions at various energies. Deep convolution neural network is used to classify these two different nuclear phase transition regions [114] using relativistic hydrodynamic simulations of heavy ion collisions. The phase transition type used in the equation of state is encoded in the evolution and deep neural network helps to decode this information from the complex final state output of heavy-ion collisions. Although there is entropy production and information loss, the network succeeds in classifying nuclear phase transition types with approximately 93% accuracy. Deep convolution neural network uses images as input, a more natural representation of the heavy-ion data of a list of particles with their four momenta, pid and charge information. Point cloud network is a perfect architecture for this data structure. A recent study uses point cloud network to classify Spinodal and Maxwell constructions of the first order phase transition [115].
Regression for nuclear-shape deformation Most heavy ions used at RHIC and LHC are deformed. The collisions of deformed nuclei produce complex correlations between charged multiplicity and anisotropic flow. Using Monte Carlo simulation data, a 34-layer residual network is used to predict the values of nuclear shape deformations [116]. The network succeeds in predicting the magnitude of nuclear shape deformations but not their signs, which indicates that there is a degeneracy between high-energy collisions of prolate-prolate and oblate-oblate nuclei.
Interpretation and explanation Interpretation is important in understanding what has been learned by the black box deep neural network. In the classification task for nuclear phase transition, a prediction difference analysis algorithm is used to locate the most important phase space regions in the input for classification. In the regression task for nuclear shape deformation, a regression attention mask algorithm is developed to highlight the regions that are important for the decision making.
Accelerate relativistic hydrodynamic simulations Accumulating data in heavy-ion collisions is slow. Stacked-UNet is used to solve relativistic hydrodynamic equations [117]. The time evolution of the energy density and fluid velocity from neural network method agree with 2+1D viscous hydrodynamics. The trained network can solve hydrodynamic equations 600 times faster than numerically solving partial differential equations. As a comparison, the GPU parallelization brings 60 to 100 times speed up.

Current study of heavy-ion collisions with machine learning have used data set generated with model simulations. To apply these techniques to real experimental data, one has to taken into account the acceptance and efficiencies of the detectors. This can be accomplished through incorporation of the characteristics of the detector in the model simulations which are used to train the network for final application to real experimental data. In the meantime, advance in the accelerated model simulations with more realistic physics scenarios are needed for more robust AI studies.

4.7.2 Project 8

The Project 8 collaboration is developing an experiment to measure the absolute neutrino mass with cyclotron radiation emission spectroscopy (CRES). The event reconstruction process for Project 8 can be framed as a challenge of feature recognition in noisy data, where the features to find are the electron tracks and how they are grouped together. The Project 8 collaboration has studied two uses of machine learning to improve track and event reconstruction. The first application was to differentiate different types of tracks by their characteristics [118]. Figure 10 shows an electron event with five visible tracks. The four sideband tracks and one visible main-carrier track are labeled. We first analyzed individual tracks and extract parameters like slope and power density, and then applied a Support Vector Machine to distinguish three track populations: main carrier tracks with high pitch angles (the angle of the electron’s momentum relative to the magnetic field in the experiment), main carrier tracks with low pitch angles, and sidebands. Having this information can help in reconstructing events, avoiding problems that might occur when particular tracks are not observed, like the missing main carrier in Fig. 10.

Machine learning can be applied to Project 8 data to identify tracks, as well. We are developing a method for identifying tracks using a Convolutional Neural Network (CNN). This particular task is a straightforward application of a CNN with a U-Net architecture. Such a tool, once optimized, will be used to do the initial optimization of the tracks in events such as Fig. 10. While the initial application of the CNN to Project 8 data is straightforward, there are a variety of details to establish, such as accounting for all of the necessary track topologies, and understanding the efficiency of detection.

4.7.3 NEXT

The NEXT neutrinoless double beta decay program has as its primary physics goal discovering or severely limiting parameter space for the Majorana nature of neutrinos in 136Xe decays. NEXT will undertake this search in a staged program of high pressure gas xenon Time Projection chambers (HPgXeTPCs), culminating in a multi-ton detector that will be effectively background-free.

HPgXeTPCs, because of the benefits of gaseous xenon (perhaps with a He additive) including its small Fano factor, allow to see the topology of the double beta decay while achieving sub 1\(\%\) energy resolution. The technology and the path to the necessary low background model has been demonstrated in a small detector NEXT-NEW. A future design of the 1-ton-scale High Definition (HD) design is shown below, along with a typical double beta event in simulation with its Bragg peaks at the end of each track. The exquisite topological information in these detectors calls out for Deep Convolutional Neural Nets (DNNs) to perform tasks such as signal and background classification and, in fact, full semantic segmentation-based event reconstruction.

The collaboration has already published [119] work on DNNs applied to NEXT-NEW data. A team is now at work on its Summit allocation to extract optimal sensitivity from simulated ton-scale designs. The team has already shown its effective use of sparse DNNs in similar highly-parallel applications on Summit, and early work is already bringing benefits to the extremely promising (multi) ton NEXT program.

4.7.4 WANDA

The sequence of steps whereby nuclear data is compiled, evaluated, processed, and incorporated into applications is referred to as the “Nuclear Data Pipeline””. The pipeline provides the critical connection between laboratory measurements and their eventual use in models of reactors, isotope production, detectors for non-proliferation, supernova explosions, radchem networks, and many other systems. To improve and supercharge this pipeline, the nuclear data community has extensive needs, including: more rapid, accurate, and robust evaluations; quicker compilation of data and accompanying contextual information from published experimental work; robust methods to optimize experimental design for verification, validation, and benchmarking; wider use of realistic physics models in transport simulations; and reproduction of the results of complex multi-physics codes via fast-execution surrogate models. AI/ML tools have tremendous potential to address all of these critical needs. During the recent Workshop for Applied Nuclear Data Activities (WANDA) [120], the nuclear data community has recently identified a number of key areas in which AI/ML advances have already made significant impacts and show substantial promise both in the short term and long into the future. Targeted investments are needed now to fully realize the potential of AI/ML in nuclear data, preferably by leveraging AI/ML advances in other areas for use in nuclear physics and simultaneously driving AI/ML innovations. Some of the many areas to emphasize include:

Using AI/ML tools to identify systematic trends in nuclear data that were missed by human evaluators, and developing AI/ML emulators to incorporate complex physics models into evaluations, so that evaluations can be more robust, new physics can be uncovered, and predictive power can be improved.
Exploiting AI/ML tools to process complex relationships between nuclear data and integral experiments to develop rigorous validation approaches, so that that AI/ML tools can be confidently deployed in nuclear energy, nuclear security, and other applications where safety is paramount.
Quantifying the intrinsic uncertainties of AI/ML tools, so that their results can be fully integrated into the nuclear data UQ process that is critical to the validation, verification, benchmarking, and normalization activities widely utilized across nuclear data activities.
Developing a new, standardized, QA-vetted, well-characterized database of nuclear information including UQ that can be easily input into AI/ML codes, so that AI/ML advances can be more quickly used across the nuclear data pipeline.
Collecting and sharing fitted models, training data, and notes on their applicability and limitations, so that the reproducibility of AI/ML results can be enhanced and advances from across disciplines can be best leveraged for the widest utilization.
Using AI/ML tools to both develop surrogate physics models and use them to sequentially search and optimize over a wide space of experimental design, so that the most impactful data are targeted and collected more efficiently, and so that specific deficiencies in data needed for robust evaluations are avoided.
Developing natural language processing (NLP) tools to automatically compile new results, so that errors in data entry can be reduced, consistency checks can be facilitated, expert validation and verifications can be quickly done, and database insertion can be seamlessly performed.
Fostering collaborations between nuclear researchers and AI/ML experts, so that appropriate algorithms can be efficiently determined for a given problem and subsequently trained, tuned, and deployed for maximum scientific impact while minimizing biases or unphysical results.

5 Cross cutting topics

The breakout sessions of the workshop focused on topics in NP with the knowledge that there are commonalities between these topics, and indeed, across many scientific domains. These ’cross cutting’ areas span the spectrum from the development of methodologies and mathematics for AI approaches, the need for sophisticated data management and curation, and means to establish an AI cognizant workforce. The following section outlines a number of these cross cutting topics.

5.1 Statistical methods and tools

Statistics and statistical methods are based on probability spaces, defined in terms of sets and probability measures. They aim to provide a better understanding and quantified characterization of a given set of data. Data mining uses statistics as well as other methods to find patterns in order to explain phenomena. Machine learning uses data mining and other learning algorithms in order to predict future outcomes, and AI uses models based by machine learning to make intelligent decisions. As the application of AI to NP is in the early stages, the integration of statistical methods and uncertainty quantification into more advanced AI applications is still under development.

5.1.1 Overview of approaches in NP

In lattice QCD and other lattice field theories, AI has been used for configuration generation, propagator inversion, observables, and overcoming the sign problem. Among the various statistical methods utilized, Jensen–Shannon divergences have been used to distinguish gauge field ensembles using deep neural networks [121]. Bayesian neural networks have been used for spectral reconstruction [122] and reconstructing parton distribution functions [123]. Machine learning regression errors for parton distribution functions has also been quantified using bias correction and bootstrap resampling [124]. Bayesian inference and other statistical methods applied to model parameterizations of partonic structure are key to extracting parton distribution functions, fragmentation functions, transverse momentum dependent distributions, and generalized parton distributions.

In low-energy nuclear theory, Bayesian methods have been used across a variety of different problems for uncertainty quantification. This includes Bayesian calibration for nucleon–nucleon phase shifts [40] and direct nuclear reactions [45,46,47]; Bayesian Gaussian processes for truncation errors in effective field theory [18, 51] and uncertainties in neutron-alpha scattering and three-body parameters [44]; Bayesian calibration for A-body calculations [59] and mass models [54, 60]; Bayesian extrapolations [23]; Bayesian model averaging [19, 24, 25]; and Bayesian neural networks for r-process beta decays [34], alpha decays [36, 37], and spallation cross sections [48]. Bayesian regularization as well as other approaches have been used for uncertainty quantification in applying neural networks to applications such as the extrapolation of truncation errors in nuclear structure calculations [52, 53] and variational methods [61].

AI applications to experimental NP are being developed across the subfields of NP. Experiments that map well to existing AI technologies, such as image analysis problems, have demonstrated success in NP. Examples include time projection chamber experiments and neutrino experiments [82,83,84,85]. Work has also been done to analyze jet substructure [86], and in detector rejection methods [87]. Current efforts are expanding upon this work, building on existing AI technologies. In the future one would like to apply AI methods to experimental methods with systematic and statistical uncertainty quantification. Similarly, Bayesian optimization will also be extremely useful for tuning event simulations and detector design.

AI techniques in accelerator science and operations have been adopted and utilized for some time. Early usage of simulated annealing and genetic algorithm techniques [125, 126] were applied to optimize the distribution of pure permanent magnets in extended undulator assemblies for SASE FELs. SVD techniques have been used to optimize steering control in storage rings [127]. Genetic algorithms and optimization techniques have been used to brightness of electron photoinjectors [128] and electron synchrotrons and storage rings [129]. Recent reviews [76, 130] have identified uses of artificial neural networks, convolutional neural networks, Bayesian optimization, reinforcement learning, random forest, and other methods in accelerator controls [131], longitudinal phase space prediction [132], anomaly detection in SRF cavities and beam diagnostics [77, 80], FEL performance enhancement [133], etc. Current efforts are expanding in all areas of accelerator control, optimization and design, diagnostics and prognostics.

5.1.2 Use of current tools

The current surge in AI has provided great advances in software tools and hardware that can provide the basis of machine learning systems used in data processing. Readily available off the shelf solutions are well suited for basic classification problems, particularly for images. Analysis of experimental data however, requires regression networks that often need careful tuning to specific problems and data sets. In addition, scientific results require well understood systematic uncertainties in values obtained from any analysis. For example, charged particle tracking requires not only a 5 parameter state vector, but also a 15 parameter diagonal covariance matrix to represent its uncertainties. These are needed as inputs to kinematic fitting routines which combine constraints imposed by physics with the experimentally measured values in order to achieve optimal resolutions. Scientific results also require study to ensure no bias is introduced by the analysis technique. More so than is needed by industrial applications.

5.2 Collaborations and collaborative activities

The importance of community and collaboration were a cross cutting theme in the workshop. There are distinct types of collaboration, each of which is beneficial.

5.2.1 NP communities of practice

Relative to other communities, the NP scientists have relatively few communities of practice that enable knowledge exchange on technical topics or the ability to articulate the requirements for community developed and supported tools. This is in contrast to, for example, the HEP community, which has several sanctioned or funded activities that focus on computing, including the HEP Software Foundation (HSF) (sponsored by CERN), the IRIS-HEP collaboration (funded by NSF) and the Forum for Computational Excellence (funded by OHEP). HSF in particular was instrumental in developing computing focused white papers for the the European Strategy for Particle Physics.

The scientific expertise to approach the challenges in NP lies within the NP community. For that reason, communities of practice within NP to share knowledge computing knowledge could be invaluable tools towards addressing common challenges. These are happening on a small scale, such as the Jefferson Lab AI lunch series and the Monthly Computing Round Table hosted jointly by BNL and Jefferson Lab. Community based groups could serve as a clearing house for training opportunity announcements and similarly to the HSF, as a tool for organizing community white papers. One of the outcomes of the AI for NP Workshop is the establishment of a proto-community that came together to produce this report.

Several concrete actions that could be undertaken by a community of practice is developing a portal for community based A.I. training resources and the development of AI recipe books. Another topic could be a discussion around data management standards. Extending these activities to include AI experts would be beneficial to creating a much needed community to leverage the rapid advancement of methods and tools in the AI/ML communities.

5.2.2 Engagement with data science community

The NP community recognizes the importance of engaging the data science community to develop technologies that enable innovation in NP. Research groups have begun collaboration with computer scientists with demonstrated success [91]. However, a broader effort to formally collaborate with the AI scientists can advance AI technologies in NP while taking advantage of unique aspects of data in NP to inform innovation in AI. Fostering such collaboration is essential for long term success in developing AI techniques that realize the potential for impacting NP challenges.

In order to best interact with the AI community, both parties must identify and engage in mutually beneficial research topics. This requires education and interaction of the two fields. To maximize collaboration, laboratories and institutions can create an infrastructure in which AI scientists are an integral part of the field. This can be accomplished through joint projects that includes well-defined metrics of success for an AI scientist working in a physics field. The collaboration will be mutually beneficial, with the AI work not considered a service, but as a true collaboration. This can be evidenced by nuclear physicists and AI scientists publishing together, whether in physics or AI journals. Building a merged community of physicists and data scientists brings challenges in nuclear physics data analysis to the consideration of AI researchers as they develop new methods. This will allow AI technology to advance in line with our community’s needs.

6 Engagement with ASCR

For the past few years, the U.S. Department of Energy, Office of Science program in Advanced Scientific Computing Research (ASCR) has been conducted several workshops directly and indirectly focused on AL/ML which resulted in several reports.

In January 2018, the ASCR Basic Research Needs workshop on Scientific Machine Learning [134] identified six priority research directions (PRDs). The first three focused on the foundation research themes: (1) Domain-awareness, (2) Interpretable, and (3) Robust. Within the NP community, the use of domain aware ML to leveraging scientific domain knowledge by enforcing physical conservation law and governing equations was identified. Additionally, providing robust ML solutions is important for scientific research and critical when deployed at scientific user facilities (SUFs). The last three focused on capability research themes: (4) Data-Intensive, (5) Enhanced Modeling and Simulation, (6) Intelligent Automation and Decision Support. All three PRDs of these items have clear applications within the NP community. For example the semi-automation of emerging SUFs could significantly reduce operational cost and downtime.

Although not explicitly focused on ML, ASCR convened a workshop on in situ data management (ISDM) on January 28–29, 2019 [135]. The goal of the ISDM workshop was to consider in situ data management to support traditional and future scientific computing needs. Six PRDs were identified: (1) Pervasive, (2) Co-designed, (3) In Situ Algorithms, (4) Controllable, (5) Composable, and (6) Transparent. These priorities are of particular interest to DOE NP since they could directly feed into the existing and future facilities, such as FRIB and the EIC. In particular, the need for provenance and reproducibility was explicitly linked to the development and use of ML. Additionally, in situ algorithm and controllable ISDM would enable semi-automated SUFs.

On June 5, 2019, the DOE Office of Science (SC) organized a one day workshop centered on the topic of Data and Models: A Framework for advancing AI in science report [136]. Three priority opportunities were identified: (1) democratize access to benchmark science data, (2) make AI operational in science with composable services and (3) address open questions in AI with frameworks. Providing a Findable, Accessible, Interoperable, and Reusable (FAIR) dataset and composable tools would accelerate the ability for members of the NP community to develop new algorithms and efficiently train them using these services.

Between July and October 2019, four town hall meetings dubbed “AI for Science” were conducted to discuss and identify the scientific needs and opportunities across a diverse collection of domains (biology, physics, mathematics, accelerators, computing, etc.). Some of the most notable grand challenges for NP included: Automate and/or optimize the operation of accelerators and detector systems; Improve experimental design and real time tuning.

Finally, it was identified that the NP community could benefit from using existing AI/ML solutions by leveraging existing ASCR investments. For example, the Exascale Computing Project (ECP) has created tools that can accelerate computationally expensive tasks. For example, using CANDLE to perform large scalable hyper-parameter optimizing scans could potentially significantly improve on existing results. Similarly, the ECP ExaLearn project is now developing scalable tools to address common AI/ML challenges such as developing surrogate models, inverse problems, and automated design and controls challenges. These tools could save a significant amount of development time and allow the NP community to focus on solving domain specific challenges.

As the NP community expands its use in AI/ML it will require access to greater computing resources to train AI/ML models. The NP community should leverage the existing ASCR computing facilities and develop a data aggregation and distribution community plan.

7 The importance of data management

AI techniques are reliant on the quantity and quality of the data and for this reason, applications of AI are likely to result in a paradigm shift in data management. Accessibility of the data to the wider NP community would create a connectivity across experiments that could increase collaboration. Viewing data as a valuable commodity impacts decisions on how data from experiments and simulations is collected, cataloged and accessed. AI techniques could also facilitate near real-time calibration and analysis.

As mentioned in several of the summaries, current analysis techniques often ’flatten’ the experimental data. To maximize the usefulness of the data, it will be important to have agreements and documentation on ’processing’ of experimental data, the application of theoretical assumptions and the treatment of systematic uncertainties that will be used as training samples or as part of combined analysis. All relevant information about the data will have to be stored with the data. This should trend towards the development of appropriate standards consistent with FAIR data principles and frameworks that capture data and metadata.

8 Workforce development

8.1 Education

There are only 26,000 AI researchers currently in the US. This is estimated to represent only a fifth of the current demand. There is an urgent need for training in AI, at a variety of educational levels and for diverse audiences. To this end,there is an urgent need to develop a range of outreach, recruitment, and educational activities. NP research will serve to raise interest in AI-related fields. The goal is to retain talented students in AI-related fields and to help them to secure employment in a wide range of careers, thus ensuring that the new techniques and concepts developed in NP laboratories are widely disseminated. Unfortunately, the current educational efforts in AI in NP—while extremely valuable—are patchwork. They include summer schools, topical programs, workshops, and conferences.

A coherent inter-disciplinary approach is needed. Several mechanisms were discussed at the Workshop aiming at improving the situation.

University-wide AI courses There is a need for inter-disciplinary AI courses involving Applied Mathematics, Statistics, and Computer Science experts, as well as domain scientists. Online courses play important role, but the in-person approaches are superior.
Graduate Fellowships Establishing graduate NP/AI fellowships, similar to, e.g., DOE Computational Science Graduate Fellowship or DOE NNSA Stewardship Science Graduate Fellowship, would enable the development of a well-educated workforce in this area.
Dual Ph.D. Programs Some universities allow “dual Ph.D.” programs that allow individual students to work within two different graduate programs. Students start graduate school in their primary department, and then enter such a program by arranging a secondary affiliation upon choice of a research project and advisor. Certificates in AI/ML are a less intensive but still beneficial approach to this problem.
Educational Outreach Opportunities AI practice is inherently interdisciplinary and an effort should be made to introduce the AI field to young physicists, computer and data scientists, mathematicians, and others in related fields as they choose their career paths. Conferences and Workshops play an important role in this cross-pollination. For example, workshop organizers received a grant from the National Science Foundation that funded travel for 18 graduate- and undergraduate-students and early career professionals, most of whom indicated they would not have been able to attend the workshop without this support. The pre-workshop hackathon provided students with an opportunity to creatively collaborate on a problem solving competition related to AI.

9 The level of AI literacy

The interest in the workshop was very good: as many as 184 scientists came to the meeting and many attended remotely. According to the data gathered by the Workshop’s questionnaire, around 40% participants are new to AI, 70% would like to apply techniques from this workshop, and 40% actively working on project using AI. These numbers well reflect the current situation: many nuclear physicists understand the potential benefits of AI, but there is a steep learning curve.

Considering the current efforts, more sophistication in using AI tools is needed. Indeed, majority of NP users apply off-the-shelf tools; fewer understand the AI glossary and make informed choices about the modern AI tools that suit their problem best. Even fewer practitioners are advanced users or innovators who consider uncertainty quantification to be an essential part of the answer and/or consider the full feedback between AI and physics problem (AI application is modified depending on the physics outcome).

In short, at this point, NP community at large does not fully grasp the depth of the AI universe with the majority of work being carried out by users often helped by enthusiastic undergraduate and graduate students. But the foundations are there: nuclear physicists have good technical background and they are used to problem-driven approaches to tool selection. This helps in choosing the best/right tools for the problems. One has to remember, however, that the newest AI tools are almost always largely untested. It takes some experience to know which tools to use. Simply understanding that this is true will help nuclear physicists avoid dangerous pitfalls.

How can the level of AI literacy be improved? As discussed in Sect. 5.2 the fastest route to an AI-educated community involves easy access to ML/AI/Data science experts. In the long-term, education of younger generation is essential. Several mechanism to improve the situation in this area are proposed in Sect. 8.

Data Availability Statement

This manuscript has no associated data or the data will not be deposited. [Authors’ comment: This paper is the proceedings of a workshop. All data presented is available in the original sources.]

Notes

Machine learning enables computers to learn from experience or examples.
Deep learning is a class of ML algorithm that are composed of multiple hidden layers.

References

The National Artificial Intelligence Research and Development Strategic Plan. https://www.nitrd.gov/pubs/national_ai_rd_strategic_plan.pdf. Accessed 11 Mar 2020
Artificial intelligence for the American People website. https://trumpwhitehouse.archives.gov/ai/. Accessed 11 Mar 2020
Reaching for the Horizon: The 2015 Long Range Plan for Nuclear Science (2015). https://www.osti.gov/biblio/1296778
Nuclear Physics and Quantum Information Science. https://science.osti.gov/-/media/np/pdf/Reports/NSAC_QIS_Report.pdf la=en&hash=91703C70429F2B7D634CBC10573079858926141D. Accessed Oct 2019
Neurodata without Borders project web site. https://www.nwb.org
CANDLE project web site. https://candle.cels.anl.gov
A. Alexandru, P.F. Bedaque, H. Lamm, S. Lawrence, N.C. Warrington, Fermions at finite density in \(2+1\) dimensions with sign-optimized manifolds. Phys. Rev. Lett. 121, 191602 (2018a). https://doi.org/10.1103/PhysRevLett.121.191602
Article ADS Google Scholar
Y. Mori, K. Kashiwa, A. Ohnishi, Toward solving the sign problem with path optimization method. Phys. Rev. D. 96, 111501 (2017)
ADS MathSciNet Google Scholar
A. Alexandru, P.F. Bedaque, H. Lamm, S. Lawrence, Finite-density Monte Carlo calculations on sign-optimized manifolds. Phys. Rev. D 97, 094510 (2018b)
ADS Google Scholar
F. Bursa, M. Kroyter, A simple approach towards the sign problem using path optimisation. JHEP 12, 054 (2018)
ADS MathSciNet MATH Google Scholar
A. Ohnishi, Y. Mori, K. Kashiwa, Path Optimization for the sign problem in field theories using neural network. JPS Conf. Proc. 26, 024011 (2019)
Google Scholar
J.-L. Wynen, E. Berkowitz, S. Krieg, T. Luu, J. Ostmeyer, “Leveraging machine learning to alleviate hubbard model sign problems” (2020), arXiv:2006.11221 [cond-mat.str-el]
M.S. Albergo, G. Kanwar, P.E. Shanahan, Flow-based generative models for Markov chain Monte Carlo in lattice field theory. Phys. Rev. D 100, 034515 (2019). https://doi.org/10.1103/PhysRevD.100.034515
Article ADS MathSciNet Google Scholar
G. Kanwar, M.S. Albergo, D. Boyda, K. Cranmer, D.C. Hackett, S. Racanière, D.J. Rezende, P.E. Shanahan, “Equivariant flow-based sampling for lattice gauge theory. Phys. Rev. Lett. 125, 121601 (2020)
ADS MathSciNet Google Scholar
G. Pederiva, A. Shindler, Machine Learning for Hadron Correlators from lattice QCD to be published (2020)
S. Gazula, J.W. Clark, H. Bohr, Learning and prediction of nuclear stability by neural networks. Nucl. Phys. A 540, 1–26 (1992)
ADS Google Scholar
K.A. Gernoth, J.W. Clark, J.S. Prater, H. Bohr, Neural network models of nuclear systematics. Phys. Lett. B 300, 1–7 (1993)
ADS Google Scholar
J.A. Melendez, R.J. Furnstahl, D.R. Phillips, M.T. Pratola, S. Wesolowski, Quantifying correlated truncation errors in effective field theory. Phys. Rev. C 100, 044001 (2019)
ADS Google Scholar
L. Neufcourt, Y. Cao, S.A. Giuliani, W. Nazarewicz, E. Olsen, O.B. Tarasov, Quantified limits of the nuclear landscape. Phys. Rev. C 101, 044307 (2020a)
ADS Google Scholar
R. Utama, J. Piekarewicz, H.B. Prosper, Nuclear mass predictions for the crustal composition of neutron stars: a Bayesian neural network approach. Phys. Rev. C 93, 014311 (2016a)
ADS Google Scholar
R. Utama, J. Piekarewicz, Refining mass formulas for astrophysical applications: A Bayesian neural network approach. Phys. Rev. C 96, 044308 (2017)
ADS Google Scholar
Z.M. Niu, H.Z. Liang, Nuclear mass predictions based on Bayesian neural network approach with pairing and shell effects. Phys. Lett. B 778, 48–53 (2018)
ADS Google Scholar
L. Neufcourt, Y. Cao, W. Nazarewicz, F. Viens, Bayesian approach to model-based extrapolation of nuclear observables. Phys. Rev. C 98, 034318 (2018)
ADS Google Scholar
L. Neufcourt, Y. Cao, W. Nazarewicz, E. Olsen, F. Viens, Neutron drip line in the Ca region from Bayesian model averaging. Phys. Rev. Lett. 122, 062502 (2019)
ADS Google Scholar
L. Neufcourt, Y. Cao, S. Giuliani, W. Nazarewicz, E. Olsen, O.B. Tarasov, Beyond the proton drip line: bayesian analysis of proton-emitting nuclei. Phys. Rev. C 101, 014319 (2020b)
ADS Google Scholar
Z.M. Niu, J.Y. Fang, Y.F. Niu, Comparative study of radial basis function and bayesian neural network approaches in nuclear mass predictions. Phys. Rev. C 100, 054311 (2019a)
ADS Google Scholar
T. M. Sprouse, R. Navarro Perez, R. Surman, M. R. Mumpower, G. C. McLaughlin, N. Schunck, Propagation of statistical uncertainties of Skyrme mass models to simulations of \(r\)-process nucleosynthesis, Phys. Rev. C 101, 055803 (2020)
A. Pastore, D. Neill, H. Powell, K. Medler, C. Barton, Impact of statistical uncertainties on the composition of the outer crust of a neutron star. Phys. Rev. C 101, 035804 (2020)
ADS Google Scholar
R. Utama, W.C. Chen, J. Piekarewicz, “Nuclear charge radii: density functional theory meets Bayesian neural networks. J. Phys. G 43, 114002 (2016b). https://doi.org/10.1088/0954-3899/43/11/114002
Article ADS Google Scholar
Y. Ma, S. Chen, J. Liu, Z. Ren, X. Chang, Y. Gao, “Predictions of nuclear charge radii and physical interpretations based on the naive Bayesian probability classifier. Phys. Rev. C 101, 014304 (2020a)
ADS Google Scholar
S. Akkoyun, N. Laouet, F. Benrachi, Improvement Studies of an Effective Interaction for N=Z sd-shell Nuclei by Neural Networks” (2020), arXiv:2001.08561 [nucl-th]
S. Akkoyun, H. Kaya, First excited 2+ energy state estimations of even–even nuclei by using artificial neural networks (2020), arXiv:2002.08218 [nucl-th]
R.-D. Lasseri, D. Regnier, J.-P. Ebran, A. Penon, Taming nuclear complexity with a committee of multilayer neural networks. Phys. Rev. Lett. 124, 162502 (2020)
ADS Google Scholar
Z.M. Niu, H.Z. Liang, B.H. Sun, W.H. Long, Y.F. Niu, Predictions of nuclear \(\beta \)-decay half-lives with machine learning and their impact on \(r\)-process nucleosynthesis. Phys. Rev. C 99, 064307 (2019b)
ADS Google Scholar
N. J. Costiris, E. Mavrommatis, K. A. Gernoth, J. W. Clark, and H. Li, Statistical global modeling of Beta–Decay halflives systematics using multilayer feedforward neural networks and support vector machines, in 17th Symposium of the HELLENIC NUCLEAR PHYSICS SOCIETY ON NUCLEAR Physics Ioannina, Greece, May 30-31, 2008 (2008) arXiv:0809.0383 [nucl-th]
U.B. Rodríguez, C.Z. Vargas, M. Gonçalves, S.B. Duarte, F. Guzmán, Alpha half-lives calculation of superheavy nuclei with \(Q\alpha \)-value predictions based on the Bayesian neural network approach. J. Phys. G46, 115109 (2019a)
ADS Google Scholar
U.B. Rodríguez, C.Z. Vargas, M. Gonçalves, S.B. Duarte, F. Guzmán, Bayesian Neural Network improvements to nuclear mass formulae and predictions in the SuperHeavy Elements region”. EPL 127, 42001 (2019)
ADS Google Scholar
Z.-A. Wang, J. Pei, Y. Liu, Y. Qiang, Bayesian evaluation of incomplete fission yields. Phys. Rev. Lett. 123, 122501 (2019)
ADS Google Scholar
A. Lovell, A. Mohan, P. Talou, M. Chertkov, “Constraining fission yields using machine learning. EPJ Web Conf. 211, 04006 (2019)
Google Scholar
S. Wesolowski, R.J. Furnstahl, J.A. Melendez, D.R. Phillips, Exploring Bayesian parameter estimation for chiral effective field theory using nucleon-nucleon phase shifts. J. Phys. G 46, 045102 (2019)
ADS Google Scholar
P. Reinert, H. Krebs, E. Epelbaum, Precision determination of pion-nucleon coupling constants using effective field theory (2020), arXiv:2006.15360 [nucl-th]
B. Kaspschak, U.-G. Meißner, How machine learning conquers the unitary limit ( 2020), arXiv:2003.09137 [physics.comp-ph]
E. Epelbaum et al., Towards high-order calculations of three-nucleon scattering in chiral effective field theory. Eur. Phys. J. A 56, 92 (2020)
ADS Google Scholar
K. Kravvaris, K.R. Quinlan, S. Quaglioni, K.A. Wendt, P. Navrátil, Quantifying uncertainties in neutron-\({\alpha }\) scattering with chiral nucleon-nucleon and three-nucleon forces. Phys. Rev. C 102, 024616 (2020)
ADS Google Scholar
G.B. King, A.E. Lovell, L. Neufcourt, F.M. Nunes, Direct comparison between Bayesian and frequentist uncertainty quantification for nuclear reactions. Phys. Rev. Lett. 122, 232502 (2019). https://doi.org/10.1103/PhysRevLett.122.232502
Article ADS Google Scholar
A.E. Lovell, F.M. Nunes, Constraining Transfer Cross Sections Using Bayes. Theorem” Phys. Rev. C 97, 064612 (2018)
ADS Google Scholar
M. Catacora-Rios, G.B. King, A.E. Lovell, F.M. Nunes, Exploring experimental conditions to reduce uncertainties in the optical potential. Phys. Rev. C 100, 064615 (2019). https://doi.org/10.1103/PhysRevC.100.064615
Article ADS Google Scholar
C.-W. Ma, D. Peng, H.-L. Wei, Z.-M. Niu, Y.-T. Wang, R. Wada, Isotopic cross-sections in proton induced spallation reactions based on the Bayesian neural network method. Chinese Phys. C 44, 014104 (2020b). https://doi.org/10.1088/1674-1137/44/1/014104
Article ADS Google Scholar
S. Akkoyun, “Estimation of fusion reaction cross-sections by artificial neural networks. NIM B 462, 51–54 (2020). https://doi.org/10.1016/j.nimb.2019.11.014
Article ADS Google Scholar
M. R. Schindler, D. R. Phillips, Bayesian Methods for Parameter Estimation in Effective Field Theories, Ann. Phys. 324, 682–708 (2009), [Erratum: Annals Phys. 324, 2051–2055 (2009)]
J. A. Melendez, S. Wesolowski, R. J. Furnstahl, Bayesian truncation errors in chiral effective field theory: nucleon–nucleon observables. Phys. Rev. C 96, 024003 (2017), arXiv:1704.03308
W.G. Jiang, G. Hagen, T. Papenbrock, Extrapolation of nuclear structure observables with artificial neural networks. Phys. Rev. C 100, 054326 (2019)
ADS Google Scholar
G.A. Negoita, J.P. Vary, G.R. Luecke, P. Maris, A.M. Shirokov, I.J. Shin, Y. Kim, E.G. Ng, C. Yang, M. Lockner, G.M. Prabhu, Deep learning: Extrapolation tool for ab initio nuclear theory. Phys. Rev. C 99, 054308 (2019)
ADS Google Scholar
J.D. McDonnell, N. Schunck, D. Higdon, J. Sarich, S.M. Wild, W. Nazarewicz, Uncertainty quantification for nuclear density functional theory and information content of new measurements. Phys. Rev. Lett. 114, 122501 (2015). https://doi.org/10.1103/PhysRevLett.114.122501
Article ADS Google Scholar
S. Yoshida, N. Shimizu, T. Togashi, T. Otsuka, Uncertainty quantification in the nuclear shell model. Phys. Rev. C 98, 061301 (2018)
ADS Google Scholar
S. Yoshida, Nonparametric Bayesian approach to extrapolation problems in configuration interaction methods. Phys. Rev. C 102, 024305 (2020)
ADS Google Scholar
A. Ekström, C. Forssén, C. Dimitrakakis, D. Dubhashi, H.T. Johansson, A.S. Muhammad, H. Salomonsson, A. Schliep, Bayesian optimization in ab initio nuclear physics. J. Phys. G 46, 095101 (2019)
ADS Google Scholar
A. Ekström, G. Hagen, Global sensitivity analysis of bulk properties of an atomic nucleus. Phys. Rev. Lett. 123, 252501 (2019). https://doi.org/10.1103/PhysRevLett.123.252501
Article ADS Google Scholar
A. Ekström, Ab initio models of atomic nuclei: challenges and new ideas” (2019), arXiv:1912.02227 [nucl-th]
V. Kejzlar, L. Neufcourt, W. Nazarewicz, P.-G. Reinhard, Statistical aspects of nuclear mass models. J. Phys. G 47, 094001 (2020)
ADS Google Scholar
J.W.T. Keeble, A. Rios, Machine learning the deuteron. Phys. Lett. B 809, 135743 (2020)
Google Scholar
N. Schunck, L.M. Robledo, Microscopic theory of nuclear fission: a review. Rep. Prog. Phys. 79, 116301 (2016)
ADS Google Scholar
S. Akkoyun, T. Bayram, S.O. Kara, N. Yildiz, Consistent empirical physical formulas for potential energy curves of 38–66ti isotopes by using neural networks. Phys. Part. Nucl. Lett. 10, 528–534 (2013)
Google Scholar
C.J. Horowitz et al., r-process nucleosynthesis: connecting rare-isotope beam facilities with the cosmos. J. Phys. G 46, 083001 (2019)
ADS Google Scholar
R. Navarro Pérez, N. Schunck, A. Dyhdalo, R. J. Furnstahl, S. K. Bogner, Microscopically based energy density functionals for nuclei using the density matrix expansion. ii. full optimization and validation, Phys. Rev. C 97, 054304 (2018)
M. Kortelainen, J. McDonnell, W. Nazarewicz, E. Olsen, P.-G. Reinhard, J. Sarich, N. Schunck, S.M. Wild, D. Davesne, J. Erler, A. Pastore, Nuclear energy density optimization: shell structure. Phys. Rev. C 89, 054314 (2014). https://doi.org/10.1103/PhysRevC.89.054314
Article ADS Google Scholar
S. Elhatisari, N. Li, A. Rokash, J.M. Alarcón, D. Du, N. Klein, B.-N. Lu, U.-G. Meißner, E. Epelbaum, H. Krebs, T.A. Lähde, D. Lee, G. Rupak, Nuclear binding near a quantum phase transition. Phys. Rev. Lett. 117, 132501 (2016)
ADS Google Scholar
S. Elhatisari, E. Epelbaum, H. Krebs, T.A. Lähde, D. Lee, N. Li, B.-N. Lu, U.-G. Meißner, Gautam Rupak, Ab initio Calculations of the Isotopic Dependence of Nuclear Clustering. Phys. Rev. Lett. 119, 222505 (2017)
ADS Google Scholar
M. Freer, H. Horiuchi, Y. Kanada-En’yo, D. Lee, U.-G. Meißner, Microscopic clustering in light nuclei. Rev. Mod. Phys. 90, 035004 (2018)
ADS MathSciNet Google Scholar
W.G. Dawkins, J. Carlson, U. van Kolck, A. Gezerlis, Clustering of four-component unitary fermions. Phys. Rev. Lett. 124, 143402 (2020)
ADS Google Scholar
M. Sambataro, N. Sandulescu, Pairing correlations and eigenvalues of two-body density matrix in atomic nuclei. Ann. Phys. 413, 168061 (2020)
MathSciNet MATH Google Scholar
M.A. Caprio, P.J. Fasano, P. Maris, A.E. McCoy, J.P. Vary, Probing ab initio emergence of nuclear rotation. Eur. Phys. J. A 56, 120 (2020)
ADS Google Scholar
P. Morfouace, C.Y. Tsang, Y. Zhang, W.G. Lynch, M.B. Tsang, D.D.S. Coupland, M. Youngs, Z. Chajecki, M.A. Famiano, T.K. Ghosh, G. Jhang, Jenny Lee, H. Liu, A. Sanetullaev, R. Showalter, J. Winkelbauer, Constraining the symmetry energy with heavy-ion collisions and bayesian analyses, Phys. Lett. B 799, 135045 (2019)
C.Y. Tsang, M.B. Tsang, Pawel Danielewicz, F.J. Fattoyev, W.G. Lynch, Insights on Skyrme parameters from GW170817. Phys. Lett. B 796, 1–5 (2019)
Y. Lim, J.W. Holt, Bayesian modeling of the nuclear equation of state for neutron star tidal deformabilities and GW170817. EPJA 55, 209 (2019). https://doi.org/10.1140/epja/i2019-12917-9
Article ADS Google Scholar
A. Edelen, N. Neveu, Y. Huber, M. Frey, C. Mayes, A. Adelmann, “Machine learning for orders of magnitude speedup in multiobjective optimization of particle accelerator systems. Phys. Rev. Accel. Beams 23, 044601 (2020). https://doi.org/10.1103/PhysRevAccelBeams.23.044601
Article ADS Google Scholar
A.D. Solopova, A. Carpenter, T. Powers, Y. Roblin, C. Tennant, K. Iftekharuddin, and L. Vidyaratne, SRF Cavity Fault Classification Using Machine Learning at CEBAF” in Proc. 10th International Particle Accelerator Conference (IPAC’19), Melbourne, Australia, 19-24 May 2019, International Particle Accelerator Conference No. 10 ( JACoW Publishing, Geneva, Switzerland, 2019) pp. 1167–1170
M. Rescic, R. Seviour, W. Blokland, Predicting particle accelerator failures using binary classifiers. Nucl. Instrum. Methods. Phys. Res. A 955, 163240 (2020)
Google Scholar
Z. Liu, Z. He, S.M. Lidia, D. Liu, Q. Zhao, “Optimization of beam loss monitor network for fault modes, in Proc. 6th International Particle Accelerator Conference (IPAC’15), Richmond, VA, USA, May 3-8, 2015, International Particle Accelerator Conference No. 6 (JACoW, Geneva, Switzerland, 2015) pp. 1356–1358
E. Fol, J.M. Coello de Portugal, and R. Tomás, Unsupervised machine learning for detection of faulty beam position monitors, in Proc. 10th International Particle Accelerator Conference (IPAC’19), Melbourne, Australia, 19-24 May 2019, International Particle Accelerator Conference No. 10 (JACoW Publishing, Geneva, Switzerland, 2019) pp. 2668–2671
A. Amthor, Z.M. Schillaci, D. Morrissey, M. Portillo, S. Schwarz, M. Steiner, C.S. Sumithrarachchi, Experimental test of an online ion-optics optimizer. Nucl. Instrum. Methods Phys. Res. A 895, 90–99 (2018)
ADS Google Scholar
A. Aurisano, A. Radovic, D. Rocco, A. Himmel, M.D. Messier, E. Niner, G. Pawloski, F. Psihas, A. Sousa, P. Vahle, A convolutional neural network neutrino event classifier. J. Instrum. 11, P09001 (2016) http://stacks.iop.org/1748-0221/11/i=09/a=P09001. Accessed Mar 11 2020
M.P. Kuchera, R. Ramanujan, J.Z. Taylor, R.R. Strauss, D. Bazin, J. Bradt, R. Chen, Machine learning methods for track classification in the AT-TPC. Nucl. Instrum. Methods Phys. Res. A (2019). https://doi.org/10.1016/j.nima.2019.05.097
Article Google Scholar
S. Delaquis et al., Deep neural networks for energy and position reconstruction in EXO-200. J. Instrum. 13, P08023–P08023 (2018)
Google Scholar
C. Adams et al., (MicroBooNE Collaboration), Deep neural network for pixel-level electromagnetic particle identification in the microboone liquid argon time projection chamber. Phys. Rev. D 99, 092001 (2019)
ADS Google Scholar
Y. S. Lai, Automated discovery of jet substructure analyses. (2018), arXiv:1810.00835 [nucl-th]
F. Barbosa et al., A new transition radiation detector based on GEM technology. Nucl. Instrum. Meth. A942, 162356 (2019). https://doi.org/10.1016/j.nima.2019.162356
Article Google Scholar
G. Gavalian, P. Thomadakis, A. Angelopoulos, V. Ziegler, N. Chrisochoides, Using Artificial Intelligence for Particle Track Identification in CLAS12 Detector (2020), arXiv:2008.12860 [cs.CV]
C. Fanelli, J. Pomponi, DeepRICH: Learning Deeply Cherenkov Detectors” Sci. Technol. 1, 015010 (2019), arXiv:1911.11717 [physics.data-an]
E. Cisbani et al., AI-optimized detector design for the future electron-ion collider: the dual-radiator RICH case. J. Instrum. 15, P05009–P05009 (2020)
Google Scholar
Y. Alanazi, N. Sato, T. Liu, W. Melnitchouk, M. P. Kuchera, E. Pritchard, M. Robertson, R. Strauss, L. Velasco, Y. Li, Simulation of electron-proton scattering events by a Feature-Augmented and Transformed Generative Adversarial Network (FAT-GAN). arXiv:2001.11103 (2020), arXiv:2001.11103 [hep-ph]
R. Abdul Khalek et al. (NNPDF), Parton distributions with theory uncertainties: general formalism and first phenomenological studies. Eur. Phys. J. C 79, 931 (2019)
N. Sato, C. Andres, J. J. Ethier, W. Melnitchouk (Jefferson Lab Angular Momentum (JAM) Collaboration), Strange quark suppression from a simultaneous monte carlo analysis of parton distributions and fragmentation functions, Phys. Rev. D 101, 074020 (2020)
J.J. Ethier, N. Sato, W. Melnitchouk, First simultaneous extraction of spin-dependent parton distributions and fragmentation functions from a global QCD analysis. Phys. Rev. Lett. 119, 132001 (2017)
ADS Google Scholar
J. Rojo, Machine Learning tools for global PDF fits” in 13th Conference on Quark Confinement and the Hadron Spectrum (Confinement XIII) Maynooth, Ireland, July 31-August 6, 2018 (2018) arXiv:1809.04392 [hep-ph]
J. Cammarota, L. Gamberg, Z.-B. Kang, J. A. Miller, D. Pitonyak, A. Prokudin, T. C. Rogers, N. Sato (Jefferson Lab Angular Momentum (JAM) Collaboration), Origin of single transverse-spin asymmetries in high-energy collisions. Phys. Rev. D 102, 054002 (2020). https://doi.org/10.1103/PhysRevD.102.054002
K. Kumericki, D. Mueller, A. Schafer, Neural network generated parametrizations of deeply virtual Compton form factors. JHEP 07, 073 (2011)
ADS MATH Google Scholar
K. Kumericki, S. Liuti, H. Moutarde, GPD phenomenology and DVCS fitting. Eur. Phys. J. A 52, 157 (2016)
ADS Google Scholar
H.-W. Lin et al., Parton distributions and lattice QCD calculations: a community white paper. Prog. Part. Nucl. Phys. 100, 107–160 (2018)
ADS Google Scholar
T. Sjostrand, S. Mrenna, P. Z. Skands, A brief introduction to PYTHIA 8.1. Comput. Phys. Commun. 178, 852–867 (2008)
R.S. Bhalerao, J.-Y. Ollitrault, S. Pal, D. Teaney, Principal component analysis of event-by-event fluctuations. Phys. Rev. Lett. 114, 152301 (2015). https://doi.org/10.1103/PhysRevLett.114.152301
Article ADS Google Scholar
A. Mazeliauskas, D. Teaney, Subleading harmonic flows in hydrodynamic simulations of heavy ion collisions. Phys. Rev. C 91, 044902 (2015)
ADS Google Scholar
Z. Liu, W. Zhao, H. Song, Principal component analysis of collective flow in relativistic heavy-ion collisions. Eur. Phys. J. C 79, 870 (2019). https://doi.org/10.1140/epjc/s10052-019-7379-y
Article ADS Google Scholar
S. Pratt, E. Sangaline, P. Sorensen, H. Wang, constraining the eq. of state of super-hadronic matter from heavy-ion collisions. Phys. Rev. Lett. 114, 202301 (2015)
J.E. Bernhard, J.S. Moreland, S.A. Bass, J. Liu, U. Heinz, Applying Bayesian parameter estimation to relativistic heavy-ion collisions: simultaneous characterization of the initial state and quark-gluon plasma medium. Phys. Rev. C 94, 024907 (2016)
ADS Google Scholar
J.E. Bernhard, J.S. Moreland, S.A. Bass, Bayesian estimation of the specific shear and bulk viscosity of quark-gluon plasma. Nature Phys. 15, 1113–1117 (2019)
ADS Google Scholar
J. F. Paquet et al. (JETSCAPE), Revisiting Bayesian constraints on the transport coefficients of QCD (2020), arXiv:2002.05337 [nucl-th]
X. Yingru, J.E. Bernhard, S.A. Bass, M. Nahrgang, S. Cao, Data-driven analysis for the temperature and momentum dependence of the heavy-quark diffusion coefficient in relativistic heavy-ion collisions. Phys. Rev. C 97, 014907 (2018)
ADS Google Scholar
R. Soltz (Jetscape), Bayesian extraction of \(\hat{q}\) with multi-stage jet evolution approach. Proceedings, 9th International Conference on Hard and Electromagnetic Probes of High-Energy Nuclear Collisions: Hard Probes 2018 (HP2018): Aix-Les-Bains, France, October 1–5, 2018, PoS HardProbes2018, 048 (2019)
Y. He, L.-G. Pang, X.-N. Wang, Bayesian extraction of jet energy loss distributions in heavy-ion collisions. Phys. Rev. Lett. 122, 252302 (2019)
ADS Google Scholar
S. A. Bass, A. Bischoff, J. A. Maruhn, Horst Stoecker, W. Greiner, Neural networks for impact parameter determination. Phys. Rev. C 53, 2358–2363 (1996)
Y.-T. Chien, Probing heavy ion collisions using quark and gluon jet substructure with machine learning, Nucl. Phys. A 982, 619–622 (2019) (the 27th International Conference on Ultrarelativistic Nucleus-Nucleus Collisions: Quark Matter 2018)
P.T. Komiske, E.M. Metodiev, J. Thaler, Energy flow polynomials: a complete linear basis for jet substructure. JHEP 2018, 13 (2018)
Google Scholar
L.-G. Pang, K. Zhou, S. Nan, H. Petersen, H. Stöcker, X.-N. Wang, An equation-of-state-meter of quantum chromodynamics transition from deep learning. Nature Commun. 9, 210 (2018)
ADS Google Scholar
J. Steinheimer, L. Pang, K. Zhou, V. Koch, J. Randrup, H. Stoecker, A machine learning study to identify spinodal clumping in high energy nuclear collisions. JHEP 12, 122 (2019)
ADS Google Scholar
L.-G. Pang, K. Zhou, X.-N. Wang, Interpretable deep learning for nuclear deformation in heavy ion collisions, (2019), arXiv:1906.06429 [nucl-th]
H. Huang, B. Xiao, H. Xiong, W. Zeming, M. Yadong, H. Song, Applications of deep learning to relativistic hydrodynamics. Nucl. Phys. A 982, 927–930 (2019)
ADS Google Scholar
A.A. Esfahani et al., Cyclotron radiation emission spectroscopy signal classification with machine learning in Project 8. New J. Phys. 22, 033004 (2020)
ADS Google Scholar
J. Renner et al., Background rejection in NEXT using deep neural networks. J. Instrum. 12, T01004–T01004 (2017). https://doi.org/10.1088/1748-0221/12/01/t01004
Article Google Scholar
V. Sobes, M. Grosskopf, K. Wendt, D. Brown, M. S. Smith, P. Talou, WANDA: AI/ML for nuclear data. summary of the session on AI/ML at the workshop on applied nuclear data activities 2020, March 3–5, 2020, ORNL/TM-2020/1535. https://conferences.lbl.gov/event/292/
E. Phiala Shanahan, D. Trewartha, W. Detmold, Machine learning action parameters in lattice quantum chromodynamics. Phys. Rev. D 97, 094506 (2018)
L. Kades, J. M. Pawlowski, A. Rothkopf, M. Scherzer, J. M. Urban, S. J. Wetzel, N. Wink, F. Ziegler, Spectral Reconstruction with Deep Neural Networks(2019) arXiv:1905.04305 [physics.comp-ph]
J. Karpie, K. Orginos, A. Rothkopf, S. Zafeiropoulos, Reconstructing parton distribution functions from Ioffe time data: from Bayesian methods to neural networks. JHEP 04, 057 (2019), arXiv:1901.05408
R. Zhang, Z. Fan, R. Li, H.-W. Lin, B. Yoon, Machine-learning prediction for quasiparton distribution function matrix elements. Phys. Rev. D 101, 034516 (2020)
ADS Google Scholar
S. Lidia, R. Carr, “Faster magnet sorting with a threshold acceptance algorithm. Rev. Sci. Instrum. 66, 1865–1867 (1995)
ADS Google Scholar
O. Chubar, O. Rudenko, C. Benabderrahmane, O. Marcouille, J.M. Filhol, M.E. Couprie, Application of genetic algorithms to sorting, swapping and shimming of the soleil undulator magnets. AIP Conf. Proc. 879, 359–362 (2007). https://doi.org/10.1063/1.2436074
Article ADS Google Scholar
W.J. Corbett, B. Pong, M. Lee, V. Ziemann, Optimum steering of photon beamlines in SPEAR. Proceedings of International Conference on Particle Accelerators 1, 1483–1484 (1993)
Google Scholar
I.V. Bazarov, C.K. Sinclair, Multivariate optimization of a high brightness dc gun photoinjector. Phys. Rev. ST Accel. Beams 8, 034202 (2005)
ADS Google Scholar
W. Gao, L. Wang, W. Li, Simultaneous optimization of beam emittance and dynamic aperture for electron storage ring using genetic algorithm. Phys. Rev. ST Accel. Beams 14, 094001 (2011)
ADS Google Scholar
A. Hofler, B. Terzić, M. Kramer, A. Zvezdin, V. Morozov, Y. vRoblin, Fanglei Lin, Colin Jarvis, “Innovative applications of genetic algorithms to problems in accelerator physics. Phys. Rev. ST Accel. Beams 16, 010101 (2013)
A.L. Edelen, S.G. Biedron, B.E. yucChase, D. Edstrom, S.V. Milton, P. Stabile, “Neural networks for modeling and control of particle accelerators. IEEE Trans. Nucl. Sci. 63, 878–897 (2016)
C. Emma, A. Edelen, M.J. Hogan, B. O’Shea, G. White, V. Yakimenko, “Machine learning-based longitudinal phase space prediction of particle accelerators. Phys. Rev. Accel. Beams 21, 112802 (2018)
ADS Google Scholar
A. Scheinker, D. Bohler, Sergey Tomin, Raimund Kammering, Igor Zagorodnov, Holger Schlarb, Matthias Scholz, Bolko Beutner, Winfried Decking, “Model-independent tuning for maximizing free electron laser pulse energy. Phys. Rev. Accel. Beams 22, 082802 (2019)
ADS Google Scholar
“Workshop Report on Basic Research Needs for Scientific Machine Learning: Core Technologies for Artificial Intelligence” ( 2019)
“ASCR Workshop on In Situ Data Management: Enabling Scientific Discovery from Diverse Data Sources”, (2019)
“Data and Models: A Framework for Advancing AI in Science”,(2019)

Download references

Author information

Authors and Affiliations

University of Maryland, College Park, MD, USA
Paulo Bedaque
Thomas Jefferson National Accelerator Facility, Newport News, VA, USA
Amber Boehnlein, Markus Diefenthaler, Latifa Elouadrhiri, David Lawrence, Robert McKeown, Wally Melnitchouk, Kostas Orginos & Yves Roblin
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Mario Cromaz & Xin-Nian Wang
Catholic University, Washington, D.C., USA
Tanja Horn
Davidson College, Davidson, NC, USA
Michelle Kuchera
Michigan State University, East Lansing, MI, USA
Dean Lee, Steven Lidia & Witold Nazarewicz
College of William & Mary, Williamsburg, VA, USA
Kostas Orginos
Oak Ridge National Laboratory, Oak Ridge, TN, USA
Michael Scott Smith
Pacific Northwest National Laboratory, Richland, WA, USA
Malachi Schram

Authors

Paulo Bedaque
View author publications
You can also search for this author in PubMed Google Scholar
Amber Boehnlein
View author publications
You can also search for this author in PubMed Google Scholar
Mario Cromaz
View author publications
You can also search for this author in PubMed Google Scholar
Markus Diefenthaler
View author publications
You can also search for this author in PubMed Google Scholar
Latifa Elouadrhiri
View author publications
You can also search for this author in PubMed Google Scholar
Tanja Horn
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Kuchera
View author publications
You can also search for this author in PubMed Google Scholar
David Lawrence
View author publications
You can also search for this author in PubMed Google Scholar
Dean Lee
View author publications
You can also search for this author in PubMed Google Scholar
Steven Lidia
View author publications
You can also search for this author in PubMed Google Scholar
Robert McKeown
View author publications
You can also search for this author in PubMed Google Scholar
Wally Melnitchouk
View author publications
You can also search for this author in PubMed Google Scholar
Witold Nazarewicz
View author publications
You can also search for this author in PubMed Google Scholar
Kostas Orginos
View author publications
You can also search for this author in PubMed Google Scholar
Yves Roblin
View author publications
You can also search for this author in PubMed Google Scholar
Michael Scott Smith
View author publications
You can also search for this author in PubMed Google Scholar
Malachi Schram
View author publications
You can also search for this author in PubMed Google Scholar
Xin-Nian Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amber Boehnlein.

Additional information

Communicated by Ulf Meissner

Disclaimer: This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise, does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.

This report is an outcome of the workshop AI for Nuclear Physics held at Thomas Jefferson National Accelerator Facility on March 4–6, 2020. The workshop brought together 184 scientists to explore opportunities for Nuclear Physics in the area of Artificial Intelligence. The workshop consisted of plenary talks, as well as six working groups.

This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Nuclear Physics under contract DE-AC05-06OR23177. Participation of students and early career professionals was supported by NSF, Division of Physics, under the Grant ‘Artificial Intelligence (AI) Workshop in Nuclear Physics,’ Award Number 2017170. Support for the Hackathon was provided by the University of Virginia School of Data Sciences and by Amazon Web Services.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bedaque, P., Boehnlein, A., Cromaz, M. et al. A.I. for nuclear physics. Eur. Phys. J. A 57, 100 (2021). https://doi.org/10.1140/epja/s10050-020-00290-x

Download citation

Received: 09 September 2020
Accepted: 07 October 2020
Published: 22 March 2021
DOI: https://doi.org/10.1140/epja/s10050-020-00290-x

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A.I. for nuclear physics

Abstract

Similar content being viewed by others

Nuclear physics midterm plan in Italy: introduction to the series

NUSTAR – The teenage years

Nuclear Forces for Precision Nuclear Physics: A Collection of Perspectives

Explore related subjects

1 Executive summary

2 Priority research directions

2.1 Future prospects

2.2 Community identified needs and commonalities

3 Workshop overview

4 Summary of workshop sessions

4.1 Lattice QCD and other quantum field theories

4.1.1 Case studies and future prospects

4.1.2 Enabling discoveries/what is needed

4.2 Low-energy nuclear theory

4.2.1 Current status

4.2.2 Case studies and future prospects

4.2.3 Enabling discoveries/what is needed

4.3 Accelerator science and operations

4.3.1 Accelerator and material design optimization

4.3.2 Provenance and prognostication for accelerator sub-systems

4.3.3 Dynamic optimization of real time operation controls

4.3.4 Summary and Final Thoughts

4.4 Experimental methods

4.4.1 Current status

4.4.2 Case Studies and Future Prospects

4.4.3 Enabling discoveries/what is needed

4.5 Event generation and simulation

4.5.1 Current status

4.5.2 Case studies and future prospects

4.5.3 Enabling Discoveries/What is Needed

4.6 Bayesian inference for quantum correlation functions

4.6.1 Current status

4.6.2 Case studies and future prospects

4.6.3 Enabling discoveries/what is needed

4.7 Additional contributions received

4.7.1 Relativistic heavy ions

4.7.2 Project 8

4.7.3 NEXT

4.7.4 WANDA

5 Cross cutting topics

5.1 Statistical methods and tools

5.1.1 Overview of approaches in NP

5.1.2 Use of current tools

5.2 Collaborations and collaborative activities

5.2.1 NP communities of practice

5.2.2 Engagement with data science community

6 Engagement with ASCR

7 The importance of data management

8 Workforce development

8.1 Education

9 The level of AI literacy

Data Availability Statement

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation