Welcome to visit Communications in Theoretical Physics,
Nuclear Physics

Solving nuclear Dirac Woods-Saxon potential with a physics-informed neural network

  • Xiao-Kai Du ,
  • Shuang-Quan Zhang , *
Expand
  • State Key Laboratory of Nuclear Physics and Technology, School of Physics, Peking University, Beijing 100871, China

Author to whom any correspondence should be addressed.

Received date: 2025-05-15

  Revised date: 2025-09-30

  Accepted date: 2025-10-09

  Online published: 2025-12-15

Copyright

© 2025 Institute of Theoretical Physics CAS, Chinese Physical Society and IOP Publishing. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
This article is available under the terms of the IOP-Standard License.

Abstract

A physics-informed neural network (PINN) is built to solve the nucleonic Dirac equation. The PINN employs the residual of the Dirac equation as the objective function instead of the variation of the energy expectation value, thereby avoiding the variational collapse problem. By integrating the automatic differentiation techniques, the PINN also overcomes the Fermion doubling problem. A constraint term in the loss function of the PINN is designed to avoid trivial solutions and an orthogonality constraint term is used to search for the excited states. The performance of the unsupervised PINN is evaluated by solving the orbitals below the Fermi surface of 16O and 208Pb in the Dirac Woods-Saxon (WS) potential. Compared to the results obtained by the traditional shooting method, obtained energies have relative errors on the order of 10-3 and the root-mean-square errors of the corresponding wave functions are also on the order of 10-3.

Cite this article

Xiao-Kai Du , Shuang-Quan Zhang . Solving nuclear Dirac Woods-Saxon potential with a physics-informed neural network[J]. Communications in Theoretical Physics, 2026 , 78(3) : 035303 . DOI: 10.1088/1572-9494/ae1a5c

1. Introduction

Presently, one of the most popular and successful approaches to many quantum many-body systems is the density functional theory (DFT), rooted in the Hohenberg-Kohn theorem [1]. Especially, for nuclear systems with strong spin-orbital coupling, the relativistic density functional theory (RDFT) that takes into account the Lorentz symmetry at the very beginning has achieved great success in describing the ground state properties of almost all nuclei on the nuclide chart [2-12]. Although technically more complicated, there are several essential advantages compared to the non-relativistic density functionals [2, 13, 14]: automatically giving the spin degree of freedom and the large spin-orbital potentials; providing the relativistic saturation mechanism; natural explanation of hidden pseudo-spin symmetry in nuclear single-particle spectrum [15-17]; discovery of spin symmetry for anti-nucleon [17, 18]; success of relativistic Brueckner calculations [10, 19]; self-consistent treatment of time-odd fields important for nuclear properties associated with rotation [5, 7].
To date, most practical DFT applications were performed with the Kohn-Sham scheme [2]. Thus, the essential part of RDFT is to solve the corresponding Kohn-Sham equations, i.e., the Dirac equations of nucleons. One can solve the Dirac equations either in basis expansion method like harmonic oscillator [7, 20, 21], Schrödinger Woods-Saxon (WS) [22] and Dirac WS [22, 23] bases or directly in the coordinate space like the shooting method [24, 25], the finite element method [25-27], the Green's function method [28-30] and the imaginary time method [31]. In contrast to the former method that may be limited on the basis space, the latter one would be more efficient for nuclear phenomena with large spatial distributions such as halo phenomena or nuclear fissions [4]. It should be noted that solving the Dirac equation in the coordinate space may face the well-known variational collapse [31] and Fermion doubling [32, 33] problems. Up to now, in literature, the inverse Hamiltonian method [33, 34] and the preconditioned conjugate gradient method with a filtering function [35, 36] have been used to overcome the variational collapse problem, while the Wilson term [33], the Fourier spectral method [34-36] and the asymmetric difference formula [37] to overcome the Fermion doubling problem.
On the other hand, with the great improvement of computer hardware and the rapid development of various algorithms, artificial intelligence nowadays has a profound impact on basic scientific research. In nuclear physics, various machine learning (ML) algorithms have been utilized to solve many practical problems and achieved remarkable successes, for recent reviews, see [38-40].
The ML algorithms are mainly divided into three categories: supervised learning, unsupervised learning and reinforcement learning. Notably, the unsupervised learning algorithms have received extensive attention in scientific research mainly due to their independence of available labeled data. For instance, aiming to attack the nuclear many-body problems, Keeble and Rios used the artificial neural networks to determine the deuteron wave function [41]; Adams et al performed variational Monte Carlo calculations with artificial neural networks for light nuclei [42, 43]; Yang and Zhao developed the FeynmanNet which reaches a high accuracy of nuclear ground-state energy in a variational way and scales polynomially with the number of nucleons [44].
In fact, the ML algorithms have been applied to directly solve the differential equations in quantum mechanics. In 1997, Lagaris et al used a single-hidden-layer neural network (NN) to solve the Schrödinger and Dirac equations of μ atoms [45]. Later, Jin et al used NNs to solve the ground state and excited states of the Schrödinger equation in the harmonic oscillator potential [46, 47]. Pu et al used a deep NN to represent the cumulative distribution function which can avoid integral operations, and solved the ground state of the Schrödinger equation [48]. Most recently, applying the inverse Hamiltonian method to avoid variational collapse and invoking the variational principle, Wang et al successfully solved the Dirac equation in Coulomb and WS potentials by using a deep neural network approach [49].
In recent years, physics-informed neural networks (PINNs) have emerged as novel ML tools, arousing great interest of researchers [50]. The PINNs are neural networks that encode the model equations into the neural network itself and have been used to solve various differential equations due to the prominent characteristics [51-54]: incorporating physical laws and domain-specific knowledge to enhance predictive capabilities, the ability to learn from sparse data input, meshless alternative to classical methods, opening up new avenues for solving inverse problems. The PINN has been extensively applied to solve ordinary and partial differential equations in fluid dynamics, material science, quantum mechanics, and so on [51-55]. Indeed, the method in [45] can be also regarded as the early PINN [53]. Therefore, it is believed that the PINN can be used to solve the nuclear Dirac equation. More importantly, the PINN employs the residual of the Dirac equation as the objective function rather than the variation of energy expectation value, so that the well-known variational collapse problem [31] can be naturally avoided. It is easy to know, if the residual equals to zero, the wave functions G(r) and F(r) automatically satisfy the Dirac equation exactly, i.e., the eigen solutions are obtained. Furthermore, by calculating the derivatives with automatic differentiation (AD) techniques [56-58], the PINN can overcome the Fermion doubling problem as well.
In this paper, a PINN will be constructed for the first time to solve the nuclear Dirac WS potential [59]. To test the performance of the PINN, the energies and wave functions of neutron orbitals below the Fermi surface for 16O and 208Pb thus obtained are compared with those obtained by the traditional shooting method. In section 2, the theoretical framework including the Dirac equation and the PINN methodology is briefly introduced. In section 3, numerical details are given, and in section 4, results and discussions are presented. Finally, a summary is given in section 5.

2. Theoretical framework

2.1. Dirac equation

In the framework of the relativistic density functional theory, instead of the Schrödinger equation, the nucleus is described as a system of Dirac nucleons that satisfy the following Dirac equation [2]
$\begin{eqnarray}\left[{\boldsymbol{\alpha }}\cdot ({\boldsymbol{p}}-{\boldsymbol{V}})+{V}^{0}+\beta (M+S)\right]{\psi }_{i}({\boldsymbol{r}})={\epsilon }_{i}{\psi }_{i}({\boldsymbol{r}}),\end{eqnarray}$
where p and M are the nucleon momentum and mass, (V0(r), V(r)) is the repulsive vector field and S(r) is the attractive scalar field, εi and ψi(r) are eigen energy and wave function of the single-particle state i, respectively.
In the case of spherical symmetry, equation (1) can be reduced as the radial equation, which depends only on the radial coordinate r. As for now, the wave function ψi(r) is characterized by the radial quantum number n, the angular momentum quantum numbers l, j and m, as well as the parity, which can be formulated as
$\begin{eqnarray}{\psi }_{n\kappa m}({\boldsymbol{r}},s)=\frac{1}{r}\left(\begin{array}{c}{\rm{i}}{G}_{n\kappa }(r){Y}_{jm}^{l}(\theta ,\phi ,s)\\ -{F}_{n\kappa }(r){Y}_{jm}^{\tilde{l}}(\theta ,\phi ,s)\end{array}\right),\end{eqnarray}$
where κ = (-1)j+l+1/2(j + 1/2) is the relativistic quantum number, G(r) and F(r) are the large and small components of the wave function, ${Y}_{jm}^{l}$ and ${Y}_{jm}^{\tilde{l}}$ are the spinor spherical harmonics with $l=j+\frac{1}{2}{\rm{sgn}}(\kappa )$ and $\tilde{l}=j-\frac{1}{2}{\rm{sgn}}(\kappa )$, respectively. The radial wave functions with the same quantum number κ but different quantum number n are orthogonal, thus the Dirac equation can be resolved with different κ blocks. With wave function (2), one can obtain the corresponding radial Dirac equations
$\begin{eqnarray}\begin{array}{l}\left(\frac{{\rm{d}}}{{\rm{d}}r}-\frac{\kappa }{r}\right){F}_{n\kappa }(r)+[{E}_{n\kappa }-U(r)]{G}_{n\kappa }(r)=0,\\ \left(\frac{{\rm{d}}}{{\rm{d}}r}+\frac{\kappa }{r}\right){G}_{n\kappa }(r)-[{E}_{n\kappa }+2M-W(r)]{F}_{n\kappa }(r)=0,\end{array}\end{eqnarray}$
where U(r) = V(r) + S(r), W(r) = V(r) - S(r) and E = ε - M.
For the potentials, Koepf and Ring have shown that the smooth WS shaped potentials can approximate the self-consistently determined relativistic scalar- and vector-potentials without changing the resulting single-particle spectra below the Fermi surface [59]. For the neutron, the WS shaped potentials are written as
$\begin{eqnarray}\begin{array}{rcl}{U}_{{n}} & = & \frac{{V}_{{n}}^{0}}{1+{{\rm{e}}}^{(r-{R}_{{n}})/{a}_{{n}}}},\\ {W}_{{n}} & = & \frac{-{\lambda }_{{n}}{V}_{{n}}^{0}}{1+{{\rm{e}}}^{(r-{R}_{{n}}^{ls})/{a}_{{n}}^{ls}}},\end{array}\end{eqnarray}$
with ${V}_{{n}}^{0}=V\left(1-{\kappa }_{{\rm{WS}}}\frac{N-Z}{N+Z}\right)$, ${R}_{{n}}={r}_{0}^{{n}}{A}^{\frac{1}{3}}$ and ${R}_{{n}}^{ls}={r}_{0}^{ls,{n}}{A}^{\frac{1}{3}}$, where the parameters V = -71.28 MeV, κWS = 0.462, ${\lambda }_{{n}}=11.12,{r}_{0}^{{n}}=1.233\,{\rm{fm}}$, an = 0.615 fm, ${r}_{0}^{ls,{n}}=1.144\,{\rm{fm}}$ and ${a}_{{n}}^{ls}=0.648\,{\rm{fm}}$ [59].

2.2. Physics-informed neural network

The starting point of the PINN is using a neural network N(θ) (here θ denotes all the NN parameters) to approximate the solution of a differential equation, according to the universal approximation theorem [60, 61]. For instance, here we can use NN to approximate the eigen wave functions G(r) and F(r) of the radial Dirac equations (3). Then, one can calculate the derivatives of the NN's outputs with respect to the inputs by the automatic differentiation [56], and results in the representation of the residual of the differential equation, i.e., the so-called physics-informed neural network. To find the final solutions of the differential equation, the loss function ${ \mathcal L }(\theta )$ is constructed as a weighted sum of the residual of the differential equation ${{ \mathcal L }}_{{\rm{DE}}}$, the residual of the boundary/initial condition ${{ \mathcal L }}_{{\rm{BI}}}$ and/or the deviations from the available data ${{ \mathcal L }}_{{\rm{data}}}$:
$\begin{eqnarray}{ \mathcal L }(\theta )={\omega }_{{\rm{DE}}}{{ \mathcal L }}_{{\rm{DE}}}(\theta )+{\omega }_{{\rm{BI}}}{{ \mathcal L }}_{{\rm{BI}}}(\theta )+{\omega }_{{\rm{data}}}{{ \mathcal L }}_{{\rm{data}}}(\theta ).\end{eqnarray}$
Finally, the differential equation is solved by minimizing the loss function ${ \mathcal L }(\theta )$ with optimization algorithms such as the gradient descent method.
In this work, in order to explicitly consider the asymptotic behaviors of the bound states at r = 0 and r = , the wave functions G(r) and F(r) are parametrized as [45]
$\begin{eqnarray}\left(\begin{array}{c}{G}_{n\kappa }(r)\\ {F}_{n\kappa }(r)\end{array}\right)\approx r{{\rm{e}}}^{-\beta r}N(r,\theta ),\end{eqnarray}$
where β > 0 is a trainable parameter and N(r, θ) is a neural network. The input of N(r, θ) is the one-dimensional radial coordinate r, and the output is a vector with two components. With the embedding of the boundary conditions in the NN and no available data being introduced, the weights in equation (5) are exactly: ωDE = 1.0, ωBI = 0.0 and ωdata = 0.0. In this sense, our PINN framework is actually an unsupervised method.
With the AD, the derivatives in the radial Dirac equations (3) can be calculated, thus lead to the residuals of the radial Dirac equations:
$\begin{eqnarray}\begin{array}{rcl}{f}_{1} & := & \left(\frac{{\rm{d}}}{{\rm{d}}r}-\frac{\kappa }{r}\right)F(r)+[E-U(r)]G(r),\\ {f}_{2} & := & \left(\frac{{\rm{d}}}{{\rm{d}}r}+\frac{\kappa }{r}\right)G(r)-[E+2M-W(r)]F(r),\end{array}\end{eqnarray}$
where $f(r,\theta ,\beta ,E)={({f}_{1},{f}_{2})}^{T}$ is regarded as the physics-informed neural network, and the energy E is a trainable parameter the same as θ and β. As a powerful tool, the AD uses exact expressions with floating-point values rather than expression strings as in the symbolic differentiation which makes it memory-efficient and involves no approximation error as in numerical differentiation [57, 58]. Also thanks to this characteristic, the Fermion doubling problem in the Dirac equation [32, 33] can be addressed by using the AD technique.
With the PINN (7), the radial Dirac equations (3) are solved by minimizing the following loss function,
$\begin{eqnarray}{ \mathcal L }(\theta ,\beta ,E)={{ \mathcal L }}_{{\rm{DE}}}+{{ \mathcal L }}_{{\rm{conz}}}+{{ \mathcal L }}_{{\rm{orth}}},\end{eqnarray}$
where
$\begin{eqnarray}\begin{array}{rcl}{{ \mathcal L }}_{{\rm{DE}}} & = & \frac{1}{{N}_{f}}\displaystyle \sum _{i=1}^{{N}_{f}}\left[| {f}_{1}({r}_{f}^{i}){| }^{2}+| {f}_{2}({r}_{f}^{i}){| }^{2}\right],\\ {{ \mathcal L }}_{{\rm{conz}}} & = & {\left[{G}_{{\rm{\max }}}({r}_{f}^{i})-1\right]}^{2},\\ {{ \mathcal L }}_{{\rm{orth}}} & = & {\int }_{0}^{{r}_{\max }}\left[{G}_{{\rm{sum}}}(r)G(r)+{F}_{{\rm{sum}}}(r)F(r)\right]{\rm{d}}r.\end{array}\end{eqnarray}$
Here, the term ${{ \mathcal L }}_{{\rm{DE}}}$ is the mean square error of equation (7) evaluated at the collocation points ${\{{r}_{f}^{i}\}}_{i=1}^{{N}_{f}}$. If the ${{ \mathcal L }}_{{\rm{DE}}}$ equals to zero, the wave functions G(r) and F(r) satisfy the radial Dirac equations (3) exactly, i.e., the eigen solutions. The term ${{ \mathcal L }}_{{\rm{conz}}}$ is a constraint to avoid the trivial zero solutions by restricting the maximum of the wave function G(r) at the collocation points to 1. The term ${{ \mathcal L }}_{{\rm{orth}}}$ is applied to search the excited states, which requires the current wave functions G(r) and F(r) to be orthogonal to the sum of the solved wave functions Gsum(r) and Fsum(r) [47]. The wave functions are normalized in the final step after being solved by the PINN.

3. Numerical details

Here, the PINN approach is established for solving the Dirac equation with a given mean field potential like equation (4). As examples, all the neutron orbitals below the Fermi surface will be solved for a light nucleus 16O and a heavy nucleus 208Pb. In order to assess its performance, the eigen energies and wave functions thus obtained will be compared with the conventional shooting method [62]. Noted that it has been shown in literature that in calculating the single-particle energies the consistency between the shooting method and other numerical methods, such as the Green's function method [28], the preconditioned conjugate gradient method with a filtering function [35] and the five-point asymmetric finite difference method [37], is at least 10-3 MeV. Here, the results from the shooting method are used as a benchmark.
For the PINN construction, a fully-connected neural network is used to express the N(r, θ) in wave functions equation (6). The Sigmoid activation function is adopted for hidden layers. The weights of the NN are initialized with the Xavier normal distribution and the biases are initialized as 0, which are the same recipe as adopted in the original PINN [50, 63]. For other trainable parameters, the parameter β is initialized as 1.0 fm-1, and the energy E is initialized as -60.0 MeV which is close to the bottom of Fermi sea in the Dirac WS potential. First, to choose an appropriate NN architecture, we calculated the 1s1/2 orbital of 208Pb under various combinations of the number of hidden layers Nlay and the number of neurons per layer Nneu as an example. For the training of NN, a sufficient number of iterations, 80000 epochs, are used to ensure high precision. The other numerical details are the same for different NN architectures. Table 1 shows the RMS errors of the wave functions G(r) obtained by PINN with respect to the shooting method, which are evaluated on the 200 uniform mesh points in the range of (0.0, 20.0] fm. It can be seen that as Nlay or Nneu increases, the RMS error generally shows a decreasing trend, with the increase in Nlay demonstrating a more pronounced effect. In this test, the NN with Nlay = 3 and Nneu = 80 achieved the smallest RMS error and was consequently selected as the architecture for subsequent calculations.
Table 1. The impact of NN architectures on the accuracy of the PINN. Taking the 1s1/2 orbital of 208Pb as an example, we show the root-mean-square (RMS) errors of the wave functions G(r) obtained by PINN with respect to the shooting method, which are evaluated on the 200 uniform mesh points in the range of (0.0, 20.0] fm.
Nlay\Nneu 10 20 40 80 100
1 0.019468 0.020548 0.015200 0.012666 0.014222
2 0.005728 0.000322 0.000266 0.000236 0.000109
3 0.000141 0.000522 0.000055 0.000051 0.000070
4 0.000112 0.000083 0.000574 0.000060 0.000073
For the evaluation of loss function (8), the collocation points ${\{{r}_{f}^{i}\}}_{i=1}^{{N}_{f}}$ are selected in a range of (0.0, 20.0] fm. The residual-based adaptive refinement is used to improve the distribution of the collocation points and the performance of the PINN [64]. In practice, we adopt an initial set of 200 mesh points with an equal interval of ∆r = 0.1 fm. Every 2000 epochs, 10 new collocation points corresponding to the largest residuals are added to the set. The total number of collocation points in the set is capped at 400.
The loss function is minimized by the Adam optimizer [65]. A preliminary learning rate is set as 10-3. The mean values of the losses within 2000 epochs, denoted as Lmean, are calculated. A conditional threshold ε = 5 × 10-5 is then set. If Lmean drops below the threshold ε, a learning rate scheduler is employed to enhance the accuracy of the PINN. Specifically, the learning rate is multiplied by a factor of $\gamma$ = 0.8 every 1000 epochs, and this adjustment process is carried out for a maximum of 20 times.

4. Results and discussion

4.1. Light nucleus 16O

The light nucleus 16O has eight protons and eight neutrons, and its ground state is spherical. There are only three single-particle orbitals below the proton or neutron Fermi surface, namely 1s1/2, 1p3/2 and 1p1/2, which are occupied by 2, 4 and 2 nucleons respectively. Therefore, in the solution of PINN, only the lowest energy level in each of their respective κ blocks needs to be solved, which also means that the third term ${{ \mathcal L }}_{{\rm{orth}}}$ in the loss function (8) does not need to be introduced.
Figure 1 shows the losses and the energies varying with epochs for the three neutron orbitals 1s1/2, 1p3/2 and 1p1/2 of 16O. As seen in figure 1(a), all the three curves of losses basically decrease with the increasing epoch although many irregular peaks emerged during the optimization. After closely analyzing the relationship between the loss and energy changes, we found that when the loss exhibits irregular peaks, the energy values simultaneously appear as small fluctuations (about 10-3 MeV). Therefore, it is inferred that both the energy and wave functions contribute to these peaks, which is the natural consequence in solving the Dirac equation. After 16818, 36138 and 41499 epochs, respectively, the mean values of the losses decrease below the conditional threshold ε for 1s1/2, 1p3/2 and 1p1/2. Finally, after further optimization, the loss for each orbital decreases to the level of 10-6 or even below, which means that a PINN solution with high precision to the Dirac equation has been achieved. The lowest final loss 2.52 × 10-7 is obtained for the 1p3/2 orbital. After comparing the different contributions to the loss, it is found that the ${{ \mathcal L }}_{{\rm{DE}}}$ term in equation (8), i.e., the residuals of the Dirac equations, contribute most to the final loss. As seen in figure 1(b), starting from the same initial value -60 MeV, the single-particle energies of the three orbitals gradually converge to the exact eigenvalues with the increasing epoch. Comparing the curves in figures 1(a) and (b), the energies vary continuously during the sharp decrease of the losses and converge to the eigenvalues when the losses steadily approach zero. Noted that, when the mean value of the losses reaches the threshold ε, the energy has also approached the eigenvalue very closely. Because the eigenvalue of the 1s1/2 orbital is closer to the initial value of the energy, the loss and energy of the 1s1/2 orbital converge the fastest.
Figure 1. The losses (a) and the energies (b) as functions of epochs, for the neutron orbitals 1s1/2 (blue), 1p3/2 (green), and 1p1/2 (orange) of 16O in the Dirac WS potential. The dashed lines in (b) are the values obtained by the shooting method.
Figure 2 shows the obtained wave functions by the PINN, in comparison with the results by the shooting method. Here, the box size Rbox = 20.0 fm and step size ∆r = 0.1 fm are used, which can guarantee the solution's accuracy of the shooting method. As seen in figures 2(a)-(c), the wave functions of all the three orbitals obtained by the PINN are consistent with those by the shooting method. Figures 2(d)-(f) show the differences of the obtained wave functions between the PINN and the shooting method. It can be seen that the maximum deviations ∆G(r) and ∆F(r) are mainly located at the tails of the peaks of the wave functions, rather than near the peaks. Overall, the differences for the wave functions F(r) are smaller than those of the wave functions G(r). Nevertheless, considering the magnitudes of F(r) and G(r), the relative differences from the true wave functions are of the same order of magnitude.
Figure 2. The obtained wave functions of the neutron orbitals 1s1/2 (a), 1p3/2 (b) and 1p1/2 (c) of 16O by the PINN (dashed lines), in comparison with the shooting method (solid lines). (d), (e) and (f) are the corresponding differences of the wave functions between the PINN and the shooting method.
To have a closer quantitative comparison, table 2 summarizes the differences of the energies and the wave functions between the PINN and the shooting method. It can be seen that the relative errors of energies are of the order of 10-4~10-3 while the absolute deviations for the three orbitals are within 0.065 MeV. For the wave functions, the RMS errors from the results of the shooting method are about 10-3 for G(r) and 10-4 for F(r). As discussed above, the relative deviations of G(r) and F(r) from the true wave functions are of the same order of magnitude. It is noted that the highest accuracy of energy is obtained for the lowest 1s1/2 orbital, whereas the best descriptions for the wave functions G(r) and F(r) are obtained for the 1p3/2 orbital in which the lowest loss is achieved.
Table 2. The results obtained by the PINN for the three neutron orbitals below the Fermi surface of 16O in Dirac WS potential, in comparison with those by the shooting method. The third and fourth columns are the energies (in MeV) obtained by the shooting method and by the PINN respectively. The value ∆E = EShooting - EPINN denotes the energy difference between the two methods and the σE = |∆E/EShooting| denotes the relative deviation. σG and σF are the RMS errors of the wave functions G(r) and F(r) evaluated at 200 mesh points. Bold values indicate the minimum errors among all orbitals.
Orbital κ EShooting EPINN E σE σG σF
1s1/2 -1 -43.1891 -43.1805 -0.0086 1.99 × 10-4 3.48 × 10-3 3.21 × 10-4
1p3/2 -2 -24.6645 -24.6978 0.0333 1.35 × 10-3 1.73 × 10-3 1.24 × 10-4
1p1/2 1 -19.0191 -19.0828 0.0637 3.35 × 10-3 2.64 × 10-3 2.58 × 10-4

4.2. Heavy nucleus 208Pb

In order to further test the performance of PINN, the neutron orbitals in the Dirac WS potential for heavy nucleus 208Pb are calculated. For 208Pb, it has 82 protons and 126 neutrons, and its ground state is also spherical. Below the neutron magic number N = 126, there exist 22 neutron orbitals in total. Unlike the light nucleus 16O, for most κ blocks, more than one orbital needs to be solved, which is realized by the introduction of the third term ${{ \mathcal L }}_{{\rm{orth}}}$ in the loss function (8).
Figure 3 shows the loss and the energy varying with the epochs for the three s1/2 orbitals of 208Pb. As seen in figure 3, the loss and energy gradually vary at the beginning and at the 33938th epoch the mean value of the losses satisfies the conditional threshold ε. After further improvement with the learning rate scheduler, the 1s1/2 orbital is obtained and the corresponding wave functions are saved. Then the orthogonality constraint ${{ \mathcal L }}_{{\rm{orth}}}$ is introduced with Gsum and Fsum being the obtained 1s1/2 wave functions. Correspondingly, the loss shows a sudden increase. With the optimization continuing, the PINN is driven to search the next orbital. At the 70592nd epoch, the conditional threshold ε for the excited 2s1/2 orbital is achieved. Later, the Gsum and Fsum take the summation of wave functions of 1s1/2 and 2s1/2 orbitals, and the PINN is driven to search the third orbital. At the 115044th epoch, the conditional threshold ε for the excited 3s1/2 orbital is achieved. In this way, the PINN can find not only the lowest sate but also the excited states with different n in the same κ block.
Figure 3. Same as figure 1, but for the neutron orbitals 1s1/2, 2s1/2 and 3s1/2 of 208Pb.
Figure 4 shows the obtained wave functions for the s1/2 orbitals of 208Pb by the PINN, in comparison with the results by the shooting method. As seen in figures 4(a)-(c), the wave functions obtained by the PINN are in good agreement with those by the shooting method. In figures 4(d)-(f), similar to figure 2, the maximum deviations ∆G(r) and ∆F(r) mainly locate at the tails of the peaks of the wave functions. The absolute differences for the wave functions F(r) are smaller than those of the wave functions G(r), but their relative differences are of the same order of magnitude.
Figure 4. Same as figure 2, but for the neutron orbitals 1s1/2, 2s1/2 and 3s1/2 of 208Pb.
Figure 5 shows the neutron spectrum of 208Pb obtained by the PINN, in comparison with the shooting method. The levels obtained by the PINN show good agreement with those by the shooting method. Table 3 lists the detailed comparisons for all the 22 orbitals below the Fermi surface of 208Pb between the PINN and the shooting method. It can be seen that the relative errors of energies are of the order of 10-3 and the absolute errors are all within 80 keV, showing a satisfying description of PINN to the eigen energies. It is interesting to note that the single-neutron energies obtained by PINN fluctuate above and below those from the shooting method. This indicates that compared to the shooting method, PINN exhibits lower accuracy and larger uncertainties. This characteristic may arise from initialization, random collocation point selection, etc. More importantly, since PINN employs the residual of the Dirac equation as the objective function rather than the variational principle, it allows both overestimation and underestimation of the eigen energies. In contrast, one always yields results not lower than the exact solution if the variational principle for energy expectation values is applied. As for wave functions, the RMS errors of PINN are also of the order of 10-3. The highest energy accuracy is 5.27 × 10-5 for the 1p1/2 orbital, whereas the best descriptions for the wave functions G(r) and F(r) are obtained 2.72 × 10-4 for the 1h11/2 orbital and 5.88 × 10-5 for the 1p3/2 orbital, respectively.
Figure 5. Energy spectrum of neutron for 208Pb in the Dirac WS potential obtained by the PINN, in comparison with the shooting method.
Table 3. Same as table 2, but for the neutron orbitals below the Fermi surface of 208Pb.
Orbital κ EShooting EPINN E σE σG σF
1s1/2 -1 -58.0103 -58.0272 0.0169 2.91 × 10-4 1.32 × 10-3 8.18 × 10-5
1p3/2 -2 -52.1132 -52.1209 0.0077 1.48 × 10-4 8.26 × 10-4 5.88 × 10-5
1p1/2 1 -51.1935 -51.1908 -0.0027 5.27 × 10-5 3.11 × 10-3 2.30 × 10-4
1d5/2 -3 -45.2784 -45.2669 -0.0115 2.54 × 10-4 3.96 × 10-3 2.67 × 10-4
1d3/2 2 -43.2777 -43.2745 -0.0032 7.39 × 10-5 3.69 × 10-3 2.65 × 10-4
2s1/2 -1 -41.1063 -41.1041 -0.0022 5.35 × 10-5 5.50 × 10-4 7.07 × 10-5
1f7/2 -4 -37.7301 -37.7414 0.0113 2.99 × 10-4 9.00 × 10-4 6.91 × 10-5
1f5/2 3 -34.4320 -34.3547 -0.0773 2.25 × 10-3 8.13 × 10-3 6.25 × 10-4
2p3/2 -2 -31.4324 -31.4182 -0.0142 4.52 × 10-4 3.82 × 10-4 9.05 × 10-5
2p1/2 1 -30.4267 -30.4508 0.0241 7.92 × 10-4 1.71 × 10-3 1.06 × 10-4
1g9/2 -5 -29.6381 -29.6543 0.0162 5.47 × 10-4 7.60 × 10-4 6.49 × 10-5
1g7/2 4 -24.9820 -25.0208 0.0388 1.55 × 10-3 5.73 × 10-4 6.35 × 10-5
2d5/2 -3 -21.5867 -21.6073 0.0206 9.54 × 10-4 2.40 × 10-3 1.38 × 10-4
1h11/2 -6 -21.1411 -21.1569 0.0158 7.47 × 10-4 2.72 × 10-4 8.64 × 10-5
2d3/2 2 -19.9730 -19.9616 -0.0114 5.71 × 10-4 6.98 × 10-4 1.14 × 10-4
3s1/2 -1 -18.8057 -18.8206 0.0149 7.92 × 10-4 7.77 × 10-4 1.40 × 10-4
1h9/2 5 -15.2317 -15.2815 0.0498 3.27 × 10-3 3.32 × 10-3 2.60 × 10-4
1i13/2 -7 -12.3623 -12.3852 0.0229 1.85 × 10-3 7.64 × 10-4 6.06 × 10-5
2f7/2 -4 -11.9215 -11.9617 0.0402 3.37 × 10-3 1.77 × 10-3 1.24 × 10-4
2f5/2 3 -9.8662 -9.8404 -0.0258 2.61 × 10-3 3.05 × 10-3 4.10 × 10-4
3p3/2 -2 -8.6514 -8.6284 -0.0230 2.66 × 10-3 5.09 × 10-4 1.26 × 10-4
3p1/2 1 -7.9710 -7.9310 -0.0400 5.02 × 10-3 2.11 × 10-3 2.69 × 10-4
Finally, we add a short note on computational cost. When benchmarked on identical CPU hardware and using the present deep NN architecture (Nlay = 3, Nneu = 80), the computation time required by PINN to solve for an orbital is about 107 times longer than that of the shooting method. This is because the PINN has to train a neural network with more than 13000 parameters. One of PINN's key advantages lies in handling high-dimensional complex problems [53], however for the solving of the radial Dirac equation, this advantage has not yet been exploited.
It is also mentioned that some other ML approaches have been employed to solve quantum mechanics problems in nuclear physics. For instance, in [42, 43], the variational Monte Carlo calculations with an artificial neural network (VMC-ANN) achieved a precision better than 0.5 MeV in calculating the ground-state energy of 4He. In [44], The deep-neural-network FeynmanNet reached within 30 keV accuracy for the ground-state energy of 4He. Note that both VMC-ANN and FeynmanNet solve nuclear many-body problems by using the variational principles and serve as ab initio calculations. Here the PINN approach gives an accuracy within 0.1 MeV for solving the single-particle energies. More recently, by using the variational principles, a deep neural network has achieved an accuracy within 10-3 MeV for the single-particle energies [49]. Nevertheless, the PINN approach is not based on the variational principles, and how to improve the accuracy and to address many-body problems in the PINN framework remain problems that warrant further investigation.

5. Summary

In summary, an unsupervised PINN model has been built to solve the nuclear Dirac WS potential, and taking 16O and 208Pb as examples, the energies and wave functions of the neutron orbitals below the Fermi surface have been solved with a satisfactory accuracy. Unlike traditional approaches that rely on the variation of energy expectation value, the PINN directly employs the residual of the Dirac equation as its objective function, thereby avoiding the variational collapse problem naturally. By integrating automatic differentiation techniques, the model overcomes the Fermion doubling problem typically encountered when solving the Dirac equation with the central difference method. An orthogonality constraint has been incorporated into the model to automatically find the excited states. Finally, in comparison with the standard results obtained from the shooting method, the relative errors of the calculated energies and the RMS errors of the wave functions for all three orbitals of 16O and 22 orbitals of 208Pb are on the order of 10-3, demonstrating the satisfactory level of accuracy and reliability of the PINN model.
Although the current testing of the model has shown the feasibility of PINN in solving Dirac equations with potentials that can closely approximate the real nuclear potentials. There are several aspects that demand further exploration and optimization.
First, an important issue lies in further improving the accuracy of PINN. In this paper, the accuracy of PINN is on the order of 10-3 in solving the Dirac equation and stable for both ground states and excited states, which is comparable to the accuracy of the original PINN in solving the nonlinear Schrödinger equation [50]. In recent reviews [52-54], advancements in PINN methodologies to boost efficiency and accuracy have been summarized, including adopting more sophisticated network architectures (e.g., convolutional neural networks/recurrent neural networks) and optimization algorithms (e.g., the L-BFGS-B optimizer), etc. In the next step, incorporating these enhancements is expected to improve the accuracy of PINN in solving the Dirac equation.
Second, the present tests are only on the deeply bound orbitals below the Fermi surface. Further evaluations to the weakly bound orbitals above the Fermi surface for stable nuclei and to the states in exotic nuclei are necessary. It is also interesting to explore the single-particle resonance states above the continuum threshold in the framework of PINN.
Third, the search for excited states may fail sometimes and the computational costs of the PINN is currently expensive. It is worth exploring more robust ways to search for the excited states and reducing the computational costs by refining the optimization process.
Fourth, the currently solved Dirac equation is merely the radial equation, i.e., the spherical symmetry is imposed. Extending the PINN to solve the Dirac equations in three-dimensional coordinates is straightforward and not overly challenging, as NN can simultaneously and efficiently represent the residuals of multiple coupled equations. This extension enables the calculations of more realistic deformed nuclear potentials. Moreover, the PINN can also be applied to solve the time-dependent Dirac equation for dynamical evolution, given its prior success in handling the time-dependent Schrödinger equation [50].
Last but not the least, given that atomic nuclei are self-bound systems, a crucial query is whether PINN can obtain a self-consistent solution of the nuclear density functional. In principle, PINN provides a viable option, but issues regarding self-consistent iteration, solution accuracy and computational efficiency must be resolved. Since self-consistent iterations inherently require substantial computational costs. Therefore, the approach to obtaining self-consistent solutions warrants careful consideration, for instance, exploring whether NN's representational capacity can be leveraged to solve the self-consistent potential field and single-particle wave functions in a synchronized manner.

The helpful discussions with P. Guo, X. F. Jiang, Z. Z. Li and C. Zhou are highly appreciated. XKD is indebted to the inspiration and prodding of J. Meng and P. W. Zhao. This work was supported by the National Key R&D Program of China (Grant No. 2024YFE0109803), the National Natural Science Foundation of China (Grants Nos. 12435006, 12481540215, 12475117, and 12141501), the State Key Laboratory of Nuclear Physics and Technology, Peking University (Grant No. NPT2023ZX03), the National Key Laboratory of Neutron Science and Technology (Grant No. NST202401016), and the High-performance Computing Platform of Peking University. Hospitality at APCTP during the program the 8th International Workshop on DRHBc Mass table is kindly acknowledged.

[1]
Hohenberg P, Kohn W 1964 Inhomogeneous electron gas Phys. Rev. 136 B864

DOI

[2]
Meng J 2016 Relativistic Density Functional for Nuclear Structure World Scientific

[3]
Ring P 1996 Relativistic mean field theory in finite nuclei Prog. Part. Nucl. Phys. 37 193

DOI

[4]
Meng J, Toki H, Zhou S-G, Zhang S Q, Long W H, Geng L S 2006 Relativistic continuum Hartree-Bogoliubov theory for ground-state properties of exotic nuclei Prog. Part. Nucl. Phys. 57 470

DOI

[5]
Vretenar D, Afanasjev A V, Lalazissis G A, Ring P 2005 Relativistic Hartree-Bogoliubov theory: static and dynamic aspects of exotic nuclear structure Phys. Rep. 409 101

DOI

[6]
Nikšić T, Vretenar D, Ring P 2011 Relativistic nuclear energy density functionals: mean-field and beyond Prog. Part. Nucl. Phys. 66 519

DOI

[7]
Meng J, Peng J, Zhang S Q, Zhao P W 2013 Progress on tilted axis cranking covariant density functional theory for nuclear magnetic and antimagnetic rotation Front. Phys. 8 55

DOI

[8]
Meng J, Zhou S-G 2015 Halos in medium-heavy and heavy nuclei with covariant density functional theory in continuum J. Phys. G 42 093101

DOI

[9]
Zhou S-G 2016 Multidimensionally constrained covariant density functional theories——nuclear shapes and potential energy surfaces Phys. Scr. 91 063008

DOI

[10]
Shen S, Liang H, Long W H, Meng J, Ring P 2019 Towards an ab initio covariant density functional theory for nuclear structure Prog. Part. Nucl. Phys. 109 103713

DOI

[11]
Geng K-P, Du P X, Li J, Fang D-L 2023 Calculation of microscopic nuclear level densities based on covariant density functional theory Nucl. Sci. Tech. 34 141

DOI

[12]
Jiang X F, Wu X H, Zhao P W, Meng J 2024 Nuclear level density from relativistic density functional theory and combinatorial method Phys. Lett. B 849 138448

DOI

[13]
Ring P 2012 Energy density functional theory in nuclei: does it have to be relativistic? Phys. Scr. T150 014035

DOI

[14]
Meng J, Zhao P 2021 Relativistic density functional theory in nuclear physics AAPPS Bulletin 31 2

DOI

[15]
Ginocchio J N 1997 Pseudospin as a relativistic symmetry Phys. Rev. Lett. 78 436

DOI

[16]
Meng J, Sugawara-Tanabe K, Yamaji S, Ring P, Arima A 1998 Pseudospin symmetry in relativistic mean field theory Phys. Rev. C 58 R628

DOI

[17]
Liang H, Meng J, Zhou S-G 2015 Hidden pseudospin and spin symmetries and their origins in atomic nuclei Phys. Rep. 570 1

DOI

[18]
Zhou S-G, Meng J, Ring P 2003 Spin symmetry in the antinucleon spectrum Phys. Rev. Lett. 91 262501

DOI

[19]
Anastasio M R, Celenza L S, Pong W S, Shakin C M 1983 Relativistic nuclear structure physics Phys. Rep. 100 327

DOI

[20]
Gambhir Y K, Ring P, Thimet A 1990 Relativistic mean field theory for finite nuclei Ann. Phys. (NY) 198 132

DOI

[21]
Stoitsov M, Ring P, Vretenar D, Lalazissis G A 1998 Solution of relativistic Hartree-Bogoliubov equations in configurational representation: spherical neutron halo nuclei Phys. Rev. C 58 2086

DOI

[22]
Zhou S-G, Meng J, Ring P 2003 Spherical relativistic Hartree theory in a Woods-Saxon basis Phys. Rev. C 68 034323

DOI

[23]
Zhang K Y, Pan C, Zhang S Q 2022 Optimized Dirac Woods-Saxon basis for covariant density functional theory Phys. Rev. C 106 024302

DOI

[24]
Horowitz C J, Serot B D 1981 Self-consistent Hartree description of finite nuclei in a relativistic quantum field theory Nucl. Phys. A 368 503

DOI

[25]
Meng J 1998 Relativistic continuum Hartree-Bogoliubov theory with both zero range and finite range Gogny force and their application Nucl. Phys. A 635 3

DOI

[26]
Pöschl W, Vretenar D, Rummel A, Ring P 1997 Application of finite element methods in relativistic mean-field theory: spherical nucleus Comput. Phys. Commun. 101 75

DOI

[27]
Almanasreh H, Salomonson S, Svanstedt N 2013 Stabilized finite element method for the radial Dirac equation J. Comput. Phys. 236 426

DOI

[28]
Sun T T, Zhang S Q, Zhang Y, Hu J N, Meng J 2014 Green's function method for single-particle resonant states in relativistic mean field theory Phys. Rev. C 90 054321

DOI

[29]
Sun T-T, Qian L, Chen C, Ring P, Li Z P 2020 Green's function method for the single-particle resonances in a deformed Dirac equation Phys. Rev. C 101 014321

DOI

[30]
Qu X Y, Tong H, Zhang S Q 2022 Canonical states in relativistic continuum theory with the Green's function method: neutrons in continuum of zirconium giant-halo nuclei Phys. Rev. C 105 014326

DOI

[31]
Zhang Y, Liang H, Meng J 2010 Avoid the tsunami of the Dirac sea in the imaginary time step method Int. J. Mod. Phys. E 19 55

DOI

[32]
Salomonson S, Öster P 1989 Relativistic all-order pair functions from a discretized single-particle Dirac Hamiltonian Phys. Rev. A 40 5548

DOI

[33]
Tanimura Y, Hagino K, Liang H Z 2015 3D mesh calculations for covariant density functional theory Prog. Theor. Exp. Phys. 2015 073D01

DOI

[34]
Ren Z X, Zhang S Q, Meng J 2017 Solving Dirac equations on a 3D lattice with inverse Hamiltonian and spectral methods Phys. Rev. C 95 024313

DOI

[35]
Li B, Ren Z X, Zhao P W 2020 Efficient solution for the Dirac equation in 3D lattice space with the conjugate gradient method Phys. Rev. C 102 044307

DOI

[36]
Xu F F, Li B, Ren Z X, Zhao P W 2024 Tetrahedral shape of 110Zr from covariant density functional theory in 3D lattice space Phys. Rev. C 109 014311

DOI

[37]
Zhang Y, Bao Y, Shen H, Hu J 2022 Resolving the spurious-state problem in the Dirac equation with the finite-difference method Phys. Rev. C 106 L051303

DOI

[38]
Bedaque P 2021 A.I. for nuclear physics Eur. Phys. J. A 57 100

DOI

[39]
Boehnlein A 2022 Colloquium: machine learning in nuclear physics Rev. Mod. Phys. 94 031003

DOI

[40]
He W, Li Q, Ma Y, Niu Z, Pei J, Zhang Y 2023 Machine learning in nuclear physics at low and intermediate energies Sci. China Phys. Mech. Astron. 66 282001

DOI

[41]
Keeble J W T, Rios A 2020 Machine learning the deuteron Phys. Lett. B 809 135743

DOI

[42]
Adams C, Carleo G, Lovato A, Rocco N 2021 Variational Monte Carlo calculations of A ≤ 4 nuclei with an artificial neural-network correlator ansatz Phys. Rev. Lett. 127 022502

DOI

[43]
Gnech A, Adams C, Brawand N, Carleo G, Lovato A, Rocco N 2022 Nuclei with up to A = 6 nucleons with artificial neural network wave functions Few-Body Syst. 63 7

DOI

[44]
Yang Y L, Zhao P W 2023 Deep-neural-network approach to solving the ab initio nuclear structure problem Phys. Rev. C 107 034320

DOI

[45]
Lagaris I E, Likas A, Fotiadis D I 1997 Artificial neural network methods in quantum mechanics Comput. Phys. Commun. 104 1

DOI

[46]
Jin H, Mattheakis M, Protopapas P Unsupervised neural networks for quantum eigenvalue problems arXiv:2010.05075

[47]
Jin H, Mattheakis M, Protopapas P 2022 Physics-Informed Neural Networks for Quantum Eigenvalue Problems. 2022 International Joint Conference on Neural Networks (IJCNN) 18 July, 2022

[48]
Pu K-F, Li H-L, H-L, Pang L-G 2023 Solving Schrodinger equations using a physically constrained neural network Chin. Phys. C 47 054104

DOI

[49]
Wang C, Naito T, Li J, Liang H 2025 A deep neural network approach to solve the Dirac equation Eur. Phys. J. A 61 162

DOI

[50]
Raissi M, Perdikaris P, Karniadakis G E 2019 Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations J. Comput. Phys. 378 686

DOI

[51]
Karniadakis G E, Kevrekidis I G, Lu L, Perdikaris P, Wang S, Yang L 2021 Physics-informed machine learning Nat. Rev. Phys. 3 422

DOI

[52]
Lawal Z K, Yassin H, Lai D T C, Idris A C 2022 Physics-informed neural network (PINN) evolution and beyond: a systematic literature review and bibliometric analysis Big Data Cogn. Comput. 6 140

DOI

[53]
Cuomo S, Di Cola V S, Giampaolo F, Rozza G, Raissi M, Piccialli F 2022 Scientific machine learning through physics-informed neural networks: where we are and what's next J. Sci. Comput. 92 88

DOI

[54]
Farea A, Yli-Harja O, Emmert-Streib F 2024 Understanding physics-informed neural networks: techniques, applications, trends, and challenges AI 5 1534

DOI

[55]
Chen X, Peng W-Q 2025 PINN for solving forward and inverse problems involving integrable two-dimensional nonlocal equations Commun. Theor. Phys. 77 025002

DOI

[56]
Paszke A, Gross S, Chintala S, Chanan G, Yang E, DeVito Z, Lin Z, Desmaison A, Antiga L, Lerer A 2017 Automatic differentiation in PyTorch 31st Conference on Neural Information Processing Systems (NIPS 2017)

[57]
Neidinger R D 2010 Introduction to automatic differentiation and MATLAB object-oriented programming SIAM Rev. 52 545

DOI

[58]
Baydin A G, Pearlmutter B A, Radul A A, Siskind J M 2018 Automatic differentiation in machine learning: a survey J. Mach. Learn. Res. 18 1

[59]
Koepf W, Ring P 1991 The spin-orbit field in superdeformed nuclei: a relativistic investigation Z. Phys. A 339 81

DOI

[60]
Hornik K, Stinchcombe M, White H 1989 Multilayer feedforward networks are universal approximators Neural Netw. 2 359

DOI

[61]
Cybenko G 1989 Approximation by superpositions of a sigmoidal function Math. Control Signals Systems 2 303

DOI

[62]
Burden R L, Faires J D, Burden A M 2015 Numerical Analysis 10th edn Cengage Learning

[63]

[64]
Yu J, Lu L, Meng X, Karniadakis G E 2022 Gradient-enhanced physics-informed neural networks for forward and inverse PDE problems Comput. Method. Appl. M. 393 114823

DOI

[65]
Kingma D P, Ba J 2015 Adam: a method for stochastic optimization 3rd International Conference on Learning Representations (ICLR) arXiv:1412.6980

Outlines

/