Welcome to visit Communications in Theoretical Physics,
Mathematical Physics

Data-driven fusion and fission solutions in the Hirota–Satsuma–Ito equation via the physics-informed neural networks method

  • Jianlong Sun ,
  • Kaijie Xing ,
  • Hongli An *
Expand
  • College of Sciences, Nanjing Agricultural University, Nanjing, 210095, China

*Author to whom any correspondence should be addressed.

Received date: 2023-08-14

  Revised date: 2023-09-19

  Accepted date: 2023-09-27

  Online published: 2023-11-10

Copyright

© 2023 Institute of Theoretical Physics CAS, Chinese Physical Society and IOP Publishing

Abstract

Fusion and fission are two important phenomena that have been experimentally observed in many real physical models. In this paper, we investigate the two phenomena in the (2+1)-dimensional Hirota–Satsuma–Ito equation via the physics-informed neural networks (PINN) method. By choosing suitable physically constrained initial boundary conditions, the data-driven fusion and fission solutions are obtained for the first time. Dynamical behaviors and error analysis of these solutions are investigated via illustratively numerical figures, which show that good results are achieved. It is pointed out that the PINN method adopted here can be effectively used to construct the data-driven fusion and fission solutions for other nonlinear integrable equations. Based on the powerful predictive capability of the PINN method and wide applications of fusion and fission in many physical areas, it is hoped that the data-driven solutions obtained here will be helpful for experts to predict or explain related physical phenomena.

Cite this article

Jianlong Sun , Kaijie Xing , Hongli An . Data-driven fusion and fission solutions in the Hirota–Satsuma–Ito equation via the physics-informed neural networks method[J]. Communications in Theoretical Physics, 2023 , 75(11) : 115002 . DOI: 10.1088/1572-9494/acfd9c

1. Introduction

With the explosive development of science and technology, deep learning with neural network methods has been widely applied in areas such as cognitive science, computer vision, image recognition, recommendation systems and natural language processing [15]. Many traditional deep learning algorithms are designed to only solve specific learning tasks [3, 6]. The huge cost of data acquisition makes the use of traditional deep learning algorithms challenging when applied to complex physics and engineering analysis. In 2019, a new deep learning framework based on physical constraints, namely the physics-informed neural network (PINN), was proposed by Raissi et al [7]. The PINN is able to provide accurate solutions even if only a small amount of data is available. Importantly, since some physical constraints are usually explicitly involved, the PINN method can provide a better physical explanation of the predicted solution. Therefore, many scholars have devoted themselves to investigating the PINN method and its variants. For example, by using the PINN method, Chen's group constructed data-driven soliton solutions for three nonlinear evolution equations [810]. Subsequently, they derived data-driven breather waves and rogue waves for the nonlinear Schrödinger (NLS) and Chen–Lee–Liu equations [11, 12]. Yan's group obtained data-driven rogue waves for the defocusing NLS equation with a potential [13]. Dai's group constructed data-driven femtosecond optical soliton excitations for the higher-order NLS equation [14]. Li's group obtained data-driven rogue waves for the NLS equation by using a mix-training PINN method [15]. Several other interesting works have also been done in [1620].
Solitons, an important type of solution for nonlinear integrable equations, have been widely used in fluid, plasma, optics, condensed matter physics and so on [21]. Therefore, many scholars have investigated the soliton solutions of nonlinear systems and a number of interesting results have been achieved. For example, Fan's group discussed the soliton resolution and large-time behavior of solutions to the Cauchy problem for the Novikov equation [22]. Yan's group analyzed soliton formation and its dynamic behaviors in quintic nonlinear media [23]. Chen's group obtained higher-order soliton solutions to the inhomogeneous variable-coefficient Hirota equation [24]. Tian's group derived the soliton solutions of the general n-component and focused on NLS equations [25, 26]. An's group acquired rich soliton solutions of the Sawada–Kotera equation via the variable separation approach [27]. Usually, the interactions between soliton solutions are elastic as their velocity, amplitude and wave shapes remain unchanged during the interaction process. However, for some nonlinear integrable equations, inelastic interactions may occur when the velocities and wave vectors of the solitons satisfy certain conditions. For example, at a certain time, one soliton divides into two or more solitons. In contrast, two or more solitons may fuse into one. The above two phenomena are called soliton fission and soliton fusion in the terminology of soliton theory. Interestingly, is has been found that fusion and fission phenomena exist in many physical areas, such as fluid, plasma, nuclear physics, even-clumping DNA, organic membranes and macromolecule materials, as well as Sr–Ba–Ni oxidation crystals and waveguides [2831]. Therefore, it would be of great interest to look for fusion and fission solutions in nonlinear integrable equations.
After surveying the existing literature, we found that no work has been done on data-driven fusion and fission solutions of nonlinear integrable equations. In particular, no such work has yet been done using the PINN method. Therefore, in this paper, we plan to investigate data-driven fusion and fission solutions in the (2+1)-dimensional Hirota–Satsuma–Ito (HSI) equation by the PINN method, which is described by:
$\begin{eqnarray}\begin{array}{l}{v}_{t}+{u}_{{xxt}}+3{({uw})}_{x}+\alpha {u}_{x}=0,\\ {u}_{y}={v}_{x},\\ {u}_{t}={w}_{x}.\end{array}\end{eqnarray}$
This equation was introduced by Hirota and Satsuma to model unidirectional shallow water waves [32]. Due to its importance and wide applications, the HSI equation has been studied extensively and various exact solutions have been constructed, such as complexiton solutions [33], semi-rational solutions [34], rational localized waves and their absorb–emit interactions [35], multi-wave, breather-wave and hybrid solutions [36], resonant multi-soliton solutions [37], and symmetric invariant solutions [38]. Unlike the above work, our aim is to construct data-driven fusion and fission solutions for the HSI equation using the PINN method.
The outline of this paper is organized as follows. In section 2, we briefly give the PINN deep learning method for the HSI equation (1.1). In section 3, by choosing suitable physically constrained initial boundary conditions, abundant data-driven fusion and fission solutions of the HSI equation are obtained via the PINN method. Numerical simulations show that such solutions take the shape of the capital letter Y in spatial structures. The error analysis reveals that our obtained data-driven fusion and fission solutions can rapidly approximate the exact ones derived by Tian and his coauthors [39]. Therefore, we conclude that the PINN method is a very effective algorithm for constructing the data-driven fusion and fission solutions for nonlinear equations. Finally, a short conclusion is provided.

2. The PINN method

The general form of a (2+1)-dimensional coupled system with three components can be written as
$\begin{eqnarray}\begin{array}{l}{u}_{t}+{{ \mathcal N }}_{u}(u,v,w,{u}_{x},{v}_{x},{w}_{x},{u}_{y},\\ {v}_{y},{w}_{y},{u}_{{xx}},{v}_{{xx}},{w}_{{xx}},\cdots )=0,\\ {v}_{t}+{{ \mathcal N }}_{v}(u,v,w,{u}_{x},{v}_{x},{w}_{x},{u}_{y},\\ {v}_{y},{w}_{y},{u}_{{xx}},{v}_{{xx}},{w}_{{xx}},\cdots )=0,\\ {w}_{t}+{{ \mathcal N }}_{w}(u,v,w,{u}_{x},{v}_{x},{w}_{x},{u}_{y},\\ {v}_{y},{w}_{y},{u}_{{xx}},{v}_{{xx}},{w}_{{xx}},\cdots )=0,\end{array}\end{eqnarray}$
where u, v and w are real-valued functions with variables x, y and t, and ${{ \mathcal N }}_{u}$, ${{ \mathcal N }}_{v}$and ${{ \mathcal N }}_{w}$ are the functions associated with the real-valued solutions and the derivatives of each order with respect to x and y, respectively. According to the principles of the PINN method, we define the physics-informed neural networks fu, fv and fw as
$\begin{eqnarray}\begin{array}{l}{f}_{u}:= {u}_{t}+{{ \mathcal N }}_{u}(u,v,w,{u}_{x},{v}_{x},{w}_{x},\\ {u}_{y},{v}_{y},{w}_{y},{u}_{{xx}},{v}_{{xx}},{w}_{{xx}},\cdots ),\\ {f}_{v}:= {v}_{t}+{{ \mathcal N }}_{v}(u,v,w,{u}_{x},{v}_{x},{w}_{x},\\ {u}_{y},{v}_{y},{w}_{y},{u}_{{xx}},{v}_{{xx}},{w}_{{xx}},\cdots ),\\ {f}_{w}:= {w}_{t}+{{ \mathcal N }}_{w}(u,v,w,{u}_{x},{v}_{x},{w}_{x},\\ {u}_{y},{v}_{y},{w}_{y},{u}_{{xx}},{v}_{{xx}},{w}_{{xx}},\cdots ),\end{array}\end{eqnarray}$
where ${{ \mathcal N }}_{u}$, ${{ \mathcal N }}_{v}$and ${{ \mathcal N }}_{w}$ are the physical models given in (2.1) and u(x, y, t; ω, b), v(x, y, t; ω, b) and w(x, y, t; ω, b) are latent functions of deep neural networks with the weight parameter ω and bias parameter b, which can be used to approximate the exact solution u(x, y, t), v(x, y, t) and w(x, y, t) of the objective equations. Then, by using multi-hidden-layer deep neural networks, the network parameters of the latent functions u, v and w and networks fu, fv and fw can be constantly trained.
In the following, in order to obtain the optimum training results, we adopt the limited-memory Broyden–Fletcher–Goldfarb–Shanno optimization algorithm [40] to minimize the whole mean squared error, namely, the loss function
$\begin{eqnarray}\begin{array}{ccc}{\mathrm{Loss}}_{{\rm{\Theta }}} & = & {\mathrm{Loss}}_{u}+{\mathrm{Loss}}_{v}+{\mathrm{Loss}}_{w}\\ & & +{\mathrm{Loss}}_{{f}_{u}}+{\mathrm{Loss}}_{{f}_{v}}+{\mathrm{Loss}}_{{f}_{w}},\end{array}\end{eqnarray}$
where
$\begin{eqnarray}\begin{array}{c}{\mathrm{Loss}}_{u}=\frac{1}{{N}_{u}}\displaystyle \sum _{i=1}^{{N}_{u}}| u({x}_{u}^{i},{y}_{u}^{i},{t}_{u}^{i})-{u}^{i}{| }^{2},\\ {\mathrm{Loss}}_{v}=\frac{1}{{N}_{v}}\displaystyle \sum _{i=1}^{{N}_{v}}| v({x}_{v}^{i},{y}_{v}^{i},{t}_{v}^{i})-{v}^{i}{| }^{2},\\ {\mathrm{Loss}}_{w}=\frac{1}{{N}_{w}}\displaystyle \sum _{i=1}^{{N}_{w}}| w({x}_{w}^{i},{y}_{w}^{i},{t}_{w}^{i})-{w}^{i}{| }^{2},\\ {\mathrm{Loss}}_{{f}_{u}}=\frac{1}{{N}_{f}}\displaystyle \sum _{i=1}^{{N}_{f}}| {f}_{u}({x}_{f}^{i},{y}_{f}^{i},{t}_{f}^{i}){| }^{2},\\ {\mathrm{Loss}}_{{f}_{v}}=\frac{1}{{N}_{f}}\displaystyle \sum _{i=1}^{{N}_{f}}| {f}_{v}({x}_{f}^{i},{y}_{f}^{i},{t}_{f}^{i}){| }^{2},\\ {\mathrm{Loss}}_{{f}_{w}}=\frac{1}{{N}_{f}}\displaystyle \sum _{i=1}^{{N}_{f}}| {f}_{w}({x}_{f}^{i},{y}_{f}^{i},{t}_{f}^{i}){| }^{2}.\end{array}\end{eqnarray}$
In the above, $\{{x}_{u}^{i},{y}_{u}^{i},{t}_{u}^{i};{u}^{i}\}{}_{i=1}^{{N}_{u}}$, $\{{x}_{v}^{i},{y}_{v}^{i},{t}_{v}^{i};{v}^{i}\}{}_{i=1}^{{N}_{v}}$ and $\{{x}_{w}^{i},{y}_{w}^{i},{t}_{w}^{i};{w}^{i}\}{}_{i=1}^{{N}_{w}}$ are the sampled initial and boundary training data of u, v and w. While the collocation points for fu, fv and fw are recorded as $\{{x}_{f}^{i},{y}_{f}^{i},{t}_{f}^{i}\}{}_{i=1}^{{N}_{f}}$. It is noticed that the loss function (2.3) is composed of the losses of initial boundary condition constraints and physical equation constraints. The first three terms on the right-hand side of (2.3) represent the loss of the initial boundary condition constraints, which measure the data-fitting condition. The last three terms on the right-hand side of (2.3) are the loss of the physical equation constraint, which measure whether the equation is satisfied.
Note that, without loss of generality, this paper selects the hyperbolic function $\tanh $ as the activation function (if there are no special instructions). All codes are based on Python 3.6 and tensorflow 1.10, and numerical experiments are run on an HP Pavilion Laptop 14-bf1xx computer with a 1.60 GHz Intel Core i5-8250U processor and 8 GB memory.

3. Data-driven fusion and fission solutions of the HSI equation

The initial boundary value problem with a Dirichlet condition for the (2+1)-dimensional HSI equation takes the following form:
$\begin{eqnarray}\left\{\begin{array}{l}{v}_{t}+{u}_{{xxt}}+3{({uw})}_{x}+\alpha {u}_{x}=0,\\ {u}_{y}={v}_{x},\\ {u}_{t}={w}_{x},\\ u(x,y,{t}_{0})={u}_{0}(x,y,{t}_{0}),v(x,y,{t}_{0})={v}_{0}(x,y,{t}_{0}),w(x,y,{t}_{0})={w}_{0}(x,y,{t}_{0}),\forall (x,y)\in {\rm{\Omega }},\\ u(x,y,t)={u}_{0}(x,y,t),v(x,y,t)={v}_{0}(x,y,t),w(x,y,t)={w}_{0}(x,y,t),\forall (x,y)\in \partial {\rm{\Omega }},\quad t\in [{t}_{0},{t}_{1}],\end{array}\right.\end{eqnarray}$
where u0(x, y, t), v0(x, y, t) and w0(x, y, t) are usually taken as the known solutions of the HSI equation (1.1), Ω = [x0x1] × [y0, y1] represents the spatial region and T = [t0, t1] denotes the time region.
With the aid of the Hirota bilinear method [4143], the N-soliton solution of the HSI equation (1.1) can obtained via
$\begin{eqnarray}u=2{(\mathrm{ln}f)}_{{xx}},\quad v=2{(\mathrm{ln}f)}_{{xy}},\quad w=2{(\mathrm{ln}f)}_{{xt}},\end{eqnarray}$
where the function f is given by
$\begin{eqnarray}f={f}_{N}=\displaystyle \sum _{\mu =0,1}\exp \left(\displaystyle \sum _{i=1}^{N}{\mu }_{i}{\eta }_{i}+\displaystyle \sum _{i\lt j}^{N}{\mu }_{i}{\mu }_{j}{A}_{{ij}}\right),\end{eqnarray}$
with
$\begin{eqnarray}\begin{array}{rcl}{\eta }_{i} & = & {k}_{i}x+{p}_{i}y+{q}_{i}t+{\eta }_{i}^{0},\\ {q}_{i} & = & -\displaystyle \frac{\alpha {k}_{i}^{2}}{{k}_{i}^{3}+{p}_{i}},\\ {{\rm{e}}}^{{A}_{{ij}}} & = & -\displaystyle \frac{{\left({k}_{i}-{k}_{j}\right)}^{3}({q}_{i}-{q}_{j})+\alpha {\left({k}_{i}-{k}_{j}\right)}^{2}+({p}_{i}-{p}_{j})({q}_{i}-{q}_{j})}{{\left({k}_{i}+{k}_{j}\right)}^{3}({q}_{i}+{q}_{j})+\alpha {\left({k}_{i}+{k}_{j}\right)}^{2}+({p}_{i}+{p}_{j})({q}_{i}+{q}_{j})},\\ i,j & = & 1,2,\cdots ,N.\end{array}\end{eqnarray}$
In the above ∑μ=0,1 is the summation of all possible combinations of μi = 0, 1.
As shown in [39, 44], when requiring the parameters ${{\rm{e}}}^{{A}_{{ij}}}$ given in (3.4)
$\begin{eqnarray}\begin{array}{l}{{\rm{e}}}^{{A}_{{ij}}}=0,\quad (1\leqslant i\lt j\leqslant M,\\ M\lt i\lt j\leqslant N,\quad N=M+L),\end{array}\end{eqnarray}$
which is equivalent to
$\begin{eqnarray}\begin{array}{c}{p}_{j}=\displaystyle \frac{-{k}_{i}{k}_{j}{\left({k}_{i}-{k}_{j}\right)}^{3}+{k}_{i}{k}_{j}^{3}({k}_{i}-{k}_{j})+2{k}_{i}{k}_{j}{p}_{i}\pm {k}_{i}{k}_{j}({k}_{i}-{k}_{j})\sqrt{-3{k}_{i}({k}_{i}^{3}+4{p}_{i})}}{2{k}_{i}^{2}},\end{array}\end{eqnarray}$
then expression (3.2) represents the fusion (or fission) solution consisting of M-fusion (or fission) and L-fusion (or fission) solitons.
In the following, we shall adopt the PINN method given in the above section to construct the data-driven fusion and fission solutions for the HSI equation with Dirichlet condition (3.1).

3.1. Data-driven pure fusion (fission) solution with N = 2

Here we consider the case of N = 2. When setting L in (3.5) to be zero and selecting the parameters as
$\begin{eqnarray}\begin{array}{l}\alpha =1,{k}_{1}=-\displaystyle \frac{3}{4},{k}_{2}=\displaystyle \frac{1}{2},{p}_{1}=\displaystyle \frac{2}{3},\\ {p}_{2}=-\displaystyle \frac{571}{576}+\displaystyle \frac{5}{192}\sqrt{1293},{\eta }_{0}^{1}={\eta }_{0}^{2}=0,\end{array}\end{eqnarray}$
we can obtain a pure exact fusion solution, which is described by
$\begin{eqnarray}\left\{\begin{array}{l}{u}_{1}=\displaystyle \frac{2({k}_{1}^{2}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{k}_{2}^{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t})}{1+{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t}}-\displaystyle \frac{2{\left({k}_{1}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{k}_{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t}\right)}^{2}}{{\left(1+{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t}\right)}^{2}},\\ {v}_{1}=\displaystyle \frac{2({k}_{1}{p}_{1}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{k}_{2}{p}_{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t})}{1+{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t}}-\displaystyle \frac{2({k}_{1}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{k}_{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t})({p}_{1}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{p}_{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t})}{{\left(1+{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t}\right)}^{2}},\\ {w}_{1}=\displaystyle \frac{2({k}_{1}{q}_{1}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}}+{k}_{2}{q}_{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t})}{1+{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t}}-\displaystyle \frac{2({k}_{1}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{k}_{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t})({q}_{1}{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{q}_{2}{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t})}{{\left(1+{{\rm{e}}}^{{k}_{1}x+{p}_{1}y+{q}_{1}t}+{{\rm{e}}}^{{k}_{2}x+{p}_{2}y+{q}_{2}t}\right)}^{2}},\end{array}\right.\end{eqnarray}$
with ${q}_{i}=-\tfrac{\alpha {k}_{i}^{2}}{{k}_{i}^{3}+{p}_{i}}\ (i=1,2)$. By changing the parameter ${p}_{2}=-\tfrac{571}{576}-\tfrac{5}{192}\sqrt{1293}$ and keeping the other parameters unchanged, we can obtain a pure exact fission solution.
Let the spatial region Ω = [−15, 15] × [−15, 15] and time region [t0, t1] = [ − 5, 5]. By substituting u0(x, y, t) = u1(x, y, t), v0(x, y, t) = v1(x, y, t) and w0(x, y, t) = w1(x, y, t) into (3.1), the initial boundary value problem can be solved. To acquire the training data, the finite difference method is employed, in which the spatial and time regions are divided into 65 × 65 and 33 points, respectively. Then, we randomly pick Nu = Nv = Nw = 100 points from the original initial boundary data and Nf = 10000 collocation points to generate a small training data set by using the Latin hypercube sampling (LHS) method [45]. According to the obtained training data, by using a nine-layer feedforward neural network with 20 neurons in each hidden layer, the data-driven pure fusion and fission solutions are successfully learned by regulating the network parameters and minimizing the loss function.
Below, we present numerical simulations that validate the effectiveness and accuracy of the obtained solution. The dynamic behaviors of the data-driven pure fusion solution by the PINN method are depicted in figure 1 and behaviors of the data-driven pure fission solution are exhibited in figure 2. From these figures, one can easily see that with x increasing, the phenomenon of fusion occurs in figure 1(a) and the phenomenon of fission happens in figure 2(a), which takes the shape of the capital letter Y. In addition, observations of figure 1(b) and figure 2(b) show that the errors between the predicted solutions and exact ones are quite small. In fact, for the fusion solution, the ${{\mathbb{L}}}_{2}$ relative error between the learning solution and the exact one is 8.569428 × 10−3, which takes 280.1288 s learning time. For the fission solution, the ${{\mathbb{L}}}_{2}$ relative error is 1.708373 × 10−2, which speeds up the 402.3156 s learning time. Therefore, we can conclude that its prediction performance is rather good. In other words, the PINN method is an effective algorithm for constructing the data-driven pure fusion and fission solutions for the HSI equation (3.1).
Figure 1. The data-driven fusion solution of the HSI equation at t = 0. (a) Density plot. (b) Comparisons of the data-driven fusion solution with the exact one at different y-values.
Figure 2. The data-driven fission solution of the HSI equation at t = 0. (a) Density plot. (b) Comparisons of the data-driven fission solution with the exact one at different y-values.

3.2. Data-driven X-type soliton solution and two interaction solutions with N = 3

Here we consider the case of N = 3. On setting the parameters as
$\begin{eqnarray}\begin{array}{l}\alpha =1,{k}_{1}=-\displaystyle \frac{3}{4},{p}_{1}=\displaystyle \frac{2}{3},\\ {k}_{2}=\displaystyle \frac{1}{2},{p}_{2}=-\displaystyle \frac{571}{576}-\displaystyle \frac{5}{192}\sqrt{1293},\\ {k}_{3}=\displaystyle \frac{2}{5},{p}_{3}=-\displaystyle \frac{12817}{18000}+\displaystyle \frac{23}{1200}\sqrt{1293},\\ {\eta }_{0}^{1}={\eta }_{0}^{2}={\eta }_{0}^{3}=0,\end{array}\end{eqnarray}$
and inserting them into (3.3), then we can obtain an exact X-type soliton solution, which takes a form of
$\begin{eqnarray}\begin{array}{l}{u}_{3}=\displaystyle \frac{2({f}_{{xx}}f-{f}_{x}^{2})}{{f}^{2}},\quad {v}_{3}=\displaystyle \frac{2({f}_{{xy}}f-{f}_{x}{f}_{y})}{{f}^{2}},\\ {w}_{3}=\displaystyle \frac{2({f}_{{xt}}f-{f}_{x}{f}_{t})}{{f}^{2}},\end{array}\end{eqnarray}$
with
$\begin{eqnarray}f=1+{{\rm{e}}}^{{\eta }_{1}}+{{\rm{e}}}^{{\eta }_{2}}+{A}_{23}{{\rm{e}}}^{{\eta }_{2}+{\eta }_{3}}.\end{eqnarray}$
In the above, ηi (i = 1, 2, 3) is described by (3.4).
Taking the spatial region Ω = [−15, 15] × [−15, 15] and time region T = [−10, 10]. On inserting them into the expression of (3.10), the initial boundary conditions of the HSI equation (3.1) can be obtained. After that, we discretize the spatial region [−15, 15] × [−15, 15] into 65 × 65 points and the time region [−10, 10] into 33 points with the aid of MATLAB. Then, the initial boundary value data set, namely the training data set, can be generated by randomly selecting Nu = Nv = Nw = 500 points from the original data set and choosing Nf = 15000 collocation points by using the sampling method given in section 3.1. Inputting the training data set into a nine-layer feedforward neural network with 40 neurons in each hidden layer, we successfully get a data-driven solution of the HSI equation (3.1), which has a ${{\mathbb{L}}}_{2}$ relative error of 3.170034 × 10−2 after 2356 times iteration in 2184 s.
The dynamic behaviors of the data-driven solution of the HSI equation by using the PINN method are depicted in figures 3(a) and (b). From these two figures, one can easily see that with x increasing, the learning solution fuses first and then splits rapidly. Since this kind of solution takes the shape of capital letter X, we can call it the data-driven X-type soliton solution. Figure 3(c) exhibits the dynamic behaviors of the exact X-type solution. A comparison of the predicted solution with the exact one is shown in figure 3(d). From the comparison, one can easily find that a very good prediction performance is achieved in the data-driven X-type solution.
Figure 3. The data-driven X-type soliton solution of the HSI equation at t = 0. (a) Density plot. (b) 3D diagram of the predicted solution. (c) 3D diagram of the exact solution. (d) Comparisons of the predicted solution with the exact at different y-values.
Now, we shall show that when N = 3, except the data-driven X-type soliton solution given in the above, another two kinds of data-driven interaction solution can be generated once appropriate parameters are chosen. To show this, we set the parameters as
$\begin{eqnarray}\begin{array}{l}\alpha =1,{k}_{1}=-\displaystyle \frac{3}{4},{p}_{1}=\displaystyle \frac{2}{3},\\ {k}_{2}=\displaystyle \frac{1}{2},{p}_{2}=-\displaystyle \frac{571}{576}+\displaystyle \frac{5}{192}\sqrt{1293},\\ {k}_{3}=\displaystyle \frac{2}{5},{p}_{3}=\displaystyle \frac{5}{6},{\eta }_{0}^{1}={\eta }_{0}^{2}={\eta }_{0}^{3}=0.\end{array}\end{eqnarray}$
According to (3.3), (3.4) and (3.5), the exact interaction consisting of fusion and a single soliton is obtained, which is expressed by
$\begin{eqnarray}{u}_{3}=2{(\mathrm{ln}f)}_{{xx}},\,{v}_{3}=2{(\mathrm{ln}f)}_{{xy}},\,{w}_{3}=2{(\mathrm{ln}f)}_{{xt}},\end{eqnarray}$
where
$\begin{eqnarray}\begin{array}{rcl}f & = & 1+{{\rm{e}}}^{{\eta }_{1}}+{{\rm{e}}}^{{\eta }_{2}}+{A}_{13}{{\rm{e}}}^{{\eta }_{1}+{\eta }_{3}}\\ & & +{A}_{23}{{\rm{e}}}^{{\eta }_{2}+{\eta }_{3}},\end{array}\end{eqnarray}$
with ηi and Aij given in (3.4). When changing the parameter ${p}_{2}=-\tfrac{571}{576}-\tfrac{5}{192}\sqrt{1293}$ and keeping the other parameters in (3.12) unchanged, the expression of (3.13) denotes an interaction solution mixed by fission and a single soliton.
By implementing the same data acquisition and training procedures as that for the X-type data-driven solution, we can get the data-driven interaction solution between fusion and a single soliton, which has a ${{\mathbb{L}}}_{2}$ relative error of 2.226418 × 10−2 after 3297 times iteration in 2861 s. In addition, we can obtain the data-driven interaction solution mixed by fission and a single soliton with ${{\mathbb{L}}}_{2}\,=2.263019\times {10}^{-2}$ after 2880 times iteration in 1299 s.
Figure 4 shows the dynamic behaviors of the data-driven solution mixed by fusion and a single soliton via the PINN method as well as the comparison of the predicted solution with the exact one. Figure 5 displays the dynamic behaviors of the data-driven solution consisting of fission and a single soliton as well as the comparison of the learning solution and the exact one. From the comparison of figure 4(b) and figure 5(b), we can find that the learning effect is quite good. Therefore, we conclude that the PINN method is very effective to construct data-driven solutions of fusion and fission types.
Figure 4. The data-driven interaction solution between fusion and a single soliton of the HSI equation at t = 0. (a) Density plot. (b) Comparisons of the predicted solution with the exact at different y-values.
Figure 5. The data-driven interaction solution between fission and a single soliton of the HSI equation at t = 0. (a) Density plot. (b) Comparisons of the predicted solution with the exact at different y-values.

3.3. Two different types of data-driven interaction solution with N = 4

It is known that when two different solitons move towards each other, interactions usually occur, which is an important and interesting phenomenon worthy of investigation. In the following, we shall study whether such interactions can occur between two different data-driven fusion solutions and between data-driven fusion and a soliton molecule. For this purpose, we take N = 4 and require the parameters to be chosen as
$\begin{eqnarray}\begin{array}{l}\alpha =\displaystyle \frac{1}{2},{k}_{1}=-\displaystyle \frac{3}{10},{p}_{1}=\displaystyle \frac{7}{10},\\ {k}_{2}=\displaystyle \frac{3}{5},{p}_{2}=-\displaystyle \frac{361}{200}+\displaystyle \frac{9}{100000}\sqrt{249570000},\\ {\eta }_{0}^{1}=3,{\eta }_{0}^{2}=-5,\\ {k}_{3}=-\displaystyle \frac{1}{2},{p}_{3}=1,{k}_{4}=\displaystyle \frac{4}{5},\\ {p}_{4}=-\displaystyle \frac{673}{250}+\displaystyle \frac{13}{200}\sqrt{1488},{\eta }_{0}^{3}=-10,{\eta }_{0}^{4}=4.\end{array}\end{eqnarray}$
Inserting these parameters into (3.3) and (3.4) yields that
$\begin{eqnarray}\begin{array}{rcl}f & = & 1+{{\rm{e}}}^{{\eta }_{1}}+{{\rm{e}}}^{{\eta }_{2}}+{{\rm{e}}}^{{\eta }_{3}}+{{\rm{e}}}^{{\eta }_{4}}\\ & & +{A}_{13}{{\rm{e}}}^{{\eta }_{1}+{\eta }_{3}}+{A}_{14}{{\rm{e}}}^{{\eta }_{1}+{\eta }_{4}}\\ & & +{A}_{23}{{\rm{e}}}^{{\eta }_{2}+{\eta }_{3}}+{A}_{24}{{\rm{e}}}^{{\eta }_{2}+{\eta }_{4}},\end{array}\end{eqnarray}$
with ηi and Aij given in (3.4). Accordingly, we obtain an exact interaction solution consisting of two different fusions:
$\begin{eqnarray}{u}_{4}=2{(\mathrm{ln}f)}_{{xx}},\,{v}_{4}=2{(\mathrm{ln}f)}_{{xy}},\,{w}_{4}=2{(\mathrm{ln}f)}_{{xt}}.\end{eqnarray}$
Using the same data discretization method in the above, we get the initial boundary value data set with the spatial region [−20, 20] × [−20, 20] dividing into 65 × 65 points and time region [−10, 10] into 33 points. Then, with the aid of LHS, we obtain a training data set by randomly sampling Nu = Nv = Nw = 1000 in the original data set and selecting Nf = 20000 collocation points. Inputting the training data set into a nine-layer neural network with 40 neurons per layer, we successfully generate a data-driven interaction solution combined with two different fusion solitons, in which the ${{\mathbb{L}}}_{2}$ error is 6.369453 × 10−2 compared with the exact solution. The whole learning process takes about 2419 s and iterates 1977 times.
Figure 6 displays the two-dimensional cross-sectional view of the data-driven interaction solution composed of two fusions, three-dimensional profiles and the corresponding error between the predicted solution and exact one. From figure 6(a), one can clearly see the ‘fusing' features and the ‘interacting' features in the predicted solutions. In addition, we can also find from figure 6(c) that excellent learning performance has been achieved.
Figure 6. The data-driven interaction solution between two fusions of the HSI equation at t = 0. (a) Density plot. (b) 3D diagram of the predicted solution. (c) 3D diagram of the exact solution. (d) Comparisons of the predicted solution with the exact at different y-values.
In the following, we shall derive the data-driven interactions between fusion and a soliton molecule in the HSI equation. Thus, we require that one pair of solitons satisfies the velocity resonant principle [46] and the other pair of solitons satisfies the condition described by (3.5), namely:
$\begin{eqnarray}{{\rm{e}}}^{{A}_{12}}=0,\quad \displaystyle \frac{{k}_{3}}{{k}_{4}}=\displaystyle \frac{{p}_{3}}{{p}_{4}}.\end{eqnarray}$
On setting the parameters as
$\begin{eqnarray}\begin{array}{c}\alpha =\frac{1}{2},{k}_{1}=-\frac{3}{10},{p}_{1}=\frac{7}{10},{k}_{2}=\frac{3}{5},\\ {p}_{2}=-\frac{361}{200}+\frac{9}{100000}\sqrt{249570000},{\eta }_{0}^{1}=-1,{\eta }_{0}^{2}=0,\\ {k}_{3}=1,{p}_{3}=1,{k}_{4}=\frac{1}{2},{p}_{4}=\frac{1}{2},\\ {\eta }_{0}^{3}=-20,{\eta }_{0}^{4}=-2,\end{array}\end{eqnarray}$
then a special exact interaction mixed by fusion and a soliton molecule is generated, which is governed by
$\begin{eqnarray*}{u}_{4}=2{(\mathrm{ln}f)}_{{xx}},\,{v}_{4}=2{(\mathrm{ln}f)}_{{xy}},\,{w}_{4}=2{(\mathrm{ln}f)}_{{xt}},\end{eqnarray*}$
where
$\begin{eqnarray}\begin{array}{rcl}f & = & 1+{{\rm{e}}}^{{\eta }_{1}}+{{\rm{e}}}^{{\eta }_{2}}+{{\rm{e}}}^{{\eta }_{3}}+{{\rm{e}}}^{{\eta }_{4}}\\ & & +{A}_{13}{{\rm{e}}}^{{\eta }_{1}+{\eta }_{3}}+{A}_{14}{{\rm{e}}}^{{\eta }_{1}+{\eta }_{4}}+{A}_{23}{{\rm{e}}}^{{\eta }_{2}+{\eta }_{3}}+{A}_{24}{{\rm{e}}}^{{\eta }_{2}+{\eta }_{4}}\\ & & \quad \quad +{A}_{34}{{\rm{e}}}^{{\eta }_{3}+{\eta }_{4}}+{A}_{13}{A}_{14}{A}_{34}{{\rm{e}}}^{{\eta }_{1}+{\eta }_{3}+{\eta }_{4}}\\ & & +{A}_{23}{A}_{24}{A}_{34}{{\rm{e}}}^{{\eta }_{2}+{\eta }_{3}+{\eta }_{4}},\end{array}\end{eqnarray}$
with ηi and Aij (i, j = 1, 2, 3, 4) described by (3.4). It is noted that the form of expression (3.20) is quite different to (3.16) for the interaction solution mixed by two fusions.
By performing the same data acquisition and training procedures as used in this subsection, we can successfully obtain the data-driven interaction solution between fusion and a soliton molecule, whose dynamic behaviors are displayed in figures 7(a) and (b). From these two figures, one can easily see that there indeed exists two solitons in the learned solution, which are bounded to form the soliton molecule, while the other solitons fuse to one, just as what happens in the exact solution depicted in figure 7(c). Their comparisons are given in figure 7(d). In fact, the ${{\mathbb{L}}}_{2}$ relative error for the learned solution is 5.363313 × 10−2, which means a good performance is achieved. Therefore, we conclude that the PINN method is an excellent method to predict the hybrid solution composed of fusion and a soliton molecule for the HSI equation.
Figure 7. The data-driven interaction solution between fusion and a soliton molecule of the HSI equation at t = 0. (a) Density plot. (b) 3D diagram of the predicted solution. (c) 3D diagram of the exact solution. (d) Comparisons of the predicted solution with the exact at different y-values.

4. Conclusion

Fusion and fission are two very important physical phenomena, which have been observed in many fields, such as fluid physics, plasma physics, biophysics, nuclear physics, organic membranes and life science. In this paper, we investigate the fusion and fission phenomena in the (2+1)-dimensional HSI equation. By introducing suitable initial boundary values conditions and physical constraints, we successfully obtain the data-driven pure fusion and fission solutions by the PINN deep learning approach. Numerical simulations show that the data-driven fusion and fission solutions take the shape of the capital letter Y in the spatial structures. In addition, we also derive some data-driven interaction solutions, including the interactions between fusion and a single soliton, between two different fusion waves, and between fusion and a soliton molecule. Numerical error analysis reveals that the data-driven fusion and fission solutions as well as the interaction solutions can rapidly converge to the exact ones given by Tian and his coauthors. Therefore we demonstrate that the PINN deep learning method is a very effective algorithm to solve fusion and fission solutions of nonlinear integrable equations, especially for the HSI equation. Due to the excellent predictive performance of the PINN method, we hope that the data-driven fusion and fission solution derived here can elucidate qualitative features of the HSI model and the physical situation itself. In addition, based on the wide applications of fusion and fission in many physical areas, it is hoped that the data-driven solutions obtained here can be helpful for experts to explain or predict some related physical phenomena.

The authors would like to express their sincere thanks to the reviewers for their kind comments and valuable suggestions. This work is supported by the National Natural Science Foundation of China under grant Nos. 12371250 and 12205154, Jiangsu Provincial Natural Science Foundation under grant Nos. BK20221508 and BK20210380 and Jiangsu Qinglan High-level Talent Project and High-level Personnel Project under grant No. JSSCBS20210277.

1
Lake B M Salakhutdinov R Tenenbaum J B 2015 Human-level concept learning through probabilistic program induction Science 350 1332 1338

DOI

2
He K Zhang X Ren S Sun J 2016 Deep residual learning for image recognition IEEE Conference on Computer Vision and Pattern Recognition 770 778

3
Krizhevsky A Sutskever I Hinton G E 2017 ImageNet classification with deep convolutional neural networks Commun. ACM 60 84 90

DOI

4
Heaton J Goodfellow I Bengio Y Courville A 2018 Deep learning Genet. Program. Evolvable Mach. 19 305 307

DOI

5
LeCun Y Bengio Y Hinton G 2015 Deep learning Nature 521 436 444

DOI

6
Zaremba W Sutskever I Vinyals O 2014 Recurrent neural network regularization arXiv:1409.2329v5

7
Raissi M Perdikaris P Karniadakis G E 2019 Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations J. Comput. Phys. 378 686 707

DOI

8
Li J Chen Y 2020 Solving second-order nonlinear evolution partial differential equations using deep learning Commun. Theor. Phys. 72 105005

DOI

9
Li J Chen Y 2020 A deep learning method for solving third-order nonlinear evolution equations Commun. Theor. Phys. 72 115003

DOI

10
Li J Chen Y 2020 A physics-constrained deep residual network for solving the sine-Gordon equation Commun. Theor. Phys. 73 015001

DOI

11
Pu J C Li J Chen Y 2021 Solving localized wave solutions of the derivative nonlinear Schrödinger equation using an improved PINN method Nonlinear Dyn. 105 1723 1739

DOI

12
Peng W Q Pu J C Chen Y 2022 PINN deep learning for the Chen–Lee–Liu equation: rogue wave on the periodic background Commun. Nonlinear Sci. Numer. Simul. 105 106067

DOI

13
Wang L Yan Z Y 2021 Data-driven rogue waves and parameter discovery in the defocusing nonlinear Schrödinger equation with a potential using the PINN deep learning Phys. Lett. A 404 127408

DOI

14
Fang Y Wu G Z Wang Y Y Dai C Q 2021 Data-driven femtosecond optical soliton excitations and parameters discovery of the highorder NLSE using the PINN Nonlinear Dyn. 105 603 616

DOI

15
Li J H Li B 2022 Mix-training physics-informed neural networks for the rogue waves of nonlinear Schrödinger equation Chaos Solit. Fractals 164 112712

DOI

16
Lin S N Chen Y 2023 Physics-informed neural network methods based on Miura transformations and discovery of new localized wave solutions Physica D 445 133629

DOI

17
Wang L Zhou Z J Yan Z Y 2023 Data-driven vortex solitons and parameter discovery of 2D generalized nonlinear Schrödinger equation with a ${ \mathcal P }{ \mathcal T }$-symmetric optical lattice Comput. Math. Appl. 140 17 23

DOI

18
Tian S F Li B 2023 Solving complex nonlinear problems based on gradient-optimized physics-informed neural networks Acta Phys. Sin. 72 100202

DOI

19
Wen X K Jiang J H Liu W Dai C Q 2023 Abundant vector soliton prediction and model parameter discovery of the coupled mixed derivative nonlinear Schrödinger equation Nonlinear Dyn. 111 13343 13355

DOI

20
Yuan L Ni Y Q Deng X Y Hao S 2022 A-PINN: auxiliary physics informed neural networks for forward and inverse problems of nonlinear integro-differential equations J. Comput. Phys. 462 111260

DOI

21
Ablowitz M J Clarkson P A 1991 Soliton, Nonlinear Evolution Equations and Inverse Scatting Cambridge University Press

22
Yang Y L Fan E G 2023 Soliton resolution and large time behavior of solutions to the Cauchy problem for the Novikov equation with a nonzero background Adv. Math. 426 109088

DOI

23
Li X Wang L Yan Z Y 2023 Soliton formation and dynamics in the quintic nonlinear media with ${ \mathcal P }{ \mathcal T }$-invariant harmonic-Gaussian potential Phys. Lett. A 459 128607

DOI

24
Zhou H J Chen Y 2023 High-order soliton solutions and their dynamics in the inhomogeneous variable coefficients Hirota equation Commun. Nonlinear Sci. Numer. Simul. 120 107149

DOI

25
Li Y Tian S F Yang J J 2022 Riemann-Hilbert problem and interactions of solitons in the n-component nonlinear Schrödinger equations Stud. Appl. Math. 148 577 605

DOI

26
Yang J J Tian S F Li Z Q 2022 Riemann–Hilbert problem for the focusing nonlinear Schrödinger equation with multiple high-order poles under nonzero boundary conditions Physica D 432 133162

DOI

27
Sun J L Li Z K An H L Zhu H X 2023 Novel soliton molecules and interaction wave solutions in a (2+1)-dimensional Sawada–Kotera equation: a multi-linear variable separation method Nonlinear Dyn. 111 12541 12552

DOI

28
Stoitcheva G Ludu L Draayer J P 2001 Antisoluiton model for fission model Math. Comput. Simul. 55 621 625

DOI

29
Hisakado M 1997 Breather trapping mechanism in piecewise homogeneous DNA Phys. Lett. A 227 87 93

DOI

30
Serkin V N Chapela V M Percino J Belyaeva T L 2001 Nonlinear tunneling of temporal and spatial optical solitons through organic thin films and polymeric waveguides Opt. Commun. 192 237 244

DOI

31
Kip D Wesner M Herden C Shandarov V 1999 Interaction of spatial photorefractive solitons in a planar waveguide Appl. Phys. B 68 971 974

DOI

32
Hirota R Satsuma J 1981 Soliton solutions of a coupled Korteweg–de Vries equation Phys. Lett. A 85 407 408

DOI

33
Zhou Y Manukure S 2019 Complexiton solutions to the Hirota–Satsuma–Ito equation Math. Meth. Appl. Sci. 42 2344 2351

DOI

34
Liu W Wazwaz A M Zheng X X 2019 High-order breathers, lumps and semi-rational solutions to the (2+1)-dimensional Hirota–Satsuma–Ito equation Phys. Scr. 94 075203

DOI

35
Zhou Y F Wang C J Zhang X X 2020 Rational localized waves and their absorb–emit interactions in the (2+1)-dimensional Hirota–Satsuma–Ito equation Mathematics 8 1807

DOI

36
Liu J G Zhu W H Zhou L 2020 Multi-wave, breather wave, and interaction solutions of the Hirota–Satsuma–Ito equation Eur. Phys. J. Plus 135 20

37
Kuo C K Ma W X 2020 A study on resonant multi-soliton solutions to the (2+1)-dimensional Hirota–Satsuma–Ito equations via the linear superposition principle Nonlinear Anal. 190 111592

DOI

38
Kumar S Nisar K S Kumar A 2021 A (2+1)-dimensional generalized Hirota–Satsuma–Ito equations: lie symmetry analysis, invariant solutions and dynamics of soliton solutions Results Phys. 28 104621

DOI

39
Shen Y Tian B Zhou T Y Gao X T 2022 Shallow-water-wave studies on a (2+1)-dimensional Hirota–Satsuma–Ito system: X-type soliton, resonant Y-type soliton and hybrid solutions Chaos Solit. Fractals 157 111861

DOI

40
Liu D C Nocedal J 1989 On the limited memory BFGS method for large scale optimization Math. Program 45 503 528

DOI

41
Liu Y Q Wen X Y Wang D S 2019 The N-soliton solution and localized wave interaction solutions of the (2+1)-dimensional generalized Hirota–Satsuma–Ito equation Comput. Math. Appl. 77 947 966

DOI

42
Hirota R 2004 The Direct Methods in Soliton Theory Cambridge University Press

43
Hu X B 1993 Generalized Hirota's bilinear equations and their soliton solutions J. Phys. A: Math. Gen. 26 L465

DOI

44
Li Y H An H L Zhang Y Y 2022 Abundant fission and fusion solutions in the (2+1)-dimensional generalized Calogero–Bogoyavlenskii–Schiff equation Nonlinear Dyn. 108 2489 2503

DOI

45
Stein M L 1987 Large sample properties of simulations using Latin hypercube sampling Technometrics 29 143 151

DOI

46
Lou S Y 2019 Soliton molecules and asymmetric solitons in three fifth order systems via velocity resonance J. Phys. Commun. 4 041002

DOI

Outlines

/