On examining the predictive capabilities of two variants of the PINN in validating localized wave solutions in the generalized nonlinear Schr&ouml;dinger equation

K Thulasidharan; N Sinthuja; N Vishnu Priya; M Senthilvelan

doi:10.1088/1572-9494/ad6854

2024 , Vol. 76 >Issue 11: 115801

DOI: https://doi.org/10.1088/1572-9494/ad6854

Others

On examining the predictive capabilities of two variants of the PINN in validating localized wave solutions in the generalized nonlinear Schrödinger equation

K Thulasidharan ¹ ,
N Sinthuja ² ,
N Vishnu Priya ³ ,
M Senthilvelan ^,⁴

Expand

¹Department of Physics, Vellore Institute of Technology, Vellore—632 014, Tamilnadu, India
²Department of Physics, Anna University, Chennai—600 025, Tamilnadu, India
³Department of Mathematics, Indian Institute of Science, Bangalore—560012, Karnataka, India
⁴Department of Nonlinear Dynamics, Bharathidasan University, Tiruchirappalli—620 024, Tamilnadu, India

Received date: 2024-03-20

Revised date: 2024-06-03

Accepted date: 2024-07-29

Online published: 2024-09-27

Copyright

Fold

Abstract

We introduce a novel neural network structure called strongly constrained theory-guided neural network (SCTgNN), to investigate the behaviour of the localized solutions of the generalized nonlinear Schrödinger (NLS) equation. This equation comprises four physically significant nonlinear evolution equations, namely, the NLS, Hirota, Lakshmanan–Porsezian–Daniel and fifth-order NLS equations. The generalized NLS equation demonstrates nonlinear effects up to quintic order, indicating rich and complex dynamics in various fields of physics. By combining concepts from the physics-informed neural network and theory-guided neural network (TgNN) models, the SCTgNN aims to enhance our understanding of complex phenomena, particularly within nonlinear systems that defy conventional patterns. To begin, we employ the TgNN method to predict the behaviour of localized waves, including solitons, rogue waves and breathers, within the generalized NLS equation. We then use the SCTgNN to predict the aforementioned localized solutions and calculate the mean square errors in both the SCTgNN and TgNN in predicting these three localized solutions. Our findings reveal that both models excel in understanding complex behaviour and provide predictions across a wide variety of situations.

Key words： generalized nonlinear Schrödinger equation; soliton; rogue waves; breathers; SCTgNN; TgNN

Cite this article

K Thulasidharan , N Sinthuja , N Vishnu Priya , M Senthilvelan . On examining the predictive capabilities of two variants of the PINN in validating localized wave solutions in the generalized nonlinear Schrödinger equation[J]. Communications in Theoretical Physics, 2024 , 76(11) : 115801 . DOI: 10.1088/1572-9494/ad6854

1. Introduction

The 1D fundamental nonlinear Schrödinger (NLS) equation is renowned for its integrability, a property that has catalyzed significant advancements in the analysis of nonlinear phenomena across various disciplines, including optics, water waves, and Bose–Einstein condensates [1–3]. The availability of analytical solutions for the NLS equation has not only fueled theoretical developments but also inspired numerous experimental investigations in these domains. While soliton solutions marked initial milestones, recent progress has been marked by advancements in breather and rogue wave solutions, showcasing the versatility of the NLS equation [4].

However, the NLS equation stands as just one among several integrable nonlinear evolutionary equations in physics. Various extensions and deformations of the NLS equation have been explored, broadening its scope of application. These extensions not only enhance our understanding of fundamental physics but also provide insight into complex phenomena, such as wave blow-up and collapse at higher intensities [5], necessitating the inclusion of higher-order terms [6–8]. On other hand, in optics, in the femtosecond range, the third-order effect becomes vital [9], while the fourth-order effect's role in an anisotropic Heisenberg ferromagnetic spin has been examined in [10–12]. Notably, the quintic effect gains significance as the optical field's intensity escalates and pulse duration contracts to attosecond scales [13, 14]. The generalized NLS equation introduced in [15] includes all these higher-order effects.

In this study, we consider the NLS equation to fifth-order terms, constituting the NLS equation hierarchy. Our approach involves augmenting the NLS equation with additional terms possessing arbitrarily large coefficients. This extension results in a hierarchy of integrable equations up to the fifth order, each characterized by a set of real coefficients. The supplementary terms encompass higher-order dispersion of all orders and nonlinear terms, surpassing the simplicity of the NLS equation. The flexibility conferred by arbitrary coefficients enables us to explore nonlinear phenomena with unprecedented depth, shedding light on intricate physical processes beyond the scope of conventional NLS equation formulations. The generalized NLS equation also admits several kinds of localized solutions—solitons, rogue waves and breathers, to name a few [15]. These nonlinear waves categorize a wide range of diverse physical fields, including fluid dynamics, optics and plasma physics, making the generalized NLS equation a versatile tool for understanding and describing complex wave phenomena.

Solitons were first identified in the context of water waves and now they are found in almost all scientific fields. These waves maintain the same speed and shape while propagating. Solitons possess numerous applications in the real world, see for instance [16]. Rogue waves, originally coined to describe extreme ocean events, have drawn considerable interest through both experimental observations and theoretical predictions. These events manifest in nonlinear fiber optics, Bose–Einstein condensations, plasmas and even financial contexts [17–20]. These waves, with amplitudes often exceeding double the significant wave height, appear suddenly and vanish without a trace. Although their origin remains unclear, a consensus among most researchers suggests that these entities are associated with specific types of waves produced by mathematical equations. These special waves, known as ‘breathers', could be early signs of rogue waves because they develop when small disturbances grow into significant ones. Generally, two breather structures exist; the Akhmediev breather (AB) and the Kuznetsov–Ma breather (KMB) [15, 21]. The AB represents a space—periodic wave localized in time, while the KMB is localized in space and oscillates periodically in time. In specific conditions, both structures become the rogue wave solution of the NLS equation. These breathers serve as plausible models for comprehending the dynamics of rogue waves across diverse physical realms. In the literature, multiple methods have been employed, for example, inverse scattering transform [22, 23], Hirota bilinearization [24] and Darboux transformation [21, 25] in order to obtain these solutions.

In recent years, artificial intelligence and machine learning have gained widespread application in efficiently managing large data sets and have assumed increasingly significant roles across various domains [26, 27]. A recent development involves employing deep neural networks to explore data-oriented solutions and identify parameters within nonlinear physical models, including the fractional version of nonlinear systems [28–35]. The concept of physics-informed neural networks (PINNs) has emerged as a technique for investigating nonlinear partial differential equations [36, 37]. Utilizing the deep learning method PINN one can attain precise solutions with minimal data. Simultaneously, since fundamental physical constraints are typically expressed via differential equations, this approach also offers a more comprehensive physical rationale for the predicted solution [38–40]. The strongly-constrained physics-informed neural network (SCPINN) [41, 42], adaptive PINN [43, 44] and theory-guided neural network (TgNN) [45, 46] are variants of the PINN. In the following, we point out the differences between these three network structures.

PINN: The PINN approach integrates established physical equations or constraints into the neural network's learning process. This fusion of data-driven learning with domain-specific principles enhances predictions. It proves particularly useful for resolving challenges arising from limited or noisy data, as well as scenarios where the underlying physics is less understood. Its applications span diverse fields, enabling prediction of physical behavior, system simulations and solving differential equations [36, 37].

SCPINN: The SCPINN advances the PINN approach by employing more robust and stringent constraints based on derivative information from solutions. It introduces parallel subnets, adaptive weights and flexible learning rates to overcome limitations present in the standard PINN. The result is a remarkable improvement in prediction accuracy across broader computational domains, rendering it well-suited for intricate problems [41, 42].

TgNN: The TgNN encapsulates a broader notion, wherein neural networks are guided by domain-specific theories or principles, extending beyond physics. It incorporates established theories or constraints to amplify predictions beyond the scope of data-driven learning alone. Its application is wide-ranging, spanning domains, such as economics, biology, engineering, etc. This approach amalgamates existing knowledge with machine learning, fostering precise and insightful predictions [45, 46].

In summary, the PINN combines physics and neural networks for smart predictions. The SCPINN makes this mix even stronger. The TgNN is a broader approach that covers various smart networks, not just for physics. The methods we mentioned earlier, the TgNN and SCPINN are useful tools that help one to make computer predictions better in different areas. They do this by combining data-based learning with expert knowledge, leading to important improvements.

In our current study, we intend to combine the strong points of the SCPINN and TgNN to make a new method called the strongly constrained theory-guided neural network (SCTgNN). This new type of neural network helps us to understand complicated structures such as rogue waves and breathers in more detail, which in turn will help us to understand nonlinear systems more deeply. In [45], the authors used the TgNN model to predict rogue wave solutions. This TgNN model is more accurate compared to the convolution neural network model. However, the authors in [45] have studied only the prediction of rogue waves in the basic NLS equation. In contrast, our goal is to go beyond that and explore solitons, rogue waves and breathers for a more general equation (generalized NLS equation).

Through our investigation, we confirm that the incorporation of higher-order dispersion parameters into the standard NLS equation induces noteworthy alterations in the dimensions and alignment of solitons, rogue waves and breathers. Intriguingly, both the SCTgNN and TgNN models consistently demonstrate precise error predictions. By integrating the system parameters α₂, α₃, α₄ and α₅ into the initial conditions, we attain a significant advantage on the direct forecasting of solutions across various parameter values.

These findings underscore the effectiveness of the novel SCTgNN model in accurately approximating soliton, rogue wave and breather solutions within the broader generalized NLS equation. This holds true even under more intricate scenarios involving higher-order effects, showcasing the versatility and robustness of our approach.

The structure of our manuscript is outlined as follows. In section 2, we illustrate the TgNN and SCTgNN models, along with the methods employed. Section 3 is dedicated to showcasing the data-driven soliton, rogue wave and breather solutions for the generalized NLS equation and the predictions made. Our findings are summarized in section 4.

2. Model and method

We consider the generalized nonlinear Schrödinger equation in the form [15, 47]:

(1a)$\begin{eqnarray}\begin{array}{l}{\rm{i}}{\psi }_{x}+{\alpha }_{2}{{\rm{\Gamma }}}_{2}[\psi (x,t)]-{\rm{i}}{\alpha }_{3}{{\rm{\Gamma }}}_{3}[\psi (x,t)]\\ \,+\,{\alpha }_{4}{{\rm{\Gamma }}}_{4}[\psi (x,t)]-{\rm{i}}{\alpha }_{5}{{\rm{\Gamma }}}_{5}[\psi (x,t)]=0,\end{array}\end{eqnarray}$

with

(1b)$\begin{eqnarray}{{\rm{\Gamma }}}_{2}[\psi (x,t)]={\psi }_{{tt}}+2| \psi {| }^{2}\psi ,\end{eqnarray}$

(1c)$\begin{eqnarray}{{\rm{\Gamma }}}_{3}[\psi (x,t)]={\psi }_{{ttt}}+6| \psi {| }^{2}{\psi }_{t},\end{eqnarray}$

(1d)$\begin{eqnarray}\begin{array}{rcl}\,{{\rm{\Gamma }}}_{4}[\psi (x,t)] & = & {\psi }_{{tttt}}+8| \psi {| }^{2}{\psi }_{{tt}}+6| \psi {| }^{4}\psi \\ & & +4\psi | {\psi }_{t}{| }^{2}+6{\psi }_{t}^{2}\bar{\psi }+2{\bar{\psi }}_{{tt}}{\psi }^{2},\end{array}\end{eqnarray}$

(1e)$\begin{eqnarray}\begin{array}{lcl}{{\rm{\Gamma }}}_{5}[\psi (x,t)] & = & {\psi }_{{ttttt}}+10| \psi {| }^{2}{\psi }_{{ttt}}+30| \psi {| }^{4}{\psi }_{t}+10\psi {\psi }_{t}{\bar{\psi }}_{{tt}}\\ & & +10\psi {\bar{\psi }}_{t}{\psi }_{{tt}}+20\bar{\psi }{\psi }_{t}{\psi }_{{tt}}+10{\psi }_{t}^{2}{\bar{\psi }}_{t}.\end{array}\end{eqnarray}$

In equation (1a), x and t are the propagation and transverse variable, respectively. The function ψ(x, t) denotes the envelope of the waves. The coefficients α_i (i = 2, 3, 4, 5) are arbitrary real constants. This generalized form of equation (1a) augments four completely integrable systems separately. If we consider the first term and the lowest second-order terms Γ₂[ψ(x, t)], we obtain the fundamental NLS equation (the other parameters are α₃ = 0, α₄ = 0, α₅ = 0). The first term with Γ₃[ψ(x, t)], gives the Hirota equation (α₄ = 0, α₅ = 0) and if we choose the first term with Γ₄[ψ(x, t)], we obtain the fourth-order NLS equation (α₃ = 0, α₅ = 0). Similarly, with Γ₅[ψ(x, t)], we arrive at a fifth-order NLS equation (α₃ = 0, α₄ = 0).

The fifth-order NLS equation has been the subject of extensive research over the years. Chowdury et al presented the fifth-order NLS equation along with its Lax pair and constructed soliton solutions using the Darboux transformation method [48]. This equation is widely recognized as a model describing the 1D anisotropic Heisenberg ferromagnetic spin chain [49]. Subsequently, breathers and rogue wave solutions for the fifth-order NLS equation were obtained through the Darboux transformation [6]. The fifth-order equation also finds application in describing the propagation of ultrashort optical pulses in optical fibers [7]. Wang discussed the structure of higher-order rogue wave solutions and their interactions [7]. Several studies have focused on deriving analytic solutions for the fifth-order NLS equation. An infinite number of conservation laws were established based on the Lax pair, and analytic solutions, including one-, two- and three-soliton forms, were also obtained using the auxiliary function method, Hirota bilinear method and symbolic calculation methods [8]. Yang et al provided first- and second-order rogue wave solutions, as well as rational solitons [50], while dark one-, two- and three-soliton solutions were generated using the Hirota bilinear method [51]. The generalized Darboux transformation was utilized to derive first-, second- and third-order rogue wave solutions [52]. Jia investigated the interaction and propagation of three kinds of breather solutions and analyzed the modulation instability of generalized solitons [53]. Subsequently, various N-soliton solutions, bright N-soliton solutions, N-dark soliton solutions and higher-order rogue wave solutions were successively obtained [54]. Sinthuja et al studied the formation of rogue waves on a periodic background in the fifth-order NLS equation [13]. In the following, we move on to analyze how the generalized fifth-order NLS equation is modeled by the TgNN and SCTgNN.

As a first step, we split the system into its real and imaginary components. This involves separating the complex wave envelope, denoted as ψ(x, t), into two real functions, u(x, t) and v(x, t). This separation can be achieved by representing ψ(x, t) in the form ψ(x, t) = u(x, t) + iv(x, t), where u(x, t) and v(x, t) are the real components.

Once this separation is established, we can proceed with detailing the structure of the machine learning model. However, to set the stage, it is essential to start by substituting the aforementioned form of ψ(x, t) into the equation of interest, equation (1). This substitution allows us to explicitly derive the expressions for u(x, t) and v(x, t), which take the following forms:

(2a)$\begin{eqnarray}\begin{array}{rcl}{v}_{x} & = & \,{\alpha }_{2}({u}_{{tt}}+2{u}^{3}+2{{uv}}^{2})+{\alpha }_{3}({v}_{{ttt}}+6{u}^{2}{v}_{t}+6{v}^{2}{v}_{t})\\ & & +{\alpha }_{4}(6{u}^{5}+12{u}^{3}{v}^{2}+6{{uv}}^{4}\\ & & +10{{uu}}_{t}^{2}+12{{vu}}_{t}{v}_{t}-2{{uv}}_{t}^{2}+10{u}^{2}{u}_{{tt}}+6{v}^{2}{u}_{{tt}}\\ & & +4{{uvv}}_{{tt}}+{u}_{{tttt}})+{\alpha }_{5}(30{u}^{4}{v}_{t}+60\\ & & \times {u}^{2}{v}^{2}{v}_{t}+30{v}^{4}{v}_{t}+10{u}_{t}^{2}{v}_{t}+10{v}_{t}^{3}+20{{uv}}_{t}{u}_{{tt}}\\ & & +20{{uv}}_{{tt}}{u}_{t}+40{v}_{t}{{vv}}_{{tt}}+10{u}^{2}{v}_{{ttt}}\\ & & +10{v}^{2}{v}_{{ttt}}+{v}_{{ttttt}}),\end{array}\end{eqnarray}$

(2b)$\begin{eqnarray}\begin{array}{rcl}-{u}_{x} & = & \,{\alpha }_{2}({v}_{{tt}}+2{u}^{2}v+2{v}^{3})+{\alpha }_{3}(-6{u}^{2}{u}_{t}-6{v}^{2}{u}_{t}-{u}_{{ttt}})\\ & & +{\alpha }_{4}(6{u}^{4}v+12{u}^{2}{v}^{3}+6{v}^{5}\\ & & -2{{vu}}_{t}^{2}+12{{uu}}_{t}{v}_{t}+10{{vv}}_{t}+4{{uvu}}_{{tt}}+6{u}^{2}{v}_{{tt}}\\ & & +10{v}^{2}{v}_{{tt}}+{v}_{{tttt}})+{\alpha }_{5}(-30{u}^{4}{u}_{t}-60\\ & & \times {u}^{2}{v}^{2}{u}_{t}-30{v}^{4}{u}_{t}-10{u}_{t}^{3}-10{u}_{t}{v}_{t}^{2}-40{{uu}}_{t}{u}_{{tt}}\\ & & -20{{vv}}_{t}{u}_{{tt}}-20{{vu}}_{t}{v}_{{tt}}-10{u}^{2}{u}_{{ttt}}\\ & & -10{v}^{2}{u}_{{ttt}}-{u}_{{ttttt}}),\end{array}\end{eqnarray}$

and the solution ψ(x, t) is trained to satisfy the neural network equations (2a) and (2b).

The above equation (2) can be rewritten in the following form:

(3a)$\begin{eqnarray}\begin{array}{rcl}{f}_{u}(x,t) & = & -{v}_{x}^{{NN}}(.)+{\alpha }_{2}[{u}_{{tt}}^{{NN}}(.)+2u{{}^{3}}^{{NN}}(.)\\ & & +2{u}^{{NN}}(.)v{{}^{2}}^{{NN}}(.)]+{\alpha }_{3}[{v}_{{ttt}}^{{NN}}(.)+6u{{}^{2}}^{{NN}}(.)\\ & & \times {v}_{t}^{{NN}}(.)+6v{{}^{2}}^{{NN}}(.){v}_{t}^{{NN}}(.)]+{\alpha }_{4}[6u{{}^{5}}^{{NN}}(.)\\ & & +12u{{}^{3}}^{{NN}}(.)v{{}^{2}}^{{NN}}(.)+6{u}^{{NN}}(.)\\ & & \times v{{}^{4}}^{{NN}}(.)+10{u}^{{NN}}(.)u{{}^{2}}_{t}^{{NN}}(.)\\ & & +12{v}^{{NN}}(.){u}_{t}^{{NN}}(.){v}_{t}^{{NN}}(.)-2{u}^{{NN}}(.)v{{}^{2}}_{t}^{{NN}}(.)\\ & & +10u{{}^{2}}^{{NN}}(.){u}_{{tt}}^{{NN}}(.)\\ & & +6v{{}^{2}}^{{NN}}(.){u}_{{tt}}^{{NN}}(.)+4{u}^{{NN}}(.){v}^{{NN}}(.){v}_{{tt}}^{{NN}}(.)+{u}_{{tttt}}^{{NN}}(.)]\\ & & +{\alpha }_{5}[30u{{}^{4}}^{{NN}}(.){v}_{t}^{{NN}}(.)+60u{{}^{2}}^{{NN}}(.)v{{}^{2}}^{{NN}}(.){v}_{t}^{{NN}}(.)\\ & & +30v{{}^{4}}^{{NN}}(.){v}_{t}^{{NN}}(.)\\ & & +10u{{}^{2}}_{t}^{{NN}}(.){v}_{t}^{{NN}}(.)+10v{{}^{3}}_{t}^{{NN}}(.)\\ & & +20{u}^{{NN}}(.){v}_{t}^{{NN}}(.){u}_{{tt}}^{{NN}}(.)+20{u}^{{NN}}(.){v}_{{tt}}^{{NN}}(.)\\ & & \times {u}_{t}^{{NN}}(.)+40{v}_{t}^{{NN}}(.){v}^{{NN}}(.){v}_{{tt}}^{{NN}}(.)\\ & & +10u{{}^{2}}^{{NN}}(.){v}_{{ttt}}^{{NN}}(.)+10v{{}^{2}}^{{NN}}(.){v}_{{ttt}}^{{NN}}(.)\\ & & +{v}_{{ttttt}}^{{NN}}(.)],\end{array}\end{eqnarray}$

(3b)$\begin{eqnarray}\begin{array}{rcl}{f}_{v}(x,t) & = & {u}_{x}^{{NN}}(.)+{\alpha }_{2}[{v}_{{tt}}^{{NN}}(.)+2u{{}^{2}}^{{NN}}(.){v}^{{NN}}(.)\\ & & +2v{{}^{3}}^{{NN}}(.)]+{\alpha }_{3}[-6u{{}^{2}}^{{NN}}(.){u}_{t}^{{NN}}(.)\\ & & -6v{{}^{2}}^{{NN}}(.){u}_{t}^{{NN}}(.)-{u}_{{ttt}}^{{NN}}(.)]+{\alpha }_{4}[6u{{}^{4}}^{{NN}}(.){v}^{{NN}}(.)\\ & & +12u{{}^{2}}^{{NN}}(.)v{{}^{3}}^{{NN}}(.)\\ & & +6v{{}^{5}}^{{NN}}(.)-2{v}^{{NN}}(.)u{{}^{2}}_{t}^{{NN}}(.)\\ & & +12{u}^{{NN}}(.){u}_{t}^{{NN}}(.){v}_{t}^{{NN}}(.)+10{v}^{{NN}}(.){v}_{t}^{{NN}}(.)\\ & & +4{u}^{{NN}}(.){v}^{{NN}}(.){u}_{{tt}}^{{NN}}(.)+6u{{}^{2}}^{{NN}}(.){v}_{{tt}}^{{NN}}(.)\\ & & +10v{{}^{2}}^{{NN}}(.){v}_{{tt}}^{{NN}}(.)+{v}_{{tttt}}^{{NN}}(.)]\\ & & +{\alpha }_{5}[-30u{{}^{4}}^{{NN}}(.){u}_{t}^{{NN}}(.)-60u{{}^{2}}^{{NN}}(.)v{{}^{2}}^{{NN}}(.){u}_{t}^{{NN}}(.)\\ & & -30v{{}^{4}}^{{NN}}(.){u}_{t}^{{NN}}(.)\\ & & -10u{{}^{3}}_{t}^{{NN}}(.)-10{u}_{t}^{{NN}}(.)v{{}^{2}}_{t}^{{NN}}(.)\\ & & -40{u}^{{NN}}(.){u}_{t}^{{NN}}(.){u}_{{tt}}^{{NN}}(.)-20{v}^{{NN}}(.)\\ & & \times {v}_{t}^{{NN}}(.){u}_{{tt}}^{{NN}}(.)-20{v}^{{NN}}(.){u}_{t}^{{NN}}(.){v}_{{tt}}^{{NN}}(.)\\ & & -10u{{}^{2}}^{{NN}}(.){u}_{{ttt}}^{{NN}}(.)-10v{{}^{2}}^{{NN}}(.)\\ & & \times {u}_{{ttt}}^{{NN}}(.)-{u}_{{ttttt}}^{{NN}}(.)],\end{array}\end{eqnarray}$

where ^NN(. ) denotes the NN approximation of the real (u) and imaginary (v) parts of the solution ψ(x, t). Here, the partial derivatives can be easily computed by applying the chain rule for the network through automatic differentiation.

Figure 1 provides a detailed description of the structure of the TgNN. To predict the solution of the generalized NLS equation using the TgNN, we employ the hyperbolic tangent function (tanh) as an activation function, with six hidden layers, each comprising 60 neurons. In the TgNN, we incorporate system parameters as inputs to the network. This approach allows us to predict solutions for various values of the system parameters. The system parameters of the generalized NLS equation, such as α₃, α₄ and α₅, are included in the input layer. Consequently, we have a total of five input neurons. The system's domain is defined as follows: x ranges from −10 to 10, t ranges from −10 to 10, α₃ varies from 0.0 to 0.5, α₄ ranges from 0.0 to 0.5, α₅ spans from 0.0 to 0.5. For training data, we consider 1000 combinations of system parameters. From these combinations, we extract 2000 sample points from the initial value at t = 0, as well as 2000 sample points from the boundaries at x = − 10 and x = 10. In addition, our sample data set comprises 10 000 collocation points that cover the entire domain (x, t, α₃, α₄, α₅) and enforce the constraint stated in equation (2). The central coordinates of these regions are determined using a Latin hypercube sampling (LHS) strategy [41].

View original graphic|Download|PPT slide

Figure 1. Graphical representation of the TgNN showcases its intricate structure, which comprises an input layer, hidden layers and an output layer. Neurons are represented as circles, and the connections between them are depicted with arrows, signifying the functional mappings. Input variables include x, t, α₃, α₄ and α₅ while the output layer provides u and v, which correspond to the real and imaginary components of the wave envelope ψ(x, t). While the deep learning process of the TgNN is data-driven, it is equally influenced by the generalized NLS equation (PDE), along with the initial and boundary conditions.

In figure 2, we display the 1000 combinations of training system parameters. The red dots represent the training data points, while the green dots represent the predicted values. In this paper, we present a limited number of predicted plots. However, once the model is trained, it possesses the capability to predict solutions for different parameter combinations. The output layer of the model comprises two neurons that yield the real (u) and imaginary (v) parts of the solution. These solutions undergo automatic differentiation and are then substituted into the loss function. For optimization during back propagation, we employ both the Adam and Limited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) methods. The loss function of the TgNN can be expressed in the following form:

(4a)$\begin{eqnarray}{{Loss}}_{\mathrm{TgNN}}={{Loss}}_{{\rm{I}}}+{{Loss}}_{{\rm{B}}}+{{Loss}}_{{\rm{D}}},\end{eqnarray}$

where,

(4b)$\begin{eqnarray}\begin{array}{rcl}{{Loss}}_{{\rm{I}}} & = & \displaystyle \frac{1}{{N}_{{\rm{I}}}}\displaystyle \sum _{j=1}^{{N}_{{\rm{I}}}}| {u}^{{NN}}({x}_{I}^{j},0,{\alpha }_{3}^{j},{\alpha }_{4}^{j},{\alpha }_{5}^{j})-{u}_{0}^{j}{| }^{2}\\ & & +| {v}^{{NN}}({x}_{I}^{j},0,{\alpha }_{3}^{j},{\alpha }_{4}^{j},{\alpha }_{5}^{j})-{v}_{0}^{j}{| }^{2},\end{array}\end{eqnarray}$

(4c)$\begin{eqnarray}\begin{array}{rcl}{{Loss}}_{{\rm{B}}} & = & \displaystyle \frac{1}{{N}_{{\rm{B}}}}\displaystyle \sum _{j=1}^{{N}_{{\rm{B}}}}| {u}^{{NN}}(\pm 10,{t}_{{\rm{B}}}^{j},{\alpha }_{3}^{j},{\alpha }_{4}^{j},{\alpha }_{5}^{j})-{u}_{{\rm{B}}}^{j}{| }^{2}\\ & & +| {v}^{{NN}}(\pm 10,{t}_{{\rm{B}}}^{j},{\alpha }_{3}^{j},{\alpha }_{4}^{j},{\alpha }_{5}^{j})-{v}_{{\rm{B}}}^{j}{| }^{2},\end{array}\end{eqnarray}$

(4d)$\begin{eqnarray}\begin{array}{rcl}{{Loss}}_{{\rm{D}}} & = & \displaystyle \frac{1}{{N}_{{\rm{D}}}}\displaystyle \sum _{j=1}^{{N}_{{\rm{D}}}}| {f}_{u}({x}_{{\rm{D}}}^{j},{t}_{{\rm{D}}}^{j},{\alpha }_{3}^{j},{\alpha }_{4}^{j},{\alpha }_{5}^{j}){| }^{2}\\ & & +| {f}_{v}({x}_{{\rm{D}}}^{j},{t}_{{\rm{D}}}^{j},{\alpha }_{3}^{j},{\alpha }_{4}^{j},{\alpha }_{5}^{j}){| }^{2}.\end{array}\end{eqnarray}$

View original graphic|Download|PPT slide

Figure 2. Red dots on the graph symbolize the data set obtained through the traditional real-time evolution method for training purposes. To assess and compare the proficiency of the successfully trained neural network, we employ green dots, which correspond to the situations depicted in figures 4, 6 and 8.

In the above equation, N_I represents the number of collocation points taken in the initial region, N_B represents the number of collocation points taken at the boundaries and N_D represents the number of collocation points taken in the domain. In addition, u^NN(x^j, t^j) and v^NN(x^j, t^j) are the outputs of the model, while u₀^j and v₀^j represent the exact values of the initial frame, respectively. In the second term of equation (4c), the notations u_B and v_B pertain to the boundary region. In the same way, the collocation points (domain values) for f_u(x, t) and f_v(x, t) are denoted as $\{{x}_{{\rm{D}}}^{i},{t}_{{\rm{D}}}^{i}\}$. These collocation points are sampled with the help of the classical LHS technique.

In equation (4a), the loss function is based on the initial and boundary value data as well as the residuals gathered from equations (2a) and (2b) at a finite set of collocation points sampled. In particular, in the right-hand side of equation (4a), the first two terms play a role in fitting the solution data, while the other term plays a major role in satisfying the residuals f_u and f_v.

To compare with the TgNN, we adopt the same activation function, number of hidden layers, and neurons in the SCTgNN model, which is shown in figure 3. The advantage of the SCTgNN model lies in its ability to predict the solution's derivatives. Differing from the TgNN, the SCTgNN model has four outputs that provide u, ${u}^{{\prime} }$, v and ${v}^{{\prime} }$ values, where the prime denotes differentiation with respect to t. By utilizing a mean squared loss function, the loss of the SCTgNN model can be formulated as follows:

(5a)$\begin{eqnarray}{{Loss}}_{\mathrm{SCTgNN}}={{Loss}}_{{\rm{I}}}+{{Loss}}_{{\rm{B}}}+{{Loss}}_{{\rm{D}}},\end{eqnarray}$

where,

(5b)$\begin{eqnarray}\begin{array}{l}{{Loss}}_{{\rm{D}}}=\displaystyle \frac{1}{{N}_{{\rm{D}}}}\displaystyle \sum _{i=1}^{{N}_{{\rm{D}}}}| {f}_{u}({x}_{{\rm{D}}}^{i},{t}_{{\rm{D}}}^{i},{\alpha }_{3}^{i},{\alpha }_{4}^{i},{\alpha }_{5}^{i};u,v){| }^{2}\\ \qquad \qquad +| {f}_{v}({x}_{{\rm{D}}}^{i},{t}_{{\rm{D}}}^{i},{\alpha }_{3}^{i},{\alpha }_{4}^{i},{\alpha }_{5}^{i};u,v){| }^{2}\\ \qquad \qquad +| {f}_{{u}^{^{\prime} }}(u,{u}^{^{\prime} },{u}_{t}){| }^{2}+| {f}_{{v}^{^{\prime} }}(v,{v}^{^{\prime} },{v}_{t}){| }^{2},\end{array}\end{eqnarray}$

in which,

(5c)$\begin{eqnarray}\begin{array}{rcl}{f}_{{u}^{{\prime} }} & = & {{u}^{{\prime} }}^{{NN}}(.)-{{u}_{t}}^{{NN}}(.),\\ {f}_{{v}^{{\prime} }} & = & {{v}^{{\prime} }}^{{NN}}(.)-{{v}_{t}}^{{NN}}(.).\end{array}\end{eqnarray}$

View original graphic|Download|PPT slide

Figure 3. Graphical representation of the SCTgNN showcases its intricate architecture, consisting of an input layer, hidden layers and an output layer. While the SCTgNN model shares similarities with the TgNN, it differs in that the TgNN's output layer yields both u and v, whereas the SCTgNN goes beyond this and produces ${u}^{{\prime} }$ and ${v}^{{\prime} }$.

In equation (5a), ${u}^{{\prime} }$ and ${v}^{{\prime} }$ correspond to the third and fourth output of the model, whereas u_t and v_t represent the differentiation with respect to t for the first and second outputs of the model. By incorporating these outputs, the introduced model achieves enhanced optimization compared to the TgNN. In the SCTgNN, the first two loss terms are similar to those in the TgNN, contributing to fitting the solution data. Meanwhile, the last term plays a crucial role in satisfying the residuals f_u, f_v, ${f}_{u}^{{\prime} }$ and ${f}_{v}^{{\prime} }$.

In simple terms, both models can make predictions. The SCTgNN stands out for its ability to predict derivatives. The loss function of these models help to improve predictions and meet requirements, ensuring accurate results.

3. Data-driven soliton, rogue wave and breather solutions for the generalized NLS equation

In this section, we explain three interesting localized solutions, namely solitons, rogue waves and breathers. We also show the results obtained through the SCTgNN and TgNN models, along with the exact solution, for the system described by equation (1).

3.1. Soliton solution

The first-order soliton solution for equation (1) is given by [15],

(6)$\begin{eqnarray}{\psi }_{s}=c\,{\rm{{\rm{sech}} }}\,({{xv}}_{s}+{ct}){{\rm{e}}}^{{\rm{i}}x{\phi }_{s}},\end{eqnarray}$

where the phase (v_s) and velocity (φ_s) having the following form:

(7)$\begin{eqnarray}{v}_{s}={c}^{3}({\alpha }_{3}+{c}^{2}{\alpha }_{5}),\quad {\phi }_{s}={c}^{2}({\alpha }_{2}+{c}^{2}{\alpha }_{4}).\end{eqnarray}$

The parameter c is an arbitrary constant. It is noted that the velocity (v_s) is described by third- and fifth-order coefficients, while the phase (φ_s) is given in terms of second- and fourth-order coefficients.

If we consider t = 0, the same solution (6) turns out to be the following:

(8)$\begin{eqnarray}{\psi }_{s}=c\,{\rm{{\rm{sech}} }}\,({{xv}}_{s}){{\rm{e}}}^{{\rm{i}}x{\phi }_{s}},\end{eqnarray}$

where v_s and φ_s are given in equation (7).

Figure 4 represents the comparison of a predicted soliton (using the SCTgNN and TgNN models) with an analytical soliton for the generalized NLS equation. The first row of figures (figures 4 (a)–(c)) is shown for the values, α₂ = 1, α₃ = α₄ = α₅ = 0 and c = 1, which is nothing but the solutions of the standard NLS equation. In addition, the error diagram showing the differences between the SCTgNN and TgNN is displayed in figures 4 (d) and (e). When we increase the value of α₃ = 0.2 and consider all other parametric values to be the same, then the orientation of the soliton changes slightly, which is shown in figures 4 (f)–(h). The corresponding error diagrams are shown in figures 4 (i) and (j). Furthermore, adding the value of α₄ as 0.1 and the remaining parameters as considered in the previous case (figures 4 (f)–(h)) then the width of the soliton changes (see figures 4 (k)–(m)). For error diagrams see figures 4(n) and (o). Similarly, when we include the fifth-order dispersion parameter (α₅) as 0.2 and consider all other parameters to remain the same as in the previous case, the orientation of the soliton changes, which is displayed in figures 4 (p)–(r). The corresponding error diagrams are displayed in figures 4(s) and (t). The star markers in figures 4 (c), (h), (m) and (r) denote the randomly chosen data points on the initial and boundary conditions. Here, we consider 2000 data points for initial and 2000 data points for boundary conditions including α₃, α₄ and α₅. From this figure, we observe that with the addition of higher-order dispersion parameters α₃, α₄ and α₅ to the standard NLS equation the width and orientation of the soliton changes. In addition, we note that both the SCTgNN and TgNN predicted the exact solutions well, with low mean square error (MSE). The MSEs are also displayed on the error prediction plots.

View original graphic|Download|PPT slide

Figure 4. Results of the soliton solution coming from the SCTgNN and TgNN, and their errors with α₂ = 1. First three columns represent the prediction of the SCTgNN, prediction of the TgNN and the exact result, respectively. Errors between the prediction of the SCTgNN and TgNN are shown in the last two columns. Parameters α₃, α₄ and α₅ are displayed in the left side margin.

Figure 5 specifically shows the results of the soliton solution of equation (1) for four different values of system parameters. The first two rows correspond to the Lakshmanan–Porsezian–Daniel (LPD) (figures 5 (a)–(e)) and fifth-order NLS (figures 5 (f)–(j)) equations, respectively, while the other two rows represent randomly chosen system parameters to better understand the behavior of the system. In the third row (figures 5 (k)–(o)), for α₃ = 0, α₄ = 0.1 and α₅ = 0.2, the orientation of the soliton is slightly changed from the original, whereas in the fourth row (figures 5 (p)–(t)), for different parameters α₃ = 0.2, α₄ = 0 and α₅ = 0.2, the orientation change is large.

View original graphic|Download|PPT slide

Figure 5. Results of the soliton solution coming from the SCTgNN and TgNN, and their errors with α₂ = 1 for different values of system parameters. First three columns represent the prediction of the SCTgNN, prediction of the TgNN and the exact result, respectively. Errors between the prediction of the SCTgNN and TgNN are shown in the last two columns. Parameters α₃, α₄ and α₅ are displayed in the left side margin.

It can be seen from figures 4 and 5 that the MSE for the soliton solution of the Hirota equation is significantly smaller compared to the results of Zhou et al [26]. In their work, the authors predicted soliton solutions with MSE in the range of 10⁻² to 10⁻³, while our results show an MSE in the range of 10⁻⁵ to 10⁻⁶. Zhang et al have proposed an improved PINN method to study numerical solutions of the Hirota equation. They predicted soliton solutions and compared their results with the PINN. However, they only achieved an error range between 10⁻³ and 10⁻⁴ [55]. In contrast, our results showed an error ranging from 10⁻⁵ to 10⁻⁶. Based on this comparison, it is evident that our SCTgNN model predicts more accurate results. Table 1 is based on MSE calculations for the soliton solution (the values of the first four rows are for figure 4 and the last four rows are for figure 5). It is evident that using data from parameter points in neural network training reveals a low MSE in both the SCTgNN and TgNN, as shown in table 1. In simpler terms, the SCTgNN and TgNN consistently exhibit smaller MSEs. These findings once again quantitatively demonstrate that the SCTgNN and TgNN display superior performance and reliability in investigating solitons in physical systems.

Table 1. MSE values for soliton solution using the SCTgNN and TgNN.

Parameters			MSE
α₃	α₄	α₅	SCTgNN	TgNN
0	0	0	2.73 × 10⁻⁶	2.6 × 10⁻⁶
0.2	0	0	4.48 × 10⁻⁶	5.12 × 10⁻⁶
0.2	0.1	0	4.27 × 10⁻⁶	5.09 × 10⁻⁶
0.2	0.2	0.2	1.77 × 10⁻⁵	2.232 × 10⁻⁵
0	0.1	0	2.05 × 10⁻⁶	2.55 × 10⁻⁶
0	0	0 .2	3.95 × 10⁻⁶	4.51 × 10⁻⁶
0	0.1	0.2	3.51 × 10⁻⁶	4.53 × 10⁻⁶
0.2	0	0.2	1.477 × 10⁻⁵	1.278 × 10⁻⁵

3.2. Rogue wave solution

The rogue wave solution for the generalized NLS equation (1) is [15]:

(9a)$\begin{eqnarray}{\psi }_{r}=c\left(4\displaystyle \frac{1+2{{\rm{i}}{B}}_{r}x}{D(x,t)}-1\right){{\rm{e}}}^{{\rm{i}}{\phi }_{r}x},\end{eqnarray}$

where the function D(x, t) is given by,

(9b)$\begin{eqnarray}D(x,t)=1+4{B}_{r}^{2}{x}^{2}+4{\left({ct}+{v}_{r}x\right)}^{2},\end{eqnarray}$

in which B_r, φ_r and v_r take the following form:

(9c)$\begin{eqnarray}\begin{array}{rcl}{B}_{r} & = & 2{c}^{2}({\alpha }_{2}+6{c}^{2}{\alpha }_{4}),\\ {\phi }_{r} & = & 2{c}^{2}({\alpha }_{2}+3{c}^{2}{\alpha }_{4}),\\ {v}_{r} & = & 2{c}^{3}(3{\alpha }_{3}+15{c}^{2}{\alpha }_{5}).\end{array}\end{eqnarray}$

If we consider t = 0, solution (9) turns out to be the same with D(x, t):

(10)$\begin{eqnarray}D(x,t)=1+4{B}_{r}^{2}{x}^{2}+4{v}_{r}^{2}{x}^{2}.\end{eqnarray}$

The prediction of rogue wave solutions and their error estimations obtained from the SCTgNN and TgNN are displayed in figure 6 for α₂ = 1 and c = 1. The first row of figures (figures 6 (a)–(c)) shows the solutions of the standard NLS equation with α₃ = α₄ = α₅ = 0. In addition, the error diagram showing the differences between the SCTgNN and TgNN is displayed in figures 6 (d) and (e). When we increase the value of α₃ to 0.2, while keeping all other parameters the same, we observe a slight change in the orientation of the rogue wave, as shown in figures 6 (f)–(h). The corresponding error diagrams are shown in figures 6 (i) and (j). Next, we add the value of α₄ = 0.1, while maintaining the previous parameter values (figures 6 (f)–(h)), and observe changes in the width of the rogue wave, as can be seen in figures 6 (k)–(m). The error diagrams are shown in figures 6(n) and (o). Similarly, when we include the fifth-order dispersion parameter (α₅ = 0.2), while keeping all the other parameters the same as in the previous case, we observe changes in the orientation of the rogue waves, as displayed in figures 6 (p)–(r). The corresponding error diagrams are displayed in figures 6(s) and (t). The star markers in the third column (figures 6 (c), (h), (m) and (r)) denote randomly chosen data points on the initial and boundary conditions. We considered 2000 data points for the initial conditions and 2000 data points for the boundary conditions, including α₃, α₄ and α₅. From this figure, we observe that with the addition of higher-order dispersion parameters α₃, α₄ and α₅ to the standard NLS equation, the width and orientation of the rogue wave change. In the case of rogue waves, the SCTgNN and TgNN predictions also match well with the exact solution, exhibiting low MSE. The MSE of the prediction is displayed at the bottom of the error prediction plots. These results indicate that the new SCTgNN and TgNN models perform well in predicting the rogue wave solution for the generalized NLS equation, with only a few small MSE errors. Figure 7 presents the results of the rogue wave solution for the generalized NLS equation (1) under four different sets of system parameters. The first two rows correspond to the LPD equations (figures 7 (a)–(e)) and the fifth-order NLS equations (figures 7 (f)–(j)), respectively. The remaining two rows illustrate the behavior of the system under randomly selected parameters. In the third row (figures 7 (k)–(o)), with parameters α₃ = 0, α₄ = 0.12 and α₅ = 0.25, the orientation of the rogue waves shows a slight deviation from the original. In the fourth row (figures 7(p)–(t) ), with different values of α₃ = 0.35, α₄ = 0 and α₅ = 0.25, the orientation change is more pronounced. In addition, these figures indicate that the width of the rogue waves also decreases.

View original graphic|Download|PPT slide

Figure 6. Results of the rogue wave solution coming from the SCTgNN and TgNN, and their errors with α₂ = 1. First three columns represent the prediction of the SCTgNN, prediction of the TgNN and the exact result, respectively. Errors between the prediction of the SCTgNN and TgNN are shown in the last two columns. Parameters α₃, α₄ and α₅ are displayed in the left side margin.

View original graphic|Download|PPT slide

Figure 7. Results of the rogue wave solution coming from the SCTgNN and TgNN, and their errors with α₂ = 1 for different values of system parameters. First three columns represent the prediction of the SCTgNN, prediction of the TgNN and the exact result, respectively. Errors between the prediction of the SCTgNN and TgNN are shown in the last two columns. Parameters α₃, α₄ and α₅ are displayed in the left side margin.

From figures 6 and 7, it is evident that the MSE for the rogue wave solution of the Hirota equation is significantly smaller in our results compared to those of Zhou et al [26]. In their work, the authors predicted rogue wave solutions with an MSE in the range of 10⁻², while our results show an MSE in the range of 10⁻⁵ to 10⁻⁶. Zhang et al also investigated rogue wave solutions of the Hirota equation, similar to their exploration of soliton solutions [55]. However, their error range was larger compared to our results. In [27], the MSE for the rogue wave solution for the LPD equation is obtained in the range of 10⁻³, whereas in figures 6 (k)–(o), we obtained the MSE in the range of 10⁻⁶ (a lower MSE indicates higher accuracy). This comparison clearly demonstrates that our SCTgNN model produces more accurate predictions. Table 2 presents the results of MSE calculations for the rogue wave solution (the values for the first four rows pertain to figure 6, while the values for the last four rows correspond to figure 7). It becomes apparent that when employing data from parameter points during neural network training, there is a noticeable lower MSE in both the SCTgNN and TgNN, as illustrated in table 2. More precisely, the SCTgNN and TgNN consistently display a lower MSE. These results, once more, provide quantitative evidence of the SCTgNN's and TgNN's better performance and their trustworthiness in the examination of rogue waves in physical systems.

Table 2. MSE values for the rogue wave solution using the SCTgNN and TgNN.

Parameters			MSE
α₃	α₄	α₅	SCTgNN	TgNN
0	0	0	9.045 × 10⁻⁵	7.979 × 10⁻⁵
0.35	0	0	2.183 × 10⁻⁵	2.207 × 10⁻⁵
0.35	0.12	0	4.91 × 10⁻⁶	8.11 × 10⁻⁶
0.35	0.12	0.25	7.5 × 10⁻⁶	8.37 × 10⁻⁶
0	0.12	0	3.5 × 10⁻⁵	3.09 × 10⁻⁵
0	0	0.25	2.878 × 10⁻⁵	2.546 × 10⁻⁵
0	0.12	0.25	7.45 × 10⁻⁶	7.18 × 10⁻⁶
0.35	0	0.25	3.286 × 10⁻⁵	3.404 × 10⁻⁵

3.3. Breather solution

The breather solution for equation (1) is given by [15],

(11a)$\begin{eqnarray}{\psi }_{b}=c\left(1+\displaystyle \frac{{\kappa }^{2}C(x)+{\rm{i}}\kappa \sqrt{4-{\kappa }^{2}}S(x)}{\sqrt{4-{\kappa }^{2}}\cos [\kappa ({ct}+{v}_{b}x)]-2C(x)}\right){{\rm{e}}}^{{\rm{i}}{\phi }_{b}x}.\end{eqnarray}$

In equation (11a), the terms C(x) and S(x) have the following form:

(11b)$\begin{eqnarray}\begin{array}{rcl}C(x) & = & \cosh \left({B}_{b}\kappa \sqrt{1-\displaystyle \frac{{\kappa }^{2}}{4}x}\right),\\ S(x) & = & \sinh \left({B}_{b}\kappa \sqrt{1-\displaystyle \frac{{\kappa }^{2}}{4}x}\right).\end{array}\end{eqnarray}$

The terms v_b, φ_b and B_b in equation (11a) are denoted by,

(11c)$\begin{eqnarray}\begin{array}{rcl}{v}_{b} & = & {\alpha }_{3}{c}^{3}(6-{\kappa }^{2})+{\alpha }_{5}{c}^{5}(30-10{\kappa }^{2}+{\kappa }^{4}),\\ {\phi }_{b} & = & 2[{\alpha }_{2}{c}^{2}+{\alpha }_{4}{c}^{4}(6-{\kappa }^{2})],\quad {B}_{b}=2({\alpha }_{2}{c}^{2}+3{\alpha }_{4}{c}^{4}).\end{array}\end{eqnarray}$

If x = 0, the breather solution takes the form:

(12)$\begin{eqnarray}{\psi }_{b}=c\left(1+\frac{{\kappa }^{2}}{\sqrt{4-{\kappa }^{2}}\cos [\kappa ({ct})]-2}\right),\end{eqnarray}$

where C(x) and S(x) are given in equation (11b).

In figure 8, we present the results of the dynamical prediction of breather solutions and their error estimations using the SCTgNN and TgNN, where α₂ = 1 and c = 1. Unlike solitons and rogue waves, breathers can be predicted using the following initial values when x = 0. Here, t ranges from −15 to 15 and x ranges from −5 to 5, while the other α_i's remain the same. The figures in the first row (figures 8 (a)–(c)) represent the solution of the standard NLS equation, corresponding to α₃ = α₄ = α₅ = 0. In addition, figures 8 (d) and (e) display the error diagram, highlighting the discrepancies between the SCTgNN and TgNN. Upon increasing the value of α₃ to 0.2, while maintaining the other parameters constant, we observe subtle changes in the orientation of the breathers, as depicted in figures 8 (f)–(h). The corresponding error diagrams are provided in figures 8 (i) and (j). Furthermore, when we set α₄ = 0.1 and keep the parameters consistent with the previous case (figures 8 (f)–(h)), the width of the breathers undergoes a change, as shown in figures 8 (k)–(m). The error diagrams for this situation can be observed in figures 8(n) and (o). Similarly, by introducing the fifth-order dispersion parameter (α₅ = 0.2) while maintaining the other parameters from the previous case, we again observe changes in the orientation of the breathers, as illustrated in figures 8 (p)–(r). The corresponding error diagrams are presented in figures 8(s) and (t). In figures 8 (c), (h), (m) and (r), the star markers indicate randomly chosen data points on the initial and boundary conditions. For our analysis, we used 2000 data points for each condition, including α₃, α₄ and α₅. Overall, the results demonstrate that the inclusion of higher-order dispersion parameters α₃, α₄ and α₅ in the standard NLS equation leads to changes in the width and orientation of the breathers. Similarly, the SCTgNN and TgNN models demonstrate superior error prediction capabilities, as illustrated in the MSE prediction figures. These findings indicate that the new SCTgNN and TgNN models excel at estimating the breather solution for the generalized NLS equation, with only a few minor errors. Figure 9 shows the results of breathers for the generalized NLS equation (1) for four different sets of system parameters. The first two rows depict the LPD equations (figures 9 (a)–(e)) and the fifth-order NLS equations (figures 9 (f)–(j)). The last two rows represent the system's behavior with randomly chosen parameters. In the third row (figures 9 (k)-(o)), with parameters α₃ = 0, α₄ = 0.12 and α₅ = 0.38, the orientation of the breathers shows a slight deviation from the original. In the fourth row (figures 9 (p)–(t)), with different values of α₃ = 0.35, α₄ = 0 and α₅ = 0.38, the orientation change is more pronounced.

View original graphic|Download|PPT slide

Figure 8. Results of the breather solution coming from the SCTgNN and TgNN, and their errors with α₂ = 1. First three columns represent the prediction of the SCTgNN, prediction of the TgNN and the exact result, respectively. Errors between the prediction of the SCTgNN and TgNN are shown in the last two columns. Parameters α₃, α₄ and α₅ are displayed in the left side margin.

View original graphic|Download|PPT slide

Figure 9. Results of the breather solution coming from the SCTgNN and TgNN, and their errors with α₂ = 1 for different values of system parameters. First three columns represent the prediction of the SCTgNN, prediction of the TgNN and the exact result, respectively. Errors between the prediction of the SCTgNN and TgNN are shown in the last two columns. Parameters α₃, α₄ and α₅ are displayed in the left side margin.

Similarly, as shown in figures 8 and 9, our results for the MSE of the breather solution of the Hirota equation are significantly smaller compared to those of Zhou et al [26]. While their work reported breather solutions with an MSE of 10⁻², our results exhibit an MSE ranging from 10⁻³ to 10⁻⁵. This indicates that our SCTgNN model provides much more accurate predictions.

Table 3 showcases the outcomes of MSE computations pertaining to the breather solution (the values of the initial four rows relate to figure 8, whereas the subsequent four rows pertain to figure 9). It becomes evident that when incorporating data from parameter points into the neural network training process, there is a conspicuously low MSE in the SCTgNN and TgNN, as depicted in table 3. In simpler terms, the SCTgNN and TgNN consistently demonstrate a lower MSE. These findings reaffirm that the SCTgNN and TgNN exhibit better performance and reliability in investigating breathers in physical systems, confirming their quantitative excellence once again.

Table 3. MSE values for the breather solution using the SCTgNN and TgNN.

Parameters			MSE
α₃	α₄	α₅	SCTgNN	TgNN
0	0	0	1.51 × 10⁻⁴	1.33 × 10⁻⁴
0.35	0	0	6.50 × 10⁻⁵	2.26 × 10⁻⁴
0.35	0.12	0	4.90 × 10⁻⁵	2.10 × 10⁻⁴
0.35	0.12	0.38	2.29 × 10⁻⁴	9.23 × 10⁻⁴
0	0.12	0	9.128 × 10⁻⁵	1.4182 × 10⁻⁴
0	0	0.38	7.194 × 10⁻⁵	4.3755 × 10⁻⁴
0	0.12	0.38	4.96 × 10⁻⁵	3.5827 × 10⁻⁴
0.35	0	0.38	3.41 × 10⁻⁴	1.11 × 10⁻³

4. Conclusion

In our study, we have explored the generalized NLS equation and its localized solutions using a novel network structure known as the SCTgNN, which incorporates the concepts of both the PINN and TgNN models. Under specific conditions, this generalized equation augments results of the NLS, Hirota, LPD and fifth-order equations. Hence, the detailed study on this equation benefits the physical community, in particular the optics and hydrodynamics community. By synergizing the strengths of the TgNN and SCPINN, our aim was to enrich our comprehension of intricate phenomena, particularly in scenarios where standardized patterns do not apply, such as in nonlinear systems. Furthermore, our scope extended beyond the exclusive prediction of rogue wave patterns, as observed in a previous study [45] employing the TgNN model. Instead, we broadened our investigation to encompass a spectrum of wave behavior, including solitons, rogue waves and breathers, within the broader framework of the generalized NLS equation. In addition, we successfully predicted solitons, rogue waves and breathers for the generalized NLS equation using our new SCTgNN model. To evaluate the efficacy of our innovative SCTgNN model and the established TgNN model, we conducted comprehensive MSE predictions. These MSE predictions are well-matched with exact values, yielding very low errors in both models. This approach enhances our capacity to predict and comprehend intricate systems characterized by complex behavior.

Our findings confirm that adding higher-order dispersion parameters to the standard NLS equation leads to changes in the width and orientation of solitons, rogue waves and breathers. Interestingly, both the SCTgNN and TgNN models consistently predict errors with high accuracy. We have integrated system parameters α₂, α₃, α₄ and α₅ into the initial conditions. This provides a significant advantage; the straightforward prediction of solutions for various parameter values. Furthermore, this approach extends beyond the specified range, enabling predictions for a large number of values. This expansion enhances both the applicability and value of our work. These results highlight the strength of the new SCTgNN model and established TgNN model in closely approximating soliton, rogue wave and breather solutions for the generalized NLS equation, even as we consider more complex scenarios with higher-order effects. Recently, Ablowitz and his collaborators introduced the fractional integrable system for the first time [56]. They derived the fractional nonlinear Schrödinger and fractional Korteweg–de Vries equation, and their corresponding one-soliton solutions, by employing the inverse scattering transform method. In the near future, we intend to extend our present analysis to fractional types of integrable equations. We also plan to study higher-order solutions for the considered system.

KT thanks the Science and Engineering Research Board, Government of India for Grant No. CRG/2021/002428. NS thanks DST-SERB, Government of India for providing the National Post-Doctoral Fellowship under Grant No. PDF/2023/000619. NVP thanks the Department of Science and Technology (DST), India, for the financial support under the Women Scientist Scheme-A. The work of MS was supported by the Science and Engineering Research Board, Government of India, under Grant No. CRG/2021/002428. MS also acknowledges MoE-RUSA 2.0 Physical Sciences, Government of India, for sponsoring this research work.

References

Publishing order | Descend order by publishing year | Descend order by cited within

1	Agrawal G P 2008 Applications of Nonlinear Fiber Optics Vol. 10 Academic

2	Le Méhauté B 2013 An Introduction to Hydrodynamics and Water Waves Springer

3	Liu W M, Kengne E 2019 Schrödinger Equations in Nonlinear Systems Springer

4	Akhmediev N N, Ankiewicz A 1997 Nonlinear Pulses and Beams Springer

5	Bergé L 1998 Wave collapse in physics: principles and applications to light and plasma waves Phys. Rep. 303 259 DOI

6	Sun W R, Tian B, Zhen H L, Sun Y 2015 Breathers and rogue waves of the fifth-order nonlinear Schrödinger equation in the Heisenberg ferromagnetic spin chain Nonlinear Dyn. 81 725 DOI

7	Wang Q M, Gao Y T, Su C Q, Shen Y J, Feng Y J, Xue L 2015 Higher-order rogue waves for a fifth-order dispersive nonlinear Schrödinger equation in an optical fibre Z. Naturf. A 70 365 DOI

8	Chai J, Tian B, Zhen H L, Sun W R 2015 Conservation laws, bilinear forms and solitons for a fifth-order nonlinear Schrödinger equation for the attosecond pulses in an optical fiber Ann. Physics 359 371 DOI

9	Sinthuja N, Manikandan K, Senthilvelan M 2021 Rogue waves on the double-periodic background in Hirota equation Eur. Phys. J. Plus. 136 1 DOI

10	Wang L, Zhang J H, Wang Z Q, Liu C, Li M, Qi F H, Guo R 2016 Breather-to-soliton transitions, nonlinear wave interactions, and modulational instability in a higher-order generalized nonlinear Schrödinger equation Phys. Rev. E 93 012214 DOI

11	Zhang H Q, Chen F 2021 Rogue waves for the fourth-order nonlinear Schrödinger equation on the periodic background Chaos 31 023129 DOI

12	Sinthuja N, Rajasekar S, Senthilvelan M 2023 Instability of single-and double-periodic waves in the fourth-order nonlinear Schrödinger equation Nonlinear Dyn. 111 16497 DOI

13	Sinthuja N, Manikandan K, Senthilvelan M 2021 Formation of rogue waves on the periodic background in a fifth-order nonlinear Schrödinger equation Phys. Lett. A 415 127640 DOI

14	Sinthuja N, Senthilvelan M 2024 Rogue wave solutions over double-periodic wave background of a fifth-order nonlinear Schrödinger equation with quintic terms Int. J. Mod. Phys. B 38 2450082

15	Ankiewicz A, Kedziora D J, Chowdury A, Bandelow U, Akhmediev N 2016 Infinite hierarchy of nonlinear Schrödinger equations and their solutions Phys. Rev. E 93 012206 DOI

16	Lakshmanan M S 2012 Introduction and Applications Springer

17	Kharif C, Pelinovsky E, Slunyaev A 2008 Rogue Waves in The Ocean Springer

18	Guo B, Tian L, Yan Z, Ling L, Wang Y F 2017 Rogue Waves: Mathematical Theory and Applications in Physics Walter de Gruyter GmbH

19	Pelinovsky E, Kharif C 2008 Extreme Ocean Waves Springer

20	Onorato M, Resitori S, Baronio F 2016 Rogue and Shock Waves in Nonlinear Dispersive Media Springer

21	Wang L H, Porsezian K, He J S 2013 Breather and rogue wave solutions of a generalized nonlinear Schrödinger equation Phys. Rev. E 87 053202 DOI

22	Randoux S, Suret P, El G 2016 Inverse scattering transform analysis of rogue waves using local periodization procedure Sci. Rep. 6 29238 DOI

23	Chen S, Yan Z 2019 The higher-order nonlinear Schrödinger equation with non-zero boundary conditions: robust inverse scattering transform, breathers, and rogons Phys. Lett. A 383 125906 DOI

24	Feng B F, Shi C, Zhang G, Wu C 2022 Higher-order rogue wave solutions of the Sasa-Satsuma equation J. Phys. A: Math. Theor. 55 235701 DOI

25	Guo B, Ling L, Liu Q P 2012 Nonlinear Schrödinger equation: generalized Darboux transformation and rogue wave solutions Phys. Rev. E 85 026607 DOI

26	Zhou Z, Yan Z 2021 Deep learning neural networks for the third-order nonlinear Schrödinger equation: bright solitons, breathers, and rogue waves Commun. Theor. Phys. 73 105006 DOI

27	Zhang Y, Wang L, Zhang P, Luo H, Shi W, Wang X 2022 The nonlinear wave solutions and parameters discovery of the Lakshmanan-Porsezian-Daniel based on deep learning Chaos, Solitons Fractals 159 112155 DOI

28	Zhong M, Yan Z 2022 Data-driven soliton mappings for integrable fractional nonlinear wave equations via deep learning with fourier neural operator Chaos, Solitons Fractals 165 112787 DOI

29	Samei M E, Zanganeh H, Aydogan S M 2021 Investigation of a class of the singular fractional integro-differential quantum equations with multi-step methods J. Math. Extension 15 1 54 DOI

30	Aydogan S M 2021 On a k-dimensional system of hybrid fractional differential equations with multi-point boundary conditions J. Math. Extension 10 1 18 DOI

31	Khan H, Alam K, Gulzar H, Etemad S, Rezapour S 2022 A case study of fractal-fractional tuberculosis model in China: Existence and stability theories along with numerical simulations Math. Comp. Sim. 198 455 DOI

32	Baleanu D, Etemad S, Mohammadi H, Rezapour S 2021 A novel modeling of boundary value problems on the glucose graph Commun. Nonlinear Sci. Num. Sim. 100 105844 DOI

33	Baleanu D, Jajarmi A, Mohammadi H, Rezapour S 2020 A new study on the mathematical modelling of human liver with Caputo-Fabrizio fractional derivative Chaos, Solitons Fractals 134 109705 DOI

34	Tuan N H, Mohammadi H, Rezapour S 2020 A mathematical model for COVID-19 transmission by using the caputo fractional derivative Chaos, Solitons Fractals 140 110107 DOI

35	Aydogan S M, Hussain A, Sakar F M 2021 On a nonlinear fractional order model of novel coronavirus (nCoV-2019) under AB-fractional derivative J. Math. Extension 11 1 31 DOI

36	Peng W Q, Pu J C, Chen Y 2022 PINN deep learning method for the Chen-Lee-Liu equation: Rogue wave on the periodic background Commun. Nonlinear Sci. Numer. Simul. 105 106067 DOI

37	Wu G Z, Fang Y, Wang Y Y, Wu G C, Dai C Q 2021 Predicting the dynamic process and model parameters of the vector optical solitons in birefringent fibers via the modified PINN Chaos, Solitons Fractals 152 111393 DOI

38	Fang Y, Wu G Z, Wang Y Y, Dai C Q 2021 Data-driven femtosecond optical soliton excitations and parameters discovery of the high-order nlse using the PINN Nonlinear Dyn. 105 603 DOI

39	Pu J, Peng W, Chen Y 2021 The data-driven localized wave solutions of the derivative nonlinear Schrödinger equation by using improved PINN approach Wave Motion 107 102823 DOI

40	Yin Y H, Lü X 2023 Dynamic analysis on optical pulses via modified PINNs: soliton solutions, rogue waves and parameter discovery of the CQ-NLSE Commun. Nonlinear Sci. Numer. Simul. 126 107441 DOI

41	Fang Y, Wang Y Y, Liu W, Dai C Q 2022 Data-driven prediction of soliton solutions of the higher-order NLSE via the strongly-constrained PINN method Comput. Math. Appl. 127 144 DOI

42	Fang Y, Bo W B, Wang R R, Wang Y Y, Dai C Q 2022 Predicting nonlinear dynamics of optical solitons in optical fiber via the SCPINN Chaos, Solitons Fractals 165 112908 DOI

43	Qin S M, Li M, Xu T, Dong S Q 2023 A-WPINN algorithm for the data-driven vector-soliton solutions and parameter discovery of general coupled nonlinear equations Physica D 443 133562 DOI

44	Qin S M, Li M, Xu T, Dong S Q 2023 AM-GPINN algorithm and its application in a variable-coefficient resonant nonlinear Schrödinger equation Phys. Scr. 98 025219 DOI

45	Bai X D, Zhang D 2022 Search for rogue waves in Bose-Einstein condensates via a theory-guided neural network Phys. Rev. E 106 025305 DOI

46	Wang N, Zhang D, Chang H, Li H 2020 Deep learning of subsurface flow via theory-guided neural network J. Hydrol. 584 124700 DOI

47	Crabb M, Akhmediev N 2019 Doubly periodic solutions of the class-I infinitely extended nonlinear Schrödinger equation Phys. Rev. E 99 052217 DOI

48	Chowdury A, Kedziora D J, Ankiewicz A, Akhmediev N 2014 Soliton solutions of an integrable nonlinear Schrödinger equation with quintic terms Phys. Rev. E 90 032922 DOI

49	Kang Z Z, Xia T, Ma W X 2021 Riemann-Hilbert method for multi-soliton solutions of a fifth-order nonlinear schrödinger equation Anal. Math. Phys. 11 14 DOI

50	Yang Y, Yan Z, Malomed B A 2015 Rogue waves, rational solitons, and modulational instability in an integrable fifth-order nonlinear Schrödinger equation Chaos 25 103112 DOI

51	Lan Z Z, Gao Y T, Zhao C, Yang J W, Su C Q 2016 Dark soliton interactions for a fifth-order nonlinear Schrödinger equation in a Heisenberg ferromagnetic spin chain Superlattice. Microst. 100 191 DOI

52	Song N 2017 Rogue wave solutions and generalized Darboux transformation for an inhomogeneous fifth-order nonlinear Schrödinger equation J. Funct. Space. 2017 910926 DOI

53	Jia H X, Shan D M 2017 Nonlinear stage of modulation instability for a fifth-order nonlinear Schrödinger equation Z. Naturf. A 72 1071 DOI

54	Yomba E, Zakeri G A 2017 Collision of N-solitons in a fifth-order nonlinear Schrödinger equation Wave Motion 72 101 DOI

55	Zhang R, Su J, Feng J 2023 Solution of the Hirota equation using a physics-informed neural network method with embedded conservation laws Nonlinear Dyn. 111 13399 DOI

56	Ablowitz M J, Been J B, Carr L D 2022 Fractional integrable nonlinear soliton equations Phys. Rev. Lett. 128 184101 DOI

Options

Outlines

模态框（Modal）标题

Abstract

Cite this article

1. Introduction

2. Model and method

3. Data-driven soliton, rogue wave and breather solutions for the generalized NLS equation

3.1. Soliton solution

Table 1. MSE values for soliton solution using the SCTgNN and TgNN.

3.2. Rogue wave solution

Table 2. MSE values for the rogue wave solution using the SCTgNN and TgNN.

3.3. Breather solution

Table 3. MSE values for the breather solution using the SCTgNN and TgNN.

4. Conclusion

References