METHOD FOR DETERMINING INFORMATIVE PARAMETERS IN NEURAL NETWORK MODELING OF CHEMICAL KINETICS
Kolesnikova O.P.(1,2), Malsagov M.Yu.(2), Mikhalchenko E.V.(2), Karandashev I.M.(2,3)
(1) National Research Center Kurchatov Institute - NIISI
117218, Moscow, Nakhimovsky ave, 36.1
(2) Moscow Institute of Physics and Technology
141700, Dolgoprudny, Institutsky lane, 9
(3) Peoples' Friendship University of Russia
117198, Moscow, Miklukho-Maklaya st., 6
The problem of approximating the temporal evolution of a hydrogen–oxygen mixture using a neural network is considered. A fully connected network with 5 layers and LeakyReLU activation was used, trained with Adam (lr = 0.01, 100 epochs) on MSE. Both the input and output of the neural network are vectors comprising the time step Δt, the system temperature, and the molar concentrations of 11 chemical species, with relevant parameter ranges: K, Pa, and s.
Analysis of the test trajectories revealed significant heterogeneity in the error. For the groups with minimal and maximal errors, the spectrum of the Jacobian of the original system was studied. It was found that the key difference is associated with the minimal eigenvalues: for well-approximated trajectories, is on the order of , while for poorly approximated ones, ranges from to . Small values are caused by slow reactions (small ), low concentrations, and quasi-steady-state balancing of forward and reverse reactions. Therefore, the model error is determined by the regimes of the predicted profiles.
Based on this identified relationship, a modification of the loss function is proposed. In addition to the standard MSE term, an additional term is introduced that accounts for the discrepancy between predicted and true values of the logarithms of reaction rate constants, computed using the Arrhenius formula from the predicted temperature.
Comparison of different training variants shows: baseline model: MSE = , STD = ; using all rate constants: MSE = , STD = ; using the selected subset of constants associated with the formation of the slow modes of the system: MSE = , STD = .
It is shown that the inclusion of a physically informed term in the loss function is effective only with a targeted selection of reactions. Using all constants leads to a deterioration of model quality, whereas considering the reactions that determine slow modes results in more than a twofold reduction in error compared to the baseline approach.
Thus, spectral analysis of the Jacobian can serve as a tool for identifying informative kinetic parameters in neural network modeling of chemical kinetics.
The work was carried out within the framework on the topics FNEF-2024-0001 and FNEF-2024-0002.