Introduction

The study on the correlation between impact sensitivity of energetic materials and molecular structures has recently been an ongoing area of research in explosive theory [1]. The so-called impact sensitivity h 50%is measured experimentally by drop weight impact test, where a height of 50% probability in causing an explosion is measured when hit by a hammer with a standard weight [2, 3]. Since storage, synthesis, and application of energetic materials are significantly affected by impact or shock sensitivity, the researches about the impact sensitivity is of primary importance. Once we get the ability to predict potential explosives using computational methods, it will provide significant cost savings, increase the safety of storage and testing of these compounds, and provide for greater control of the performance of these explosives [4]. Presently, the usage of quantum mechanical methods as a predictive tool for assessing relationships between molecular properties of various explosive molecules and their impact sensitivities has been suggested in several previous works. Correlations between impact sensitivity and chemical composition reported earlier by Kamlet and Adolph were probably the most widely applied in the area of energetic materials research [5]. These demonstrated that for families of high-energy molecules with similar decomposition mechanisms, there were approximately linear relationships between log h 50%and OB100, the latter was a measure of “oxygen balance”; MNDO/3 molecular orbital method was used by Owens [6] to calculate the energy for bond rupture for a number of energetic molecules and the results showed that the weakest bond was generally that between the NO2and the remainder of the molecule. In addition, his calculation results indicated a correlation between the magnitude of the energy barrier and the susceptibility to detonation; Politzer [7] investigated shock-sensitivity relationships for nitramines and nitroaliphatics, and showed that shock sensitivity was related to the strengths of all the N–NO2and C–NO2bonds, taken in conjunction with overall molecular size; Rice and co-workers [8, 9] pointed out that for nitroaromatic molecules there was a correlation between the bond dissociation energy of the weakest bond and sensitivity. Moreover, they investigated the relation between impact sensitivity and charge distribution in energetic molecules; Xiao and co-workers [1012] once proposed a principle of the smallest bond order to identify the relative magnitudes of impact sensitivity of most energetic compounds. For polynitroaromatic series, Jinshan and Gang [1] showed that the Mulliken bond order of C–NO2linkage correlated well with the observed values of h 50%; Zeman found a logarithmic relationship between the drop energy E drand heats of fusion for 33 polynitro compounds [13]. The correlation between impact sensitivity and approximated heats of detonation of several nitramines was examined by Edwards et al. [4]. Besides, many other properties have been found to correlate with impact sensitivities. These properties include molecular electronegativities [14, 15], vibrational states [16, 17], molecular weights and detonation gas concentrations [18], parameters related to oxidation numbers [19], partial atomic charges [15, 2024], heats of reaction [25, 26], heats of explosion [27], bond orders [28, 29], and activation energies [2731]. Although works listed above succeeded in finding a certain kind of correlations between some of the specific molecular properties and the impact sensitivity, many of the correlations almost existed in a certain type of explosive and the number of explosives in each type was usually small. In addition, most of the works simply examined the correlation between one or two molecular properties and impact sensitivity of explosive compounds. Consequently, in order to make a further study on the correlation between impact sensitivity and molecular properties, investigations should be carried out on how molecular properties affect impact sensitivity for the same type of energetic materials with similar molecular structure. In the present work, DFT method B3P86/6-31G**is adopted to calculate molecular properties of 33 nitramine explosives, and artificial neural networks method is utilized to research the correlation order between different molecular properties and impact sensitivity.

The neural network is a nonlinear function of many parameters that maps particular inputs (in our case, molecular properties) to an output (in our case, impact sensitivity). The network is made up of simple, yet highly connected, processing elements called nodes [32]. They are computer-based emulations of the brain’s information-processing capability [3]. Neural networks have been applied to various engineering problems, such as robotics, pattern recognition, speech, etc. [33]. In addition, the use of neural network methods has grown constantly in a variety of applications in chemistry and physics since their first utilization for the prediction of protein secondary structure. For example, neural networks have been used to calculate the ground-state eigenenergy of two-dimensional harmonic oscillators [34], to solve nonhomogenous ordinary and partial differential equations [35], and to obtain the electronic correlation energy for atoms and diatomic molecules [36]. But for the explosive engineering, there exist few reports about the impact sensitivity based on neural networks method, although Nefati [3] and Cho et al. [37] predicted the impact sensitivity of various types of explosive molecules via neural networks. However, they both provided better impact sensitivity values by employing dozens of compositional and topological descriptors, which is not very realistic for explosive synthesis engineering, and which property can correlate with the sensitivity best is still not clear. In the current work, we will choose five fundamental molecular descriptors in each input set so as to examine the sensitivities more conveniently. Since the fundamental molecular properties were easier to calculate, correlation study should be used to identify fundamental molecular properties that indicate sensitivity best. In addition, their works did not include the correlation order between impact sensitivity and molecular properties, which will be studied in the following part of this paper.

Energetic secondary nitramines are currently a subject of increasing interest in high-energy materials research. Of particular importance are efforts aimed at gaining insight into how molecular features of nitramines influence their impact sensitivities.

In this paper, we will utilize the back-propagation neural network (BP) to analyze the correlation between impact sensitivity of nitramines and molecular properties. In particular, the correlation order between various molecular properties and impact sensitivity will also be examined.

Theoretical methods

Calculation method

Thirty-three nitramine molecules as shown in Table 1were investigated in this paper. All the experimental impact sensitivity values were taken from [38]. In this work, by using Gaussian 98 hybrid DFT B3P86 method, the Becke-style three-parameter density functional theory [39] with the Perdew’s 86 (P86) [40] in combination with the basis set 6-31G**[41], the geometry was optimized and then the molecular structures were calculated. The calculated molecular properties included the sum of electronic and zero-point energy (E), zero-point vibration energy (ZPE), dipole moment (μ), electronic spatial extent (R 2), highest occupied molecular orbital (HOMO), and lowest unoccupied molecular orbital (LUMO). Furthermore, the atomization energy of various explosive molecules (ΔE) and the bond dissociation energy of the weakest X–NO2bond (BDE) were also calculated in this paper.

Table 1 Thirty-three nitriamine molecular formulas and our final calculated values (h 50%, cm) using set 6 and set 1 along with experimental values

One can expect that the properties listed above examined for correlations contain many “obvious” choices and may be already well described in previous works. That is true when only one or two molecular properties are examined. But until recently, the correlation order of different molecular properties with impact sensitivity has not been studied for the same type of energetic materials with similar molecular structure, and which property correlates with sensitivity best is still not known when different properties are considered. Here, we would like to present a detailed study on the correlation between different molecular properties and impact sensitivity for nitramines.

Back-propagation neural network

In the field of structure–activity or structure–property relationships, the back-propagation neural networks, particularly the three-layer networks, have gained wide acceptance [3]. So a fully connected three-layer architecture, which had an input layer, a hidden layer, and an output layer, was adopted in our neural network.

A back-propagation neural network, which was first developed by Rumelhart and McClelland [42], “learns” by repeatedly passing data through neurons and adjusting their weights and biases to minimize the mean-squared error (MSE), \(\sum {(t_i } - o_i )^2 /n\)(iis a training observation and nis the total number of input vectors), until the output (o i ) predicted by network matches the target (t i ) or given property point values.

Table 2 Twelve molecular descriptors and eight different input sets calculated for this study

The neural network can be trained to perform a particular task using a variety of learning algorithms. In this paper, we chose BP algorithm implementing Levenberg–Marquardt variation for the network, whose training speed was much faster than that of a gradient descent algorithm [43].

By using BP neural network, emphasis was primarily laid on finding out the molecular descriptor, which had the strongest correlation with impact sensitivity of nitramines. Thus, impact sensitivity h 50%was an obvious choice of the only output neuron. Similarly, selecting the proper molecular descriptors of nitramine molecules, which were to be used as the input neurons of the neural network, was also our focus. After a preliminary evolutionary multiple linear regression treatment, BDE/E, BDE*μ, HOMO−LUMO, μ, R 2, BDE, ΔE, (HOMO−LUMO)*BDE and the number of C, H, N, O atoms in each CHNO nitramine molecule, which indicated the value of bond dissociation energy divided by total energy, bond dissociation energy multiplied by dipole moment, energy gap, electronic spatial extent, bond dissociation energy, atomization energy, and energy gap multiplied by bond dissociation energy, respectively, were chosen as input descriptors. In order to find out the correlation order between molecular descriptors and impact sensitivity of nitramine molecules, the descriptors mentioned above were divided into eight different sets, each were combined with the number of C, H, N, O atoms comprising each CHNO nitramine molecule. In this case, eight different input sets, with the number of C, H, N, O atoms being the common descriptors in each set, were constructed, as summarized in Table 2.

Usually, the difficulty in finding optimum neural-network architecture resided in deciding on the number of hidden layers and the number of neurons in them. By trial and error, we found that only one hidden layer containing two neurons yielded better overall results than those with more neurons in hidden layers despite the fact that these latter networks could give slightly better fitting of the data set. Therefore, the 5-2-1 structure, which was 5 neurons in input layer, 2 neurons in hidden layer, and 1 neuron in output layer, was adopted for our neural network as depicted in Fig. 1.

Fig. 1
figure 1

5-2-1 BP network structure

During training and test process, we divided 33 nitramine molecules into two groups, a training group and a test group, respectively. The training group was used to help the net learn the relationship between the given input vectors and the target output vector. Meanwhile, the test set consisting of a few compounds, some of which may be dissimilar to the training group, were chosen to validate the network’s generation ability. In order to accelerate the convergent speed, input groups must be normalized. The normalized formula is

$$x_i ^\prime = (x_i - u)/s_{\rm d}$$
(1)

where \(x_i ^\prime\)is the data after normalization, x i is the original data, uand s dare respectively the average value and standard error of the original data [44]. Now, various types of threshold functions are used in neural networks [45], the most common being the sigmoid function, which is incorporated as a transfer function in the MATLAB Neural Network Toolbox 3.0, installed in a PC computer. In this paper, the log sigmoid transfer function, log(1/1+ez) was used to calculate the outputs of the neurons in hidden layer and output layer. The training parameters were generated randomly. The learning rate was optimized by trial and error and the best value obtained was 0.05. The reasonable output error was 0.001. The neural network was trained for 2000 epochs (using incremental updating or randomization of the training data order to avoid local minima), which were required to reach a maximum error of 0.001. In Fig. 2, the maximum error of the network could be observed as a function of the number of propagation cycles during the training process. As can be seen, the estimate of the network accuracy was asymptotically decreasing functions. After about 60 cycles, the network converged to a maximum error of 0.00099. Further training could not improve the performance of the network, and at this point training was stopped.

Fig. 2
figure 2

The curve of generation ability of the network

Fig. 3
figure 3

Comparison of mean-squared error for eight input sets

After training process, a few more nitramine molecules, which belonged or did not belong to the input groups, were chosen as test group to validate the generation ability of the network. The reasoning for this specific construction was to allow the test group to contain molecules which were dissimilar to the training group, so that the test and prediction results would exhibit distinct outliers. The test procedure was repeated five times starting from different initial random sets of weights, a technique known to weaken the detrimental effects of local minima on the error surface [46], and the mean-square errors were calculated, as shown in Fig. 3. The mean values, which were obtained by employing set 6, were used as the final values presented in Table 1. Meanwhile, the predicted impact sensitivity values of nitramines calculated by employing set 1 were also listed in Table 1for comparison. Two plots of predicted versus experimental h 50%values obtained by employing input set 6 and set 1 are depicted in Figs. 4and 5to illustrate our conclusion more intuitively.

Fig. 4
figure 4

Predicted vs. experimental h 50%values obtained by employing set 6 containing ΔE

Fig. 5
figure 5

Predicted vs. experimental h 50%values obtained by employing set 1 containing (HOMO−LUMO)*BDE

Results and discussion

As shown in Fig. 3, MSE values were in the range from 0.782 to 1.200. The smallest MSE value was from the set 6 with ΔE, and the biggest MSE value was from the set 1 including (HOMO−LUMO)*BDE. It indicated that with the same network architecture and training parameters, molecular descriptor ΔEhad the strongest correlation while (HOMO−LUMO)*BDE had the slightest correlation with impact sensitivity of the nitramines studied in this work. It is noteworthy that the correlation order studied here was in fact between input sets and impact sensitivity. Since corss-correlation of descriptors in each input set was very slight and each set had four common descriptors, the difference of MSE values in training and test process can be thought mainly coming from very different descriptor in each input set. Thus, we considered the correlation order was mainly decided by the very different descriptor in each set.

Two plots of predicted versus experimental h 50%values obtained by employing set 6 and set 1 are displayed in Figs. 4and 5, respectively. Solid lines indicate perfect agreement between experiment and prediction. As shown in Fig. 5, we could see that the predicted values in set 1, in which the molecular descriptor (HOMO−LUMO)*BDE was included, were mostly larger than the experimental data. Nevertheless, in Fig. 4, the predicted values in set 6, in which ΔEwas contained, were in excellent agreement with the experimental values. It clearly indicated that ΔEhad the strongest correlation with impact sensitivity while (HOMO−LUMO)*BDE had the slightest correlation with impact sensitivity. Predicted values in other six sets were all worse than set 6, but better than set 1.

In quantum chemistry, atomization energy (ΔE) was defined as the difference between molecular energy and the sum of energies of atoms comprising the molecule. For instance such as CH4N4O2,

$$\displaylines{ \Delta E({\rm CH}_4 {\rm N}_4 {\rm O}_2 ) = E({\rm C}) + 4E({\rm H}) + 4E({\rm N})\cr +\, 2E({\rm O}) - E({\rm CH}_4 {\rm N}_4 {\rm O}_2 )}$$
(2)

For nitramine explosives, whose structures were mainly made up of carbon chains, the molecular descriptor ΔEhad the strongest correlation with impact sensitivities. This conclusion was in agreement with Edward et al.’s work [4].

BDE here was defined as the difference between the zero-point-corrected energy of parent nitramine molecule and that of the products of the unimolecular dissociation in which an NO2group was removed. For example, for CH4N4O2, the BDE was

$$\displaylines{ {\rm BDE (CH}_{\rm 4} {\rm N}_{\rm 4} {\rm O}_{\rm 2} {\rm )} = {\rm [}E{\rm (CH}_{\rm 4} {\rm N}_{\rm 3} {\rm )} + E{\rm (NO}_{\rm 2} {\rm )]} \cr - E{\rm (CH}_{\rm 4} {\rm N}_{\rm 4} {\rm O}_{\rm 2} {\rm )}}$$
(3)

It was noteworthy that for nitramines studied in this work, the breaking of the R–NO2may be the rate-controlling step in the initiation of detonation, which again supported the Owens’ conclusion [6]. Based on chemical reaction kinetics, Arrhenius equation showed that the reaction rate constant depended on the ratio of activation barrier to temperature, so the activation energy may be a key factor in impact initiation and sensitivity of a material. As BDE/Ehad a certain relationship with activation energy, a strong correlation might exist between impact sensitivity and BDE/E, which indicated that bond dissociation energy in unit molecular energy. In this paper, descriptor BDE/Ehad the second strongest correlation with impact sensitivity of nitramines, with MSE value 0.833.

As depicted in Fig. 3, the correlation order after ΔEand BDE/Ewas as follows: R 2, BDE, μ, BDE*μ, HOMO−LUMO, and (HOMO−LUMO)*BDE. For the last one, the difference between LUMO and HOMO was defined as electronic transition level, which had influence on molecular properties. The lower the HOMO was, the larger was the molecular ionization energy. LUMO was related to molecular affinity, the smaller it was, the more the energy of system decreased when electrons came into the orbital. (HOMO−LUMO)*BDE and HOMO−LUMO had the weakest correlation with impact sensitivity, which indicated that the initial process of detonation correlated slightly with electronic excitation for nitramines we studied.

Conclusion

In this paper, by using DFT B3P86/6-31G**method, the electronic structures of 33 nitramine molecules were calculated. Based on the calculated molecular properties, we performed BP neural network to study on the correlation between impact sensitivity of nitramine explosives and molecular structures. Eight different input sets were constructed to train the network, and a few more molecules were chosen to test the generation ability of the network. Best results were obtained by employing set 6, which contained the molecular descriptor ΔE. The correlation order between impact sensitivity and the descriptors was obtained as follows: ΔE, BDE/E, R 2, BDE, μ, BDE*μ, HOMO−LUMO, and (HOMO−LUMO)*BDE. It demonstrates that the atomization energy ΔEis the best index of impact sensitivity for nitramine explosives, and BDE/Ethe second. Whether this conclusion is general for other chemical families needs to be further studied.