The capabilities of identifying trace organic compounds in complex mixtures by gas chromatography–mass spectrometry can be effectively implemented only by the joint use of electron ionization mass spectra and gas chromatographic retention indices (RIs). However, judging by the contents of modern databases (including the NIST Mass Spectral Library [1]), the number of compounds characterized by mass spectra is significantly lower than the number of compounds for which RIs on standard stationary phases are known. For example, the NIST database [1] (version 2020) contains 350 643 electron ionization mass spectra of 306 869 compounds, but RI values are available for only 139 498 compounds. Among them, both mass spectra and retention indices are known simultaneously for only a portion (111768) of test substances.

The limiting stage in the formation of databases of reference RI values is the availability of the test substances. Their target synthesis, if necessary, is the most time consuming step. For this reason, a rapid increase in the volume of reference data arrays is unlikely, and the improvement of methods for calculating retention indices based on molecular structures is of considerable current importance. There are a huge number of methods used for calculating RIs, the consideration of which is an independent task. Recently, algorithms that involve the use of various topological descriptors (QSRR) have become popular [24], but self-learning algorithms based on the principles of neural networks [58] and more complex ones (for example, gene expression programming [9]) are becoming more widespread. At the same time, it cannot be argued that the possibilities of well-known and widely used additive schemes have been exhausted [10, 11].

The advantages of modern computer methods for calculating RIs [59] manifested themselves in the processing of the largest possible data samples, which is not always reasonable. The point is that, as new data are accumulated, the known reference values of RIs can be refined or even corrected (this also applies to the NIST database [1]), and this leads to the need to process the array of information for all compounds again. In addition, the accuracy of the estimates of retention indices in such algorithms is actually averaged over all of the test substances. This makes it difficult to identify less reliable data, which often has to be done manually. Moreover, the need to solve more local problems of estimating the RIs of single compounds using simpler methods is of current interest in the practice of chromatographic analysis. At the same time, the accuracy characteristics of such estimates should relate to particular results rather than represent average values for the calculation method as a whole. In such cases, algorithms applicable to small taxonomic groups are effective, including the following:

(1) based on a correlation of RI with the position of a characterized compound in the corresponding group of homologs, congeners, or structural analogs, RI = f(nx);

(2) as a result of a correlation of the RI values of compounds from a selected taxonomic group with their physicochemical characteristics, RI = f(Ax);

(3) as a result of a correlation of the RIs of compounds from a taxonomic group (1) with known values of the RIs of compounds from another group (2) (structural analogs), RI1 = f(RI2).

Algorithms of the first group are an extended version of additive schemes; thus, their capabilities are comparable to each other. Methods of the second group provide the highest accuracy in estimating gas chromatographic retention indices, especially those involving the use of normal boiling points of homologs or congeners [12, 13]. The reason for this is that the number of compounds for which the values of Tb were determined with high accuracy significantly exceeds the number of compounds characterized by RIs. However, some complexity of these correlations lies in the fact that the relations used for calculations are nonlinear (linear logarithmic) [12, 13], which somewhat complicates the calculations:

$$\log {\text{RI}} = a\log {{T}_{{\text{b}}}} + bA + c,$$
(1)

where A is a parameter that characterizes the position of a particular compound in the corresponding taxonomic group (for example, the number of carbon atoms in a molecule, the number of rings, formal unsaturation, etc.); a, b, and c are coefficients calculated by the least squares method.The calculation of retention indices based on a correlation of their values for structural analogs of different groups (the third group; in the general case, such dependences are nonlinear) involves the use of similar linear logarithmic relationships [14]:

$$\log {\text{R}}{{{\text{I}}}^{1}} = a\log {\text{R}}{{{\text{I}}}^{2}} + bA + c,$$
(2)

where the meaning of the parameters A, a, b, and c is the same as in formula (1).

Considering the possibilities of correlation of gas chromatographic retention indices of structural analogs from different taxonomic groups, note a practically important special case. For some combinations of structural analogs from different classes, the function RI1 = f(RI2) can be approximated with acceptable accuracy by the simplest linear regression equation

$${\text{R}}{{{\text{I}}}^{1}} \approx aR{{I}^{2}} + b.$$
(3)

In such cases, the estimation of RIs is greatly simplified because it does not require the use of nonstandard calculation methods. However, no a priori rules for identifying groups of congeners to provide an acceptable accuracy of the results obtained by this variant have been formulated so far.

As an example, Fig. 1a illustrates the dependence of the retention indices of bromine-substituted methanes CH4 – xBrx with the numbers of bromine atoms from one to four (1 ≤ х ≤ 4) on the values of RIs for structurally similar chlorine-substituted methanes CH4 – xClx (1 ≤ х ≤ 4). It would seem that one should not expect large differences in the properties of these halogen derivatives; they are characterized by similar permittivities and dipole moments, and the atomic radii of bromine and chlorine are close (1.00 and 1.15 angstrom units, respectively). However, the function RI(CH4 – xBrx) = f[RI(CH4 – xClx)] is nonlinear, and calculations under these conditions can be performed only using Eq. (2). At the same time, a similar relationship between the retention indices of iodomethanes and bromomethanes (Fig. 1b) is nearly linear (the correlation coefficient R = 0.9989), although the difference in the atomic radii of iodine and bromine is greater (1.40 and 1.15 angstrom units, respectively). Thus, from this example, it follows that the linearity or nonlinearity of the correlations RI1 = f(RI2) is determined by some reasons other than the physicochemical characteristics of compared structural analogs.

Fig. 1.
figure 1

Illustration of typical nonlinear and almost linear dependences of the retention indices of compounds from different taxonomic groups: (a) bromomethanes and chloromethanes, 1 ≤ х ≤ 4; (b) iodomethanes and bromomethanes, 1 ≤ х ≤ 4. (a) The nonlinear dependence can be linearized using Eq. (2). (b) Linear regression parameters: a = 1.49 ± 0.05, b = –101 ± 39, R = 0.9989, and S0 = 23.

Commenting on Fig. 1b, it should be noted that the RI of tetraiodomethane (1862) is disproportionately overestimated compared to those of other iodine derivatives: 516 ± 7 (CH3I), 899 ± 9 (CH2I2), and 1209 ± 24 (CHI3). It was included in the NIST database [1] based on two publications [15, 16], but, most likely, it was erroneous. The fact is that CI4 melts at 168–171°С with decomposition; therefore, its gas chromatographic analysis is impossible due to degradation in the chromatographic injector. Therefore, the RI value belonging to another compound (a decomposition product of CI4) was replaced by an extrapolated theoretically expected estimate of 1446 calculated from linear logarithmic relation (2) based on data for the other three iodomethanes. As noted previously [17], checking the reliability of initial data is a necessary step of property–property correlations.

The purpose of this work was to identify criteria for the applicability of the simplest linear relations (3) to the estimation of gas chromatographic retention indices of congeners from various taxonomic groups based on the index values for congeners from other groups. The problems to be solved included the evaluation of unknown RI values, the verification of their correctness, and the identification of analytes in some cases.

INITIAL DATA ACQUISITION AND PROCESSING (EXPERIMENTAL)

The NIST database [1] was a source of gas chromatographic retention indices of compounds under consideration on standard nonpolar polydimethylsiloxane stationary phases. The RI values of ethoxysilanols (C2H5O)4 – xSi(OH)x (1 ≤ х ≤ 3) reported by Zenkevich and Baranov [18] were used (the conditions of gas chromatographic analysis were also specified).

Excel (Microsoft 2007) and Origin (versions 8.1 and 4.1) were used for least squares calculations. As criteria for the correctness of the linear relations RI1aRI2 + b, we selected (a) the correlation coefficients R > 0.99 and (b) the results of comparing the minimum RI differences of homologs from a characterized group with the generalized variances S0. The values of S0 in the first approximation characterize the reliability of values calculated using a linear regression equation (for more accurate estimates, so-called error corridors are used [19]). The results of the approximation of RIs by relations (3) are significant if the minimum differences between the retention indices of neighboring structural analogs exceed twice the values of S0:

$${\text{min}}\left| {{\text{R}}{{{\text{I}}}_{i}}-{\text{R}}{{{\text{I}}}_{i}}_{{ \pm 1}}} \right| > 2{{S}_{0}}.$$
(4)

The failure of inequality (4) corresponds to unreliable estimates of RIs, which are inappropriate to use for solving analytical problems.

RESULTS AND DISCUSSION

Taxonomic groups of structural analogs. Features of linear correlations between the gas chromatographic retention indices of structural analogs were revealed using compounds from the following taxonomic types as examples:

(1) Substituted methanes: CH4 → CH3X → CH2X2 → CHX3 → CX4. Because the first member (CH4, RI = 100) in these groups is the same, it was excluded from consideration. As a result, only four compounds remained in them; this fact decreased the practical value of such RI correlations but simplified the identification of their regularities and features.

(2) Substituted ethanes: C2H6 → C2H5X → CH3CHX2 → XCH2CH2X → XCH2CHX2 → XCH2CX3 → X2CHCHX2 → X2CHCX3 → X3C–CX3. After the exclusion of unsubstituted ethane (RI = 200), eight compounds remained in these groups, and some of them (depending on X) were not characterized by RI values; because of this, the solution of such problems is a problem of greater current interest than that with compounds of the first type.

(3) Substituted benzenes: C6H6 → C6H5X → C6H4X2 (3 isomers) → С6Н3Х3 (3 isomers) → С6Н2Х4 (3 isomers) → С6НХ5 → С6Х6. Without unsubstituted benzene (RI = 654 ± 7 [1]), the number of compounds in these groups was 13. For many of them, the estimation of RIs is of considerable current interest due to the lack of experimental data.

(4) Substituted naphthalenes: С10Н8 → С10Н7Х (2 isomers) → С10Н6Х2 (10 isomers). These groups most often were limited to disubstituted compounds because reliable values of RIs are unknown for many isomers with even three the simplest substituents (methyl groups) [1].

Correlations of gas chromatographic retention indices in the group of substituted methanes and related compoundsFootnote 1. Checking the mutual correlation of the RI values of substituted methanes at different Х (Х = Cl, Br, I, OCH3, OC2H5, CH3, C6H5, etc.) makes it possible to identify examples of both nonlinear relations RI1 = f(RI2) (Fig. 1a) and (more rarely) linear ones (as in Fig. 1b). To characterize the approach under consideration, it is necessary to elucidate the reasons for such differences, which, as noted above, are not related to the chemical nature of fragments X.

A necessary but not sufficient condition for the mutual linear correlation of the retention indices of structural analogs is their symbatic changes in both compared groups. If the values of RI monotonically increase or decrease in one of the groups and have an extremum is in the second group, the application of relations (3) cannot provide the necessary accuracy of estimates. For example, the extremums (maximums) of retention indices were observed in the groups of methoxymethanes CH3OCH3 → … → C(OCH3)4, fluoromethanes CH3F → … → CF4, and methylsilanes CH3SiH3 → … → Si(CH3)4.

To obtain more definite conclusions on variations in the values of RIs in different groups, it is necessary to compare not just the retention indices but their first numerical differences \(\Delta _{{{\text{RI}}}}^{1}\) = RIi + 1 – RIi (Table 1). Thus, for the above example of bromo- and chloromethanes (Fig. 1), the retention indices monotonically increased in either of the groups. However, the differences \(\Delta _{{{\text{RI}}}}^{1}\) in the series CH3Cl → CCl4 monotonically decreased from 201 to 48, while the last value of \(\Delta _{{{\text{RI}}}}^{1}\) (186) in the series CH3Br → CBr4 was somewhat larger than the previous one (181); that is, the set of the values of \(\Delta _{{{\text{RI}}}}^{1}\) was characterized by an insignificant extremum (minimum). In the mathematical sense, the first numerical differences \(\Delta _{{{\text{RI}}}}^{1}\) are equivalent to the first derivative of the function RI = f(nX) with respect to the number of variable fragments in the structure; however, it is a difficult task to identify the causes of variations in its sign (the appearance of extremums) even at a phenomenological level. Steric interactions of fragments X in molecules, similar to the effects found previously [20] using methylbenzenes as an example, seem most likely.

Table 1. Retention indices of bromo- and chloromethanes [1] and their differences

Thus, a correlation of RI values cannot provide an acceptable accuracy of their estimates if the first differences of retention indices \(\Delta _{{{\text{RI}}}}^{1}\) change monotonically in one the compared taxonomic groups and are characterized by extremums in the second one. These examples include combinations of the data set of CH3Cl → CCl4 (the values of \(\Delta _{{{\text{RI}}}}^{1}\) decreased monotonically) with the data sets of CH3OCH3 → C(OCH3)4, CCl3Br → CBr4, and (CH3)3SiCl → SiCl4 (in all cases, the values of \(\Delta _{{{\text{RI}}}}^{1}\) exhibited maximums).

On the other hand, if both of the compared sets of RI values are characterized by extremums of \(\Delta _{{{\text{RI}}}}^{1}\), it can be assumed that their correlation is acceptable in terms of the accuracy of retention index estimates. Table 2 compares the retention indices of the series of methoxymethanes CH3OCH3 → C(OCH3)4 (data for which are not sufficiently reliable [1] and deserve clarification), considered as a function of the retention indices of chlorobromomethanes CCl3Br → CBr4. In both of the groups, the maximums of \(\Delta _{{{\text{RI}}}}^{1}\) were detected; however, the correlation of the RI values of these sets of structural analogs illustrated in Fig. 2 is suitable for obtaining practically useful index estimates. All values of \(\Delta _{{{\text{RI}}}}^{1}\) exceeded twice the generalized variance S0 = 23. Table 2 summarizes the parameters of the linear regression equation and the calculated values of RIs; the average absolute deviation of calculated and reference values was only 15 ± 5 index units.

Table 2. Retention indices of chlorobromo- and methoxymethanes [1] and their differences
Fig. 2.
figure 2

Correlation of the retention indices of methoxymethanes and chlorobromomethanes (1 ≤ х ≤ 4). Linear regression parameters: a = 1.42 ± 0.10, b = –667 ± 95, R = 0.995, and S0 = 23.

The values of \(\Delta _{{{\text{RI}}}}^{1}\) in two taxonomic groups to be compared should either increase or decrease simultaneously. Therefore, antibatic changes in \(\Delta _{{{\text{RI}}}}^{1}\), which are equivalent to different signs of the first derivative dRI/dnX, is another reason for the absence of the linear correlation RI1 = f(RI2). A correlation of the retention indices of fluorochloromethanes CF4 – xClx and chloromethanes CH4 – xClx (1 ≤ х ≤ 4), the results of which are illustrated in Figs. 3a, 3b, can be given as an example. If both groups of structural analogs were formed by increasing the number of chlorine atoms in the molecules, that is, CF3Cl → … → CCl4 and CH3Cl → … → CCl4, the function RI1 = f(RI2) was nonlinear (Fig. 3a). The reason is that the values of \(\Delta _{{{\text{RI}}}}^{1}\) increased in the former series and decreased in the latter series. To convert the data into a linear relationship, the sequence of structural analogs in the former (target) group should be reversed, that is, ranked by decreasing the values of \(\Delta _{{{\text{RI}}}}^{1}\): CCl4 → … → CF3Cl. However, as a result of this data transformation, a point (shown in Fig. 3b at the top left on an enlarged scale) corresponding to the pair of compounds CH3Cl–CCl4, which do not contain fluorine and, therefore, do not correspond to the other fluorine derivatives in chemical terms, appeared in the plot. Therefore, it is reasonable that it fell out of the general linear regression; the correlation coefficient for the three remaining pairs of compounds was R = 0.9995.

Fig. 3.
figure 3

Correlation of the retention indices of fluorochloromethanes and chloromethanes (1 ≤ х ≤ 4): (a) nonlinear dependence, both groups of structural analogs are ranked by increasing number of chlorine atoms in the molecules, CF3Cl → CCl4 and CH3Cl → CCl4; (b) groups of analogs are ranked by increasing values of \(\Delta _{{{\text{RI}}}}^{1}\), CF3Cl → CCl4 , CCl4 → CF3Cl, and CH3Cl → CCl4. After excluding the CH3Cl–CCl4 point (upper left), the linear regression parameters for the three remaining pairs are a = –2.00 ± 0.06, b = 1536 ± 38, R = 0.9995, and S0 = 6.

As an example of the application of the discussed correlation of RI values in the group of substituted methanes and their analogs (including silicon-containing ones), is interesting to consider the gas chromatographic identification of unusual products of the partial hydrolysis of tetraethoxysilane (C2H5O)4Si (RI, 978 ± 10). Among them, triethoxysilanol (C2H5O)3SiOH (RI, 727 ± 3), diethoxysilanediol (C2H5O)2Si(OH)2 (RI, 608 ± 1), and ethoxysilanetriol (C2H5O)Si(OH)3 (RI, 537 ± 2) were detected [18]. If we compared the RI values of these ethoxysilanols naturally ranked by increasing the number of ethoxy groups in the molecules (which corresponds to an increase in the values of \(\Delta _{{{\text{RI}}}}^{1}\)) with the data for chloromethanes CH3Cl → CCl4 (which correspond to a decrease in \(\Delta _{{{\text{RI}}}}^{1}\)), the function RI1 = f(RI2) was found nonlinear (Fig. 4a). To linearize it, one of these sequences should be replaced by a reverse one, for example, a set of ethoxysilanols should be considered based on an increase in the number of OH groups in the molecules: (C2H5O)4Si → … → (C2H5O)Si(OH)3. As a result, we obtain an almost ideal linear regression (the correlation coefficient R is as high as 0.9995), the graph of which is shown in Fig. 4b. Note that, of course, the scales of axes in Figs. 4a and 4b are the same and special comments are required in order to distinguish between the two variants under consideration.

Fig. 4.
figure 4

Correlation of the retention indices of ethoxysilanols (C2H5O)4 – xSi(OH)x and chloromethanes (1 ≤ х ≤ 4): (a) nonlinear dependence, the RI values of ethoxysilanols are ranked by increasing molecular weights (or, otherwise, the number of ethoxy groups in the molecules), which corresponds to an increase in \(\Delta _{{{\text{RI}}}}^{1}\); (b) both groups are ranked by decreasing values of \(\Delta _{{{\text{RI}}}}^{1}\), which leads to the reversal of the sequence of alkoxysilanols (C2H5O)4Si → (C2H5O)Si(OH)3. (b) Regression parameters: a = 1.30 ± 0.03, b = 1408 ± 16, R = –0.9995, and S0 = 8.

To additionally confirm the fact that the partial hydrolysis products of tetraethoxysilane formed a set of congeners, we can consider another correlation of their RI values with data for some other set of structural analogs, for example, the iodoalkanes CI4 (RI, 1446) → CHI3 (1209 ± 24) → CH2I2 (899 ± 9) → CH3I (516 ± 7). Bromoalkanes cannot be used for this purpose because of their inherent small maximum values of \(\Delta _{{{\text{RI}}}}^{1}\) (see Table 1). Figure 5 shows a plot of the linear function RI1aRI2 + b. After eliminating a point corresponding to the pair of compounds CH3I–(C2H5O)4Si, the value of R for the remaining three pairs of structural analogs was –0.998 at S0 = 9.

Fig. 5.
figure 5

Correlation between the retention indices of ethoxysilanols (C2H5O)4 – xSi(OH)x and iodomethanes (2 ≤ х ≤ 4). Both groups of structural analogs are ranked in descending order of \(\Delta _{{{\text{RI}}}}^{1}\). Linear regression parameters: a = –0.35 ± 0.02, b = 1038 ± 29, R = –0.998, and S0 = 9.

The considered correlations RI1aRI2 + b are recommended to be used not only for evaluation but also for checking the reliability of reference retention indices of the least characterized “exotic” compounds. In this case, it is advisable to choose compounds with well-known and reproducible values of RIs as reference materials (used for comparison), for example, well-characterized hydrocarbons. The following example is of this type: it is necessary to check the known RI values of the methylsilanes CH3SiH3 → Si(CH3)4. There are no data for the simplest monomethylsilane in the NIST database [1], and the next two homologs were characterized in a single work by Szekely et al. [21]; therefore, they have to be supplemented with calculated values based on the boiling points [1214]. The RI values of methylsilanes can be compared with the data for isostructural C2–C5 alkanes, as illustrated in Table 3.

Table 3. Retention indices of methylsilanes and С2–С5 alkanes [1]

In this case, the graph of the function RI1 = f(RI2) was very close to a linear one; thus, it is not shown here and only the following linear regression parameters are given: a = 1.01 ± 0.03, b = 39 ± 9, R = 0.9992, and S0 = 4.5. Thus, it is confirmed that the known values of the RIs of methylsilanes are sufficiently reliable.

With an appropriate control of the variations \(\Delta _{{{\text{RI}}}}^{1}\) in the compared groups of structural analogs, the considered approach has no fundamental limitations, and it is characterized by exceptional simplicity.

Correlation of gas chromatographic retention indices in other taxonomic groups. This correlation (with a larger volume) significantly expands the capabilities of the approach under consideration. On the other hand, one can expect an unpredictable complication of the functions RI1 = f(RI2), for example, due to the manifestation of difficult to formalize steric effects, which are noticeable only for individual compounds within these groups. As an example, it is expedient to consider a group of substituted ethanes (9 congeners), whose RI values are compared with the data for isostructural alkanes (Table 4). From their graphical representation (Fig. 6), it follows that the entire set of points is subdivided into two subgroups corresponding to two linear sections: (I) for compound nos. 1–6 (containing from one to four chlorine atoms) and (II) for compound nos. 6–9 (from four to six chlorine atoms). The caption to Fig. 6 specifies the parameters of two linear regression equations corresponding to each of these sections. At the moment, it seems difficult to explain the reasons for the appearance of two linear sections, and this fact should be treated as a phenomenologically revealed regularity. Similarly, two linear portions of the function RI1 = f(RI2) were observed when the RI values of chloro- and bromoethanes were correlated.

Table 4. Retention indices of chloroethanes С2Н6 – xСlx and С3–С8 alkanes [1]
Fig. 6.
figure 6

Correlation between the retention indices of chloroethanes C2H6 – xClx (1 ≤ х ≤ 6) and C3–C8 alkanes. Linear regression parameters: a = 1.88 ± 0.08, b = 128 ± 32, R = 0.997, and S0 = 14 for section (I); a = 1.00 ± 0.07, b = 339 ± 43, R = 0.995, and S0 = 10 for section (II).

The average accuracy of the retention indices of chloroethanes estimated based on the data for isostructural hydrocarbons is 9 ± 5 index units, which is sufficient for practical purposes and not inferior to the capabilities of much more complicated methods. Because chlorine atoms can be located at two carbon atoms, this fact complicates the use of the first numerical differences of the retention indices \(\Delta _{{{\text{RI}}}}^{1}\) (the array of \(\Delta _{{{\text{RI}}}}^{1}\) becomes two-dimensional).

It is advisable to start testing the capabilities of the considered approach in the groups of substituted benzenes by comparing the retention indices of bromo- and chlorobenzenes. Table 5 summarizes reference data; for some bromobenzenes, the RI values are unknown, and their assessment is of considerable current interest. Because the correlation coefficient R reached 0.9989, the plot of the function RI1 = f(RI2) (Fig. 7) confirms its linearity.

Table 5. Retention indices of bromobenzenes С6Н6 – xBrx and chlorobenzenes С6Н6 – xClx [1]
Fig. 7.
figure 7

Correlation between the retention indices of bromobenzenes C6H6 – xBrx (1 ≤ х ≤ 6) and the corresponding chlorobenzenes. Linear regression parameters: a = 1.68 ± 0.03, b = –489 ± 40, R = 0.9989, and S0 = 24.

An objective limitation of this method for assessing retention indices is the inability to distinguish between some position isomers characterized by relatively small differences in RIs, for example, ortho-, meta-, and para-dibromobenzenes with indices of 1216, 1190, and 1193, respectively. Their pairwise differences are lower than twice the value of S0 = 24 (see Eq. (4)). It is noteworthy that there are no such coincidences for isomeric tribromobenzenes: 1530 (1,2,3-isomer), 1464 (1,2,4-isomer), and 1390 (1,3,5-isomer).

Similar estimates of RIs based on their mutual correlations for compounds from different taxonomic groups were obtained for the Br- and Cl-benzenes C6H6 – xHalx (1 ≤ x ≤ 6) as a result of their comparison with the data for methylbenzenes. It was effective to compare the retention indices of the iodobenzenes C6H6 – xIx (1 ≤ x ≤ 3) and the methyl esters of benzenecarboxylic acids C6H6 – x(CO2CH3)x (1 ≤ x ≤ 4) with the data for corresponding chlorobenzenes, the retention indices of the chloronaphthalenes C10H8 – xClx (1 ≤ x ≤ 4) with the data for methylnaphthalenes, etc.

CONCLUSIONS

Thus, an analysis of the capabilities and features of the simplest linear correlation of gas chromatographic retention indices of compounds from different taxonomic groups of structural analogs, RI1aRI2 + b, made it possible to establish the following:

— If each compound in a particular group is uniquely defined by a molecular formula; that is, it has no isomers (this is possible for the simplest substituted methanes, CH3X → … → CX4), symbatic variations of the first numerical differences of the retention indices, \(\Delta _{{{\text{RI}}}}^{1}\) = RIn + 1 – RIn, are responsible for a linear correlation of retention indices in these groups. The presence of extrema of \(\Delta _{{{\text{RI}}}}^{1}\) in one of the groups with a monotonic change in the other is an unambiguous sign of the absence of the linear correlation RI1aRI2 + b. If the values of \(\Delta _{{{\text{RI}}}}^{1}\) increase in one of the groups and decrease in the other, the ranking order of compounds should be reversed in any of the groups.

— It was shown that retention index correlations are possible in groups of structural analogs, such as substituted ethanes, benzenes, and naphthalenes. In all cases, the use of the simplest relation RI1aRI2 + b makes it possible not only to estimate the values of RIs for compounds not yet characterized but also to refine unreliable reference data.