Prediction of Henry's law constants of CO2 in imidazole ionic liquids using machine learning methods based on empirical descriptors

Wu, Ting; Li, Wan-Li; Chen, Meng-Yao; Zhou, Yan-Mei; Zhang, Qing-You

doi:10.1007/s11696-020-01415-8

Prediction of Henry's law constants of CO₂ in imidazole ionic liquids using machine learning methods based on empirical descriptors

Original Paper
Published: 19 November 2020

Volume 75, pages 1619–1628, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Chemical Papers Aims and scope Submit manuscript

Prediction of Henry's law constants of CO₂ in imidazole ionic liquids using machine learning methods based on empirical descriptors

Download PDF

Ting Wu¹,
Wan-Li Li¹,
Meng-Yao Chen¹,
Yan-Mei Zhou¹ &
…
Qing-You Zhang ORCID: orcid.org/0000-0002-8923-0843¹

590 Accesses
14 Citations
Explore all metrics

Abstract

In this study, a total of 160 experimental data points of Henry's law constant of CO₂ in 32 imidazole ionic liquids (ILs) were collected, with the temperatures range from 283 to 350 K. Herein intuitive and explanatory descriptors related to Henry's law constant (HLC) were suggested from the 2D structural features of the ILs according to experimental experience and laws. Temperature was used as another variable due to its significant effect on Henry's law constant. Three machine learning methods were used to construct models to fast predict the HLC based on suggested descriptors. Multi-layer Perceptrowas mainly used to build the model and compared with the results of Random forest and Multiple Linear Regression after investigating the outliers and variable selection. In addition, if only one data point was left at a similar temperature and the reduced dataset was also used to build models in the same procedure, the results were not as good as those of the full dataset but still satisfactory.

ILPC: simple chemometric tool supporting the design of ionic liquids

Article Open access 19 August 2016

An Analytical Predictive Model for the Heat Capacity of Imidazolium-Type Ionic Liquids Derived Directly from Artificial Neural Network Modeling

Article 09 March 2023

Atomic-Level Topological Indices for Prediction of the Infinite Dilution Activity Coefficients of Oxo Compounds in Water

Article 19 February 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Carbon dioxide, whose increase in concentration is the main cause of the greenhouse effect, is one of the main greenhouse gases. As people pay more attention to environmental issues, the technologies of carbon dioxide capture arousing more and more interests (Orhan et al. 2016). There are already some methods for capturing CO₂ (such as adsorption, absorption, cryogenic methods, separation with membranes, etc.), and the amine-based solvents method is currently the most commonly used method (Zhang et al. 2013). However, due to its volatility and corrosivity, this method is potentially environmentally threatening and its operation is unstable (Bahadur et al. 2015; Dutcher et al. 2015).

Ionic liquids, as a green alternative to traditional volatile solvents, have excellent properties, such as low vapor pressures, recyclable and adjustable characteristics, good thermal stability and so on (Welton 1999; Rogers and Seddon 2003; Hallett and Welton, 2011). Thus, ILs have the potential to become a new option for capturing CO₂ (Zhang et al. 2013). Since the first discovery that CO₂ was soluble in ILs was made by Blanchard et al. (1999), then some experimental and theoretical studies on the dissolution of CO₂ in different ILs have been performed (Marcus 2018; Baghban et al. 2015; Aghaie et al. 2019; Deng et al. 2019). Among them, imidazole ILs are favored by most researchers because they are easy to synthesize and have high adjustability.

ILs as a promising solvent for capturing CO₂ in the future, it is important to measure the Henry’s law constant of CO₂ in it. Henry's law states that the amount of dissolved gas is proportional to its partial pressure in the gas phase and the proportionality factor is called the Henry's law constant (Sander 2015). It is a physical constant describing the distribution ability of a compound in the gas–liquid two-phase and it can affect the direction and velocity of migration in two phases of gas and liquids (Bhangare et al. 2018). As a ubiquitous metric, it is particularly important in chemical processing and environmental science (Wang et al. 2017; Kapelner et al. 2016). The theoretical methods for obtaining the HLCs include currently thermodynamic method, kinetic method, quantum chemistry calculations and quantitative structure–property relationship (QSPR) research (Abraham et al. 2019; Odabas and Adali 2016; Chao et al. 2017; Gorji et al. 2017; Zhao and Rafiqul 2017).

Some studies have been done about HLCs of CO₂ in ILs by some researchers. However, traditional experiments are expensive and time-consuming due to the large variety of ILs and thermodynamic or kinetic methods are difficult to deal with large amounts of property data. Therefore, the QSPR research is another better choice. Some properties of ILs have been studied in this way, such as toxicity, melting point, viscosity, density, molecular orbitals and so on (Barycki et al. 2018; Cerecedo Cordoba et al., 2019; Koi et al. 2019; Chen et al. 2019; Pereira et al. 2017). QSPR models have also been applied to the prediction of HLCs of CO₂ in ILs. Diako Ghaslani et al. developed two models with Multiple Linear Regression (MLR) and Least Squares Support Vector Machines (LS-SVM) to estimate the HLC of CO₂ in 32 ILs at 298 K (including imidazolium, pyridinium, pyrrolidinium and thiouronium families) (Ghaslani et al. 2017). Descriptors in most of these studies were calculated by Dragon software, and then were selected by the genetic algorithm and lacks intrinsic meaning to target property and is not intuitive enough.

In this study, a dataset consisting of 160 H’s law constants of CO₂ in 32 imidazole ILs at different temperatures was collected. It is worth mentioning that intuitive and meaningful descriptors were manually extracted from the structures of ILs, which were similar as the descriptors in our prior research to predict the toxicity of ILs (Wu et al. 2020). The ILs in the test set were not included in the training set, i.e., the constructed models were assessed by the ILs not be used to train models. Three machine learning methods, Multi-layer Perceptron (MLP), Random forest (RF), and MLR, were used to build model, respectively, and the model established by MLP has obtained best results. The importance of variables was also studied, and some linear models were developed with MLR using just temperature variable, cation and anion descriptors, respectively, to investigate their contribution to HLC.

Materials and methods

Dataset

The dataset about HLC was collected from the ILs Database-IL Thermo v2.0 (updated on June 11, 2019) and related literature (Anderson et al. 2007), including the HLCs of CO₂ in 32 imidazole ILs, for the full names, see the Electronic Supplementary Material (ESM) S1 at different temperatures, with a total of 160 data points, temperature range from 283 to 350 K (see the Table 1). The main structures of cations and anions in this dataset are shown in Fig. 1. To assess the capability of model, the dataset is divided into training set and test set. A total of 25 data points of 6 ILs were randomly selected as the test set (indicated by* in Table 1), and a total of 135 data points of the remaining 26 ILs were used as the training set.

Table 1 Henry’s law constant of CO₂ in 32 ILs of 160 data set

Full size table

However, in this dataset, some HLCs of CO₂ in the same IL were obtained at similar temperatures. If the difference between the two temperatures (or more than two) is less than 1.5 degrees, they were regarded as similar temperature here and the two data points were considered as similar data points. The reduced dataset was obtained if one data point was kept for similar data points. For the HLC of the same ionic liquid at similar temperatures, the processing methods are as follows: (1) If there are two HLCs at similar temperatures, one is arbitrarily retained when the difference between the two is less than 1 MPa; (2) For the case where there are more than two HLCs at similar temperatures, if there is one data point with a difference greater than 1 MPa from the average value, the data point is discarded. Then, the median value is retained when the number of remaining data is odd; the data closest to the average value are retained when the number of remaining data is even.

After deleting similar data, a reduced dataset consisting of 98 data points of the HLCs of CO₂ in 32 ILs was formed (see the ESM S2). Similarly, 21 data points of the above 6 ILs were used as the test set, and 77 data points of the remaining 26 ILs were used as the training set.

Descriptors

The extraction of descriptors is an essential step to establish the relationship between the structures and its properties.

Anions are the main factors affecting the solubility of CO₂ in ILs as well as the HLCs (Kelley et al. 2017; Yim and Lim 2013; Cadena et al. 2004; Aki et al. 2004). The descriptors derived from anions are described as follows. For simple anions, it was recommended to use the entire structure as a descriptor. Specifically, the PF₆, BF₄ and PF₃R₃ groups were used as a descriptor (a₁), and the Tf₂N group was suggested as another descriptor (a₂). For larger anions, some features of structures were suggested as descriptors. The presence of a cyanide containing anion was observed to have an effect on the HLCs. This may be due to the interaction of Lewis base and Lewis acid, which makes cyanide have an important effect on the solubility of CO₂ in IL. Therefore, a cyano group was suggested as a descriptor, represented by ^aC#N(a₃). At the same time, because the presence of phospholipid and thioester groups has a positive effect on the HLCs, they were also suggested as two descriptors, which are denoted as ^aSO₄R (a₄) and ^aPO₄R₂ (a₅), respectively. In addition, it is clear from this dataset that the higher the degree of anion fluorination, the smaller the value of the HLCs. A large number of related studies have shown that the degree of anion fluorination has a great effect on the solubility of carbon dioxide in ILs (Yim and Lim 2013; Aki et al. 2004). Therefore, the amount of fluorine was suggested here as a descriptor, denote as ^anF (a₆). The length of the alkyl chain in the anion is also noteworthy. It is observed that the HLC decreases with a significant increase of the length of the anionic chain. Then, another descriptor was extracted, the reciprocal of length (a₇).

The cation plays a minor role in the effect of the HLCs (Cadena et al. 2004). Studies have shown that the presence of cationic fluorine substituents will also slightly increase the solubility of CO₂ in ILs. Therefore, the number of cationic fluorine atoms was also suggested as a descriptor (c₁). At the same time, the reciprocal of length of cationic alkyl side chains (Liu et al.2016), the number of substituents on the imidazolium ring (Aki et al. 2004), and the existence of cyano group (Kim et al. 2014) will also slightly affect the solubility of CO₂, in other words, it also affects the HLCs. Therefore, another three descriptors were suggested about cations, represented by c₂, c₃ and c₄, respectively.

As mentioned above, eleven descriptors were suggested from the IL anion and cation structures. And because Henry's law constant changes significantly with temperature, temperature was also considered another variable denoted by T. In summary, a total of 12 descriptors related to the HLCs of CO₂ were suggested.

Methods

Multi-layer perceptron (MLP)

The main machine learning method used in this study is Multi-layer Perception, which is an artificial neural network with a forward structure. It implements a multilayer network with hidden layers (one hidden layer was used here). In this study, the MLP Regressor package from Weka (version 3.8.3) was used (Hall et al. 2009). To avoid overfitting, according to the number of variables and the number of objects in the training set, the number of nodes in the hidden layer needs to satisfy the following formula (Andrea and Kalayeh 1991) and other parameters are default settings.

$$ 1.8 < \left( {p = \frac{N}{M}} \right) < 2.2, M = \left( {I + 1} \right)H + \left( {H + 1} \right)O $$

(1)

where N is the number of samples, I is the number of nodes of input layers, H is the number of nodes of hidden layers, and O is the number of nodes of output layers.

Random forest (RF)

Random forest can be regarded as a set of classification regression trees (but without pruning), it generates a large number of trees in a random manner, and builds a model from the set of trees (Breiman 2001). Random Forests are grown with the Random Forest library in the R Program (version 2.10.1) (Ihaka and Gentleman 1996). In the training process, the training set was randomly divided into two parts for each tree, about 2/3 of the data are used for training and the remaining part of the data is used to evaluate the trained model. The process of cross-validation is called Out-of-bag (OOB). The method quantifies the importance of a variable by the decrease in a node’s impurity every time the variable is used for splitting. The trained model was used to predict the test set that are not involved in model construction. The number of trees in the random forest is set to 1000, and default parameters were used.

Multiple linear regression (MLR)

Here, a linear model was established to explore the relationship between the suggested descriptors and Henry's law constants of CO₂ in ILs. The MLR algorithm used in this study is the MLR software package under Weka (version 3.8.3). The remaining parameters are default values. Select variables could be performed by M5 method. The M5 method using the Akaike criterion for model selection iteratively removes the descriptor with the smallest standardized coefficient until no improvement is observed.

Results and discussion

Model construction with MLP and investigation of outliers

MLP method was used to study the relationship between the structures of imidazole ILs and HLCs. The dataset of 160 data points was divided into a training set and a test set. The result of cross-validation of the training set was R (the correlation coefficient) = 0.9819, and the result of the test set was R = 0.9758. Then, all 160 data points were considered as a whole to verify the model. To avoid overfitting, the node of hidden layer H was set to 6. The results of tenfold cross-validation of the whole dataset obtained were R = 0.9798, MAE (mean absolute error) = 0.4935 MPa, RMSE (root mean squared error) = 0.7233 MPa.

It is known from experience and related studies that the HLCs would gradually increase as the temperature increases, which is consistent with the results of observational experimental data, and the predictions also confirm this. However, by analyzing the data, it is found that there are unreasonable experimental values in the dataset. Taking the ionic liquid 1-Hexyl-3-methylimidazolium bis(trifluoromethylsulfonyl)imide ([Hmim][Tf₂N]) as an example, the results are shown in Table 2. The experimental HLC at 293.15 K should be between the corresponding HLC of 283 K and 298.1 K (HLC is 2.42 MPa and 3.44 MPa, respectively), while the experimental value was actually 1.244 MPa. This HLC is obviously too low and leading to a large error for the predicted value. The HLC at 333.15 K has a similar situation. It can be seen that the data collected in the database is not always reasonable, and the established model (the high error of these two data points) also implies that such experimental data are possibly abnormal, the two data with high error are in bold in the Table 2. Since most of the data are obtained from experiments, the existence of errors is difficult to avoid. The purity and water content of the ILs purchased by the experimenters, as well as the accuracy of the experimental methods and experimental instruments and so on, these all will affect the experimental values. This is also a problem of collecting dataset from the database.

Table 2 Prediction results for the ionic liquid [Hmim][Tf₂N]

Full size table

In this case, a total of 5 outliers (abnormal data points) found in the whole dataset were deleted and then the remaining 155 data points were used to construct model with MLP. As a result, the correlation coefficient of the tenfold cross-validation of the training set was 0.9844, the correlation coefficient of the test set reaches 0.9897, and MAE = 0.4224 MPa, RMSE = 0.5095 MPa. It can be clearly seen that the results have improved. The correlation between the prediction values and the experimental values of the test set before and after removing the outliers are show in Fig. 2. And the results are show in Table 3.

Table 3 The results of 160 data points data set with MLP

Full size table

In a word, the results show that the predicted value obtained from the model established with the descriptors directly extracted from the structures has a good correlation with the experimental values. It proves once again the influence of the cation and cation structures of ILs on the HLCs, and of course the temperature cannot be ignored.

Investigation of variables selection

The importance of all the 12 suggested descriptors was calculated by RF to analyze the contribution of different descriptors on HLC, and the results are listed in Table 4. It can be seen that temperature and the number of anionic fluorine atoms have a great contribution to the model to make prediction of HLC, and the cationic chain length and the structure-T₂N also have a relatively great contribution. On the contrary, the contribution of the PO₄R₂ group (a₅) and the amount of fluorine of cations (c₁) are relatively small.

Table 4 The variables ranked decreasingly on the basis of variable importance of RF

Full size table

Based on the results, variables were selected according to the importance of the variables to show the influence of the number of descriptors on the model. 4, 6, 8, and 10 variables were selected, respectively, and four models were built by MLP to compare the results with all 12 variables. The results are shown in Table 5. It can be seen that the cross-validation result of the training set based on 10 descriptors is the best. Then, another method, M5 method in MLR was used to do the variable selection. The linear formula after variable selection is as follows. From the formula, it can be found that descriptors a₂, a₅, c₁ were removed in the variable selection process, combined with the results of RF, 10 descriptors (without a₅, c₁) were finally used to describe the structures and build the model. The values of descriptors of the entire dataset obtained from the above rules are shown in the ESM S3.

$$ {\text{HLC }} = { 3}.{1653}*a_{1} + {5}.{7563}*a_{3} + {3}.{278}*a_{4} + \, - 0.{2231}*a_{6} + {3}.{7325}*a_{7} + {4}.{6724}*c_{2} + 0.{59}0{3}*c_{3} + {1}.{34}*c_{4} + 0.{1}0{61}*T + - {3}0.{1}0{29} $$

(2)

Table 5 Comparison of modeling results with different numbers of variables

Full size table

In addition, in the formula obtained by MLR, the coefficient of each descriptor is positive or negative reflects the contribution of the corresponding descriptor to HLC is positive or negative. For example, the coefficient of a₆ is − 0.2231, which means a₆, the amount of fluorine in the anion has a negative contribution to HLC, that is, the more fluorine in the anion, the smaller the HLC. On the contrary, the coefficients of the remaining descriptors are all positive, that is, their corresponding structures all contribute positively to the HLC. This again shows that the descriptor in this study is not only intuitive, but also interpretable.

The effects of temperature, cation and anion descriptors, respectively, on HLC

Some linear models were developed with MLR using just temperature variable, cation and anion descriptors, respectively, to discuss their contribution to HLC based on the modeling results with the entire dataset (as the training set). The results are shown in Table 6. It can be seen that the correlation coefficient between just a temperature variable and HLC is more than 0.5, indicating that temperature does have a great impact. But the coefficient is not higher probably because for different compounds, the influence of temperature is different, which cannot be reflected by a simple linear relationship, and its influence is more reflected in a single compound. In addition, it is obvious from the results that the influence of anions on HLC is indeed better than that of cations, which is consistent with our analysis and the conclusions in the literature. What is more, this also shows that the best results can be obtained by combining these three types of descriptors.

Table 6 Comparison of the linear models using three types of descriptors

Full size table

Comparison of different machine learning methods

So as to compare the modelling results of different machine learning methods, the above 10 descriptors were used to build models through RF and MLR without variable selection, respectively. All of the results obtained by different methods are listed in Table 7. It can be seen that these results of RF and MLR were not as good as those of MLP, and the MLR results are slightly better than the RF results. In conclusion, the prediction results of this MLP model are closer to the experimental values.

Table 7 Comparison of the three methods constructed by MLP, RF and MLR

Full size table

The validation of the model

Then, for the purpose of verifying the validity of this model, a zero model was tried to establish as follows. The median value of the experimental value was used as the predicted value, and the average absolute error and the root mean square error between it and the experimental values were calculated. The results obtained were MAE = 2.5738 MPa and RMSE = 3.5125 MPa. It can be seen that compared with the results of zero model, the model developed here using 10 descriptors has more impressive results.

Leave-one-compound-out (LOCO) verification was also performed for the dataset of 155 data points to verify the model. That is, each time, one ionic liquid (including all its data points) is used as the test set, and the remaining ILs were used as the training set for modelling to predict the test set. For the dataset of the 155 data points, the results for the LOCO verification of 32 ILs were R = 1.2129, MAE = 1.6694 MPa.

Prediction of HLC at specific constant temperature

To investigate the capability of predicting HLC at a specific temperature, a total 19 ILs were selected at 298.15 K in the dataset. Since a₂ was removed in the variable selection process by M5 method in MLR and descriptors a₃, a₅, c₄ are all zero according to their structures, 3 descriptors of anion, a₁ + a₄, a₆, a₇ (a₁ and a₄ were combined) and 3 descriptors of cation, c₁, c₂, c₃ were finally used. Three machine learning methods were built using the 6 descriptors, separately. For MLP, the tenfold cross-validation results of the whole set were R = 0.9333, MAE = 0.5949 MPa, RMSE = 1.0019 MPa. And the tenfold cross-validation results of the whole set modelling with MLR were R = 0.9855, MAE = 0.3717 MPa, RMSE = 0.5126 MPa. Then, the OOB cross-validation with RF was performed, and the results were R = 0.9581, MAE = 1.1199 MPa, RMSE = 1.6383 MPa. From the results, it can be seen that the capability to predict HLC with the descriptors suggested here at a specific temperature is also satisfactory.

Investigation of reduced dataset

The research of the reduced dataset was also performed then. The existing outliers in the reduced dataset were removed, that is, the remaining 94 data points were mainly used to study (their descriptors values are shown in the Electronic Supplementary Materials S3 and marked with #). When modelling with MLP, the node of hidden layer H is set to 3. The result of tenfold cross-validation of the entire dataset is R = 0.9801. The results of cross-validation of 75 data points in the training set were R = 0.9788, MAE = 0.4919 MPa, RMSE = 0.7985 MPa. For the test set, the results were R = 0.9814, MAE = 0.6407 MPa, RMSE = 0.8392 MPa. After that, RF and MLR models were also built to compare with MLP, and all the results are shown in Table 8.

Table 8 Results of modeling 98 data points with MLP

Full size table

Compared with the results in Table 7, it can be seen that the results become slightly worse after removing similar data points. This may be because the existence of similar data points makes the experimental values more reliable (the researchers had more chances to correct their experimental data by comparing similar data points) and the results better than those of only one experimental value. Therefore, it can be considered that if the data of HLCs of CO₂ at different temperatures of ILs in this dataset are more abundant, it will be more conducive to the establishment of the model, and the results will be further improved.

Conclusion

Intuitive, simple and interpretable descriptors including 2D structure features and temperature were generated and used to construct the models for prediction of the Henry's law constant of CO₂ in imidazole ILs. Further, the built models were used to predict the ILs of the test set that never appeared in the training set, and satisfactory results were obtained. The different machine learning methods were attempted, and the results indicate that the models developed by MLP are better than those by the MLR and RF. If the experimental data points are more reliable, the model has the potential to be further improved. One advantage of these interpretable descriptors is that it has potential to assist in the mechanism research of targets properties by constructing models using the related structure features.

References

Abraham MH, Acree WE, Hoekman D (2019) A new method for the determination of Henry’s law constants (air-water-partition coefficients). Fluid Phase Equilib 502:112300. https://doi.org/10.1016/j.fluid.2019.112300
Article CAS Google Scholar
Aghaie M, Rezaei N, Zendehboudi S (2019) Assessment of carbon dioxide solubility in ionic liquid/toluene/water systems by extended PR and PC-SAFT EOSs: carbon capture implication. J Mol Liq 275:323–337. https://doi.org/10.1016/j.molliq.2018.11.038
Article CAS Google Scholar
Aki SNVK, Mellein BR, Saurer EM (2004) High-pressure phase behavior of carbon dioxide with imidazolium-based ionic liquids. J Phys Chem B 108:20355–20365. https://doi.org/10.1021/jp046895
Article CAS Google Scholar
Anderson JL, Dixon JK, Brennecke JF (2007) Solubility of CO₂, CH₄, C₂H₆, C₂H₄, O₂, and N₂ in 1-Hexyl-3-methylpyridinium Bis(trifluoromethylsulfonyl)imide: comparison to other ionic liquids. Acc Chem Res 40:1208–1216. https://doi.org/10.1021/ar7001649
Article CAS PubMed Google Scholar
Andrea TA, Kalayeh H (1991) Applications of neural networks in quantitative structure-activity relationships of dihydrofolate reductase inhibitors. J Med Chem 34:2824–2836. https://doi.org/10.1021/jm00113a022
Article CAS PubMed Google Scholar
Baghban A, Ahmadi MA, Shahraki BH (2015) Prediction carbon dioxide solubility in presence of various ionic liquids using computational intelligence approaches. J Supercrit Fluids 98:50–64. https://doi.org/10.1016/j.supflu.2015.01.002
Article CAS Google Scholar
Bahadur I, Osman K, Coquelet C (2015) Solubilities of carbon dioxide and oxygen in the ionic liquids methyl trioctyl ammonium Bis(trifluoromethylsulfonyl)imide, 1-butyl-3-methyl imidazolium Bis(trifluoromethylsulfonyl)imide, and 1-butyl-3-methyl imidazolium methyl sulfate. J Phys Chem B 119:1503–1514. https://doi.org/10.1021/jp5061057
Article CAS PubMed Google Scholar
Barycki M, Sosnowska A, Jagiello K, Puzyn T (2018) Multi-objective genetic algorithm (MOGA) as a feature selecting strategy in ionic liquids’ quantitative toxicity-toxicity relationship models’ development. J Chem Inf Model. https://doi.org/10.1021/acs.jcim.8b00378
Article PubMed Google Scholar
Bhangare RC, Ajmal PY, Rathod TD (2018) Experimental and theoretical determination of henry’s law constant for polychlorinated biphenyls: its dependence on solubility and degree of chlorination. Arch Environ Contam Toxicol 76:142–152. https://doi.org/10.1007/s00244-018-0577-z
Article CAS PubMed Google Scholar
Blanchard LA, Hancu D, Beckman EJ (1999) Green processing using ionic liquids and CO₂. Nature 399:28–29. https://doi.org/10.1038/19887
Article Google Scholar
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Article Google Scholar
Cadena C, Anthony JL, Shah JK (2004) Why is CO₂ so soluble in imidazolium-based ionic liquids? J Am Chem Soc 126:5300–5308. https://doi.org/10.1021/ja039615x
Article CAS PubMed Google Scholar
Cerecedo Cordoba JA, González Barbosa JJ, Frausto Solís J (2019) Melting temperature estimation of imidazole ionic liquids with clustering methods. J Chem Inf Model 59:7. https://doi.org/10.1021/acs.jcim.9b00203
Article CAS Google Scholar
Chen MY, Wu T, Zhao TF, Zhou YM, Zhang QY, Aires-de-Sousa J (2019) Machine learning to predict the specific optical rotations of chiral fluorinated molecules. Spectrochim Acta Part A 223:117289. https://doi.org/10.1016/j.saa.2019.117289
Article CAS Google Scholar
Deng T, Liu F, Jia G (2019) Prediction carbon dioxide solubility in ionic liquids based on deep learning. Mol Phys 11:1–8. https://doi.org/10.1080/00268976.2019.1652367
Article CAS Google Scholar
Dutcher B, Fan M, Russell AG (2015) Amine-based CO₂ capture technology development from the beginning of 2013-a review. ACS Appl Mater Interfaces 7:2137–2148. https://doi.org/10.1021/am507465f
Article CAS PubMed Google Scholar
Ghaslani D, Gorji ZE, Gorji AE (2017) Descriptive and predictive models for Henry’s law constant of CO2, in ionic liquids: a QSPR study. Chem Eng Res Des 120:15–25. https://doi.org/10.1016/j.cherd.2016.12.020
Article CAS Google Scholar
Gorji AE, Gorji ZE, Riahi S (2017) Quantitative structure-property relationship (QSPR) for prediction of CO₂ Henry’s law constant in some physical solvents with consideration of temperature effects. Korean J Chem Eng 34:405–1415. https://doi.org/10.1007/s11814-017-0018-0
Article CAS Google Scholar
Hall M, Frank E, Holmes G (2009) The WEKA data mining software: an update. SIGKDD Explor 11:10–18. https://doi.org/10.1145/1656274.1656278
Article Google Scholar
Hallett JP, Welton T (2011) Room-temperature ionic liquids: solvents for synthesis and catalysis. 2. Chem Rev 111:3508–3576. https://doi.org/10.1021/cr1003248
Article CAS PubMed Google Scholar
Ihaka R, Gentleman R (1996) R: a language for data analysis and graphics. J Comput Graph Stat 5:299–314. https://doi.org/10.2307/1390807
Article Google Scholar
Kapelner A, Krieger A, Blanford WJ (2016) Optimal experimental designs for estimating Henry’s law constants via the method of phase ratio variation. J Chromatogr 1468:183–191. https://doi.org/10.1016/j.chroma.2016.08.049
Article CAS Google Scholar
Kelley SP, Flores LA, Shannon MS (2017) Understanding carbon dioxide solubility in ionic liquids by exploring the link with liquid clathrate formation. Chem Eur J 23:14332–14337. https://doi.org/10.1002/chem.201703117
Article CAS PubMed Google Scholar
Kim JE, Kim HJ, Lim LS (2014) Solubility of CO2 in ionic liquids containing cyanide anions: [c2mim][SCN], [c2mim][N(CN)2], [c2mim][C(CN)3. Fluid Phase Equilib 367:151–158. https://doi.org/10.1016/j.fluid.2014.01.042
Article CAS Google Scholar
Koi ZK, Yahya WZN, Talip RAA (2019) Prediction of the viscosity of imidazolium-based ionic liquids at different temperatures using the quantitative structure property relationship approach. New J Chem 43:16207. https://doi.org/10.1039/C9NJ03436F
Article CAS Google Scholar
Liu XY, He MJ, Lv N (2016) Selective absorption of CO₂ from H₂, O₂ and N₂ by 1-hexyl-3-methylimidazolium tris(pentafluoroethyl)trifluorophosphate. J Chem Thermodyn 97:48–54. https://doi.org/10.1016/j.jct.2016.01.013
Article CAS Google Scholar
Marcus Y (2018) The solubility parameter of carbon dioxide and its solubility in ionic liquids. J Solut Chem 48:1025–1034. https://doi.org/10.1007/s10953-018-0816-y
Article CAS Google Scholar
Odabas M, Adali M (2016) Determination of temperature dependent Henry’s law constants of polychlorinated naphthalenes: application to air-sea exchange in Izmir Bay, Turkey. Atmos Environ 147:200–208. https://doi.org/10.1016/j.atmosenv.2016.10.009
Article CAS Google Scholar
Orhan OY, Ume CS, Alper E (2016) The absorption kinetics of CO₂ into ionic liquid—CO₂ binding organic liquid and hybrid solvents. Energy Eff Solvents CO2 Capture Gas-Liq Absorpt. https://doi.org/10.1007/978-3-319-47262-1_11
Article Google Scholar
Pereira F, Xiao KX, Latino DARS, Wu CC, Zhang QY, Aires-de-Sousa J (2017) Machine learning methods to predict density functional theory B3LYP energies of HOMO and LUMO orbitals. J Chem Inf Model 57:11–21. https://doi.org/10.1021/acs.jcim.6b00340
Article CAS PubMed Google Scholar
Rogers RD, Seddon KR (2003) Ionic liquids-solvents of the future? Science 302:792–793. https://doi.org/10.1126/science.1090313
Article PubMed Google Scholar
Sander R (2015) Compilation of Henry’s law constants (version 4.0) for water as solvent. Atmos Chem Phys 15:4399–4981. https://doi.org/10.5194/acp-15-4399-2015
Article CAS Google Scholar
Wang C, Yuan T, Wood SA (2017) Uncertain Henry’s law constants compromise equilibrium partitioning calculations of atmospheric oxidation products. Atmos Chem Phys. https://doi.org/10.5194/acp-2017-92
Article PubMed PubMed Central Google Scholar
Welton T (1999) Room-temperature ionic liquids. Solvents for synthesis and catalysis. Chem Rev 99(8):2071. https://doi.org/10.1021/cr980032t
Article CAS PubMed Google Scholar
Wu T, Li W, Chen M, Zhou Y, Zhang Q (2020) Estimation of ionic liquids toxicity against leukemia rat cell line IPC-81 based on the empirical-like models using intuitive and explainable fingerprint descriptors. Mol Inf 39:2000102. https://doi.org/10.1002/minf.202000102
Yim JH, Lim JS (2013) CO2 solubility measurement in 1-hexyl-3-methylimidazolium ([HMIM]) cation based ionic liquids. Fluid Phase Equilib 352:67–74. https://doi.org/10.1016/j.fluid.2013.05.005
Article CAS Google Scholar
Zhang L, Chen J, Lv JX (2013) Progress and Development of Capture for CO2 by Ionic Liquids. Asian J Chem 25:2355–2358. https://doi.org/10.14233/ajchem.2013.13552
Article CAS Google Scholar
Zhao YS, Rafiqul I (2017) Ionic liquids for absorption and separation of gases: an eSxtensive database and a systematic screening method. AIChE J 63:1353–1367. https://doi.org/10.1002/aic.15618
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China [No. 21576071; 21776061]; the Foundation of International Science and Technology Cooperation of Henan Province [No. 162102410012]; the Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry [No. 20091001] and the program for Science & Technology Innovation Team in Universities of Henan Province [No.19IRTSTHN029]

Author information

Authors and Affiliations

Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, Henan, China
Ting Wu, Wan-Li Li, Meng-Yao Chen, Yan-Mei Zhou & Qing-You Zhang

Authors

Ting Wu
View author publications
You can also search for this author in PubMed Google Scholar
Wan-Li Li
View author publications
You can also search for this author in PubMed Google Scholar
Meng-Yao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yan-Mei Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Qing-You Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qing-You Zhang.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 384 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, T., Li, WL., Chen, MY. et al. Prediction of Henry's law constants of CO₂ in imidazole ionic liquids using machine learning methods based on empirical descriptors. Chem. Pap. 75, 1619–1628 (2021). https://doi.org/10.1007/s11696-020-01415-8

Download citation

Received: 29 June 2020
Accepted: 03 November 2020
Published: 19 November 2020
Issue Date: April 2021
DOI: https://doi.org/10.1007/s11696-020-01415-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Prediction of Henry's law constants of CO₂ in imidazole ionic liquids using machine learning methods based on empirical descriptors

Abstract

Similar content being viewed by others

ILPC: simple chemometric tool supporting the design of ionic liquids

An Analytical Predictive Model for the Heat Capacity of Imidazolium-Type Ionic Liquids Derived Directly from Artificial Neural Network Modeling

Atomic-Level Topological Indices for Prediction of the Infinite Dilution Activity Coefficients of Oxo Compounds in Water

Introduction

Materials and methods

Dataset

Descriptors

Methods

Multi-layer perceptron (MLP)

Random forest (RF)

Multiple linear regression (MLR)

Results and discussion

Model construction with MLP and investigation of outliers

Investigation of variables selection

The effects of temperature, cation and anion descriptors, respectively, on HLC

Comparison of different machine learning methods

The validation of the model

Prediction of HLC at specific constant temperature

Investigation of reduced dataset

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (PDF 384 kb)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Prediction of Henry's law constants of CO2 in imidazole ionic liquids using machine learning methods based on empirical descriptors

Abstract

Similar content being viewed by others

ILPC: simple chemometric tool supporting the design of ionic liquids

An Analytical Predictive Model for the Heat Capacity of Imidazolium-Type Ionic Liquids Derived Directly from Artificial Neural Network Modeling

Atomic-Level Topological Indices for Prediction of the Infinite Dilution Activity Coefficients of Oxo Compounds in Water

Explore related subjects

Introduction

Materials and methods

Dataset

Descriptors

Methods

Multi-layer perceptron (MLP)

Random forest (RF)

Multiple linear regression (MLR)

Results and discussion

Model construction with MLP and investigation of outliers

Investigation of variables selection

The effects of temperature, cation and anion descriptors, respectively, on HLC

Comparison of different machine learning methods

The validation of the model

Prediction of HLC at specific constant temperature

Investigation of reduced dataset

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (PDF 384 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Prediction of Henry's law constants of CO₂ in imidazole ionic liquids using machine learning methods based on empirical descriptors