Abstract
A series of difficulties make inaccessible precise experimental determinations of solubilities in standard conditions for monosaccharides; in water, the monosaccharides may switch from the acyclic to cyclic form and also the cyclic forms can undergo mutarotation. There are many ways to express the structural information as structure descriptors, but the alternatives become fewer when looking for invariants with good identification abilities, and the characteristic polynomial is one of them. The disadvantage of the characteristic polynomial resides in the fact that is defined with disregard of the chemical information coming from the type of the element and the type of the bond. Here, an extension of the characteristic polynomial was used accounting for the chemical information. In water, monosaccharides exist in all forms, but only one is an invariant for all the acyclic form. If it is something in structure which associates the structure information with the solubility, then it is present in all its form including the acyclic one, and therefore, the acyclic forms can be used to derive structure–property relationships. A search for linear relationships expressing the solubility as a function of the structure of the acyclic forms of monosaccharides was conducted by using the extension of the characteristic polynomial. The search used the experimental data available to select the models that are able to estimate the solubility, with each different to the other in terms of the effects considered. Considering the obtained results, the extended characteristic polynomial provides a very good estimation capability for the solubilities of monosaccharides.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Sugars are short-chain carbohydrates, with their molecule consisting of carbon (C), hydrogen (H) and oxygen (O) atoms with the general formula Cm(H2O)n where 2 ≤ m (and usually 3 ≤ m ≤ 7) and n ≤ m (and usually n = m or n = m − 1).
Monosaccharides are the simplest carbohydrates, with general formula (CH2O)n, where n ranges from 2 (diose, H–(C=O)–(CH2)–OH) to usually 7 (when n = 3 are trioses, when n = 4 are tetroses, when n = 5 pentoses, when n = 6 hexoses and when n = 7 heptoses). There are 24 monosaccharides (see Table 1) from diose (n = 2) to hexoses (n = 6). The monosaccharides with lower number of atoms (e.g., n = 3 and n = 4) may cyclize by dimerization leading to cyclic monosaccharides with n = 6 and n = 8, respectively, and also, the monosaccharides can join together to form disaccharides. A disaccharide is formed whenever two monosaccharides (identical or not) joined. Two identical monosaccharides can form up to eleven different disaccharides (Paulus and Klockow-Beck 1999), and the numbers increase even more abruptly when different monosaccharides are connected [Schmidt (1986) counted 720 trisaccharides, 34,560 tetrasaccharides and 2,144,640 pentasaccharides] and the consequence is an enormous diversity and complexity in carbohydrate structure and chemistry.
In Table 1, ‘n = ’ stands for the n from the general formula (CH2O)n of the monosaccharides, while the split into aldoses and ketoses is based on the position of the double-bonded oxygen in their structure.
The solubilities of monosaccharides are reported in both experimental and theoretical studies (Banerjee 1996; Briciu et al. 2010; Sârbu and Briciu 2010; Kot-Wasik et al. 2014), but unfortunately are at different experimental conditions (18 °C or 20 °C (ChemicalBook 2017) or other temperatures, 10 mmHg, 18 mmHg or other different barometric pressures (ChemSpider 2017) in place of IUPAC SATP of 25 °C and 1 bar (McNaught and Wilkinson 1997), or different solvents or solvent mixtures (van Putten et al. 2014)).
Actually, there are very few experimental data on solubilities of monosaccharides reported at IUPAC SATP conditions. Gray et al. (2003) reported solubilities for five monosaccharides, while (Teles et al. 2016) have the same five ones.
In addition, the recent literature is abundant of studies in connection with the solubility of monosaccharides and of a growing interest is not only their solubility in water, but also in water-based solvents and solvent mixtures. Thus, Ye et al. (2017) reported solubility data for butanol–water mixtures, while others reported for ionic aqueous solvent mixtures including water + NaCl (Hernández-Luis et al. 2003; Ghalami-Choobar et al. 2015), water + NaBr (Zhuo et al. 2005), water + NaI (Zhuo et al. 2008), water + LiCl, water + KH2PO4 and water + NaC6H11O7 (Banipal et al. 2014a, b, 2015), water + 3-hydroxypropylammonium acetate (Singh et al. 2015), water + 1-hexyl-3-methyl imidazolium chloride (Zafarani-Moattar et al. 2017) and even in a series of ionic aqueous solvent mixtures (Carneiro et al. 2013). Also, the solubility of other compounds in mixtures of water + monosaccharides is of interest as well—Nain (2016) reporting data for water + d-mannose solvent mixtures. Solubility-related recent studies include solid–liquid and vapor–liquid phases equilibrium data for some monosaccharides as well as some disaccharides (Jónsdóttir et al. 2002) and solid–liquid phase equilibrium data for binary and ternary systems of certain monosaccharides and water (Guo et al. 2017).
The main problem in conducting studies relating the experimental measurements on carbohydrates is the scarcity of structural information from combined factors (difficulties to crystallize and the limitations in NMR analysis (Zwahlen and Vincent 2002)). Another challenge is the fact that usually the researchers conducting the structural determinations are not the same with the ones conducting the property measurements, and by this way, the reliability of the data sources is reduced, since very easily during the experimental treatment, the monosaccharides may switch from the acyclic to cyclic form and the cyclic forms can undergo mutarotation.
Other data which may be of interest reported for monosaccharides include equilibrium constants of their complexes, but also here the information available is scarce; Hacket et al. (1997) reports the equilibrium constants of complexes between β-cyclodextrin and three out of the 24 monosaccharides listed in Table 1, obviously not enough paired data to do an analysis in the series.
The increasing interest of series-based data including the monosaccharides is confirmed by a recent study of Buttersack (2017) which reports data not only for monosaccharides (most of pentoses and all hexoses) which estimated hydrophobicity by direct measurement of the hydrophobic interaction of carbohydrates and other hydroxy compounds with a C18-modified silica gel column.
The solubilities in standard conditions for 8 out of the 24 monosaccharides listed in Table 1 were involved in this study to obtain relations expressing the solubility as a function of the structure of monosaccharides in their acyclic form.
Material and method
The available data about solubilities of 8 monosaccharides (listed in Table 2) were collected from the literature. For one (fructose) was necessary an extrapolation, while for other two (allose and psicose) a conversion of the units was made.
In water exists all forms but only one is ‘unique’—e.g., does not have different conformation states—the acyclic form, and this is the reason for which it was used. If it is something in structure which explains its behavior in water, then it is present in all its forms including the acyclic one. The advantage of using the acyclic form is given by its uniqueness, which allows to do the desired inference in the whole set of structures.
The structural information as 3D geometries for the D-type isomers of acyclic forms was taken from PubChem database (CID numbers of the files given in Table 2). For one monosaccharide (CID 111123 corresponding to D-idose), the 3D geometry were built from its 2D geometry. On the 24 files containing different geometries of monosaccharides were calculated properties using Spartan’14 software in the following configuration: energy calculation with Hartree–Fock (HF) method, 6-31G* dual basis (Steele and Head-Gordon 2010); the infrared (IR) parameters (Pople et al. 1989) were computed too and thermodynamic entities were derived (CV—molar heat capacity at constant volume H—enthalpy, S—entropy, G—free enthalpy, C 0 V , H0, S0, G0 at 298.15 K and S0K, C 0K V , ZPE—zero point energy—all at 0 K).
On the CID files containing the structures listed in Table 2 were calculated the extended characteristic polynomials. Since the procedure of calculation for the extended polynomial is detailed elsewhere (Jäntschi and Bolboacă, 2016; Joiţa and Jäntschi, 2017), here it is given only in brief. The extended characteristic polynomial (ChPE) is calculated on the chemical structure containing only the heavy elements (without hydrogen atoms). The extended characteristic polynomial (ChPE) on a molecule (Mol) on the evaluation point λ is calculated depending on the choice of the atomic property (AP) and of the metric operator (MO) by using the identity (I) and the connectivity (C) matrices as the determinant (|∙|):
As any other polynomial formula, the evaluation point λ (or the argument of the polynomial) is to be used to evaluate the polynomial in a certain point of the domain of the values; it may take any real value. Equation (1) resembles the classical definition of the characteristic polynomial (ChP = |λ∙[Id] − [Ad]|, in which the identity [Id] and adjacency [Ad] matrices of the graph of the Mol molecule are here changed into [I(AP, Mol)] and [C(MO, Mol)], as it is detailed in Joiţa and Jäntschi (2017).
For a given molecule (Mol), each choice of the atomic property (AP ∈ {‘A,’ ‘B,’ ‘C,’ ‘D,’ ‘E,’ ‘F,’ ‘G,’ ‘H’}) and of the metric operator (MO ∈ {‘c,’ ‘C,’ ‘G,’ ‘G,’ ‘t,’ ‘T’}) provides a polynomial formula for the molecule.
The encodings for the atomic property (AP) provide specifications for the type of the element when the undistinguishable values of 1 from the main diagonal of the identity matrix [Id] are replaced with distinguishable values for each element as follows:
-
‘A’ provides relative (to the last element of period 7, Uuo, A(Uuo) = 294) atomic mass;
-
‘B’ provides the connection with the classical characteristic polynomial—always 1;
-
‘C’—(atomic partial) charges, as available in the PubChem files;
-
‘E’—electronegativity relative to Fluorine (4.00) on the Pauling (Pauling 1932) scale;
-
‘F’—first ionization potential, relative to the potential of ionization for Hydrogen (1312 kJ/mol);
-
‘G’—melting point temperature relative to diamond’s allotrope of Carbon (3820 K);
-
‘H’—number of attached hydrogen atoms relative to the same for CH4 (4).
The encodings for the metric operator provides specifications for the bonds when undistinguishable values of 1 from the adjacency matrix ([Ad]) for the {‘c’, ‘g’, ‘t’} alternatives and from the distance matrix ([Di]) for the {‘C’, ‘G’, ‘T’} alternatives are replaced with distinguishable values as follows:
-
Inverse (in Å−1) of the geometrical distances (in Å) replaces 1′s in [Ad] (when MO = ‘g’) and in [Di] (when MO = ‘G’);
-
Inverse of the bond order replaces 1′s in [Ad] (when MO = ‘c’) and in [Di] (when MO = ‘C’);
-
Provides the connection with the classical characteristic polynomial (on [Ad], when MO = ‘t’) and on its extension on the distance matrix (on [Di], using its inversed values, when MO = ‘T’).
Thus, when MO ∈T{‘c,’ ‘g,’ ‘t’}, Eq. 1 can be rewritten as ChPE(AP, MO, λ, Mol) = |λ∙Id(AP, Mol) − Ad(MO, Mol)| and when MO ∈ {‘C’, ‘G’, ‘T’}, then ChPE(AP, MO, λ, Mol) = |λ∙Id(AP, Mol) − Di(MO, Mol)|, where I(AP, Mol), Ad(MO, Mol) and Di(MO, Mol) are functions which replace the values of 1 from identity ([Id]), adjacency ([Ad]) and distance ([Di]) matrices depending on the selected alternatives of AP and MO.
The extended characteristic polynomials can be computed on any value of the argument (λ), but like for the replacements of the values of 1, only the values from [− 1, 1] range provide contraction mappings (Jäntschi et al. 2016). The evaluation of the polynomial was made in 2001 equally spaced points from [− 1, 1], including thus − 1, 0 and 1 in this series of evaluation points. The name of the evaluated extended characteristic polynomial was given with eight characters, L1L2L3L4d1d2d3d4, where d1, d2, d3, and d4 are the digits of the representation of the number ranging from 0 to 1000 used to provide the equally spaced points of evaluation. The letters L1 to L4 have the following assignment: L4 is ‘N’ for negative λ arguments (from − 1.000 to − 0.001) and ‘P’ for nonnegative λ arguments (from 0.000 to 1.000), L3 encodes the connectivity (MO) alternatives, L2 encodes the identity (AP) alternatives, while L1 encodes a micro-linearization to macro-linearization alternative (‘I’ leaves the values unchanged, f(x) = x, ‘R’ provides reciprocal values, f(x) = 1/x, while ‘L’ provides the logarithm of the absolute values, f(x) = ln(|x|)). A total number of 288,144 (2001∙8∙6∙3) possible value-based representations of the structure result by this way. For a set of molecules, each valid (non-null) representation provides a structure descriptor.
Two alternatives were considered: to obtain a relationship expressing the solubility from calculated properties (the ones calculated with the Spartan’14 software) and to obtain a relationship expressing the solubility from the evaluations of the extended characteristic polynomial.
Linear regressions were involved, when the search was conducted with the following alternative models [Eq. 2 with one (x) descriptor, Eqs. 3 to 5 with two (x1 and x2) descriptors].
Equation (2) is simple linear regression, Eq. (4) is multiple (bi-varied) linear regression with only additive effects among descriptors, and Eq. (3) is multiple (bi-varied) linear regression with only multiplicative effects among descriptors, while Eq. (5) is multiple (bi-varied) linear regression quantifying for both additive and multiplicative effects among descriptors.
Adjusted determination coefficients allow selection of the best explanatory models, and their Fisher Z transformations (Fisher 1915, 1921) allow comparison among them.
The condition to use Eqs. (2)–(5) is that the dependent variable reconstitute a normal distribution. Otherwise, a series of transformations of the data are required (Bolboacă and Jäntschi 2013).
Results and discussion
On the 24 files containing different geometries of monosaccharides, the series of calculated properties are listed in Table 3 (Spartan’14 software, energy calculation with HF 6-31G* dual basis), and their associated thermodynamic quantities are listed in Table 4 (Spartan’14 software, computing IR parameters).
Unfortunately, the solubility listed in Table 2 is a little departed from normality. A histogram of the values easily proves this (see Fig. 1).
The logarithm of the solubilities reconstitute the normality much better (see Fig. 2).
Because of very few data (only 8 values), it is difficult to do a test for normality. The alternatives are Anderson–Darling (AD) and Kolmogorov–Smirnov (KS) (Bolboacă and Jäntschi 2009). The KS test provides a value of 0.24358 (p = 0.644) when solubility is tested for normality and a value of 0.14593 (p = 0.985) when ln(solubility) is tested for normality. The AD test provides a value of 0.56634 (p = 0.656) when solubility is tested for normality and a value of 0.2579 (p = 0.949) when ln(solubility) is tested for normality. The p values show that the ln(solubility) is much closer to normality than the solubility. Therefore, the logarithm transformation was applied to the solubility.
Searching for regressions of type Eqs. (2)–(5) with ln(solubility) (as well as with solubility) as dependent variable and whole pool of properties listed in Tables 3 and 4 as independent variables with potential explanatory power was unsuccessful. Only a very poor association between ln(solubility) and SolvE (solvation energy) was identified (r = 0.32, r2 = 0.10, \(r_{\text{adj}}^{2}= 1-(1-r^2)(8-1)/(8-2)\) = − 0.05), definitely insufficient to be taken into account. Only its intercept is statistically significantly different from zero. A possible explanation is the fact that the solubility (and its logarithm) is almost orthogonal on the other properties (average of the correlation of ln(solubility) with the properties listed in Tables 3 and 4 is 0.05).
Therefore, seeking relationships estimating solubility from structure information provided by the extended characteristic polynomial seems appropriate and is definitely helpful. By applying Eqs. (2)–(5) were selected the best explanatory structure-based descriptors as the ones providing the best association defined by the models of the equations; their names and values for the compounds with known solubility are given in Table 5.
In Table 5, Selection from column indicates which alternative (one of the equations: 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b) identified the descriptor as able to explain the ln(solubility).
As can be seen in Table 5, all selected descriptors came from geometry-based approach (‘G’ letter in the third position, when the adjacency matrix ([Ad]) is replaced with the distance matrix ([Di]) in the former definition of the characteristic polynomial (|λ∙Id(AP, Mol) − Di(MO, Mol)|). It may seem surprising, but is not, since the distance matrix brings further knowledge than the adjacency (let us remember that the 1′s are in the same position in the distance matrix as are in the adjacency; the change is on 0′s, and some of 0′s from adjacency are replaced with nonzero values based on the distances between the atoms). Also this selection of the geometry as the predominant metric explains why the properties derived from energetic calculations given in Tables 3 and 4 fail to estimate the solubility, and there are many geometrical arrangements sharing same energy. Regarding the atomic property, two of them were not selected in the best explanatory descriptors: number of the hydrogen atoms (‘H’) and relative atomic mass (‘A’), and it seems by this way that those have a little influence on the solubility. It may seem not quite logical, when we think about the dissociation process accompanying the dissolving, but if it is taken into account that each hydrogen is differently involved in this dissociation process, then it may seem reasonable that their number does not play an important role while the partial charges (‘C’) do.
Melting point temperature of the elements (‘G’) provides the first level of approximation for the association between the chemical structure and solubility (RGGN0477 = 1/GG(− 0.477) evaluated polynomial selected by Eq. 2a model in Table 5).
Splitting effects models [Eq. (3), additive, and Eq. (4), multiplicative] select both the same two atomic properties explaining the best association topology of the molecule (‘B’) and the first ionization potential (‘F’), and while taking into consideration both additive and multiplicative effects [in Eq. (5)], the pair of explanatory atomic properties is changed to electronegativity (‘E’) and partial charges (‘C’). Because the same pair of atomic properties was selected for both additive and multiplicative effects models, it seems reasonable to account for both effects, and therefore, the model selected by Eqs. (5) should be used for predictions [see the models listed in Table 6, where UV(λ) is the value of the extended characteristic polynomial UV in λ, when U encodes the atomic property (AP) and V encodes the distance metric (MO)].
In Table 6, Equation indicates the type of the equation and it is one of the alternatives given as Eqs. (2)–(5), and Equation for ln(Solubility) gives the selected polynomials (GG selected by the equation of type 2a, BG and FG selected by Eq. 3a, and so on), their evaluation points (λ) which provides the best explanatory power for the model (λ = − 0.477 for GG polynomial in Eq. 2a, λ = 0.987 for BG and λ = 0.285 for FG polynomial for the selections made using Eq. 3a, and so on), as well as the meaning of the encodings for the names of the polynomials (GG polynomial was obtained using ‘G’ alternative for atomic properties—melting point temperature relative to diamond’s allotrope of Carbon, ‘G’ alternative for metric operator—calculations made using the Cartesian coordinates from geometric 3D model of the molecule) as well as the selected alternative for the connectivity ([C] = [Di], the distance matrix in the case of the model obtained using the equation of type 2a. Also Equation for ln(Solubility) gives the coefficients of the equations, as it is \(- 2.35_{ \pm 0.24}\) for Eq. (2a), where the value of − 2.35 is the intercept for the model of type 2a, given along with their 95% confidence intervals and at 5% risk being in error, the value of − 2.35 is subject to change with ± 0.24. The Statistics column gives the determination coefficient (r2) and adjusted (due to the size of the sample) one (\(r_{\text{adj}}^{2}\)).
The equation listed as entry 5a in Table 6 was used for estimation (on the compounds with known solubility) and for prediction (for the rest of the compounds). The results are given in Table 7. As can be seen in Table 7, the estimated values are close to the measured values for the solubility. The greatest departure is at CID 90008 (corresponding to d-psicose), and it is below 2%, which is safely in the range of the experimental error of the measurements on relative scales (5%).
In Table 7, ln(Solubility) column contains the values of the ln(Solubility) as were estimated (for the first 8 monosaccharides) and, respectively, predicted (for the rest of them) by the equation listed as entry 5a in Table 6, while Solubility column contains the exponent of those values (exp(ln(solubility) = solubility) ready to be used.
The estimated values show a solubility of d-idose (CID 111123) and d-ribose (CID 5311110) similar to the solubility of d-psicose (CID 90008), while d-dihydroxyacetone (CID 670) seems to have the highest solubility (about 0.41 mol/mol, see Table 7). According to the literature, at 20 °C (not at 25 °C), d-dihydroxyacetone solubility is greater than 930 g/l (SCCS SCCS 2010; more than 10 mol/l) sustaining thus this estimation of its solubility for 25 °C. The second solubility among the selected monosaccharides seems to have d-altrose (about 0.35 mol/mol, see Table 7), but unfortunately no experimental data are available for comparison.
For glycolaldehyde (CID 756), d-erythrose (CID 94176), d-glyceraldehyde (CID 751), d-erythrulose (CID 5460177), d-threose (CID 439665), d-lyxose (CID 65550), d-ribulose (CID 151261) and d-xylulose (CID 5289590), the estimation provides a solubility similar to the solubility of d-xylose (CID 644160), about 0.13 mol/mol, which is likely since in water the monosaccharides suffer a complex process of mutarotation, leading to a series of different forms, as were shown in (Curtius et al. 1968).
Please note that in the absence of experimental data available, the results provided as predicted solubilities for monosaccharides (the last 16 entries in Table 7) are not validated. In order to be validated, further measurements on solubilities of monosaccharides must be conducted.
Conclusions
The experimental measurements of the solubilities of monosaccharides are scarce and rarely reported at standard conditions. Since solubility is of great importance for the biological role of the monosaccharides and the search for property–property relationships expressing the solubility was unsuccessful, a search for a structure–property relationship was conducted.
By using the extended characteristic polynomial, a structure–property relationship was found with great capacity of estimation of the solubility for the monosaccharides (\(r_{\text{adj}}^{2}\) = 0.9996). The relation suggests that the solubility of the monosaccharides is strongly dependent on the geometry and the atomic partial charges and the electronegativities of the elements play the main role in its expression. The relation was used to predict the solubility for 16 monosaccharides, when plausible solubilities were obtained.
Abbreviations
- IUPAC SATP:
-
International Union of Pure and Applied Chemistry Standard Ambient Temperature and Pressure
- NMR:
-
Nuclear Magnetic Resonance
- CID number:
-
PubChem Compound Identifier
- eqs.:
-
equations (when are more than one)
References
Banerjee S (1996) Estimating water solubilities of organics as a function of temperature. Water Res 30(9):2222–2225. https://doi.org/10.1016/0043-1354(96)00306-5
Banipal PK, Aggarwal N, Banipal TS (2014a) Study on interactions of saccharides and their derivatives with potassium phosphate monobasic (1:1 electrolyte) in aqueous solutions at different temperatures. J Mol Liq 196:291–299
Banipal PK, Hundal AK, Aggarwal N, Banipal TS (2014b) Studies on the interactions of saccharides and methyl glycosides with lithium chloride in aqueous solutions at (288.15 to 318.15) K. J Chem Eng Data 59(8):2437–2455. https://doi.org/10.1021/je5001523
Banipal PK, Singh V, Aggarwal N, Banipal TS (2015) Hydration behaviour of some mono-, di-, and tri-saccharides in aqueous sodium gluconate solutions at (288.15, 298.15, 308.15 and 318.15) K: volumetric and rheological approach. Food Chem 168:142–150. https://doi.org/10.1016/j.foodchem.2014.06.104
Bolboacă S-D, Jäntschi L (2009) Distribution fitting 3. Analysis under normality assumption. BUASVMCN Hortic 66(2):698–705
Bolboacă S-D, Jäntschi L (2013) Quantitative structure–activity relationships: linear regression modelling and validation strategies by example. Biomath 2(1):a1309089. https://doi.org/10.11145/j.biomath.2013.09.089
Briciu RD, Kot-Wasik A, Wasik A, Namiesnik J, Sarbu C (2010) The lipophilicity of artificial and natural sweeteners estimated by reversed-phase thin-layer chromatography and computed by various methods. J Chromatogr A 1217:3702–3706. https://doi.org/10.1016/j.chroma.2010.03.057
Buttersack C (2017) Hydrophobicity of carbohydrates and related hydroxy compounds. Carbohyd Res 446–447:101–112. https://doi.org/10.1016/j.carres.2017.04.019
Carneiro AP, Held C, Rodríguez O, Sadowski G, Macedo EA (2013) Solubility of sugars and sugar alcohols in ionic liquids: measurement and PC-SAFT modelling. J Phys Chem B 117(34):9980–9995. https://doi.org/10.1021/jp404864c
Chambers CC, Hawkins CD, Cramer CJ, Truhlar DG (1996) Model for aqueous solvation based on class IV atomic charges and first solvation shell effects. J Phys Chem 100(40):16385–16398. https://doi.org/10.1021/jp9610776
ChemicalBook (2017) D(-)-Fructose. Accessed January 12, 2017. http://chemicalbook.com/chemicalproductproperty_en_cb6139083.htm
ChemSpider (2017) D-(-)-Lyxose. Accessed January 12, 2017. http://chemspider.com/Chemical-Structure.58993.html
Curtius H-C, Völlmin JA, Müller M (1968) Determination of the mutarotation of monosaccharides by gas chromatography - Elucidation of different forms of fructose and sorbose by gas chromatography, infrared and mass spectroscopy. Fresenius’ Zeitschrift für Analytische Chemie 243(1):341–349. https://doi.org/10.1007/BF00530708
Fisher RA (1915) Frequency distribution of the values of the correlation coefficient in samples of an indefinitely large population. Biometrika 10(4):507–521. https://doi.org/10.2307/2331838
Fisher RA (1921) On the ‘probable error’ of a coefficient of correlation deduced from a small sample. Metron 1:3–32
Flood AE, Addai-Mensah J, Johns MR, White ET (1996) Refractive index, viscosity, density, and solubility in the system Fructose + Ethanol + Water at 30, 40, and 50 C. J Chem Eng Data 41(3):418–421. https://doi.org/10.1021/je950188f
Fukada K, Ishii T, Tanaka K, Yamaji M, Yamaoka Y, Kobashi K-I, Izumori K (2010) Crystal structure, solubility, and mutarotation of the rare monosaccharide D-Psicose. Bull Chem Soc Jpn 83(10):1193–1197. https://doi.org/10.1246/bcsj.20110107
Ghalami-Choobar B, Shafaghat-Lonbar M, Mossayyebzadeh-Shalkoohi P (2015) Activity coefficients determination and thermodynamic modeling of (NaCl + Na2HCit + glucose + H2O) system at T = (298.2 and 308.2) K. J Mol Liq 212:922–929. https://doi.org/10.1016/j.molliq.2015.10.051
Gray MC, Converse AO, Wyman CE (2003) Sugar monomer and oligomer solubility. data and predictions for application to biomass hydrolysis. Appl Biochem Biotechnol 105–108:179–193. https://doi.org/10.1385/ABAB:105:1-3:179
Guo L, Wu L, Zhang W, Liang C, Hu Y (2017) Experimental measurement and thermodynamic modeling of binary and ternary solid–liquid phase equilibrium for the systems formed by L-arabinose, D-xylose and water. Chin J Chem Eng 25(10):1467–1472. https://doi.org/10.1016/j.cjche.2017.02.005
Hacket F, Coteron J-M, Schneider H-J, Kazachenko VP (1997) The complexation of glucose and other monosaccharides with cyclodextrins. Can J Chem 75(1):52–55. https://doi.org/10.1139/v97-007
Hernández-Luis F, Amado-González E, Esteso MA (2003) Activity coefficients of NaCl in trehalose-water and maltose-water mixtures at 298.15 K. Carbohydr Res 338(13):1415–1424. https://doi.org/10.1016/s0008-6215(03)00177-0
Jäntschi L, Bolboacă S-D (2016) Extending characteristic polynomial from graphs to molecules. In: International conference on sciences, University of Oradea, Faculty of Sciences, May 13–14, 2016, Oral presentation, Mathematics section, (Friday) May 13, 2016, pp 1500–1520
Jäntschi L, Bálint D, Bolboacă S-D (2016) Multiple linear regressions by maximizing the likelihood under assumption of generalized Gauss-Laplace distribution of the error. Comput Math Methods Med 2016:8578156. https://doi.org/10.1155/2016/8578156
Joiţa DM, Jäntschi L (2017) Extending the characteristic polynomial for characterization of C20 fullerene congeners. Mathematics 5(4):84. https://doi.org/10.3390/math5040084
Jónsdóttir SÓ, Cooke SA, Macedo EA (2002) Modeling and measurements of solid-liquid and vapor-liquid equilibria of polyols and carbohydrates in aqueous solution. Carbohydr Res 337(17):1563–1571. https://doi.org/10.1016/S0008-6215(02)00213-6
Kot-Wasik A, Wasik A, Namiesńik J, Sârbu C, Naşcu-Briciu RD (2014) Retention modeling of some saccharides separated on an amino column. J Liq Chromatogr Relat Technol 37(10):1383–1396. https://doi.org/10.1080/10826076.2013.789806
Kozakai T, Fukada K, Kuwatori R, Ishii T, Senoo T, Izumori K (2015) Aqueous phase behavior of the rare monosaccharide D-Allose and X-ray crystallographic analysis of D-Allose dihydrate. Bull Chem Soc Jpn 88(3):465–470. https://doi.org/10.1246/bcsj.20140337
McNaught AD, Wilkinson A (1997) IUPAC. Compendium of chemical terminology, 2nd edn. Blackwell Scientific Publications, Oxford
Nain AK (2016) Physicochemical study of solute-solute and solute-solvent interactions of glycine, l-alanine, l-valine and l-isoleucine in aqueous-d-mannose solutions at temperatures from 293.15 K to 318.15 K. J Chem Thermodyn 98:338–352. https://doi.org/10.1016/j.jct.2016.03.012
Pauling L (1932) The nature of the chemical bond. IV. The energy of single bonds and the relative electronegativity of atoms. J Am Chem Soc 54(9):3570–3582. https://doi.org/10.1021/ja01348a011
Paulus A, Klockow-Beck A (1999) Structures and properties of carbohydrates. In: Paulus A, Klockow-Beck A (eds) Analysis of carbohydrates by capillary electrophoresis. Vieweg Teubner, Wiesbaden, pp 28–48
Pople JA, Head-Gordon M, Fox DJ (1989) Gaussian-1 theory: a general procedure for prediction of molecular energies. J Chem Phys 90(10):5622–5629. https://doi.org/10.1063/1.456415
Sârbu C, Briciu RD (2010) Lipophilicity of natural sweeteners estimated on various oils and fats impregnated thin-layer chromatography plates. J Liq Chromatogr Relat Technol 33(7–8):903–921. https://doi.org/10.1080/10826071003766021
SCCS (2010) SCCS/1347/10. Scientific committee on consumer safety, opinion on dihydroxyacetone, European commission, directorate-general for health & consumers. SCCS, Cambridge
Schmidt RR (1986) New methods for the synthesis of glycosides and oligosaccharides—are there alternatives to the Koenigs–Knorr method? Angew Chem, Int Ed Engl 25:212–235. https://doi.org/10.1002/anie.198602121
Singh V, Chhotaray PK, Gardas RL (2015) Effect of protic ionic liquid on the volumetric properties of ribose in aqueous solutions. Thermochim Acta 610:69–77. https://doi.org/10.1016/j.tca.2015.04.023
Steele RP, Head-Gordon M (2010) Dual-basis self-consistent field methods: 6-31G* calculations with a minimal 6-4G primary basis. Mol Phys 105(19–22):2455–2473. https://doi.org/10.1080/00268970701519754
Teles ARR, Dinis TBV, Capela EV, Santos LMNBF, Pinho SP, Freire MG, Coutinho JAP (2016) Solubility and solvation of monosaccharides in ionic liquids. Phys Chem Chem Phys 18(29):19722–19730. https://doi.org/10.1039/C6CP03495K
van Putten RJ, Winkelman JGM, Keihan F, van der Waal JC, de Jong E, Heeres HJ (2014) Experimental and modeling studies on the solubility of d-Arabinose, d-Fructose, d-Glucose, d-Mannose, Sucrose and d-Xylose in methanol and methanol-water mixtures. Ind Eng Chem Res 53(19):8285–8290. https://doi.org/10.1021/ie500576q
Ye T, Qu H, Gong X (2017) Measurement and correlation of liquid-liquid equilibria for the ternary systems of Water + d -Fructose + 1-Butanol, Water + d -Glucose + 1-Butanol, and Water + d -Galactose + 1-Butanol at (288.2, 303.2 and 318.2) K. J Chem Eng Data 62(8):2392–2399. https://doi.org/10.1021/acs.jced.7b00302
Zafarani-Moattar MT, Shekaari H, Mazaher Haji Agha E (2017) Thermodynamic studies on the phase equilibria of ternary ionic liquid, 1-hexyl-3-methyl imidazolium chloride + d-fructose or sucrose + water systems at 298.15 K. Fluid Ph Equilibria 436:38–46. https://doi.org/10.1016/j.fluid.2016.12.024
Zhuo K, Zhang H, Wang Y, Liu Q, Wang J (2005) Activity coefficients and volumetric properties for the NaBr + maltose + water system at 298.15 K. J Chem Eng Data 50(5):1589–1595. https://doi.org/10.1021/je050064v
Zhuo K, Liu H, Zhang H, Liu Y, Wang J (2008) Activity coefficients and volumetric properties for the NaI + maltose + water system at 298.15 K. J Chem Eng Data 53(1):57–62. https://doi.org/10.1021/je700366w
Zwahlen C, Vincent SJF (2002) Determination of 1H homonuclear scalar couplings in unlabeled carbohydrates. J Am Chem Soc 124(24):7235–7239. https://doi.org/10.1021/ja017358v
Acknowledgements
I address thanks to the anonymous reviewers of which helpful comments were very useful and enriched the present paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Jäntschi, L. Structure–property relationships for solubility of monosaccharides. Appl Water Sci 9, 38 (2019). https://doi.org/10.1007/s13201-019-0912-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13201-019-0912-1