Introduction

Tara polyphenols comprise of gallic acid derivatives [1] composed of a quinic acid esterified to gallic acid moieties (galloylquinic acids (GQAs)) (Fig. 1), which may also further form aryl ester(s) (depsides) with one or more additional gallic acid residues to form gallotannins [2]. A large number of GQA regioisomers which differ in the position(s) and nature of the respective acyl moieties exist in natural products. These compounds are structurally related to chlorogenic acids (caffeoylquinic acids (CQAs)), where the quinic acid core is esterified to caffeic acid units [1]. An important distinguishing feature of GQAs is the depsidic bond formed between gallic acid units in tara gallotannins [1, 3]. GQAs containing depsidic bonds show a characteristic shoulder at 300 nm in their UV spectra, which increases in intensity with the number of depsidic bonds [4].

Fig. 1
figure 1

Structures of the galloylquinic acid derivatives tentatively identified in the tara extract. Peak labels correspond to Table 1 and Fig. 1; numbers specify the number of galloyl units for each compound and superscript letter distinguish between isomeric species. Compound marked with # are tentatively identified for the first time in tara. For compounds labelled *, representative structures are shown, since regioisomers could not be identified based on the available data. Furthermore, for compounds labelled with a dagger, only representative structures are presented, as m- and p-isomers could not be distinguished

Tara tannins are mainly used for the tanning of animal hides used in car interiors due to their properties conferring grain cracking resistance [5]. Other potential uses for tara tannins include their application as resins [6, 7], wood adhesives [8], as protection against steel oxidation [9], as rust removers [10] and as natural antibacterials [11].

Due to the large number of regioisomers comprising tara tannins, their analytical characterisation is challenging. Solvent extraction is commonly performed using water and methanol for low molecular weight (MW), and acetone, ethyl acetate or ethanol for high MW gallic acid derivatives [12,13,14]. For the purification of gallic acid species, fractionation of solvent extracts by size exclusion chromatography (SEC) on Sephadex LH-20 followed by semi-preparative C18 reversed-phase (RP) isolation is typically used [12, 15, 16]. Normal-phase liquid chromatography (NP-LC) fractionation followed by preparative isolation on a C18 column has also been used for the eventual NMR characterisation of gallic acid derivatives [17]. In terms of analytical separation, NP-LC has been used in the past [18, 19], but reversed-phase liquid chromatography (RP-LC) is nowadays the preferred mode of HPLC for hydrolysable tannins in general [13].

Gallic acid species show a typical UV absorbance band at 280 nm, with a bathochromic shift of 10–12 nm depending on the degree of galloylation. For mass spectrometric identification, negative electrospray ionisation is generally employed [13], with typical fragmentation involving successive neutral losses of galloyl groups (− 152 amu), and both quinic acid (m/z 191) and gallic acid (m/z 169) moieties are detected as fragment ions [20]. Clifford and co-workers [21, 22] reported a fragmentation scheme to discriminate between chlorogenic acid regioisomers, which was subsequently also applied to the identification of GQAs in tara [20]. The same relative RP-LC elution order was noted for both mono- and diacylquinic acids, regardless of the chemical nature of the acyl unit [20, 21, 23, 24].

Due to its ability to produce mainly singly charged ions, matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF-MS) has been used to determine the molecular weight distribution of protein-(tara)tannin complexes [25]. This work showed that in tara, quinic acid can be substituted with up to eight galloyl units. Pizzi et al. [26] hypothesised that tara tannins could include structures comprising an ellagic acid core esterified to gallic acid with an additional CO2 unit, although the limited resolution of the TOF instrument used made distinction between these species and GQAs difficult.

More recently, ion mobility MS (IM-MS) has also found application in the analysis of ellagitannins [27, 28] and chlorogenic acids [29,30,31]. Ion mobility proved useful for the separation of chlorogenic acid regioisomers as well as prototropic isomers differing their deprotonation sites [30]. Zheng et al. [31] recently reported the use of drift tube ion mobility and a newly developed ultra-high-resolution IM system (structures for lossless ion manipulations (SLIM)) to successfully differentiate between cis/trans-dicaffeoylquinic acid isomers.

In this study, we report an analytical approach based on hydrophilic interaction chromatography (HILIC) and RP-LC methods hyphenated to diode array detection and travelling wave ion mobility (TWIM) HR-MS for the detailed analysis of tara tannins. We have recently reported the application of a similar approach for the analysis of ellagitannins in chestnut [28]. In the present work, the particular emphasis is on the use of this methodology to distinguish between regioisomeric GQA species found in tara.

Experimental

Materials and reagents

The tara tannin sample, an acetone-water extract of Peruvian tara (Caesalpinia spinosa) pods, was obtained from Silvateam (San Michele Mondovì, Italy). HPLC grade acetonitrile, methanol (MeOH) and formic acid were purchased from Sigma-Aldrich (Johannesburg, South Africa), and deionised water was obtained using a Milli-Q water purification system (Millipore, Milford, MA, USA). Poly-dl-alanine used for IM calibration was purchased from Sigma-Aldrich.

Instrumentation and chromatographic conditions

Analyses were performed on an Acquity UPLC system hyphenated to a photodiode array (PDA) detector (500 nL flow cell, 10 mm path length) and a Synapt G2 quadrupole time-of-flight (Q-TOF) mass spectrometer equipped with an ESI source operated in negative ionisation mode (Waters, Milford, MA, USA). UV spectra were recorded from 230 to 500 nm at an acquisition rate of 20 Hz. The MS scan range was 100–2000 amu at a scan time of 0.2 s, with a capillary voltage of − 2.5 kV, a cone voltage of 15 V, a source temperature of 120 °C and an extraction cone voltage of 4.0 V. Nitrogen was used as the desolvation gas at a flow rate of 650 L/h and a temperature of 275 °C. A collision energy ramp of 10–30 V was used to obtain high-energy LC-MSE data. A sodium formate solution and leucine enkephalin (m/z = 554.2615) were used for accurate mass calibration and as lock mass calibrants, respectively.

IM measurements were performed using N2 as the drift gas at 90 mL/min and mobility T wave velocity and wave height of 448 m/s and 37.1 V, respectively. Helium and N2 flows of 180 mL/min and 90 mL/min were used for the helium and IM cells, respectively.

Travelling wave collisional cross sections (TWCCSN2s, Ω) were calculated according to [32] using poly-dl-alanine as a calibrant [33]. Briefly, corrected arrival times (td) were calculated for each ion by correcting for mass-to-charge ratio (m/z), charge (z) and reduced mass (μ) and the exponential factor X (determined from a plot of td vs. Ω′). These values were plotted against reported Ω values for poly-dl-aniline ions of m/z 230–1100 [33], and this calibration curve was used to calculate Ω values of the identified hydrolysable tannins.

For both RP-LC and HILIC separations, acidified water (0.1% formic acid) was used as mobile phase A and acetonitrile as mobile phase B. The tannin sample was prepared by dissolving 12 mg/mL in MeOH. Two microliters was injected in full loop mode using acetonitrile and MeOH/H2O (50/50, v/v) as weak and strong needle washes, respectively.

HILIC analyses

HILIC separations were performed on an XBridge Amide (150 mm × 4.6 mm i.d., 2.5 μm dp) column (Waters) by using a linear gradient of 10–12.5% A (0–20 min) at a flow rate of 1 mL/min. The column effluent was split into 1:3 between the PDA and MS detectors.

RP-LC analyses

Separations were performed on a Kinetex (100 mm × 2.1 mm i.d., 1.7 μm dp) superficially porous column (Phenomenex, Torrance, USA) using the following linear gradient: 2–20% B (0.0–40 min) and 20% B (40–50 min). The flow rate was 0.4 mL/min, with the total flow directed to the MS source.

Data processing

Data acquisition and processing were performed using MassLynx (v. 4.1) and DriftScope (v. 2.1) software (Waters). Reported mass spectra were filtered as a function of arrival time for both low- and high-collision energy data using DriftScope.

Result and discussion

Chromatographic separation

Base peak ion (BPI) chromatograms obtained for the RP-LC and HILIC-IM-MS analyses of the tara tannin extract are presented in Fig. 2a and b, respectively, where the numbers specify the number of galloyl units and the superscript letters distinguish isomeric species. For the gallic acid derivatives and gallotannins present in tara, HILIC provided relatively poor chromatographic performance compared to RP-LC. Furthermore, for the GQA derivatives, the general HILIC retention order (i.e. increasing retention with an increasing degree of galloylation) was similar to that obtained by RP-LC, since a higher number of galloyl moieties increase both the hydrophobicity (due to the aromatic groups) and polarity (due to hydroxyl groups). However, for regioisomers of the same degree of galloylation, the relative elution order often differs between the separation modes (compare the elution orders of compounds with the same number according to superscripts).

Fig. 2
figure 2

Base peak ion chromatograms obtained for the (a) RP-LC and (b) HILIC-IM-Q-TOF-MS analyses of tara hydrolysable tannins. Peak labels correspond to Table 1 and Fig. 1, numbers specify the number of galloyl units for each compound and superscript letters distinguish between isomeric species. * indicates isomers which could not be matched between RP-LC and HILIC. For experimental conditions, refer to the section “Instrumentation and chromatographic conditions

Identification of tara hydrolysable tannins

For the tentative identification of compounds 1a–c, 2a–f, 3a, 4a and 5b in tara (Table 1 and Fig. 1), RP-LC elution orders, UV spectral information and high-resolution, high-energy MS data were compared to previous literature reports [20, 21, 23, 24, 34,35,36]. UV spectra are useful in the differentiation of GQA structural isomers of the same degree of galloylation, but differing in the number of depsidic bonds, based on the relative absorbance at 300 nm due to the depsidic bond [4]. Concerning RP-LC retention, previous studies have shown that the structurally related monoacyl chlorogenic acids (CQAs) elute in the sequence 3-, 5- and 4-O-caffeoylquinic acids [34, 35]. The same elution order has been reported for the positional isomers of monoacyl GQAs in RP-LC [20, 21]. Also for diacylated CQAs and GQAs, an identical RP-LC elution order is observed in the sequence 3,4-, 3,5- and 4,5-di-O-acylquinic acids, irrespective of the nature of the acyl group [20, 21, 23, 24].

Table 1 Galloylquinic acids and gallic acids tentatively identified in tara by RP-LC and HILIC-IM-HR-MS

The substitution position of the caffeoyl moiety on the quinic acid core in monoacylated CQAs can also be identified by MSn [21, 36]. High-collision energy fragmentation spectra show a distinctive dehydrated quinic acid fragment ion at m/z 173 ([quinic acid-H2O-H]) when the acyl unit is located on the 4-position (4-O-CQA), whereas 3-O-CQA is distinguished from the 5-O-CQA positional isomer by a comparatively more intense hydroxycinnamic acid (m/z 179 [caffeic acid-H]1−) ion relative to the m/z 191 ([quinic acid-H]1−) base peak ion. On this basis, Clifford et al. [21] derived a protocol to differentiate diacyl CQAs based on MSn data. 3,5-di-O-CQA is easily identified as this compound does not show a dehydrated quinic acid ion at m/z 173, whereas 3,4-di-O-CQA is differentiated from 3,5-di-O-CQA by the relative intensity of the caffeic acid ion in the MS3 spectrum. The likelihood of intact hydroxycinnamic acid ions being formed during fragmentation decreases in the sequence 1 ≈ 5 > 3 > 4 for substituted CQAs [21, 22], with the facile loss of the C5 acyl unit rationalised [22] as being dependant on the site of the negative charge on the precursor ion. Significantly, this behaviour of acylated quinic acid derivatives is independent of the nature of the acyl moiety [20], such that the same protocol could be used to identify GCA derivatives in the present work.

Based on the criteria outlined above, a total of 45 GQAs comprising mono- to pentagalloylated quinic acid positional and structural isomers were identified in tara (Table 1 and Fig. 1), as discussed in more detail in the following sections.

Three monogalloylquinic acid species were identified based on their elution order and MSE spectra under RP-LC conditions [20]. The last eluting isomer at 2.80 min (1c) was identified as 4-O-galloylquinic acid, based on the characteristic fragment ion at m/z 173 in its MSE spectrum and its relative hydrophobicity. The two remaining isomers eluting at 1.15 min and 2.28 min were identified as 3-O-galloylquinic acid (1a) and 5-O-galloylquinic acid (1b), respectively, taking into account their elution order as well as the low-intensity acyl fragment (m/z 169) for 1b. A clean MSE spectrum could not be obtained for 3-O-galloylquinic acid under either RP-LC or HILIC conditions due to co-elution, and this compound was therefore solely identified based on RP-LC elution order. To assign the monogalloylquinic acids (1a,b,c) in HILIC, the relative peak areas between the two separations were used, which revealed the expected reversed elution order.

Assigning monogalloylquinic acid species between RP-LC and HILIC based on MSE data revealed that the protocol originally developed by Clifford and co-workers [21] for the assignment of CQAs under RP conditions is not suitable as a basis to identify the position of the galloyl unit under HILIC conditions. Comparison of MSE spectra for the two major monogalloylquinic acid isomers obtained under RP-LC (see Electronic Supplementary Material (ESM) Fig. S1) and HILIC (ESM Fig. S2) conditions shows that in the latter case, similar fragment ion intensities are observed, while the fragment ion at m/z 173, which is diagnostic for all 4-substituted quinic acid derivatives under RP-LC conditions, is also observed for both species. Clifford et al. [22] proposed that the pathways followed by regioisomeric caffeoylquinic acids depend on the position of the negative charge under RP-LC conditions. If this is correct, the apparent loss in mass spectrometric specificity for the monogalloylquinic acid species in HILIC may be a consequence of the negative charge being located on the same position for all three regioisomers, possibly due to a lower polarity [37] or higher effective pH [38, 39] of the organic-rich HILIC mobile phase.

A total of six digalloylquinic acid derivatives were detected and identified based on accurate mass information. Isomers eluting at 8.29 min, 10.12 min and 11.74 min in RP-LC are distinguished by a shoulder at 300 nm in their UV spectra, which points to these compounds containing a depsidic bond and therefore only one site of galloylation. These three regioisomers were characterised by multiple arrival times at 3.10–3.30 ms and 3.60–3.70 ms (refer to Fig. 6 and the following discussion). The last eluting isomer (2f, 11.74 min in RP-LC) shows the characteristic dehydrated quinic acid fragment ion at m/z 173, identifying this compound as 4-di-O-digalloylquinic acid (ESM Fig. S3c). The other two isomeric species eluting at 8.29 min and 10.12 min were therefore identified as 3-O-galloylquinic acid (2c, ESM Fig. S3a) and 5-di-O-galloylquinic acid (2d, ESM Fig. S3b), respectively, taking into account the elution order of their respective monogalloyl derivatives. Two of the three monodepsidic digalloylquinic acid isomers co-eluted in HILIC, and fragmentation spectra revealed no differences for the peaks at retention times of 7.71 min and 9.01 min (data not shown), so that these isomers could not be distinguished in this separation mode.

For the digalloylquinic acid isomers not containing a depsidic bond, the isomer eluting at 7.12 min in RP-LC was identified as 3,5-di-O-galloylquinic acid (2b) based on the absence of the dehydrated quinic acid fragment ion (m/z 173) in its MSE spectrum (ESM Fig. S4b). The two remaining isomers could not be differentiated based on the available MS data. However, taking into account the relative elution order for these isomers reported on a phenyl-hexyl RP column [21], the isomers eluting at 6.10 min and 10.97 min in RP-LC were tentatively identified as 3,4-di-O-galloylquinic acid (2a, ESM Fig. S4a) and 4,5-di-O-galloylquinic acid (2e, ESM Fig. S4c), respectively.

Eight trigalloylquinic acid derivatives were identified based on HR-MS data. The first isomer to elute under RP-LC conditions showed no shoulder at 300 nm in its UV spectrum, identifying this component as 3,4,5-tri-O-galloylquinic acid (3a, ESM Fig. S5f). The later eluting isomers showed an increase in arrival time in IM from 4.00 to 4.35 ms and a shoulder at 300 nm in their UV spectra (ESM Fig. S5g). Comparing the HILIC × IM (ESM Fig. S5a) and RP-LC × IM (ESM Fig. S5b) contour plots allowed for straightforward identification of 3,4,5-tri-O-galloylquinic acid in HILIC due to its unique arrival time at 4.00 ms. The MSE spectra (ESM Fig. S6) of this isomer in both HILIC and RP-LC are unique due to the presence of an extra fragment ion at m/z 477 ([M-galloyl-H2O-H]1−), which is also evident in the high-energy spectrum of 1,3,4,5-tetra-O-galloylquinic acid (4a, see further) [20]. The remaining trigalloyl species (3b–h) all produced the same fragmentation spectra in RP-LC (ESM Fig. S7a). Since seven of these isomers were resolved by RP-LC compared to only three in HILIC and identical arrival times were measured for these, assignment of individual species between HILIC and RP-LC was not possible.

The MSE spectra obtained for the GQAs in tara generally showed more extensive fragmentation under HILIC compared to RP-LC conditions, which may prove beneficial in some cases. For example, different from the case for monogalloylquinic acid species, the diagnostic intensities of ions at m/z 169, 173 and 191 for the trigalloylquinic acid isomer eluting at 9.30 min in HILIC allowed for identification of the galloylation positions as 3 and 5 due to the absence of the m/z 173 peak, although the position of the depsidic bond could not be determined (ESM Fig. S7c). This observation suggests that when comparable MS data between RP-LC and HILIC are required, such as for library searching for example, higher collision energies should be used in RP-LC.

A total of ten tetragalloylquinic acid derivatives were detected in the tara extract. The isomer eluting first in RP-LC was easy to assign as 1,3,4,5-tetra-O-galloylquinic acid (4a) by the absence of a shoulder at 300 nm in its UV spectrum and the fact that this compound showed the shortest arrival time of 4.76 ms (refer to Fig. 3). This is further supported by the unique fragmentation behaviour of this isomer, which exhibits an ion at m/z 601 ([M-galloyl-CO2H-H]1−, 0.5 ppm) under MSE conditions [20] (ESM Fig. S8).

Fig. 3
figure 3

Extracted ion HILIC × ion mobility (a) and RP-LC × ion mobility (b) contour plots for tetragalloylquinic acid derivatives (m/z 799) in tara. Panel (c) shows the corresponding extracted ion arrival time plot (identical for HILIC and RP-LC), panels (d) and (e) the corresponding extracted ion chromatograms for HILIC and RP-LC and inserts (f)–(h) the representative UV spectra of each of the positional isomers containing 1, 2 or 3 depsidic bonds, respectively. Compound numbers correspond to Table 1

IM provided valuable complementary information to RP-LC and HILIC separations regarding the number of depsidic bonds, which, together with HR-MSE data, greatly facilitated the tentative identification of positional isomers. As an example, the extracted ion HILIC and RP-LC × ion mobility contour plots for tetragalloylquinic acid species (m/z 799, [M−H]) are shown in Fig. 3. 1,3,4,5-Tetra-O-galloylquinic acid (4a, Fig. 3f), identified as discussed above, showed the shortest arrival time of all tetra-O-galloylquinic acid derivatives at 4.76 ms. Two additional arrival time clusters where recorded at 5.04–5.24 ms and 5.66 ms, respectively (Fig. 3c), both coinciding with an increasingly evident shoulder at 300 nm in the UV spectra of the respective peaks (Fig. 3g, h). (A similar increase in arrival time and relative absorbance at 300 nm as a function of number of depsidic bonds was observed for tri- and penta-O-galloylquinic acid species; Fig. S5 (see ESM) and Fig. 5, respectively). This indicates, as expected, that the effective TWCCSN2 values of GQA derivatives increase with the number of depsidic bonds (Table 1). The complementary information provided by UV and IM data therefore provides a simple means of distinguishing between positional isomers containing different numbers of depsidic bonds. Furthermore, the characteristic arrival times for particular positional isomers facilitated assignment of individual isomeric species of di-, tri-, tetra- and pentagalloylquinic acids between HILIC and RP-LC (Figs. 6, S5 (see ESM), 3 and 5, respectively), which generally showed an inverse elution order. For example, minor differences in arrival time were measured for regioisomeric tetragalloylquinic acid species (Fig. 3) which contain one depsidic bond (4b, 4c and 4d), indicating that 4c and 4d retain the same elution order in HILIC and RP-LC. Although no further information regarding the substitution patterns could be obtained, IM offered partial differentiation between these isomeric species, which was not possible based on MS data alone (ESM Fig. S9).

Based on the criteria outlined above, tetragalloylquinic acid derivatives 4b–d could be assigned as containing a single depsidic bond, while compounds 4e–j contained two depsidic bonds. Unlike 4b–d, however, the isomers containing two depsidic bonds showed nearly identical arrival times (5.66 ms) and fragmentation behaviour, and, especially in HILIC, extensive co-elution. As a consequence, assignment of individual isomers was not possible for these isomers.

Fifteen pentagalloylquinic acid derivatives were identified by HR-MS (m/z 951, C42H31O26). The pentagalloylquinic acid isomers eluting at 24.28 min (5a) and 25.19 min (5b) in RP-LC and 17.06 min (5b) and 18.31 min (5a) in HILIC produce a characteristic ion at m/z 601 ([M-2galloyl-CO2H-H]1−, 0.5 ppm) under MSE conditions (Fig. 4), indicating that an O-digalloyl group containing one depsidic bond is attached to quinic acid at position 1 for both isomers [20]. The formation of fragment ions m/z 647 ([M-2galloyl-H]1−), m/z 601 ([M-2galloyl-CO2H-H]1−) and m/z 629 ([M-2galloyl-H2O-H]1−) can be explained by an acyl transfer mechanism (ESM Fig. S10, adapted from Scheme B in [22]). However, no further information regarding the nature of the remainder of acyl groups could be obtained by either MS or UV data. The key to distinguishing between these isomers is the longer arrival time of 5a (6.28 ms vs. 5.80 ms for 5b), which indicates that this compound contains an additional depsidic bond, although its position could not be ascertained (Fig. 5). Based on this information, 5b was tentatively identified as 1-di-O-galloyl-3,4,5-tri-O-galloylquinic acid, while 5a contains a second depsidic chain at position 3, 4 or 5 on the quinic acid core.

Fig. 4
figure 4

MSE spectra obtained for pentagalloylquinic acid isomers eluting at 17.06 min (a, 5b) and 18.31 min (b, 5a) under HILIC conditions

Fig. 5
figure 5

Extracted ion HILIC × ion mobility (a) and RP-LC × ion mobility (b) contour plots for pentagalloylquinic acid derivatives in tara. Panel (c) shows the corresponding extracted ion arrival time plot (identical for HILIC and RP-LC), panels (d) and (e) the corresponding extracted ion chromatograms for HILIC and RP-LC and inserts (f)–(h) the representative UV spectra of positional isomers containing 1, 2 or 3 depsidic bonds, respectively. Compound numbers correspond to Table 1

The fragmentation spectra for the remaining pentagalloylquinic acid species at arrival times of 6.28 ms and 7.04 ms, representing isomers containing 2 and 3 depsidic bonds, respectively, were very similar (ESM Fig. S11). The only noticeable difference is the higher abundance of the fragment ion at m/z 495 [M-3galloyl-H] in the MSE spectra of the species containing 3 depsidic bonds (td = 7.04 ms), indicating that a cleavage of depsidic chains occurs more readily than single O-galloyl groups.

Multiple arrival times (3.10~3.30 ms and 3.60~3.70 ms) were observed for each of the digalloylquinic acid isomers containing a single depsidic bond (2c, 2d and 2f), as shown in Fig. 6. Since the positions of the depsidic bonds for each of these compounds have been ascertained based on MS data, the two species resolved for each compound by IM do not correspond to positional isomers. Rather, the species resolved by IM likely correspond to either prototropic isomers differing in the position of their negative charges [30, 37] or to structural isomers containing m- and p-depsidic bonds, respectively [18]. MSE spectra for each of the IM-resolved species filtered according to arrival time are presented in Fig. S12 (see ESM). These indicate no discernible differences between the species separated by IM, with the exception of the higher prevalence of sodium and potassium adducts for the 4-O-digalloylquinic acid species and especially the 5-O-digalloylquinic acid species with the later arrival times (Fig. S12, b1 and c1).

Fig. 6
figure 6

Extracted ion HILIC × ion mobility (a) and RP-LC × ion mobility (b) contour plots for digalloylquinic acid (diGQA) derivatives in tara. Panel (c) shows the corresponding extracted ion arrival time plot (identical for HILIC and RP-LC), panels (d) and (e) the corresponding extracted ion chromatograms for HILIC and RP-LC and inserts (f) and (g) the representative UV spectra of positional isomers containing one and no depsidic bonds, respectively. Compound numbers correspond to Table 1

Considering that the m- and p-isomers of digallic acid are quite well separated in both HILIC and RP-LC (Fig. 2, IIa,b) and that IM did not provide differentiation of these species (see in the following), the fact that no chromatographic separation of the IM-resolved digalloylquinic acid species is evident in either separation mode implies that these more likely correspond to prototropic isomers, although this could not be conclusively proven based on the available data.

In general, the results reported in the present work for galloylquinic acids compare well with a previous study of structurally related chlorogenic acid regioisomers [31] in terms of relative IM behaviour. 4,5-Di-O-caffeoylquinic acid and the structurally related 4,5-di-O-galloylquinic acid (2e) in tara both showed shorter arrival times relative to the respective 3,4- and 3,5-di-O-acylquinic acid species, which made it possible to assign individual non-depsidic digalloylquinic acid species between HILIC and RP-LC (Fig. 6). It was straightforward to assign the depsidic digalloylquinic acid isomers in HILIC compared to RP-LC due to the multiple arrival times observed for the proposed prototropic isomers, which subsequently indicated all three depsidic isomers (2c, 2d and 2f) co-eluted in HILIC (Fig. 6).

For several of the di-, tetra- and penta-O-galloylquinic acid derivatives (2c,d, 4b,c,d, 5a,b), evidence of on-column conversion between species was observed in RP-LC, as reflected by a characteristic-raised baseline (peak–plateau–peak) between species in the relevant extracted ion chromatograms (ESM Fig. S13). This phenomenon is well known for chiral compounds [40, 41] and has been observed previously, although not addressed in detail, for tara polyphenols [20]. Although not clearly evident in HILIC, this might be due to the relatively poor chromatographic performance of the technique. These interconversion reactions likely correspond to on-column conversions between the relevant positional isomers (e.g. between 3- and 5-di-O-galloylquinic acid (2c and 2d) and between tetragalloylquinic acid species with one depsidic bond (4b,c,d)). The pentagalloylquinic acid derivatives 5a and 5b with unique arrival times at 6.28 ms and 5.80 ms also show this characteristic raised baseline due to on-column conversion (ESM Fig. S13c). If 5b (td = 5.80 ms) is indeed 1-di-O-digalloyl-3,4,5-tri-O-galloylquinic acid, which converts to 5a (td = 6.28 ms) containing two depsidic bonds, this provides evidence that galloyl units are capable of migrating not only to different positions on the quinic acid core but also to phenolic groups on (presumably adjacent) galloyl units. In IM, the interconverting species representing the raised baseline not surprisingly showed the same arrival time as the corresponding isomers (Fig. 6).

Finally, several gallic acid derivatives were also detected in tara, including gallic acid (I), digallic acid (II), trigallic acid (III) and tetragallic acid (IV) (Fig. 1). The presence of these compounds is likely a consequence of the hydrolysis of GQAs. The di- and trigallic acid species showed the expected shoulder at 300 nm in their UV spectra due to the depsidic bond(s) (it was not possible to obtain a clean UV spectrum for tetragallic acid due to co-elution). These compounds were characterised by simple fragmentation patterns, with MSE spectra showing losses of galloyl groups and both deprotonated (m/z 169) and decarboxylated (m/z 125) gallic acid ions.

Two chromatographic peaks were observed for both digallic acid (II) and trigallic acid (III) in both HILIC and RP-LC. Identical arrival times were measured for these species (Table 1). For digallic acid, the two peaks correspond to m- and p-depsidic isomers, with p-digallic acid eluting before m-digallic acid in RP-LC [42]. It has been shown that m-digallic acid is converted to p-digallic acid in methanol [18]. Since the tara sample was dissolved in methanol, such a conversion may be responsible for the detection of p-digallic acid in this work. Furthermore, oxidation of m-digallic acid has been shown to results in the formation of ellagic acid (V) [43], which was identified based on HR-MSE and UV spectral data [4, 28, 44]. For trigallic acid, three of the four possible isomers (m-p, p-m, m-m and p-p) were observed in HILIC and only two in RP-LC. These two peaks could be assigned between the separation modes based on their relative peak areas, since both showed identical arrival times. The third trigallic acid species, detected only in HILIC, showed a slightly shorter arrival time. Only a single peak of very low intensity was observed to tetragallic acid (IV) in both HILIC and RP-LC.

Conclusions

Comprehensive analysis of a commercial tara extract by RP-LC and HILIC coupled to diode array detection and IM-HR-MS allowed for the tentative identification of 45 isomeric galloylquinic acid species in addition to six gallic acid derivatives and ellagic acid. RP-LC provided superior separation for tara tannins compared to HILIC. The complimentary information provided by the combination of UV spectral data and IM separation provided a facile methodology to identify the number of depsidic bonds and thereby to differentiate between positional isomers of the same degree of galloylation. Characteristic arrival time differences allowed in most cases for the assignment of individual isomeric species between HILIC and RP-LC. The proposed presence of prototropic isomers explains the loss in mass spectrometric specificity for the monogalloylquinic acid species as well as multiple IM arrival times for the di-O-galloylquinic acid isomers. Evidence of on-column conversion between species was observed in RP-LC, where the combination of IM and chromatographic data suggests the migration of galloyl groups to different positions on the quinic acid core as well as to galloyl phenolic groups. The results of the present study for tara tannins confirm the utility of an analytical methodology based on the combination of RP-LC and HILIC separation with IM and HR-MS for the comprehensive analysis of hydrolysable tannins. Future work should include the use of commercial reference standards, where available, to confirm the tentative assignments reported in the present work.