Introduction

The mammalian integument is an important source of novel natural products (Nicolaides 1974). Comparative chemical studies have indicated the occurrence of sterols, wax esters, hydroxyacids, ceramides, glycolipids, and other compounds in mammalian skins (Birkby et al. 1982; Lindholm et al. 1981). Some of these chemicals have also been shown to occur in the domestic sheep (Itô et al. 1971), horse (Colton and Downing 1982), and guar (Albone et al. 1986; Ishii et al. 2004). In ruminants, skin metabolites, in particular those from specialized scent glands (Albone et al. 1986), are thought to function as chemical cues in various contexts (Müller-Schwarze 1971).

Deer urinate on their tarsal glands (rub-urinate) as a means to release skin metabolites that arouse a marking or agonistic behavior (Müller-Schwarze 1971). The tarsal organ consists of a scent gland and a hair tuft that carries the scent and plays a primary role in mutual recognition and other aspects of social behavior (Müller-Schwarze 1971). The primary component of tarsal scent of the male black-tailed deer, Odocoileus columbianus, was identified as cis-4-hydroxydodec-6-enoic acid lactone (Brownlee et al. 1969), which possesses s lactone motif compatible with it functioning as a semiochemical (Schulz and Hötling 2015). However, the isolated compound and similar synthetic analogues did not duplicate the original tarsal scent in inducing expected behaviors and are currently considered inactive (Müller-Schwarze 1969). Bioactivity driven fractionation experiments further demonstrated that ketones and aldehydes may not contribute significantly to the total activity of tarsal scent (Müller-Schwarze 1971).

We report chromatographic analysis of the tarsal gland extracts of whitetail deer, Odocoileus virginianus. Metabolite differences between male and female glands were investigated using HPLC/HR-MS analysis. The predominant components of male extracts were isolated and identified, as well as the known compounds cholesterol 3-sulfate (Hoye et al. 2007) and 3-(3-hydroxyphenyl)propanoic acid (Pouchert and Campbell 1974). Structures were determined by HR-MS and spectroscopic methods, including 1H, 13C, DFQ-COSY, HSQC, HMBC, and TOCSY. Relative configurations were supported with NOESY correlations, and absolute configurations deduced by comparing these spectroscopic data with those of identical substructures.

Methods and Materials

Equipment and Reagents

Mass spectra were recorded on a TQ-S TOF LC mass spectrometer (Waters Corporation, Milford, Massachusetts, USA). Silica gel (70–230 and 230–400 mesh, Merck, Darmstadt, Germany), RP-18 reverse-phase silica gel (Merck), and Sephadex LH-20 (Merck) were used for open column chromatography. Thin layer chromatography (TLC) was conducted on glass plates pre-coated with GF254 silica gel (Merck). Spots were first visualized under UV light at 254 nm and then stained with sprays of an acidic methanol solution of 5% anisaldehyde (Sigma-Aldrich, St. Louis, Missouri, USA). The standard N-octanoylglycine was purchased from Sigma-Aldrich, St. Louis, Missouri, USA.

Collection and Extraction of Tarsal Glands

Tarsal gland scents were extracted with ethanol from the hair tuft of tarsal glands excised from female and male whitetail deer harvested during the firearm hunting season in Richland County, Wisconsin during November 2015. Tarsal glands were excised immediately after harvest and preserved in 95% ethanol at −20 °C. All deer were harvested legally by licensed hunters.

Metabolite Analysis

Metabolites were fractionated using an Acquity ultra-high performance liquid chromatography (UPLC) system (Waters), and detected at high resolution using a Xevo G2-S QToF mass spectrometer equipped with an electrospray ionization (ESI) source (Waters). An ACQUITY C18 BEH UPLC column (2.1 × 100 mm, 1.7 μm particle size; Waters) was used to profile polar metabolites, and the column temperature was set at 30 °C. The mobile phase consisted of water (A) and acetonitrile (B) with a gradient program at a flow rate of 250 μL min−1 for 10 min: 80% A for 1 min; 0% A from 1 to 9 min; and 80% A from 9 to 10 min for column equilibrium. To avoid cross-contamination of samples during the analysis, the needle was washed twice with 80% methanol after each injection. Analyte residues were also reduced by injecting 10 μL methanol as a ‘rinsing solution’ on the column after each sample injection using the elution gradient program described above.

High-resolution mass spectrometry was performed in negative electrospray ionization mode. Full-scan spectra were recorded in the range of m/z 100–1000. Nitrogen gas was used as both desolvation gas (600 L h−1) and cone gas (50 L h−1). Argon was used as the collision gas at a pressure of 5.3 × 10−5 Torr. Source and desolvation temperatures were 100 and 400 °C. The cone voltage and capillary voltage were set to 30 V and 2.8 kV. For MS spectrum and MS/MS spectrum, the collision energies for collision-induced dissociation were 5 and 40 eV. Scan time was set at 0.2 s, with an interscan delay of 0.5 s. The LockSpray™ dual electrospray ion source with internal references used for these experiments was leucine enkephalin at a concentration of 100 ng mL−1. Lock-mass calibration data at m/z 554.2615 in negative ion mode were acquired for 1 s at every 10 s interval and the flow rate was set at 5 μL min−1. Data treatment, alignment, peak picking, normalization, deconvolution, and multivariate analysis were performed using Massylynx 4.1 (Waters).

NMR Spectra

1D and 2D NMR spectra were recorded on a Bruker Avance III 900 NMR (Bruker; Billerica, Massachusetts, USA) and Varian 500 NMR spectrometer equipped with 5-mm probe (Agilent Technologies, Santa Clara, CA 95051) in CD3OD and DMSO at 298 K. Chemical shifts (δ) in ppm were referenced to tetramethylsilane (TMS) at 0.00 ppm for 1H and 13C. Coupling constants (J) were given in Hertz. The pulse conditions were: for 1H, spectrometer frequency (SF) = 899.01 MHz, spectral width (SWH) =18,518.5 Hz, pulse 90° width (P1) = 9.35 μs, acquisition time (AQ) = 1.99 s, relaxation delay (D1) = 1.00 s, number of dummy scans (DS) = 4; for 13C, SF = 226.08 MHz, SWH = 54,200.25 Hz, P1 = 25.00 μs, AQ = 0.99 s, D1 = 1.5 s, DS = 36,000; for 1H–1H COSY, SF = 899.00 and 899.00 Hz, SWH = 10,776 and 10,788 Hz, AQ = 0.26 s, D1 = 3.0 s, DS = 16; for HSQC, SF = 899.00 (1H), 226.07 Hz (13C), SWH = 14,367.8, 37,450.2 Hz, AQ = 0.035 s, D1 = 1.5 s, DS = 32; for HMBC, SF = 899.00 (1H), 226.07 Hz (13C), SWH = 10,775.9, 54,152.9 Hz, AQ = 0.19 s, D1 = 1.5 s, DS = 32; for NOESY, SF = 499.91, 499.91 Hz, SWH = 2490, 2490 Hz, AQ = 0.15 s, D1 = 1.0 s, DS = 16, mixing time (D8) = 0.86 s.

Purification of Extracts

The organic solvents were removed under reduced pressure at 40 °C by a rotary evaporator (Buchi Co., New Castle, Delaware, USA). This residue was suspended in methanol and filtered through 2 μm filter paper. The filtrate was concentrated again under reduced pressure at 40 °C to yield 0.5 g of a dark residue.

The residue was subjected to liquid chromatography over silica gel (50 g; gradient elution from 95% CHCl3/MeOH to 100% MeOH, 0.5 L total volume). TLC analysis stained by spraying with an acidic methanol solution of 5% anisaldehyde was used to guide the pooling of the eluents into 6 fractions. Fraction 4 was concentrated and left a residue (12 mg) that was further purified using Sephadex LH-20, first on a CHCl3-MeOH (1:1) column and then on a MeOH (100%) column.

Results and Discussion

Four major components were detected in the extracts of tarsal organs of male deer using UPLC/MS-MS (Fig. 1). These were termed cervidin A, B, C and D, in reference to the family Cervidae to which the whitetail deer belongs. The mass difference between each compound were m/z 14, indicating a series of analogues that differed by a single methylene group.

Fig. 1
figure 1

UHPLC/HR-MS chromatogram of component analysis of extract of tarsal organ of male deer showing four major components A-D

Purification by liquid chromatography provided samples of cervidin A (1, 0.23 mg), cervidin B (2, 0.15 mg), cervidin C (3, 0.25 mg), cervidin D (4, 0.10 mg), cholesterol 3-sulfate (5, 0.86 mg), and 3-(3-hydroxyphenyl)-propanoic acid (6, 0.25 mg). The structures of these compounds are shown in Fig. 2.

Fig. 2
figure 2

Chemical structures of compounds 16

Cervidin B (2) was obtained as a pale oil. The molecular formula was determined to be C19H34NO5 ([M − H], calcd 356.2437) by HR-ESI-MS, suggesting three degrees of unsaturation (Supplementary Information: Table S1 and Figs. S1-S4). The 1D NMR and HMBC data (Table 1 and Fig. 3a) were used to establish the following features: two carbonyl (δC 176.4, 175.5), three O-methines (δH 3.37, δC 75.1; δH 3.93, δC 80.9; δH 3.82, δC 83.4), one methyl (δH 0.90, δC 14.6), and 13 other methylene groups. The connectivity between CH-CH-CH2-CH2-CH was supported by correlations in the COSY spectrum. The resonances in 1H NMR spectrum at 3.37 ppm (H-10), 3.82 ppm (H-9), and 3.93 ppm (H-6) and long-range correlations from H-6 to C-9 and from H-9 to C-6 in the HMBC spectra suggested an 2,5-disubstituted tetrahydrofuranoid ring in which one of the substituent chains carries an adjacent sec-hydroxyl group, leading the construction of a scaffold in 2. Furthermore, the COSY correlations between H2–5 and H2–4, H2–4 and H2–3, H2–3 and H2–2, respectively, extended the side chain of core moiety of tetrahydrofuranoid ring. The long-range correlations from H2–3 to C-1 (δC 176.4) in HMBC spectrum further extended the side chain with an acid/amide group. The resonances and multiplicity of methylene [δH 3.80 (s), δC 43.6] and correlations from H2–1′ to both carbonyl carbons in HMBC spectrum indicated a connectivity of acetic group. These NMR spectra and experiments are shown in Supplementary Material, Figs. S5-S10).

Table 1 1H and 13C NMR Data for natural cervidin B (2) recorded in MeOH-d4 at 900 MHz for 1H and 225 MHz for 13C
Fig. 3
figure 3

(a) Key HMBC (blue arrows), COSY (red bold lines) and (b) NOESY correlations for cervidin B (2)

As a reference compound, N-octanoylglycine, which possesses similar connectivity sequences indicated for 2, was used to record proton and carbon resonances in the same deuterated solvents and conditions specified by Pouchert and Campbell (1974). The glycine conjugated moiety in 2 was similar to that in N-octanoylglycine. The chemical shifts of methylene-1′ and carbonyl further confirmed the HOOC-CH2-NH-CO- moiety and excluded the possibility of HOOC-NH-CH2-CO- group. The remaining methylene and methyl groups accounted for a C-7 alkyl chain on the other side of the tetrahydrofuranoid ring moiety.

The planar structure of 2 was identical to the 16-bovidic acid from the wool fat of domestic sheep, Ovis aries (Itô et al. 1971), and to the 18-bovidic acid from pelage extracts of banteng, Bos javanicus (Ishii et al. 2004). The COSY and HMBC correlations fully supported the skeleton of 2 within a hydroxyl adjacent to the tetrahydrofuranoid ring. A 2D TOCSY experiment was performed and the correlations supported the connectivity of hydroxylated tetrahydrofuranoid core moiety (Fig. 3a). Therefore, the planar structure of 2 was determined as glycine conjugated 10-hydroxy-6,9-oxidooctadecanoic acid (Ishii et al. 2004).

The stereochemistry of the hydroxylated 2,5-tetrahydrofuranoid ring has been widely studied due to its specific physiological function, which provides analogs for cervidin compounds. The relative configurations of 2 was established through analyses of the NOESY correlations, as well as comparisons of its NMR spectra with those of previously reported compounds. A detailed analysis of the NOESY spectrum of 2 revealed correlations between H-6/H-7b, H-6/H-8b, and H-8b/H-10 (Fig. 3b), placing these protons on the same face of the tetrahydrofuran ring. A complementary NOESY correlation was observed between H-9/H-7a, indicating their cis relationship to one another.

An identical sub-structural analogue of compound 2 was the hydroxyfuranoid moiety in bovidic acid (Ishii et al. 2004), which was isolated from pelage extracts of banteng. Evans et al. (2004) synthesized the hydroxyfuranoid moiety with 6S, 9R, 10R configuration, and their spectroscopic data matched those of 2. The comparison of the carbon and proton chemical shifts between 2 and each of the analogues (Supplementary Material Table S2) with possible configurations clearly suggested that 2 possesses a 6S, 9R, 10R absolute configuration. We combined the elemental composition information from mass spectra and NMR spectra to assign the remaining compounds as cervidins A, C, and D (Fig. 2). The similarity of NOESY correlations and proton and carbon resonances in cervidins A, C, and D indicated the same absolute configurations 6S, 9R, 10R as 2.

The biological function of an organic compound is determined by its properties, such as chain length, functional groups, and stereochemistry (Wyatt 2003). More specifically, chiral compounds have fixed spatial arrangements of atoms that contain no mirror plane, or centre of inversion, and thus occur in two non-superimposable mirror-image forms. Stereoisomers are characterized by differences in physicochemical properties and may trigger distinct biological responses in insects (Mori 1998) and fish (Li et al. 2018). Each cervidin compound possesses three chiral centers, potentially supporting eight stereoisomers. In addition to these chiral centers, the chain length on tetrahydrofuran ring further enriches the complexity of the cervidin mixture from male tarsal glands. The cervidins represent a new addition to the molecular diversity of deer metabolites.