1 Introduction

Liquid chromatography-mass spectrometry (LC-MS) is an integral part of drug discovery and development in the pharmaceutical industry, routinely used to reveal and characterize trace-level impurities, by-products, and degradation products, which are common occurrences in synthetic chemistry reactions [1]. The need to identify all components present alongside a synthetically produced target compound is often hampered by the fact that these undesirable molecules can be structurally similar to the target compound, yet present at much lower concentrations. LC ESI MS/MS has the sensitivity with which to detect molecules at low concentrations and the experimental flexibility to extract information that can lead to proposed empirical formulae and molecular structure [26]. There are many MS/MS techniques available, the most common one used in the pharmaceutical industry being collision induced dissociation (CID). CID involves collisions between precursor ions and neutral gaseous atoms or molecules, which results in the conversion of a precursor ion’s kinetic energy into internal energy. This increase in internal energy causes bond dissociation within the precursor ion, typically favoring the lowest energy fragmentation pathways [79]. Problems often arise upon the CID analyses of analogous compounds as product ions observed from the cleavage of the weakest bonds may not allude to the location of, or nature of, such small structural molecular changes, demanding that other analytical approaches be explored. A more recent addition to the tandem MS toolbox is electron capture dissociation (ECD), an electron-based fragmentation technique primarily used to fragment and characterize multiply charged peptide and protein cations [1012]. It involves the interaction of low energy electrons (1–5 eV) with multiply protonated species, resulting in charge reduction and bond cleavage [13, 14]. ECD of peptides/proteins has been shown to produce a simple but regular distribution of product ions that complement the CID distribution, but the ability of ECD to preserve labile covalently bound modifications gives this technique a distinct advantage over CID [1518]. In an attempt to gain yet further information regarding peptide structure, Zubarev et al. increased the ECD electron energy beyond what is common for multiply charged peptides (>10 eV), generating hotter electrons [13, 19]. Applying these higher energy electrons to multiply charged peptides is known as hot electron capture dissociation (hECD) and has been shown to produce a significant increase in side chain fragmentation producing an even greater depth of information [20, 21]. More recently, the interaction between hotter electrons and singly charged cations has been shown to induce fragmentation and is referred to as electron induced dissociation (EID) [19, 2229]. In a very recent study with pharmaceutical type molecules, EID produced extensive product ion data that largely complemented CID, was comparable to electron ionization (EI), and was more tolerant to a wider range of charge carrying species than CID [29].

ECD is somewhat notorious for its lack of efficiency compared with other techniques such as CID [10, 15]. Historically, in order to achieve usable spectra, samples of high concentration were preferred and S/N allowed to build-up by accumulating many spectra. Significant inroads have been made in optimizing methodologies to facilitate ECD of peptides on the LC time-scale [3, 4, 30]. For EID to be widely pertinent to real-world applications, it too must be coupled to chromatographic techniques, producing spectra of sufficient quality that information can be gained for compounds of a wide dynamic concentration range from the limited number of spectra achievable across a chromatographic peak. This development will show that LC-EID is achievable, plus it provides unique information not afforded by LC-CID. The power to generate multiple datasets of complementary information on one instrument in the very short LC time-scale facilitates greater in-depth analysis of complex mixtures of small molecules, which are typical of the pharmaceutical industry.

2 Experimental

2.1 Sample Preparation and Sample Introduction

The sample of cediranib was supplied by AstraZeneca, Macclesfield, UK. All solvents were purchased from Sigma-Aldrich, Dorset, UK. For the chromatography, a solution of cediranib was made to 2 mg mL–1 in methanol; 10 μL aliquots were injected on to either a Luna 3 μm C18 (2), 100 Å, 150 × 2 mm column (Phenomenex) with a mobile phase flow rate of 200 μL min–1 for a 12 min separation or a 4.5 min separation using a Zorbax SB-C18 1.8 μm 4.6 × 50 mm column (Agilent Technologies) with a mobile phase flow rate of 1 mL min–1 and a 1:4 split prior to the mass spectrometer. Gradient conditions started at 95% (H2O + 0.1% formic acid)/5% (CH3CN + 0.1 % formic acid) and finished at 5 % (H2O + 0.1% formic acid)/95% (CH3CN + 0.1 % formic acid). A 5 μg mL–1 solution of cediranib in methanol was used for occasional experiments requiring direct infusion. In such instances a syringe pump delivered the sample solution at a flow rate of 5 μL min–1.

2.2 Mass Spectrometry

Unless stated otherwise, measurements were made using a ThermoFinnigan LTQFT mass spectrometer fitted with a 7 Tesla magnet and equipped with an electrospray ion source (Bremen, Germany). The following ion source parameters were optimized for the most stable ion signal: the nitrogen sheath gas was kept between 8 and 10 arbitrary units, the auxiliary gas and sweep gas were set between 2 and 4 arbitrary units, as per the manufacturer’s software, the capillary was heated to 250 °C, and the spray voltage was held at 4–4.5 kV. The tube lens voltage was varied to deliver the optimal ion intensity.

Isolation of the precursor ions was carried out in the LTQ ion trap with a fixed isolation window of 4 m/z in order to include the full isotopic profile of the compound of interest. All LC-MS/MS experiments were performed by alternating MS and MS/MS scans throughout the chromatographic run. CID experiments were performed entirely within the LTQ ion trap using helium as a collision gas and optimized at a normalised collision energy level of 25 as per the manufacturer’s software, unless otherwise stated. EID was performed entirely within the FTICR cell using an indirectly heated dispenser cathode. To generate ‘hot’ electrons required for EID and hECD, the electron energy was set at 20 arbitrary units, as per the manufacturer’s software. The cathode offset was 2.5 V for LC-MS/MS experiments and 2.1 V for the direct infusion experiments, resulting in approximate electron energies of 17.5 and 17.9 eV, respectively. For ECD, the electron energy was set at 5 arbitrary units, as per the manufacturer’s software, and the cathode offset was 2.1 V resulting in an approximate electron energy of 2.9 eV. The electron irradiation time was fixed at 70 ms. Data was recorded using the acquisition software Xcalibur ver. 2.0.7 (Thermo Fisher Scientific Inc.) and processed using the embedded program Qual Browser. Comparative direct infusion ESI QTOF MS measurements were made on a Waters QTOF Premier (Manchester, UK). QTOF-CID experiments were performed using ion source parameters that optimized the precursor ion peak intensity, argon as a collision gas and a collision energy of 25 eV.

3 Results ad Discussion

3.1 LC-MS/MS of Cediranib

The process for manufacturing cediranib, as with all pharmaceuticals, is rigorously developed and controlled to ensure that any impurities present are well below internationally recognized safety limits. The inherent sensitivity of mass spectrometric methods allows detection of trace level impurities that are below these permitted safe levels. A process development sample of cediranib containing higher levels of impurities than the marketed compound was analyzed by LC FTICR MS, which separated and detected eleven different compounds of varying ion abundance. Those compounds are labeled 1 to 11 on the total ion chromatogram shown in Figure 1. The most intense peak, 3, relates to a precursor ion at 451.21370 m/z, which corresponds to the empirical formula C25H28N4O3F with an accuracy of 0.7 ppm and is taken to be protonated cediranib, as expected. Accurate mass measurements for 1, 2, and 4 to 11 suggested molecular formulae in each case, as given in Table 1, however molecular structures remained unknown.

Figure 1
figure 1

Total Ion Chromatogram (TIC) from LC FTICR MS analysis of cediranib. Peaks labeled 1 to 11 indicate the presence of components proven to be from the cediranib sample. A comparison of sample and control is given in supplementary information Figure S1

Table 1 Summary of product ions following LC-CID and LC-EID of cediranib labeled LC peak number 3 and analogues labeled LC peak number 1, 2, and 411

Initial CID experiments to probe the structure of cediranib (Figure 2) were carried out by direct infusion of sample into a QTOF MS. The protonated cediranib molecule fragmented to give only one product ion peak at 112 m/z (S/N > 30,000/1, data not shown), corresponding to the cleavage of the C(1)–O bond of the propylpyrrolidine arm. This provides very little useful information about the molecular structure, highlighting the need for additional analytical techniques. Identification of the unknown species by tandem mass spectrometry was hampered by their low ion abundances, particularly in the presence of such a hugely abundant target compound. Coupling MS with chromatography minimized in-source ion suppression, improving precursor ion S/N in preparation for MS/MS. In this instance, both CID and EID have been employed to target and identify the unknown species present in the cediranib sample. Figure 3a shows the spectrum obtained from the LC-CID of cediranib performed and detected in the LTQ of the LTQFT. The single product ion at 112 m/z that was observed from the QTOF CID is not observed due to the one-third cut-off, which affects MS/MS carried out in quadrupole ion traps [31]. This results in the majority of CID product ions being present in the higher m/z region, including 340 m/z, which corresponds to the neutral loss of the propylpyrrolidine moiety. By comparison, the results from LC-EID shown in Figure 3b display the opposite trend for product ion distribution, providing unique information at the lower m/z range. The peak at 112.11208 m/z corresponds to the empirical formula C7H14N with an accuracy of 0.04 ppm, confirming the identity of the propylpyrrolidine arm of the molecule. Comparison of LC-CID and LC-EID product ion spectra really highlights the complementary nature of these two techniques, supporting previous studies [29], but also demonstrates that both are achievable in the LC timescale.

Figure 2
figure 2

Structure of cediranib with the propylpyrrolidine subunit atoms labeled C(1) to C(7) and N(8)

Figure 3
figure 3

Comparison of tandem MS for protonated cediranib, eluting at 6.55 min (peak 3 in Figure 1), by (a) LC-CID performed on an LTQ ion trap MS (average of 274 scans) and (b) LC-EID performed on an FTICR MS (average of 21 scans). # Indicates external interference in the FTICR MS and should be discounted. Product ions in bold represent fragmentation relating to the propylpyrrolidine arm

3.2 LC-MS/MS of Unknown Species

The first of the unknown species to be studied was the first to elute from the column, peak 1 with a retention time of 5.69 min (see Figure 1). This relates to a mass peak 562.31897 m/z, which corresponds to the empirical formula C32H41N5O3F with an accuracy of 0.3 ppm. The LC FTICR MS/MS of this precursor ion is given in Figure 4 for (a) LC-CID and (b) LC-EID. Comparison shows the LC-EID data to be the most immediately useful data, despite being a summation of 13 times less spectra than for LC-CID. Accurate mass measurement on the product ion at 451 m/z proposed a molecular formula C25H28N4O3F with an accuracy of 0.1 ppm. This is the same empirical formula as protonated cediranib, so at a glance the likely relationship between these two molecules is established. The neutral loss from 562 to 451 m/z during EID is consistent with a fragment having the empirical formula C7H13N and is considered to be most likely due to the addition of a second propylpyrrolidine arm onto the parent molecule during synthesis. Further indication into the structural similarities can be gained by comparing the LC-EID for cediranib shown in Figure 3b directly with the LC-EID of 562 m/z in Figure 4b. LC-EID of cediranib and the unknown species at 562 m/z resulted in the product ions 70, 84, and 112 m/z and accurate mass measurements allowed the empirical formulae C4H8N, C5H10N and C7H14N to be assigned, respectively. These three fragments are all from the propylpyrrolidine arm, a conclusion supported by visually comparing Figure 4b against EI mass spectral library records for a number of molecules with alkylpyrrolidine functional groups. Also in common is a product ion at 340 m/z, confirmed by accurate mass measurement despite lower S/N in Figure 4b, which corresponds to the protonated indolphenol structure given in Figure 5a and represents cediranib minus the propylpyrrolidine arm that is such an intense product ion in itself. In total, nine product ions observed in the LC-EID of the unknown compound at 562 m/z can be explained by comparison with the LC-EID of protonated cediranib, lending credence to the proposal that both molecules are closely structurally related. Now that the unknown molecule with [M + H]+ = 562 m/z is considered to be related to cediranib, the difference being an additional propylpyrrolidine moiety, the question remains as to the location of the second propylpyrrolidine. There are eight peaks observed in Figure 4b for the LC-EID spectrum of 562 m/z, which are not observed from the LC-EID of protonated cediranib, two of which provide insight to the location of the extra functional group. First, the peak at 223 m/z has been determined to have a molecular formula of C14H27N2 (0.7 ppm error), consistent with two propylpyrrolidine arms. As the C(1)–O is the main cleavage site within this molecule, this suggests that the addition of the second propylpyrrolidine is situated on one of seven carbon atoms labeled C(1) to C(7) or the nitrogen atom N(8) as described in Figure 2. In fact, evidence for this at very low abundance is also present in the LC-CID spectrum in Figure 4a. Second, the peak at 360 m/z has been identified as a fragment with the molecular formula C21H18N3O3 (1.4 ppm error), which is proposed to correspond to the loss of one intact propylpyrrolidine moiety plus the pyrrolidine ring of the second arm and hydrogen fluoride (C7H14N1 + C4H8N1 + HF). This further implies that the addition is located on the pyrrolidine ring C(4)–C(7) or N(8). NMR analysis of the extracted impurity corroborates the MS findings and further identifies the location of the second propylpyrrolidine as the N(8) atom, giving a quaternary ion as shown in Figure 5b.

Figure 4
figure 4

(a) LC-CID (average of 157 scans) and (b) LC-EID (average of 12 scans) of 562 m/z eluting at 5.69 min, labeled peak 1 in Figure 1. Product ions labeled in bold represent fragmentation relating to the propylpyrrolidine arms, including 223 and 360 m/z, which confirm the location of the modification. # Indicates external interference in the FTICR MS and should be discounted

Figure 5
figure 5

Structures of (a) indolphenol and (b) quaternary ion

The remaining unknown compounds labeled 2, 411 in Figure 1 have been investigated using the same LC-CID and LC-EID methodology and results are summarized in Table 1. Here, the total number of product ions has been counted for each technique (LC-CID and LC-EID). It can clearly be seen that EID provides a usable number of product ions for all the precursor ions studied, and that the number of EID product ions is comparable or often greater than to CID. The most interesting fact here, though, comes from comparing the observed product ion m/z values for each technique. There is very little overlap between the observed product ions, highlighting the complementary nature of these two techniques. Delving further shows that when this complementary data is considered as a whole, the number of peaks that each unknown compound has in common with cediranib is sufficiently high to suggest a strong likelihood that each unknown species is chemically related to the cediranib. As with the compound at LC peak 1 (Figure 1), further analytical techniques may be ultimately required to confirm the full molecular structure, however in this case the unknown species labeled 2, 411 remain proprietary information of the pharmaceutical company and cannot be discussed further.

3.3 Tandem MS of [M + 2H]2+ for Cediranib

Cediranib was observed in two charge states: singly and doubly protonated. Doubly charged ions were not observed for any of the weakly abundant, co-sprayed cediranib analogues. The doubly protonated adduct of cediranib, despite being considerably less abundant, would tend to be the precursor ion of choice as multiply charged species are generally accepted to provide the superior product ion spectrum [32, 33]. Figure 6a shows CID for 226 m/z, [M + 2H]2+ for cediranib performed with a normalized collision energy of 18. As could be expected, this spectrum represents a huge improvement over the CID of the singly protonated precursor (Figure 3a) in terms of number of product ions observed and provides comparable data to EID of the singly protonated species (Figure 3b).

Figure 6
figure 6

Tandem mass spectrometry of doubly protonated cediranib by direct infusion. Comparison of (a) CID (normalized collision energy of 18) with (b) ECD (electron energy = 2.9 eV) and (c) hECD (electron energy = 17.9 eV)

As mentioned previously, the interaction between low energy electrons and multiply charged cations is considered to result in the capture of an electron and subsequent bond dissociation, a technique known as electron capture dissociation (ECD). When the ECD approach was employed to study the [M + 2H]2+ for cediranib (Figure 6b), irradiation with low energy electrons (electron energy = 2.9 eV) gave very limited information. There is weak evidence of deprotonation from [M + 2H]2+ to give [M + H]+ but the main dissociation point is cleavage of the C(1)–O bond (Figure 2) to create product ions corresponding to protonated indolphenol and the propylpyrrolidine arm. In an experiment akin to hECD, using identical instrumental conditions to the EID experiments, the impact of hotter electrons (electron energy = 17.9 eV) was able to induce a much greater degree of dissociation as shown in Figure 6c. This spectrum closely resembles EID of [M + H]+ (Figure 3b) and CID of [M + 2H]2+ (Figure 6a) in terms of relating product ions to structure. Overall, ECD was of no benefit when characterising this small molecule but hECD of the doubly charged species and EID of the singly charged species are both informative suggesting that the charge state of the precursor ion is not as crucial for the electron based fragmentation techniques.

4 Conclusion

The electron energy required to generate the optimum mass spectrum in terms of number of product ions and the S/N ratio of the peaks is of the order of 18 eV, regardless of whether the molecule is singly or doubly protonated, although if this choice is presented, the doubly protonated precursor ion would be more desirable. Using this value for electron energy, rapid EID has been achieved on the LC timescale and for compounds of a wide dynamic abundance. Perhaps most importantly, LC-EID has proven to be a very powerful tool in that it provided crucial information about the identity and location of a molecular modification to an impurity that would otherwise require preparative isolation and NMR, a time-consuming and expensive alternative. This solution facilitates identification and characterization at a much earlier stage of drug development where sample may be limited.