Introduction

Postmortem redistribution (PMR) is a well-recognized phenomenon in forensic toxicology, which can complicate interpretation of drug concentrations and conclusions on the cause of death in medico-legal investigations. The PMR phenomenon combines concentration changes after death due to degradation, neo-formation, or diffusion processes (e.g., from high drug concentrations in the gastrointestinal tract or the lungs) and seems to be strongly dependent on analyte characteristics and the individual case circumstances such as the distribution of the analyte at time of death [1]. Furthermore, the time between death and sampling time point should be of major importance. Blood from peripheral sites—usually obtained from the femoral vein—is mostly agreed to be the matrix of choice for quantitation of xenobiotics because of its distance from central organs and the gastrointestinal tract [2]. However, even peripheral blood was shown to exhibit massive time-dependent PMR with drug concentrations decreasing or increasing up to 300 % [3]. Extensive blood clotting, collapse of the veins, putrefaction processes, or exsanguination can lead to unavailability of blood for analysis. If no blood at all is available, alternative matrices such as liver, kidney, muscle, lung, etc., have to be considered for toxicological analysis [4]. However, the interpretation of quantitation results in alternative postmortem matrices provides even more pitfalls due to lacking reference values or unpredictable PMR processes. PMR within solid tissues can only be assumed based on site-dependent concentration differences in samples taken at autopsy. Nevertheless, information about time-dependent redistribution in solid tissues is needed as these matrices are used for forensic investigations. Collection of solid tissue samples at more than one time point is only possible in the time window between admission of the body at the mortuary and autopsy with biopsy sampling before autopsy representing the only practicable strategy. Quantitation in biopsies is not a common practice in forensic toxicology. Only a few papers focused on quantitation in biopsy samples, e.g., in renal biopsies from rats and humans for the immunosuppressant drugs mycophenolic acid and tacrolimus. Tissue samples were homogenized either mechanically or enzymatically and then underwent liquid-liquid extraction (LLE). Sensitive instrumentation such as liquid chromatography (LC) coupled to mass spectrometry (MS) is required as only minimal sample amount of about 20 mg/biopsy is usually available [5, 6].

In postmortem toxicology, substances of different pharmacological classes such as opiates, benzodiazepines, antidepressants, neuroleptics, or drugs of abuse are of interest with often limited knowledge on consumed drugs prior to analysis. As ideal sample preparation and analytical instrumentation may vary between matrices and substance classes, often different analytical methods are used for quantitation [7]. However, sufficient sample amount is necessary which is often difficult for postmortem samples especially with biopsy samples. Multi-analyte procedures aiming at covering a multitude of relevant compounds seems to represent an interesting alternative. Unspecific extraction methods such as protein precipitation (PP) allow simultaneous extraction of various substance classes and are fast and cheap. However, extract purity after PP is low especially with postmortem matrices and tissue extractions which in turn can lead to severe matrix effects [8]. These problems are reduced using more sophisticated LLE or solid-phase extractions (SPE) [9]. On the other hand, LLE and SPE are more specific and may not work for simultaneous extraction of various substance classes. Several methods covering a large amount of analytes have been published in recent years and are summarized in [10].

Another major challenge in quantifying that many compounds with different therapeutic and/or toxic concentration ranges in different matrices in only one analytical run is to find calibration models which can handle the huge dynamic range without source or detector saturation of the MS. Different strategies to overcome these problems have been applied including the use of offset voltage parameters, less-sensitive product ions, polarity switching, or 13C product ions [11, 12]. The naturally occurring 13C isotope is usually about 1.1 % of all carbon atoms producing a much lower MS signal compared with 12C. Using the 13C isotope as precursor ion can help avoid saturation effects. The 13C isotope is stable, and its distribution underlies geological differences [13]. To the best of our knowledge, only Bergeron et al. tested the use of a 13C isotope as precursor for the simultaneous quantitation of mebeverine and its metabolite as both show large differences in their therapeutic plasma concentration ranges. However, in their case, application of 13C precursors did not give the desired result and the strategy of polarity switching was used instead for final quantitation [11].

Although forensic toxicologists are aware of the PMR effect, it is still under-explored. Especially systematic studies on time-dependent postmortem changes are limited and are difficult to conduct due to practical and even more important due to ethical reasons. To provide better insight into time-dependent PMR processes in blood as well as in different postmortem tissues, a novel, sensitive method should be developed and validated using LC-MS/MS allowing simultaneous quantitation in low amount of biopsy samples of different postmortem tissues for a multitude of over 80 forensically relevant compounds. This method should create the prerequisites for further studies on time-dependent PMR.

Materials and methods

Chemicals and reagents

Methanolic or acetonitrilic solutions (1 mg/mL) of 6-monoacetylmorphine (MAM), agomelatine, alprazolam, amisulpride, amitriptyline, amphetamine, nortriptyline, aripiprazole, benzoylecgonine (BEC), bromazepam, bupropion, hydroxybupropion (OH-bupropion), chlorprothixene, clonazepam, 7-aminoclonazepam, citalopram, clomipramine, norclomipramine, clozapine, norclozapine, cocaine, codeine, diazepam, dihydrocodeine (DHC), doxepine, nordoxepine, duloxetine, ecgonine methyl ester (EME), fentanyl, fluoxetine, norfluoxetine, flupentixol, flunitrazepam, 7-aminoflunitrazepam, flurazepam, N-desalkylflurazepam, hydroxyethylflurazepam (OH-flurazepam), haloperidol, hydrocodone (HC), hydromorphone (HM), imipramine, ketamine, levomepromazine, lorazepam, meta-chlorophenylpiperazine (mCPP), methadone, 1,5-dimethyl-3,3-diphenylpyrrolidine (EDDP), methamphetamine, 3,4-methylenedioxyamphetamine (MDA), 3,4-methylenedioxy-N-ethylamphetamine (MDEA), 3,4-methylendioxymethamphetamine (MDMA), methylphenidate (MPH), midazolam, α-hydroxymidazolam (OH-midazolam), mirtazapine, morphine, nitrazepam, nordazepam, olanzapine, opipramol, oxazepam, oxycodone, oxymorphone, paroxetine, perazine, pethidine, pipamperone, promazine, quetiapine, risperidone, 9-hydroxyrisperidone (OH-risperidone), sertraline, trazodone, temazepam, tramadol, O-desmethyltramadol (ODMT), triazolam, trimipramine, venlafaxine, O-desmethylvenlafaxine (ODMV), zolpidem and zopiclone and methanolic solutions of the deuterated internal standards (IS; 0.1 mg/mL), MAM-d3, alprazolam-d5, amisulpride-d5, amitriptyline-d3, amphetamine-d6, aripiprazole-d8, citalopram-d6, clomipramine-d3, clonazepam-d4, 7-aminoclonazepam-d4, clozapine-d3, cocaine-d3, BEC-d3, EME-d3, codeine-d3, diazepam-d5, duloxetine-d3, fentanyl-d5, flunitrazepam-d3, 7-aminoflunitrazepam-d3, fluoxetine-d6, norfluoxetine-d6, N-desalkylflurazepam-d4, flurazepam-OH-d4, haloperidol-d4, ketamine-d4, lorazepam-d4, mCPP-d8, MDMA-d5, methadone-d9, methamphetamine-d9, methylphenidate-d9, midazolam-d4, midazolam-OH-d4, morphine-d3, nordazepam-d5, oxazepam-d5, paroxetine-d6, pethidine-d4, pipamperone-d10, quetiapine-d8, risperidone-d4, temazepam-d5, tramadol-d3, trazodone-d6, triazolam-d4, trimipramine-d3, venlafaxine-d6, zolpidem-d6 and zopiclone-d4 were obtained from AdipoGen (Liestal, Switzerland), Lipomed (Arlesheim, Switzerland), LGC (Wesel, Germany), or Cerilliant (delivered by Sigma-Aldrich, Buchs, Switzerland). Water was purified with a Purelab Ultra Millipore filtration unit (Labtech, Villmergen, Switzerland) and acetonitrile of HPLC grade was obtained from Fluka (Buchs, Switzerland). All other chemicals used were from Merck (Zug, Switzerland) and of the highest grade available.

Biosamples

Human blank whole blood samples from the femoral veins (femoral blood (FB)) and the heart (heart blood (HB)), cerebrospinal fluid (CSF) from the lumbar spine, urine, muscle, liver, kidney, lung, spleen, brain tissue, and adipose tissue (AT) were used for method development and validation. They were obtained from postmortem cases that were submitted to the authors’ laboratory and autopsied by order of the state attorney. Ethical approval was obtained by the Cantonal Ethics Committee of Zurich (approval number 42.2005).

Sample preparation

Body fluid and tissue samples were extracted by a two-step LLE; 20 mg body tissue (muscle, liver, kidney, lung, spleen, adipose, or brain tissue) were weighed into 2 mL plastic tubes containing metal beads and homogenized using a Fast Prep®-24 Instrument (MP Biomedicals, Illkirch, France). Twenty microliters of body fluid (blood, CSF, or urine) were extracted in a 2-mL Eppendorf tube (Schoenenbuch, Switzerland). To each sample (homogenized tissues or body fluids), 50 μL IS mixture (concentrations are given in the Electronic supplementary material (ESM) Table S1) and 10 μL spiking solution were added. The first LLE was performed by adding 250 μL of 0.2 M phosphate buffer (pH 7.4) and 0.8 mL of butyl acetate/ethyl acetate (BuAc/EtAc; 1:1, v/v). The mixture was shaken for 10 min and centrifuged (5 min, 10,000 rpm), and 750 μL supernatant were transferred into an autosampler vial. To the remaining sample, 100 μL of 0.1 M NaOH was added (pH 13.5) and the second LLE was performed by adding another 0.8 mL of BuAc/EtAc (1:1, v/v), shaking, and centrifugation as described above. Eight hundred microliters of supernatant was added to the first extract, and after addition of 25 μL formic acid (20 %, v/v), evaporated to dryness under a gentle stream of nitrogen at 40 °C. The sample was reconstituted in 60 μL mobile phase (eluent A/B 90:10 (v/v), see below) and analyzed by LC-MS/MS.

LC-MS/MS settings

The analysis was performed using a Thermo Fisher Ultimate 3000 UHPLC system (Thermo Fisher, San Jose, California, USA) coupled to an Sciex 5500 QTrap linear ion-trap quadrupole mass spectrometer (Sciex, Darmstadt/Germany). The LC settings were as follows: Phenomenex (Aschaffenburg, Germany) Synergy Polar RP column (100 × 2.0 mm, 2.5 μm), gradient elution with 10 mM ammonium formate buffer in water containing 0.1 % (v/v) formic acid (pH 3.5, eluent A), and acetonitrile containing 0.1 % (v/v) formic acid (eluent B). Start conditions were 10 % B for 1 min with a flow rate of 0.5 mL/min, 1–10 min to 25 % B and 0.6 mL/min, 10–17 min concave upward to 95 % B, hold at 95 % B for 0.5 min, at 17.5–18.5 min re-equilibrating to 10 % B and 0.5 mL/min. Injection volume was 10 μL.

The Turbo V ion source equipped with a stainless steel electrode (100 μm internal diameter) was operated in positive electrospray ionization (ESI) mode with the following MS conditions: gas 1, nitrogen (50 psi); gas 2, nitrogen (60 psi); ion spray voltage, 5500 V; ion-source temperature, 450 °C; curtain gas, nitrogen (30 psi); and collision gas, medium. The MS was operated in the advanced scheduled multiple reaction monitoring (MRM) mode using three transitions for each analyte. The third transition corresponded to the 13C isotope of the first transition. The mass spectrometer settings for MRM 3 were identical to those of MRM 1, except the Q1 and Q3 masses were set 1 Da higher. For each analyte transition, an intensity threshold was defined. For the ISs, 1 MRM transition was used. The MS settings for each analyte are given in ESM Table S2. The MS was controlled by Analyst® 1.6.2 software (Sciex, Darmstadt/Germany). Quantitation was performed with MultiQuant® 2.1.1 software (Sciex, Darmstadt/Germany).

Method validation

Preparation of calibration samples

Methanolic or acetonitrilic solutions (1 mg/mL) for each analyte were used as stock solutions for calibration spiking solutions. Working solutions (0.1 and 0.01 mg/mL) were prepared by dilution from each stock solution in methanol or acetonitrile, depending on the solvent delivered in. For antidepressants and neuroleptics (ADN), two spiking solutions in methanol (d0-mix 1 ADN MeOH and d0-mix 2 ADN MeOH) and two spiking solutions in acetonitrile (d0-mix 1 ADN ACN and d0-mix 2 ADN ACN) for calibration standards were prepared by mixing appropriate amounts of the corresponding stock or working solutions as described previously [14]. For all other analytes, two spiking solutions in methanol (d0-mix 1 rest MeOH and d0-mix 2 rest MeOH) were prepared.

Different amounts of the d0-mixes 1 or 2 were diluted to a total volume of 5 mL methanol or acetonitrile to give the final calibration spiking solutions (Cal 1-13 ADN in methanol and Cal 1-13 ADN in acetonitrile and Cal 1-13 rest in methanol) in a final concentration two times higher than the aimed target blood or tissue concentration.

The final calibration samples were prepared without matrix; 10 μL of each calibration spiking solution was mixed with IS and 25 μL formic acid (20 %, v/v) in an autosampler vial, dried at 40 °C under nitrogen and reconstituted in 60 μL eluent A and B (90:10, v/v). The final calibration ranges are given in ESM Table S3 for each analyte.

Preparation of quality control samples

Stock, working, and spiking solutions were prepared as described above from different stock solutions. Different amounts of either methanolic or acetonitrilic stock or working solutions were mixed to the quality control (QC) spiking solution (low, med, high, 13C in methanol or acetonitrile), in a final concentration two times higher than the corresponding blood/tissue concentrations. The concentration of QC low was defined between calibrators 1 and 2, QC med between calibrators 5 and 6, QC high between calibrators 9 and 10, and QC 13C between calibrators 12 and 13. QC samples were prepared from 20 μL analyte-free body fluid or 20 mg analyte-free body tissue and 10 μL of the corresponding fortifying solution in methanol and 10 μL of the corresponding fortifying solution in acetonitrile. The final QC concentrations are given in ESM Table S3 for each analyte.

Selectivity and crosstalk

Five blank matrix samples from different sources were analyzed for peaks interfering with the detection of analytes or IS for FB, HB, CSF, urine, muscle, liver, kidney, lung, spleen, brain, and AT. A methanolic solution of the IS mixture was analyzed to check for appropriate IS purity and presence of native analytes. A methanolic solution of each analyte was injected separately at concentrations of 1 or 10 μg/mL and checked for interfering peaks. Analytes which showed interfering peaks were then tested at their QC high and QC 13C concentration.

Calibration model

Calibrators 1–13 were divided into two partly overlapping calibrations, conventional 12C calibration with calibrators 1–10 and isotope (13C) calibration with calibrators 8–13. In the 12C calibration, MRM 1 (12C isotope) was used as quantifier; in the 13C calibration, MRM 3 (13C isotope of MRM 1) was used as quantifier.

Calibration curves (single measurement per level at four validation days) were prepared with each batch of validation samples. The regression lines for both calibrations 12C and 13C were calculated using a simple linear model with 1/x weighting. Requirements were back-calculated calibrator concentration of ±30 % of target and sufficient accuracy (±30 % as described below for bias and precision) of the analyzed QC samples. Only for analytes that did not fulfill these criteria, additionally linear 1/x 2, quadratic or semi-logarithmic quadratic weighted 1/x, and 1/x 2 least-square regression model were tested. From those, a final calibration model was chosen, which yielded the lowest bias and precision data over the four validation days.

Matrix effects and extraction efficiency

FB, HB, CSF, urine, muscle, liver, kidney, lung, spleen, AT, and brain tissue from five different postmortem cases were used for the matrix-effect (ME) and extraction-efficiency (EE) experiments at QC med concentration. The experiments were performed according to Matuszewski et al. [15]. Briefly, three sets of samples (X, Y, Z) were prepared. Group X was spiked with IS and QC med spiking solution before extraction, while group Y was spiked after extraction. Group Z consisted of five QC med samples without matrix. EE and ME were calculated using Eqs. 1 and 2, respectively.

$$ \%\mathrm{E}\mathrm{E}=\frac{X}{Y}*100 $$
(1)
$$ \%\mathrm{ME}=\frac{Y}{Z}*100 $$
(2)

Where X = analyte peak area in group X; Y = analyte peak area in group Y; and Z = analyte peak area in group Z.

Accuracy and precision

QC samples (low, high, 13C) were analyzed according to the procedures described above in duplicates on each of 4 days. Bias was calculated as the percent deviation of the mean calculated concentration at each concentration level from the corresponding theoretical concentration. Intra-day (RSDR) and inter-day (RSDT) precisions were calculated as relative standard deviation (RSD) according to [16].

Limits

The lower limits of quantitation (LOQ) were defined as the concentrations of the first calibrator fulfilling the criteria of signal to noise 1:10 determined via peak heights. The limits of detection (LOD) were not systematically investigated.

Comparison of accuracy between 12C and 13C calibration

QC low was only calculated using 12C calibration, QC 13C was only calculated using 13C calibration. QC high accuracy was calculated in both calibrations and was compared by Bland-Altman plots. For that, the average concentration of QC high calculated via 12C and 13C calibration over 4 days (see “Accuracy and precision”) was plotted versus the difference percentage between 12C and 13C calibrations.

Results and discussion

Although forensic toxicologists are aware of the PMR effect, it is still under-explored. To provide better insight into time-dependent PMR processes in blood and especially in different postmortem tissues, biopsy samples should be collected at admission of the corpse at the mortuary and again during autopsy and compared for a novel study. Reliable conclusion of results can only be drawn if the applied analytical methodologies can be shown to be precise enough to be able to address concentration changes as actual postmortem changes compared with analytical precision. Therefore, the aim of the present study was the development and validation of a rather simple and applicable LC-MS/MS method in 11 different postmortem specimens, with special focus on acceptable intra-day precision.

Sample preparation

Collection of solid tissue samples at more than one time point without opening of the corpse is only possible using biopsy sampling. Quantitation of drugs in postmortem biopsies has not been published in the field of forensic toxicology yet. Generally, only few publications exist about quantitation of small molecules in tissue biopsies. In these studies, not only was sample preparation started with mechanical homogenization in most of the cases [6, 17] but also enzymatic and chemical digestion was described [5, 18]. As digestion techniques were rather time consuming and may lead to stability problems for some analytes, mechanical homogenization was chosen for the present study. Due to small sample sizes of maximum 20 mg body tissue collected during biopsy or 20 μL body fluid, only one extract could be generated from one sample. Therefore, a rather unspecific sample preparation was required allowing simultaneous extraction of a large number of analytes. In forensic toxicology, sample preparation of postmortem matrices consists either of a PP, LLE, or SPE step in most of the cases [1921]. LLE or SPE are more time consuming compared with the faster and rather unspecific PP but can reduce ME more efficiently [9, 22]. Therefore, a compromise had to be found allowing simultaneous extraction of a multitude of forensically relevant compounds with a high variety in physical-chemical properties but with still acceptable ME in postmortem matrices. In preliminary experiments, PP and different SPE and LLE procedures were compared in terms of EE and ME (data not shown). LLE with BuAc/EtAc (1:1, v v) at physiological and alkaline pH turned out to be the best compromise. Due to the high sensitivity of the 5500 QTrap instrument, 20 mg of body tissue or 20 μL of body fluid were sufficient to quantitate low therapeutic concentrations for the majority of the analytes even if LLE was not the optimal extraction procedure as, e.g., for morphine.

Calibration optimization

Usually, it is recommended to perform calibration in the matrix type of the analyzed sample. Due to the high number of investigated matrices in one case and the fact that blank postmortem tissue samples are difficult to obtain in sufficient quantity, it was impossible to generate calibration curves for all 11 matrices. For practical reasons, a simple solvent calibration was chosen for all matrices. It has to be mentioned that the major focus of the presented method should be the precise comparison between different time points in one sample matrix instead of high accuracy between different matrices. Therefore, although being aware of the fact that a solvent calibration may lead to accuracy problems, it was a reasonable approach to compare concentration changes within one type of matrix.

Another major challenge in quantifying so many compounds with different therapeutic and/or toxic concentration ranges in different matrices in only one analytical run is to find calibration models which can handle the huge dynamic range without source or detector saturation of the MS. Some neuroleptics such as flupentixol have therapeutic blood concentration ranges in the low microgram-per-liter range while, e.g., trazodone has therapeutic blood levels up to 2500 μg/L. Starting the analysis, it is usually unknown whether therapeutic or even toxic levels several times higher than expected from recommended therapeutic blood concentrations are present. Furthermore, large concentration differences can occur between the different matrices. CSF contained mostly very low concentrations, whereas, e.g., liver and lung usually contained concentrations up to 100 to 200 times higher than those in the blood [2, 20, 23]. As biopsy samples could be analyzed only once, the calibration range had to cover an extreme dynamic range. Trying to calibrate in such a wide range led to MS saturation problems. In a first step, alternative MRM transitions with lower intensities were chosen as quantifiers. Additionally, the declustering potentials were modified to be in a less optimal range, to further lower the signal intensities. However, although the use of alternative MRM transitions and modification of the declustering potentials lead to less saturation in higher concentrations, low therapeutic concentrations could not be analyzed anymore. In a second approach, a third MRM transition was introduced corresponding to the 13C isotope of MRM 1. As 13C makes up only about 1.1 % of natural carbon, this signal was expected to be less intense and less prone to saturation allowing a linear calibration even in highest concentrations. As exemplarily shown for risperidone in Fig. 1, the sole use of 12C calibration showed strong saturation effects at higher concentrations (A), while dividing the whole calibration range in MRM 1 (12C) for lower (B) and MRM 3 (13C) for higher concentrations (C) resulted in linear calibration models over the whole respective range. In mass spectrometry, 13C isotopes have various applications. They are used to label target molecules for example in metabolomics or proteomics [24] or are used to produce labeled internal standards [25]. Naturally and unnaturally occurring 12C/13C ratios are determined in environmental, food, medicinal, or forensic chemistry using isotope ratio mass spectrometry [26]. The general validity of this calibration approach was further investigated and proven during the final validation process as described below.

Fig. 1
figure 1

Risperidone calibration over the whole concentration range showing strong saturation effects in higher concentrations (a). Division of the total calibration curve in MRM 1 (12C) for lower (b) and MRM 3 (13C) for higher concentrations (c) resulted in linear calibration models over the whole concentration range. QC low was only calculated using 12C calibration, QC 13C only using 13C calibration. QC high could be calculated with both calibrations

LC-MS/MS optimization

The final MRM chromatogram of the 83 analytes (MRM 1) is given in Fig. 2. The presence of the 83 analytes was successfully screened for by mass spectrometry with selected MRM for each analyte using two MRM transitions for 12C and an additional MRM 3 for 13C. Only norfluoxetine showed only one fragment in MRM 1 and 3. Generally, guidelines recommend the use of two qualifying ion fragments in addition to the quantifier. In the presented method, MRM 3 was exclusively a quantifier. The addition of another qualifier ion would have led to increased cycle times and consequently to a loss of data points. In real cases, analyte identity is already confirmed prior to quantitation, as heart blood is screened using a different LC-MS/MS screening method including enhanced product ion scans and comparison of the mass spectra with an in-house library. Therefore, monitoring of one qualifier should be sufficient in the quantitation method. Even monitoring of over 80 compounds with three MRM transitions each was only possible using a new, advanced scheduled MRM mode in Analyst® 1.6.2 software. After introducing a certain intensity threshold for each analyte, MRM 2 and 3 were only measured as long as MRM 1 stayed above the defined intensity threshold. Additionally, the scheduled MRM windows were narrowed for the later part of the chromatogram where many analytes were co-eluting. Combining intensity thresholds for each analyte and variable scheduled MRM windows, a minimum of 8 to 10 data points per peak was achieved for the majority of the analytes. As expected, baseline separation of over 80 analytes is impossible within reasonable runtimes. Especially in the last 2 min, several analytes are still co-eluting but could be sufficiently differentiated by mass spectrometry with exception of fluoxetine/methadone and 13C-clomipramine/chlorprothixene. Some analyte pairs like codeine-d3/13C-dihydrocodeine and venlafaxine-d6/7-aminoflunitrazepam were overlapping only at high concentrations.

Fig. 2
figure 2

LC-MS/MS chromatogram of the 83 analytes (MRM 1)

Besides interference in MRM transitions possibly leading to wrong identification, co-elution of two or more compounds might result in ion suppression or ion enhancement between various analytes, especially when using ESI. Such effects, when present in the analyte mixture of calibration samples, but not in authentic samples would result in falsely increased or decreased quantitation results [27]. However, as comparison of analyte concentrations at different time points in the very same case would still be possible in a relative manner, ion suppression and enhancement of co-eluting analytes should play a minor role and was therefore not further investigated.

Method validation

Validation was performed based on national and international guidelines [16, 28] with some modifications as described in the respective sections. Validation of a huge number of 83 analytes in 11 different matrices required these modifications and since special focus for a future PMR study was on an acceptable intra-day precision, some enlargements of acceptance limits for other parameters were acceptable. Even then, it is clear that criteria could not be met for all analytes in every respective matrix. Still, overall results of validation were surprisingly good.

Selectivity and crosstalk

No interfering peaks were detected in the blank matrices of FB, HB, and CSF. Peaks at the retention time of diazepam (MRM 1–3), EME (MRM 1), MPH (MRM 1), tramadol (MRM 1), and venlafaxine (MRM 1) were detected in some of the blank tissue matrices but in negligible amounts below the lowest calibrator. The IS mixture showed small peaks for clomipramine, norclomipramine, EDDP, HC, MPH, mirtazapine and pethidine, which all had areas far below calibrator 1. However, it has to be mentioned, that all matrices used for this experiment were from authentic cases and had been assigned as “blank” samples based on a routinely performed blood/urine screening. Very low concentrations of some analytes irrelevant for the particular case might have been present and might be the reason for the observed interferences.

Due to the extremely broad calibration range, injection of some analytes at their QC 13C concentrations lead to peak broadening and deterioration of separation from adjacent compounds. For example, DHC and morphine influenced codeine-d3, oxymorphone, and HM. However, the interference peak area of codeine-d3 made up only 2 % of its regular IS area and was therefore considered as irrelevant. Oxymorphone and HM were only partly influenced and therefore insignificant for routine analysis. 7-aminoflunitrazepam was not baseline separated from venlafaxine-d6 anymore, but comparing the IS areas of the low and high calibrators, the area of venlafaxine-d6 was not influenced to a significant extent. Clomipramine lead to interfering peaks with the quantifier and qualifier of chlorprothixene and norclomipramine at a concentration between calibrators 2 and 3. Injection of clomipramine at its QC high concentration did show the same effect, but concentrations were below the lowest calibrator. On the other hand, chlorprothixene or norclomipramine did not influence clomipramine. As the interferences only appeared in the highest calibrators and the resulting change in peak areas was only small, the accuracy of the calibrators should not be affected to a relevant extent. Problems would arise, if a sample contained high clomipramine concentrations together with low chlorprothixene or norclomipramine concentrations. In such a case, the chlorprothixene and norclomipramine concentrations would have to be interpreted with caution or a separate calibration for each analyte had been necessary. Taking into consideration that such a combination of analytes should be very seldom, results for selectivity and crosstalk were vastly satisfying and in addition, crosstalk was not observed for all other compounds, even in the highest concentrations tested.

Calibration model

If possible, the simplest linear regression model with 1/x or 1/x 2 weighting was used. Only analytes that did not fulfill the required criteria using linear regression models were additionally tested for quadratic regression models. The final regression models, weighting factors, and the respective ISs are given in ESM Table S3. QC low was only calculated using 12C calibration, while QC 13C was only calculated using 13C calibration. QC high accuracy was calculated using both calibrations. From the 83 analytes, in the 12C calibration only ten analytes and in the 13C calibration only five analytes did not fulfill the criteria for linear calibration and were therefore fitted to a second-order model to finally meet the criteria. In total, five analytes did not fulfill a back-calculated calibrator concentration of ±30 % in the 12C calibration and the 13C calibration in all regression models, what may have been caused by the lack of an appropriate IS.

Matrix effects and extraction efficiency

Based on international validation guidelines, EE and ME studies are usually performed at two concentration levels, low and high [16, 28]. Due to the high number of analytes and different postmortem matrices, the ME and EE experiments were only performed at QC med. ME data and RSD of all analytes in all matrices are summarized in ESM Table S4. Briefly, an ion suppression greater than 25 % in all matrices was observed for EME, norfluoxetine, flupentixol, hydromorphone, oxymorphone, and sertraline, while agomelatine, amitriptyline, nortryptiline, aripiprazole, chlorprothixene, clomipramine, norclomipramine, clonazepam, norclozapine, diazepam, nordazepam, duloxetine, flunitrazepam, fluoxetine, desalkylflurazepam, haloperidol, nitrazepam, paroxetine, and trimipramine exhibited ion suppression only in tissues but not in liquid matrices. Thirty-eight analytes (46 %) were affected by ion suppression exclusively in brain and/or AT. Overall, matrices did not cause ion suppression or enhancement in the same extent. Ion suppression could be observed for only 6 analytes in CSF, for 8 analytes each in FB and HB, and for 12 analytes in urine. CSF was therefore the matrix causing the least ME, followed by blood and urine. Muscle, lung, spleen, and kidney were similarly affected with 22 to 27 suppressed analytes. Worse results could be observed in liver with 36 suppressed analytes and brain and AT with 74 and 80 suppressed analytes, respectively. Brain and AT are the matrices with the highest lipid content, which might be the cause of the extensive ion suppression. The fact that the body fluids showed less ME compared with the solid tissues supports the assumption that lipophilic matrix components are the main cause of ion suppression [29, 30]. Additionally, many analytes which only elute at a higher organic solvent percentage (RT >13 min; where lipophilic matrix is known to elute) were affected by ion suppression in the solid tissues. The ME of EME, morphine, oxymorphone, HM, and amphetamine may be explained by the early elution at an RT less than 2 min that probably falls together with salts from the matrices. In these cases, body fluids and solid tissues were affected to the same extent. Ion enhancement greater than 25 % was observed for 7-aminoclonazepam, 7-aminoflunitrazepam, OH-flurazepam, levomepromazine, olanzapine, perazine, pipamperone, promazine, risperidone, and temazepam in different matrices (ESM Table S4). Olanzapine is known to be unstable and susceptible to oxidation in biological matrices [31, 32]. Levomepromazine, perazine, promazine, and promethazine are phenothiazines known to be easily oxidized to sulfoxides [33]. This could be the reason, why these compounds did not pass the specifications. The thioxanthenes chlorprothixene and flupentixol are structurally related to the phenothiazines and might also be affected by oxidation. Interestingly, thioxanthenes were prone to strong ion suppression in contrast to phenothiazines, which suffered from strong ion enhancement. As the thioxanthenes eluted closely together at high retention times, the ion suppression may have been caused by lipophilic matrix components such as fatty acids. As it was expected from postmortem tissues, many compounds exhibited ion suppression or enhancement, but for most analytes these effects were reproducible (RSD <15 %) and therefore tolerable, as long as the method was sensitive enough. As given in ESM Table S4, only four analytes in body fluids, eight analytes in muscle, and only one analyte in AT did not pass these criteria. In contrast, 23 to 45 analytes in spleen, lung, kidney, liver, and brain were outside these criteria. Use of an IS—usually applied in routine analysis—could further compensate these deviations in all matrices as shown in summary for all analytes and tissues in Table 1. Briefly, in body fluids only olanzapine was still above the set criteria of 15 % and in body tissues only bupropion, flupentixol, EDDP, olanzapine, opipramol, oxymorphone, perazine, sertraline, ODMT, and ODMV still exceeded the criteria. The mentioned analytes that did not benefit from IS compensation, were analytes without an appropriate IS with comparable structure or RT.

Table 1 Matrix effects (ME) and relative standard deviation (RSD) of 83 analytes in 11 different matrices (n = 5 each) calculated as analyte/IS ratio

Determination of EE was done comparing blank tissue matrix spiked before and after extraction. Of course, this may not reflect the true recovery of substances from authentic tissue samples, where the substances may not be dissolved to the same extent as after spiking them onto the matrix. However, recovery from authentic samples could not be determined for over 80 analytes in several tissues due to unavailability of sufficient reference material and for practical reasons. Therefore, the approach in the present experiment seemed to be the most appropriate. A summary of the EE for all analytes in all matrices is given in ESM Table S5. Generally, EE was between 70 and 90 % for the majority of the analytes in the different matrices. EE below 50 % in all matrices was only observed for BEC and morphine. In blood, CSF, urine, kidney, muscle, lung, and AT, 3 to 10 analytes, and in liver, brain, and spleen, 16 to 24 analytes had EEs below 50 %. Especially, considering the fact that a solvent calibration was used, low EE from matrix should result in bad accuracy. However, as already described for ME, usage of IS may have been compensated for bad EE as shown in ESM Table S6. With IS compensation, only nortryptiline, bupropione, norclozapine, flupentixol, levomepromazine, EDDP, nitrazepam, olanzapine, opipramol, oxymorphone, perazine, promazine, promethazine, and sertraline had recoveries below 80 %. Standard deviation above 15 % indicating unreproducible EE in more than one matrix was only seen for nortryptiline, bromazepam, bupropion, norclozapine, nordoxepine, flupentixol, OH-flurazepam, HC, HM, levomepromazine, mCPP, EDDP, olanzapine, opipramol, oxymorphone, perazine, promazine, promethazine, sertraline, ODMT, and ODMV. Overall, ME and EE were comparable with typical multi-analyte methods.

Accuracy and precision

In forensic toxicology, national and international guidelines exist, which recommend various validation procedures [16, 28]. If these full validation recommendations exceed the possibilities of a laboratory in terms of time and cost, simplified partial validations can be performed [28]. Due to the high number of matrices, accuracy and precision experiments were performed in a modified validation procedure generating data over 4 instead of 8 days. FB and HB were handled as one matrix named blood in general for accuracy and precision experiments. Results of the accuracy and precision experiments are given in Table 2. As the aim of the future study is to assess concentration differences between 2 and 3 time points—intra-day precision was considered the most important point to focus on in the method validation process. Therefore, bias, intra-day, and inter-day precision will be discussed separately in the following.

Table 2 Intra-day (RSDR) precision, inter-day (RSDT) precision, and bias data of 83 analytes in the different matrices

As a solvent calibration was used, bias was expected to some extent, e.g., for those analytes with low recoveries and/or not having an appropriate IS. However, even if the accuracy did not fulfill the acceptance criteria, concentration differences can still be assessed relatively. Nevertheless, 38 analytes fulfilled the criteria of ±30 % in all matrices. The best results were obtained in body fluids, with 75–81 % of the analytes passing the accuracy requirements. The body tissues comparatively caused bias in more analytes with only 66–71 % of the analytes passing the criteria. Compounds with bad accuracy results (detected in more than two matrices) were nortryptiline, bupropion, OH-bupropion, chlorprothixene, clomipramine, norclozapine, cocaine, BEC, EME, codeine, DHC, nordoxepine, flupentixol, haloperidol, HC, HM, levomepromazine, mCPP, MDEA, EDDP, MPH, morphine, olanzapine, oxycodone, oxymorphone, perazine, pipamperone, promazine, promethazine, quetiapine, sertraline, ODMT, and ODMV. Out of the 51 analytes with an appropriate deuterated IS, 78 % showed good accuracy results. Of the 32 analytes without appropriate IS, only 22 % had acceptable accuracy results.

The intra-day precision criteria were fulfilled by the majority of the analytes (Table 2). There was no apparent difference in precision between the matrices observed. In all matrices, 83–93 % of the analytes had an intra-day precision below 20 % for all QC samples, with exception of brain where only 75 % of the analytes passed the criteria. As not always the same analytes failed, only 49 analytes (59 %) passed the criteria of the intra-day experiment in all ten matrices. Analytes failing in more than two matrices were bupropion, chlorprothixene, norclomipramine, flupentixol, flurazepam, HC, levomepromazine, mCPP, EDDP, olanzapine, opipramol, perazine, pipamperone, promazine, promethazine, sertraline, and ODMV. All those analytes except mCPP and pipamperone did not have an IS with similar structure or retention time. In case of mCPP and pipamperone, the RTs of the IS mCPP-d8 and pipamperone-d10 shifted due to the rather high deuteration grade, which might have been responsible for the insufficient compensation. PMR for substances with low precision in a certain matrix will have to be interpreted with care—if reasonable interpretation will be possible at all—considering high analytical variations.

The inter-day precision seemed to be more matrix dependent than intra-day precision. In liver, brain, and AT, 60–64 % of the analytes showed a precision below 20 %. In the other matrices, 76–83 % of the analytes passed the criteria (Table 2). Also in this experiment, the intersection was smaller with only 43 % of the analytes passing the criteria in all ten matrices. Additionally to the above-mentioned analytes, nortryptiline, OH-bupropion, clomipramine, norclozapine, nordoxepine, OH-flurazepam, lorazepam, MDA, nitrazepam, oxymorphone, quetiapine, ODMT, and trazodone failed the criteria in more than two matrices. Again, mainly analytes with no appropriate IS were affected.

In summary, all accuracy and precision parameters were fulfilled for 51 compounds in blood, 59 analytes in urine, 58 in CSF, 54 in muscle, 52 in kidney and spleen, 43 in liver, 42 in AT, and 41 in brain tissue. Twenty-eight analytes fulfilled all parameters in all ten matrices. Taking into consideration that so many analytes had been assessed in so many different matrices using a calibration in solvent instead of matrix and a calibration extension by 13C isotopes, the overall performance in terms of accuracy and precision was perfectly acceptable.

Limits

The LOQs are listed in ESM Table S3. Although sample amount was a lot smaller compared with other quantitation methods used in forensic toxicology, LODs were in a comparable range [14, 34]. As the detection of lower analyte concentrations was not of interest for the future application of the method, determination of LOD was resigned.

Comparison of 12C and 13C calibrations

In order to achieve a sufficient dynamic calibration range, two calibrations were combined: first, a conventional calibration with MRM 1 (12C) for low to moderate concentrations, and second, an additional isotope calibration with a MRM 3 (13C) for higher concentrations (Fig. 1). Applicability of this approach was tested by comparing accuracy of QC high determined using both calibrations. As exemplified for four matrices in Fig. 3, a difference in the calculated concentrations of QC high between the conventional and the isotope calibration greater than 20 % was observed for five to seven analytes in each matrix. From the mentioned exceptions, levomepromazine, perazine, and promazine were above 20 % in all and clomipramine, flupentixol, olanzapine, oxymorphone, and ODMV in more than two matrices. Although it seems as if the same analytes have similar errors in the different matrices (Fig. 3), no systematic findings were observed. For example, ODMV showed an error of −30 % in blood, −20 % in CSF, and +25 % in liver but was within ±20 % in all other matrices. Only levomepromazine, promazine, and oxymorphone showed a deviation in a similar range, so that changes in the calibration models might have improved the closeness between 12C and 13C calibrations. However, this might have worsened the bias for QC low or 13C. Generally, these analytes also showed problems in the other validation experiments, which may explain the deviations. Especially for these analytes, all samples of one case should be calculated either within 12C or 13C calibration. In general, combining 12C and 13C calibration successfully expanded the dynamic range without observing saturation effects.

Fig. 3
figure 3

Differences (%) of the average QC high concentration calculated with 12C calibration and 13C calibration is depicted as Bland-Altman Plot of all analytes exemplarily in blood (a), muscle (b), CSF (c), and liver (d). Dotted lines represent deviation of ±20 %

Conclusions

An LC-MS/MS multi-analyte method for the simultaneous quantitation of 83 analytes in 11 postmortem matrices extracted from low milligram biopsy samples and small volumes of body fluids was developed with the aim to investigate concentration changes in postmortem samples taken at different time points. An extremely wide calibration range was achieved using a combination of a conventional (12C) calibration for low to moderate concentrations and a 13C isotope calibration of the same MRM for high concentrations. Calibration was successfully performed with external calibrators in solvent without matrix. The method was validated in terms of selectivity, crosstalk, ME, EE, accuracy, precision, and calibration model. Intra-day precision was the most important point in the validation process, considering the aim of concentration comparison between different time points in one matrix. Depending on the matrix, 75–93 % of the analytes fulfilled the criteria for a future study on time-dependent PMR, which is surprisingly good for such a universal method.