Introduction

Intravitreal therapy is used to treat ocular diseases in the posterior segments of the eye, for example, age-related macular degeneration and diabetic macular edema. Intravitreal drug administration must be used, because topical eye drop treatment does not lead to adequate drug concentrations in the posterior target sites (retina, choroid, vitreous). The drug is injected directly into the vitreous, which is a hydrophilic gel consisting of a network of well separated hyaluronic acid and collagen type II fibers, allowing diffusion of drug molecules, even macromolecules, to the retina and choroid. One advantage of the intravitreal injections is that it generally leads to minimal systemic exposure of the drug. However, due to the invasive nature of the injections, other ways of delivering drugs to the posterior eye segment have been attempted, but the topical, systemic and subconjunctival delivery routes generally fail in reaching effective concentrations in the vitreous body (1).

After an injection into the vitreous there are two routes by which the molecule can be eliminated: 1) anteriorly via flows of aqueous humor and uveal blood circulation; 2) posteriorly through the blood-retinal barrier to retinal and choroidal blood circulation (2). The blood-retinal barrier is composed of the retinal pigment epithelium and the tight walls of retinal capillary. The cells in the blood-retinal barrier express transporters, which in principle may affect drug elimination from the vitreous, but the effect of transporters on drug elimination in vivo is still unclear (3). The posterior route presents a large surface area surrounding the vitreous, but the cells in the blood-retinal barrier form a tight layer, allowing efficient elimination of lipophilic compounds. The anterior route, in contrast, is limited by the small area that is available for diffusion from the vitreous into the posterior and anterior chambers. In general, elimination through the posterior route takes place rapidly, whereas drug elimination via the anterior route is slower (2).

Intravitreal drugs can be administered as injections or implants that release drug over prolonged times. In all cases, drug concentrations in vitreous are dependent on the rate of drug elimination from the vitreous. Slow intravitreal drug elimination is desirable, since it will prolong the duration of drug action after intravitreal injections and minimizes the required drug loading in the controlled release formulations. The intravitreal half-life of a compound must be determined in vivo by injecting the compound into the vitreous of an animal, usually rabbit, and measuring the concentration of the compound in the vitreous at certain time-points after the injection. Pharmacokinetic evaluation is typically carried out using many time points (often 5–10) and several replicates of each time point is needed (more than 5). Thus, for plotting a reliable time-concentration curve at least 20 animals are needed, which makes this method unacceptable for drug screening. There is a need for an alternative method that would enable screening of molecules before in vivo studies and also reduce and refine animal experiments related to development of ocular drugs and intravitreal drug delivery systems.

A computational method like quantitative structure-property relationships (QSPR) modeling is optimal for rapid virtual evaluation and pre-selection of compounds. Even though QSPR models are widely used to study the relationship between physicochemical properties and the pharmacokinetic properties of molecules, surprisingly few attempts have been made to predict the intravitreal half-life of injected drugs. Some prior studies have established that compounds with a high molecular weight are preferably eliminated by the anterior route (4), and that molecular weight (MW) and lipophilicity do have an influence on the intravitreal half-life (57). However, some studies (5,6) have been done using small sets of structurally and chemically similar compounds and, therefore, those models are not broadly applicable to diverse compounds. Durairaj and co-workers used a large set of molecules, including small molecules and macromolecules, with a MW range from 32 to 149 000 Da, but explored only a limited set of variables, including only molecular weight, lipophilicity and solubility variables to derive QSPR models by multiple regression (7).

We aimed to build a computational model for virtual prediction of drug elimination from the vitreous. The models are focused on compounds with molecular weight below 1500 Da, thus excluding macromolecules. The use of a lower, narrower MW range will make the model more applicable for prediction of small molecules. We have made a careful literature search and collected experimental data on the intravitreal half-lives of 47 compounds in albino and pigmented rabbits. The data set contains molecules with diverse structural and chemical features. The relationship between the intravitreal half-life and 33 physicochemical descriptors was determined by multivariate analysis. Simple mathematical equations were derived, enabling reliable prediction of the intravitreal half-life for a compound without any experimental data.

Materials and Methods

Compound Data Set

A data set of 47 compounds (Table I) was collected from an an extensive literature search for intravitreal injections (5,849). The half-lives of the compounds included in the dataset have been measured in either albino or pigmented rabbits. A subset was constructed from this dataset, containing 39 compounds with experimental data collected from only albino rabbits (Table I). An average value was calculated for compounds with more than one reference. The model is focused on molecules with molecular weight less than 1500 Da. In addition, we only included data from experiments with compounds that were dissolved in water or buffer solution and injected into normal, healthy eyes. When the intravitreal half-life had not been reported for a compound, the half-life was calculated from the reported concentrations. Compounds for which the intravitreal concentrations had been measured during a time-span that was less than two half-lives, were excluded from the data set. Data that was deemed of insufficient quality due to for instance large standard deviations or an irregular time-concentration curve was also excluded. For two-compartment models, the half-life of the terminal phase was used, except for moxifloxacin and fluorouracil, where the alpha phase was dominant.

Table I Calculated Descriptors Used in the Final Models and Experimentally Determined Intravitreal Half-Life of Compounds in the Data Sets

Molecular Descriptors

The chemical structure of the compounds in the data set were retrieved from ACD/Dictionary (50), or the the PubChem database (http://pubchem.ncbi.nlm.nih.gov/). The ACDLabs software package version 12 was used to calculate the molecular descriptors for these compounds (50). A total of 33 descriptors were chosen for this study; pKa, LogD at pH 2, 5.5, 6.5, 7, 7.4 and 10, LogP, MW, PSA (polar surface area), FRB (freely rotatable bonds), HD (hydrogen bond donors), HA (hydrogen bond acceptors), Htot (HD + HA), molar refractivity, molar volume, parachor, index of refraction, surface tension, density, polarizability, C ratio, N ratio, NO ratio, hetero ratio, halogen ratio, number of rings and number of aromatic, 3-, 4-, 5- and 6-membered rings.

Multivariate Data Analysis

Before data analysis, an external data set of seven compounds was randomly extracted from the collected data set of 47 compounds from pigmented and albino rabbits (mixed set), while six compounds were randomly extracted from the albino data set of 39 compounds (Table I). The compounds in the external data sets were not used for model building or training. A logarithmic transformation was performed for those variables that had a broad range or were not equally distributed over the range. A principal component analysis (PCA) including all molecular descriptors was calculated to analyze the diversity of the data set. The logarithm of intravitreal half-life (logt1/2) was correlated to the molecular descriptors by Partial Least Squares (PLS) analysis using Simca-P (version 10.5) (51). Based on the distribution of the compounds in the PCA plot training sets of 30 and 25 compounds and internal test sets of ten compounds and eight compounds were generated, respectively, for the mixed and albino data set. Only the compounds in the training set were used to calculate the models. Both cross-validation and prediction of the intravitreal half-life of the compounds in the internal tests sets were used to evaluate the predictive capability of the models. For cross-validation, the training set is divided into seven groups and the intravitreal half-life for the compounds in each group is predicted using the data in the other groups. The sum of squared errors between the actual and predicted data is calculated and converted into the Q2 value (cross validated R2). The half-life for the compounds in the internal test sets were predicted for further evaluation of the models. Ultimately, the final models were assessed by predicting the half-life of the external data sets that were extracted before data analysis (Table I).

The statistical significance of the models was assessed by Y-scrambling, i.e. the model is fitted to scrambled half-life values and R2Y (equivalent to the R2 coefficient) and Q2 are calculated. After repeating the process 50 times, the results are plotted with the R2Y values on the Y-axis and the Q2 values on the X-axis and regression lines are fitted to the points. For statistically valid models, the intercepts should be below 0.3 and 0.05 for R2Y and Q2, respectively.

Results

Literature Search

Initially, we found 83 reported intravitreal half-lives for compounds by literature searches. Twenty of these half-lives were excluded from the data set, since they did not fulfill the criteria as described in the Materials and Methods section. Among the compounds that were left out from the data set is triamcinolone acetonide, which is a well-studied and widely used drug in ocular therapy. However, it was excluded from the study as it is poorly soluble and precipitates in the vitreous after injection, thus, its long half-life is due to the slow dissolution of the drug particles in the vitreous. The remaining 63 reported half-lives were used to build the mixed data set of 47 compounds, containing data from both pigmented and albino rabbits, as well as the subset of 39 compounds from albino rabbits (Table I). An average value was calculated for those compounds with more than one study that reported a half-life. The terminal half-life, in the case of two-compartment models, was used as the elimination half-life in all cases except two: moxifloxacin and fluorouracil. For moxifloxacin the half-life of the first phase was used, since it is the dominant phase, while an effective half-life was calculated for fluorouracil, taking into account both the first phase (alpha) and the terminal phase (beta), (t1/2, eff = (AUCalpha/AUCtot)x t1/2alpha + (AUCbeta/AUCtot)x t1/2beta)(52). For fluorouracil, the first phase accounted for approximately one third of the total elimination based on the AUC of the alpha phase. Therefore, the effective half-life deviates from terminal half-life and is a more representative indicator for drug concentration decay in the vitreous. The unusual elimination profile of moxifloxacin might be due to its ability to strongly bind to melanin (53).

Molecular Diversity of the Compounds

The compounds in the mixed data set cover a broad range of structurally and chemically diverse molecules, with an intravitreal half-life from 0.9 to 62.3 h. The molecular weights of the compounds varied from 32 to 1449 Da and the LogD7.4 from -10.60 to 2.68. All calculated descriptors were used to perform PCA for both mixed and albino dataset (Fig. 1). The mixed data set resulted in a PCA model with five principal components that explained 81% of the variance in the data set, while the PCA model for the albino data set contained four principal components explaining 80% of the variance. The PCA score plots of the two first principal components, explaining 44% and 18%, respectively, of the variance in the mixed data set and 42% and 18%, respectively, of the variance in the albino data set, are shown in Fig. 1. Methanol lies outside the elliptic 95% tolerance volume of both the mixed and albino PCA models, but it was not excluded from model building, since 5% of the data set is allowed outside the tolerance volume. The compounds in the data sets were divided into training sets (30 and 25 compounds in the mixed and albino training sets, respectively) and internal test sets (10 and 8 compounds in the mixed and albino test sets, respectively) based on their distribution in the PCA score plot (Fig. 1, Table I).

Fig. 1
figure 1

PCA score plot of (a) the 47 compounds in the mixed data set and (b) the 39 compounds in the albino data set. The black triangles represent compounds in the training set and the unfilled triangles the compounds in the internal test set. The ellipse depicts the 95% tolerance volume based on hotelling T2 (0.05).

PLS Analysis of Training Set Compounds and Evaluation of Derived Models with Internal Test Set Compounds

The PLS model using all the 33 calculated descriptors and the 30 compounds of the mixed training set had a R2Y = 0.65 and Q2 = 0.59, and the PLS model with all descriptors for the albino set had similar values, R2Y = 0.69 and Q2 = 0.55 (Table II). In order to construct a simpler model, we excluded those descriptors that according to SIMCAs variable influence on projection (VIP) function were deemed the least influential (Fig. 2). A VIP value above 1 specifies the descriptor as more influential than average in explaining the modeled response. A threshold of 1.2 and 1.1 was chosen for the mixed and the albino data set, respectively, leading to models with 14 descriptors for both data sets. The models had improved statistics with higher R2 and Q2 values than the initial models (Table II). The descriptors were related to lipophilicity (LogD at pH 5.5, 6.5, 7 and 7.4), hydrogen bonding (Htot, HD, HA and PSA) and mass (MW, MV, polarizability, molar refractivity and parachor), as well as the amount of rotatable bonds, defined with the variable FRB. Subsequently, we excluded the descriptors with high correlation coefficients (> 0.9) with Htot, LogD7.4 or MW. The Htot variable was chosen since it combined the information from the HD and HA descriptors, which it is highly correlated to. The MW descriptor was selected since it had the highest VIP value of the correlated coefficients, while the LogD7.4 descriptor was chosen for its physiological relevance. This resulted in a four-variable model for both data sets, including the descriptors Htot, MW, LogD7.4 and FRB. Both models had good statistics, with R2Y = 0.64, Q2 = 0.61 for the mixed model and R2Y = 0.73, Q2 = 0.71 in the albino model. The predictability of these models was evaluated on the internal test set, with a Q2int value of 0.61 and 0.67 for the mixed and albino model, respectively. Exclusion of the FRB descriptor in the mixed model improved the statistics and the prediction of the internal test set (Table II), while creating a three-variable model by removing any of the other descriptors led to markedly decreased predictability of the internal test set (data not shown). Similarly, exclusion of the MW descriptor in the albino data set improved the model statistics and predictability of the internal test set, while creating a three-variable model by exclusion of any of the other variables did not have a beneficial impact. Furthermore, a two-variable model with good statistics and better predictability for the internal data set was obtained for the mixed data sets using only the Htot and LogD7.4 descriptors (Table II). A two-variable model for the albino data using the same descriptors resulted in a model with a lower Q2 value, but good predictability of the internal test set.

Table II PLS Models Obtained from the Mixed Data Set Compounds and the Albino Dataset Compounds
Fig. 2
figure 2

VIP plots, with confidence intervals, from PLS analysis using all variables and the training set compounds for (a) the mixed data set and (b) the albino data set. A horizontal line has been drawn in (b) to indicate the 1.1 VIP threshold.

The two-variable model was chosen for the mixed data set, while the three-variable model was chosen for the albino data set for further evaluation. The final models are described by the following equations:

$$ \begin{array}{*{20}{c}} {\log {{{\text{t}}}_{{{{1} \left/ {2} \right.},{\text{mixed}}}}} = - 0.046 - 0.051\left( {\log {{{\text{D}}}_{{7.4}}}} \right) + 0.640\left( {{\text{Log}}{{{\text{H}}}_{{{\text{tot}}}}}} \right)} \\ {\log {{{\text{t}}}_{{{{1} \left/ {2} \right.},{\text{albino}}}}} = - 0.164 - 0.032\left( {\log {{{\text{D}}}_{{7.4}}}} \right) + 0.435\left( {{\text{Log}}{{{\text{H}}}_{{{\text{tot}}}}}} \right) + 0.461\left( {{\text{Log}}\left( {{\text{FRB}} + 1} \right)} \right)} \\ \end{array} $$

The statistical significance of the predictive capability of these final models was evaluated with the validate function in Simca-P. The R2Y and Q2 Y-intercepts were -0.06 and -0.16 for the mixed model and -0.02 and -0.18 for the albino model, respectively. Both are well below the upper limits of a statistically valid model.

Evaluation of the Final Models on an External Test Set

The final models were used to predict the intravitreal half-life of the compounds in the external data sets that were removed prior to model building (Table I). Both models predicted the compounds in the external data sets with good accuracy, with an external Q2 value of 0.81 and 0.88, respectively, for the mixed and albino models.

Discussion

Two models for the prediction of intravitreal half-life were obtained, one specifically for albino rabbits and one that is suitable both for albino and pigmented rabbits. Due to the limited amount of data available for pigmented rabbits (only 13 compounds), we did not attempt to build a specific model for pigmented rabbits. Both of the final models had good statistical values, with the R2Y and Q2 of the albino higher (both values 0.75) compared to those of the mixed set model (0.66 and 0.64, respectively). The ability of the models to predict the intravitreal half-life was verified both on internal test sets as well as on randomly selected external test sets (Fig. 3). The models performed well, as the difference in observed and predicted half-life for the most poorly predicted compound in the mixed test sets was 2.3-fold (cidofovir in the external test set) and all in all, only four other compounds had more than a two-fold difference to the experimental value. In the albino test sets, clarithromycin was predicted to have a 3.4-fold longer half-life than observed experimentally, but on the other hand, there was only one other compound, kanamycin, with a two-fold error in its predicted half-life, while all the other compounds had less than two-fold error-prediction.

Fig. 3
figure 3

Predicted versus observed log t1/2 values based on the final models. Prediction of the mixed data set compounds in the internal test set (a) and external test set (b) and the albino data set compounds in the internal test set (c) and external test set (d). A diagonal line has been drawn in R2Y =1 to facilitate interpretation and the dashed lines represent a 3-fold prediction range.

The final models contained only two or three variables, Htot and LogD7.4 in the model for the mixed data set, and Htot, LogD7.4 and FRB for the albino set. Both models had better R2Y and Q2 values than the models using all 33 calculated variables and also slightly improved values compared to the four-variable models (Table II). Furthermore, the variables that are included in the final models are easily interpretable. A higher LogD7.4 value leads to a shorter half-life, which is likely due to increased permeability of the cell membranes in the blood-retinal barrier, while increasing the amount of hydrogen bond donors and acceptors results in longer half-lives, as polar compounds have more difficulties to permeate membranes. According to the albino model, flexible compounds that have a high amount of rotatable bonds have longer half-lives.

Previous studies have linked intravitreal half-life to MW and lipophilicity (47). In our study, MW was also an influential descriptor with a relatively high VIP value, but still its exclusion improved the predictivity of the models. One possible explanation is that our model focuses on low MW compounds and, thus, does not highlight the effect MW on the half-life, and a more clear effect might only be observed when significantly larger compounds, like macromolecules with MW of several thousand Da are included. Macromolecules, like proteins, have a slower rate of diffusion in the vitreous and they are predominantly eliminated from the vitreous through the anterior route. These compounds have significantly longer half-lives than molecules that can permeate the blood-retinal barrier. However, we have only included compounds that have MW <1500 Da in our study, and therefore this size effect may not be evident in our data set. Another likely explanation could be that the number of rotatable bonds as well as hydrogen bonds generally tends to increase with the size of molecules and, therefore, the successful use of MW in describing membrane permeability is due to its correlation with increased polarity and flexibility (54), which in our model is described by the Htot and FRB variables. However, Veber and co-workers found that the number of FRB influences membrane permeability of compounds independently of MW (54).

Some compounds in the data set are known substrates for transporters in the retinal pigment epithelium cells. For example, carbenicillin and quinidine have been identified as substrates for P-glycoprotein (P-gp) (15,28,45). Interestingly, these compounds are not outliers in our model, which might be due to the relatively small increase (1.5–2.5-fold) in intravitreal half-life of these compounds when a P-gp inhibitor is coadministered. For the same reason the model is quite accurate in predicting the half-lives of these transporter substrates, as the effect of active transport apparently falls within the 3-fold prediction error of the model for the compounds in this study. The lack of influence of active transport on the estimation of half-lives suggests that active transport is not highly significant for intravitreal elimination for the compounds included in this study.

The two models that were generated in this study are quite similar, which either can point to similar pharmacokinetics in the eye of albino and pigmented rabbits, or to a bias of albino rabbits due to the majority of data acquired from albino rabbits. However, based on a comparison of the half-lives of six compounds for which the half-life had been reported both in albino and pigmented rabbits (Table III), there is no clear trend for longer elimination times in either albino or pigmented, suggesting similar rate of vitreal elimination. It is well known that many drugs are able to bind to the pigmentation (e.g. ganciclovir (10,3032), gentamicin (33,34) and grepafloxacin (35) in our data set) and this may affect drug distribution in the tissues, such as retinal pigment epithelium. This is not reflected in the elimination rate constants, because even though some fraction of the intravitreally injected drug would bind to the melanin granules in the retinal pigment epithelium, it will mostly enter plasma that acts as a sink rather than distributing back to the vitreous.

Table III Comparison of Half-Lives Obtained in Both Albino and Pigmented Rabbits

The QSPR model presented is expected to be a useful tool in ocular drug research. The model will provide an early virtual estimate of the drug elimination rate. Thus, it will be easy to estimate drug concentration profiles after intravitreal injections at different dosing levels. With straightforward modeling approaches concentration predictions can be extended to administration of suspensions and drug delivery systems. In that case, drug dissolution or release rate can be used as input rate and the QSPR based elimination as output rate. There are some limitations in the use of the QSPR model. Firstly, we recommend that the models are used to predict the intravitreal half-life for compounds in the same chemical space as the test compounds, i.e. the descriptor values should be in the same range as the descriptor values in the model. Secondly, the half-lives in the human eye may not be the same as in the rabbit eye. The dimensions of the human eye are slightly bigger than in the rabbit eye, but the physiological factors of drug elimination (blood retina barrier, aqueous humor flow, blood aqueous humor barrier) are similar. Geometrical scaling methods can be used to obtain estimates for elimination in human eyes.

Conclusions

We have built a QSPR model for the prediction of intravitreal half-life of drug-like compounds. The model encompasses a broad chemical space and 33 in silico descriptors were used to build an optimal model. Overall, the QSPR model will be a useful tool in ocular drug discovery and development as it will help to reduce and refine animal experiments.