INTRODUCTION

Food exerts complicated effect on pharmacokinetic and/or pharmacodynamic profiles of a drug. In this study, the key physicochemical parameters that contribute to the food effect were identified by statistical analysis of the effect of food on the extent of drug absorption. A prediction model was also established using logistic regression.

The effect of food on oral absorption may be attributed to specific mechanism for an individual compound. For example, food may interfere with specific transporters that are involved in absorption of a specific compound (1). Food may increase the splanchnic blood flow rate and increase the bioavailability of compounds that undergo extensive first pass effect (2). Certain compounds, e.g., tetracycline and digoxin, can also chelate to specific components of food leading to reduced bioavailability (3,4). These specific food effects may be difficult to predict based on physicochemical descriptors of compounds.

However, food also exerts general physiological changes and its effect on drug absorption may be statistically predicted for compounds with similar physicochemical properties. With food intake, the gastric pH increases initially to about pH 6, followed by decrease in pH value to 2 in approximately 1 h because of increased acid secretion (5,6). The bile secretion also increases with food intake, which may enhance the solubility of lipophilic compounds (6). It was suggested that the permeability may be reduced in general for poorly permeable compound because food impedes the diffusion of the compound to the mucosal surface (7). Larger volume after food intake, which leads to lower concentration of dissolved compound, also reduces the amount absorbed in a definite period of time. Additionally, general binding between drug and food components and incorporation of drug in the micelles of food may also impede the access of the drug to the epithelium surface and hence absorption. It was also well known that food causes delayed gastric emptying leading to delayed tmax and lower Cmax (8,9), although these parameters were not analyzed in this study.

Fleisher et al. summarized the general trend of food effect on drug absorption based on BCS classification (7). BCS class I compounds are likely to have no food effect; BCS class II compounds are likely to have positive effect (increased absorption); BCS class III compounds are likely to have negative effect (decreased absorption) and there is no clear trend for BCS class IV compounds (7). For the 92 entries used in this study, the relationship between BCS classification and the effect of food on the extent of absorption (area under the curve of the time–plasma concentration curve, AUC) was summarized (Table I). If using the criteria that BCS class 1 shows no food effect, BCS classes 2 and 4 show positive food effect and BCS class 3 shows negative food effect as suggested by Fleisher et. al, 67% of entries were predicted into the right category. This result suggests that it is possible to build a statistical model to predict the effect of food on extent of drug absorption based on physicochemical properties. In the current study, more descriptors are used to statistically predict the food effect to improve accuracy for prediction.

Table I Relationship Between Food Effect on the Extent of Absorption (AUC) and BCS Classification of Compounds

MATERIALS AND METHODS

Clinical data of food effect of 90 marketed compounds (92 total entries including hydrochlrothiazide at two different doses with different food effect and free base and mesylate salt of saquinavir) were collected from the literature and Physician Desk Reference (10) (Tables II and III). Hydrochlrothiazide at lower dose is listed in Table II as the dose is less than the maximum absorbable dose (MAD) while the higher dose entry is listed in Table III with the dose > MAD. The food effect on the extent of absorption was separated into three categories: positive food effect (statistically significant increase in AUC with food, 31 entries), negative food effect (statistically significant decrease in AUC with food, 24 entries) and no food effect (no statistically significant difference in AUC, 37 entries).

Table II List of Compounds with Maximum Absorbable Dose (MAD) Greater than Clinical Dose and Their Physicochemical Properties
Table III List of Compounds with Maximum Absorbable Dose (MAD) Less than Clinical Dose and Their Physicochemical Properties

The following physicochemical parameters were collected for these compounds: clinical dose used in the published food effect study, solubility at pH 7, calculated log D at pH 7 using ACD software (11), total surface area (calculated using ACD software), polar surface area (calculated using ACD software), number of hydrogen bond donor (calculated using ACD software), number of hydrogen bond acceptor (calculated using ACD software). The solubility values were collected from the literature (10,1215). For compounds with solubility reported based on USP definition but without specific value, the lower value of the range defined in the USP was chosen (12). From these parameters, additional parameters were derived. The dose number was calculated as ratio of dose to the solubility at pH 7 (16), which was further assigned into three categories (dose number category): denoted as 1 if dose number <50, 2 if 50 ≤ dose number ≤ 250, and 3 if dose number >250. The Log D was also categorized into three categories (LogD category): denoted as 1 if Log D < −1, 2 if −1 ≤ Log D ≤ 1 and 3 if log D > 1. The maximum absorbable dose (MAD) was calculated using the following equation (17):

$$ {\text{MAD}} = {\text{Solubility}}{\left( {{{\text{mg}}} \mathord{\left/ {\vphantom {{{\text{mg}}} {{\text{mL}}}}} \right. \kern-\nulldelimiterspace} {{\text{mL}}}\;{\text{at}}\;{\text{pH}}\;7} \right)} \times 250{\left( {{\text{mL}}} \right)} \times 180{\left( {\min } \right)} \times {\text{absorption}}\;{\text{rate}}\;{\text{constant}}{\left( {\min ^{{ - 1}} } \right)} $$
(1)

where the absorption rate constants of 0.0006, 0.003 and 0.0134 were used for poorly permeable (Caco-2 permeability < 20 nm/sec), moderately permeable (Caco-2 permeability between 20 and 100 nm/sec) and highly permeable (Caco-2 permeability > 100 nm/sec) compounds, respectively (18). The absorption rate constants were estimated based on the correlation between Caco-2 and human permeability of 20 compounds (18). For compounds without reported Caco-2 permeability value, the MAD value was determined from the reported value of percent absorbed in human pharmacokinetics studies. For example, tamsulosin hydrochloride was reported to be 100% absorbed at the clinical dose, the MAD to dose ratio is therefore assigned as 1 (10). This estimation may not be accurate in terms of absolute value of MAD but it is accurate in terms of the relationship between MAD and dose. Since compounds are categorized in the model based on whether the MAD is ≥ or < the dose, the relationship between MAD and dose is important for the model rather than the absolute value of MAD. Since Caco-2 permeability data were not available for all compounds in the study, Caco-2 permeability was not used as an individual parameter for prediction. In the above equation, we used 180 min as the small intestine residence time in human instead of 270 min used in the original paper (5).

All compounds were first separated into two categories: MAD < clinical dose and MAD ≥ clinical dose. The clinical dose is the dose studied for food effect in the published references. For compounds with MAD ≥ clinical dose, complete absorption is achievable. Therefore, theoretically these compounds may not show positive food effect and were fitted into negative food effect and no food effect categories, using logistic regression. Compounds with MAD < clinical dose were fitted into positive food effect, negative food effect and no food effect categories by logistic regression with cross validation using SAS™ 8.2 (19).

Logistic regression is often used to investigate the relationship between binary response (negative and no food effect in the case of MAD ≥ clinical dose) or ordinal responses (negative, no food effect and positive food effect in the case of MAD < clinical dose) and a set of explanatory variables (physicochemical variables in this study).

For the binary response model, the linear logistic model has the following form for the probability (p) of a response given the observed x:

$$ \log it{\left( p \right)} \equiv \log {\left( {\frac{p} {{1 - p}}} \right)} = \alpha + \beta \prime x $$
(2)

Where α is the intercept parameter and β is the vector of slope parameters, which gives the probability of negative food effect as:

$$ p = \frac{1} {{1 + \exp {\left( { - {\left( {\alpha + \beta x} \right)}} \right)}}} $$
(3)

and the probability of no food effect is 1 − p.

For the ordinal response model, the three possible responses of Y were denoted by 1, 2, and 3 (Y = 1 if negative food effect, Y = 2 if no food effect, and Y = 3 if positive food effect) and x was the vector of explanatory variable. The linear logistic regression model with common slopes was fitted using LOGISTIC procedures in SAS™ 8.2 as the following:

$$ \log it{\left( p \right)} \equiv \log {\left( {\frac{p} {{1 - p}}} \right)} = \alpha _{i} + \beta \prime x,i = 1,2 $$
(4)

where \( p = \Pr {\left( {Y \leqslant i\left| x \right.} \right)} \)is the cumulative probability, which means when i = 2, \( p = \Pr {\left( {Y \leqslant 2\left| x \right.} \right)} = \Pr {\left( {Y = 1\left| x \right.} \right)} + \Pr {\left( {Y = 2\left| x \right.} \right)} \); α1, α2 are two intercept parameters and β is the vector of slope parameters.

This model gives the probabilities of Y being 1 (p 1), 2 (p 2), and 3 (p 3) of given x as:

$$ p_{1} = \frac{1} {{1 + \exp {\left( { - {\left( {\alpha _{1} + \beta x} \right)}} \right)}}}, $$
(5)
$$ p_{2} = \frac{1} {{1 + \exp {\left( { - {\left( {\alpha _{2} + \beta x} \right)}} \right)}}} - p_{1} , $$
(6)

and

$$ p_{3} = 1 - p_{1} - p_{2} $$
(7)

The score chi-square statistics was used to ensure the adequacy of common slope assumption in the ordinal response model.

During model-building, four variable selection methods: forward selection, backward elimination, stepwise selection, and best subset selection were used to select the explanatory variables based on the 0.05 significance level.

RESULTS

From logistic regression, compounds with MAD < clinical dose, the probabilities of positive, negative and no food effect were determined using the following equations:

Probability of negative food effect (p 1):

$$ p_{1} = \frac{1} {{1 + \exp {\left\{ { - {\left[ { - 1.28 - 1.04 * {\left( {{\text{dose number category}}} \right)}} \right]}} \right\}}}} $$
(8)

Probability of positive food effect (p 2):

$$ p_{2} = \frac{1} {{1 + \exp {\left\{ { - {\left[ {1.53 - 1.04 * {\left( {{\text{dose number category}}} \right)}} \right]}} \right\}}}} - p_{1} $$
(9)

Probability of no food effect (p 3):

$$ p_{3} = 1 - p_{1} - p_{2} $$
(10)

In the above model, the score chi-square test has a p-value of 0.20, indicating adequacy of common slope assumption for the cumulative probability in the logistic model.

For compounds with MAD ≥ clinical dose, the probabilities of negative and no food effect were determined by following equations:

Probability of negative food effect (p 1):

$$ p_{1} = \frac{1} {{1 + \exp {\left\{ { - {\left[ {6.66 - 2.97 * {\left( {{\text{dose number category}}} \right)} - 1.56 * {\left( {{\text{Log D category}}} \right)}} \right]}} \right\}}}} $$
(11)

Probability of no food effect (p 2):

$$ p_{2} = 1 - p_{1} $$
(12)

The predicted food effect of each compound was assigned based on the highest probability of food effect category. The prediction results are provided in Tables IV and V and are summarized in Table VI. Overall, 74 out of 92 entries (80%) were predicted into the correct category. Compounds with positive food effect can be distinguished more easily as 97% of compounds with positive effect (30 out of 31 entries) were predicted into correct category while 79% (19 out of 24 entries) and 68% (25 out of 37 entries) compounds with negative and no food effect were predicted into correct category, respectively (Table VI). For a given prediction, the probability of correct prediction is 83, 73 and 83% for positive, negative and no food effect, respectively (Table VII). This statistical method provides more accurate prediction than the estimation based on BCS classification with 67% of the entries being predicted correctly (Table I). It should be mentioned that the accuracy of the current model is based on cross-validation, which may not hold true for an external dataset. The parameters used in present prediction are similar to those used to categorize BCS classes. The success of prediction using the current model further validates the importance of parameters used in BCS classification for oral absorption. It’s interesting to note that parameters, such as polar surface area, total surface area, percent polar surface area, number of hydrogen bond donors and acceptors, do not contribute significantly to the food effect based on the statistical analysis. The model also indicates that food effect is sensitive to the category of the properties rather than the individual value, suggesting that compounds in the same category possess similar properties for food effect.

Table IV Prediction Results of Compounds with Maximum Absorbable Dose (MAD) Greater than Clinical Dose
Table V Prediction Results of Compounds with Maximum Absorbable Dose (MAD) Less than Clinical Dose
Table VI Summary of Prediction of Food Effect Results Using Present Model
Table VII Probability of Correct Prediction of Food Effect Using Present Model

The current model was able to predict all compounds with true positive food effect in the database of this study into the correct category except ribavarin. The model also predicted some compounds with no food effect into positive food effect category. It was more difficult to discriminate compounds with no food effect from compounds with negative food effect. The limitation of the model is described in the Discussion section.

DISCUSSION

In the statistical model, when MAD was greater than clinical dose (complete absorption, Table III), compounds with dose number category 1 (dose number <50) and log D category 1 or 2 (log D <1) were predicted to have negative food effect. Compounds with dose number category 1 and log D category 3 (log D > 1) were predicted to show no food effect. Compounds with dose number category 2 or 3 (dose number >50) and any log D category were also predicted to show no food effect. These results suggest that among compounds that can be completely absorbed, only compounds that are highly soluble (dose number <50) and hydrophilic (Log D <1) are statistically prone to show negative food effect. It has been suggested that food may serve as a physical barrier for drug absorption, which may reduce absorption of compounds with narrow window of absorption (7). Those compounds that are highly soluble and hydrophilic may belong to the class of compounds with narrow window of absorption and therefore show negative food effect.

However, the model failed to predict correctly the compounds with narrow window of absorption that are not highly soluble and hydrophilic. Based on the model, indinavir (log D = 2.66) was predicted to show positive food effect because of its low solubility and high log D value. However, it showed negative food effect in the clinical study and was attributed to its narrow window of absorption (20), which was not statistically predicted based on descriptors used in the current model. On the other hand, some highly soluble and hydrophilic compounds may not exhibit narrow window of absorption, e.g., lamivudine. This compound was predicted to have negative food effect but showed no food effect clinically (71). The root cause of these incorrect predictions is the inaccurate correlation between log D and intestinal permeability. Fexofenadine (negative food effect) was predicted to have no food effect because of high log D value. However, it has poor intestinal permeability (21). In contrast, zidovudine showed no food effect because of moderate permeability despite low log D value (22). It was reported that the correct classification of human permeability based on Log D is 87% for 16 drugs (23). A better prediction in the present model may be achieved if human intestinal permeability data are available.

For compounds with MAD less than clinical dose (incomplete absorption, Table III) with dose number category 2 or 3 (dose number >50), the model predicted to have positive food effect. Therefore, for compounds that cannot be completely absorbed because of limited solubility or dissolution rate, food enhanced their oral absorption statistically. This solubilization effect of food can be explained physiologically. Food, especially high fat meal which is often used in the food effect study, introduces higher concentration of lipids and bile salts, leading to higher solubility and dissolution rate of lipophilic compounds, e.g., halofantrine (24). It should also be noted that, the dose number, in addition being an indicator of solubility dose relationship, may also reflect other properties of a compound that are susceptable to food effect. It was reported that food may in general inhibit both influx and efflux transporters (25). For compounds with high dose number, they are more likely to be substrates of efflux transporters and therefore more likely to show positive food effect. These compounds are also more likely to be absorbed through lymphatic uptake and the postprandial increase in lymphatic flow will cause the positive food effect (26). On the other hand, compounds with low dose numbers are more likely to be the substrate of influx transporters causing negative food effect.

For compounds with incomplete absorption, if the dose number was less than 50, it was predicted to have no food effect. This result suggests that if incomplete absorption is caused by factors other than solubility, food generally exerts no effect on absorption. It is interesting to note that the food effect category is separated by dose number 50 rather than the value 250 used in BCS classification (27). The dose number 50 (the dose that can be solubilized with 50 mL of water at pH 7) was chosen arbitrary in this study to separate compounds that are highly soluble with respect to dose (dose number <50) from those that are moderately soluble (50 ≤ dose number ≤ 250). Indinavir is the only compound with MAD less than clinical dose in the entry showed negative food effect.

It is noted that all poorly soluble weak bases except indinavir in this database showed positive food effect, which suggests that initial higher gastric pH under fed condition does not impede the overall amount of weak bases dissolved as secretion of HCl with food reduces the gastric pH to around 2 within 60 min (5,6). Furthermore, the residence time of the drug in the stomach is longer with food, which may enhance the total amount dissolved. Slower entry to the intestine (delayed gastric emptying), lower duodenal pH and higher concentration of lipid and bile salts under fed conditions may also reduce the precipitation of weak bases in intestine leading to higher bioavailability.

The present model heavily relies on the accuracy of estimated MAD, which is determined by the accuracy of solubility and permeability. However, discrepancy may exist between in vitro estimated solubility and permeability and in vivo absorption. For some lipophilic compounds, the solubility in the intestinal fluid may be significantly higher than the aqueous solubility used in the current calculation, leading to underestimation of MAD values (28). The absorption rate used to calculate MAD is the average value. Some compounds may show higher absorption rate than the average value, which also leads to underestimation of MAD values. The absorption window of 3 h was used to calculate MAD, which may underestimate the MAD value of highly permeable compound with colonic absorption. Imiquimod and rofecoxib were predicted to have positive food effect but showed no food effect clinically. The estimated MADs of these two compounds based on the highest absorption rate suggested in the literature are much lower than the clinical dose because of very high dose number (18). However, it is reported that rofecoxib (29) and imiquimod (30) showed complete oral absorption in clinical study. The Cmax of these two compounds was also unaltered in the presence of food, suggesting lack of food effect on dissolution. The underestimation of MAD values of these 2 compounds lead to incorrect prediction of the food effect. The incorrect prediction for bicalutamide (predicted to have positive food effect but showed no effect clinically) is also caused by underestimated MAD value. Postpandial dosing of bicalutamide resulted in significant increase in Cmax although without effect on AUC (31). The higher Cmax may be caused by increased solubility with food. However due to high permeability, complete absorption was achieved leading to no food effect on AUC (31). It is also worth mentioning that the solubility and permeability values are collected from various sources and therefore may be associated with inaccuracy. We also only used solubility value at pH 7 and the pH effect on solubility was not considered in this study. However, since the model used categorized parameters instead of absolute values, the variability in the source data may be somewhat mitigated.

The prediction accuracy of current model is limited by the physicochemical parameters used in the model, and may not represent all factors contributing to the effect of food on AUC, such as effect on metabolism and specific chelation between food and drug. The current model also cannot predict food effect on the activity of a specific transporter responsible for the absorption of a particular compound as the current understanding of food effect on transporter is limited. Ribavirin is the only entry that showed positive food effect clinically but was not predicted correctly. It is reported that intestinal absorption of ribavirin is mediated by the concentrative Na+ nucleoside purine (CN1) transporter and can be saturated with increasing dose (1). Although the mechanism of food effect on ribavirin is unknown, the current model will not be able to predict its food effect if the activity of nucleoside purine (N1) transporter is altered by food.

The current model considers only the intrinsic properties of a compound and their relationship to the food effect. Formulations may significantly change the intrinsic property of a compound and therefore its food effect. Danazol, when dosed as conventional capsule showed significant positive food effect as predicted (32). However, no food effect was observed when danazol was dosed as an emulsion (32). Itraconazole is another example that the conventional capsule formulation showed positive food effect as predicted from the molecular properties (33) but the hydroxypropyl-β-cyclodextrin solution formulation showed negative food effect as the formulation changed the intrinsic property (solubility) of the compound (33).

Despite the limitation of the model, it successfully predicts the food effect of a compound with reasonable accuracy based on physicochemical properties. The model may be improved if additional parameters could be included when better understanding of effect of food on permeability and metabolism is achieved.

CONCLUSIONS

A statistical model is established to predict the food effect on extent of absorption based on solubility, permeability and dose of a compound. It was found that critical parameters for food effect prediction include maximum absorbable dose (MAD) category (a parameter that combines information of dose, solubility and permeability), category of dose number (a parameter that combines dose and solubility) and category of Log D (an indicator of permeability). For overall 92 entries, 80% were predicted into the correct category. 97, 79 and 68% of entries with positive, negative and no food effect, were predicted into respective correct categories. Given a compound, the probability of correct prediction is 83, 73 and 83% if it is predicted to show positive, negative or no food effect, respectively. Since all parameters used in the model can be estimated during the discovery stage, the model may be used to predict the possible food effect in early discovery.

The statistical analysis revealed that positive food effect on absorption is primarily caused by solubilization effect of food. Statistically, food causes negative effect on absorption because it interferes with absorption of compounds that are highly hydrophilic and probably with narrow window of absorption.