Introduction

Long-term reliability of implantable medical devices requires testing to assure low corrosion and metal ion release rates. Corrosion can reduce the structural integrity of the device and the release of metal ions has been associated with adverse immunological response (Ref 1-3). ASTM F 2129 is a test method used to assess the corrosion susceptibility of small metallic implants using cyclic potentiodynamic polarization (Ref 4). The United States Food and Drug Administration (FDA) typically requires medical-device manufacturers to provide data that will demonstrate that their device will have sufficient corrosion resistance during the expected in vivo service life which could be greater than 10 years. Frequently, medical-device manufacturers use ASTM F 2129 to satisfy this FDA requirement.

However, neither the ASTM F 2129 test method nor the FDA provides any guidance as to what constitutes an acceptance criterion for the polarization results or how many samples should be tested. ASTM F 2129 suggests comparison with a device already approved and on the market in the United States that is without any postmarket approval incidents of corrosion failures. This approach can, however, be quite expensive as it requires testing twice the number of experimental units (i.e., number of candidate devices and predicate devices); moreover, it presumes the existence of a suitable predicate device, i.e. a device that is implanted in a similar location in the body and manufactured from similar material and that the medical-device company has accurate knowledge regarding the predicate device’s in vivo corrosion resistance.

As discussed in our earlier paper on nitinol corrosion, NiTi appears to have very good corrosion resistance based on many patient-years of in vivo experience (Ref 5). Nevertheless, the development of both a statistically valid acceptance criterion for corrosion resistance and a method of evaluation would be of significant benefit to the medical-device community. In addition, there is a need for a protocol by which a medical-device manufacturer can periodically check the corrosion resistance of an approved medical-device to assure that the manufacturing process is still within specification and devices are being manufactured with adequate corrosion resistance, i.e. a need for a manufacturing lot acceptance criterion. This study is an attempt to establish such a methodology and is a continuation of work that was first reported at SMST 2007 in Japan (Ref 5).

Three general methodologies exist as possibilities for an acceptance criterion. The first is that the pitting or breakdown potential (Eb), a measure of resistance to localized corrosion in a specific environment, of an implantable device should have similar or better corrosion resistance as determined by ASTM F 2129 testing than approved devices currently on the market with no known corrosion problems. The electrochemical parameters discussed in this paper are shown in Fig. 1. These exemplar devices should ideally be manufactured from similar materials and be used in the same general location in the body as proposed for the current device to assure a similar in vivo exposure environment, i.e. exposure to blood, bile, urine, saliva, etc. As mentioned above, it is sometimes difficult, and expensive, to find such exemplars and there is little guidance on how many devices should be tested to confirm equivalent or better corrosion resistance. The second methodology that has been proposed is that the breakdown potential (Eb) of the device should be greater than some threshold value, independent of the material used for the implant (Ref 6-8). These implant alloys can have drastically different pitting resistance and it is somewhat arbitrary that a minimum Eb should be specified independent of the material and environment. It is generally agreed that if a NiTi medical-device can be made reliably with an Eb of greater than 0.600 V with respect to a saturated calomel electrode (SCE), then there is little doubt that these materials will be resistant to corrosion in vivo unless other conditions like fretting, galvanic effects, or crevice corrosion take place in vivo. The reality, however, is that many medical devices will not be able to achieve such high values due to material selection, material finish, design, and manufacturing limitations. The third methodology, which we further develop in this paper, is to use the difference between the breakdown potential (Eb) and the open-circuit or rest potential (Er). This gap (Eb − Er) is considered to be a better parameter to evaluate medical-device margins against pitting in vivo as has previously been discussed by Pound (Ref 9). The basic principle behind this approach is that if the in vivo Er is much lower than the pitting potential the device will have a large margin of safety against pitting or correspondingly, if Eb − Er is nearly zero or negative then the device will most likely pit when exposed to that environment. This is illustrated in Fig. 2. This Eb − Er parameter has been used frequently in the past to minimize the risk of pitting in many industries (Ref 9-13).

Fig. 1
figure 1

Definition of electrochemical parameters Eb, Er, and Eb − Er

Fig. 2
figure 2

Schematic illustrating the margin of safety against pitting

One problem with the Eb − Er criteria, as discussed by Rosenbloom and Corbett, is that there is little information regarding Er for in vivo conditions (Ref 8). Another issue is how Eb, Er, and ultimately Eb − Er vary during long-term in vivo exposures. The current ASTM F 2129 Standard states that devices are to be tested in their finished state. However, it is unlikely that the device, after exposure to in vivo conditions for several months will have the same resistance to pitting corrosion or even the same rest potential as a result of the surface film modification. Finally, there seems to be a wide variation in Eb values for NiTi depending on surface finish. Some NiTi:

  1. 1.

    will always pit below the minimum potential of 800 mV specified in ASTM F 2129;

  2. 2.

    will never undergo pitting below 800 mV; and

  3. 3.

    will infrequently breakdown.

Normal quality-control methodologies can be used for samples that always pit or never pit, but case 3, samples that only infrequently breakdown (datasets that contain censored data), are more difficult to treat statistically.

In an attempt to answer some of these questions, and to investigate whether long-term in vivo exposure might affect the margin of safety against localized corrosion, we report how long-term immersion in aerated and nonaerated phosphate buffered saline (PBS) will affect Eb, Er, and ultimately Eb − Er on a statistically significant number of samples with two different surface conditions: mechanically polished (MP) and electropolished (EP).

Experimental data were collected on the breakdown potential (Eb) of replicate surrogate devices (i.e., wire samples) and on the variation in rest potential (Er) with immersion time for individual units. Using these data, we conducted simulation studies to investigate the implications for device performance—specifically, the probability that the rest potential would exceed breakdown potential during the life of the device. The simulation results provided statistical support in defining a threshold value that would represent an acceptable margin between breakdown and rest potential (Eb − Er), as well as a correspondingly low probability of corrosion-induced failure during the lifetime of the device.

We then implemented a tolerance interval approach for determining, from standard testing of a candidate device, whether there is sufficient confidence that a high proportion of replicate units will attain the specified threshold value. Finally, we developed guidance for medical-device manufacturers’ quality-assurance programs in addressing the key practical question of how to determine the number of replicate units required for testing in order to establish an acceptable lower bound on the value of Eb − Er for manufactured devices. Developing this lower bound can be quite challenging for NiTi materials with good surface treatments since the vast majority of samples do not exhibit breakdown. We suggest a methodology to treat this type of highly censored data, i.e. few breakdowns with a large number of censored data, i.e. values greater than 800 mV SCE.

Materials and Methods

This investigation evaluated the pitting resistance of NiTi wire in two as-received surface conditions: MP and EP. The surface preparation of the material is proprietary to the material supplier and was not provided. The EP and MP NiTi used in the experiments came from different material lots.

The NiTi properties and surface characterization for the MP specimens that were purchased were described 55.9%Ni content superelastic wire with an Af temperature in the 5-15 °C temperature range with a bright finish described as mechanical polishing that was followed by an argon anneal. The EP specimens that were purchased were described as superelastic straight wire that conforms to ASTM F 2063 Specifications (Ref 14). The EP materials had a nominal composition of 55.8 wt.% Ni. This material had an Af temperature in the 5-18 °C range. Both the MP and EP wire had a diameter of 1.45 mm (0.057 in).

Auger analysis revealed surface oxide thicknesses of 28.6 ± 3.7 and 5.72 ± 0.79 nm for MP and EP wire, respectively (Ref 15). These film thicknesses were based on four measurements from two different locations on two different MP and EP NiTi wires. The thickness measurement was based on the equivalent sputter rate of SiO2 under identical conditions used for the NiTi. Oxide thickness measurements were calculated as the thickness where the oxygen intensity dropped to half of its maximum within the profile. Both oxide films on the MP and EP had a 2 wt.% or less Ni concentration at the surface (Ref 15). All electrochemical testing was done with 2.5-cm long wire. Both cut ends of the wire were coated with silicone sealant which resulted in a surface area of 1.15 cm2.

Figure 3 shows a representation of the size distribution of inclusions from longitudinal and transverse cross sections of the wire, and is similar for both materials used. The longitudinal sections indicate that the inclusions are present as stringers that broke up during the drawing process and are relatively strong and brittle compared to the surrounding matrix, as indicated by the fracturing and voids formed around these inclusions. We were unable to classify the inclusions with energy dispersive spectroscopy (EDS) but were able to observe that they are titanium rich. Infrequently, these inclusions intersect the wire surface as is shown in Fig. 4 for an EP specimen. Locating surface inclusions on the MP wire was much more difficult due to the rough surface and thick oxide layer.

Fig. 3
figure 3

Inclusions from metallurgical cross sections. Longitudinal (upper left) and transverse (upper right) of EP wire. The corresponding longitudinal and transverse metallographic sections from the MP wire are shown at the bottom left and right

Fig. 4
figure 4

Inclusions intersecting the surface. EP (left) and MP (right) in as-received condition

All pre-test immersions and electrochemical testing were performed in PBS (8.0 g L−1 NaCl, 0.2 g L−1 KCl, 1.15 g L−1 Na2H2PO4, and 0.2 g L−1 KH2PO4) at 37 ± 1 °C. The pH of the solutions was 7.4 ± 0.1 for all immersions and electrochemical testing. When samples were pre-exposed to aerated PBS, these solutions were aerated with laboratory air (79% N2, 21% O2, and 0.038% CO2). For deaerated tests, the PBS was sparged with ultrahigh-purity nitrogen for 30 mins prior to testing and then continuously throughout the test. The use of deaerated and aerated solutions was chosen to represent the widest range of oxidizing conditions to which an implantable medical-device might be subjected. Obviously, the deaerated PBS has a lower redox potential than would be expected to be found in vivo, and the aerated PBS is expected to provide higher redox potentials than conditions in vivo, except for perhaps implants in the airways or devices that are subjected to peroxides from an immunological response.

All tests were performed using Gamry computer-controlled potentiostats and Gamry DC-105 software. Long-term potential monitoring was performed in accordance with ISO 16429:2004 and the cyclic polarization tests were performed in accordance with ASTM F 2129-06 (Ref 4, 16). All potentials were measured with respect to a SCE. The potentiodynamic scan rate used was 0.167 mV/s and the potential increased until breakdown occurred or 1.1 V (SCE) was reached. All Er values reported are the corrosion potentials observed at the end of the pre-test immersions, i.e. in aerated or deaerated solutions, not the Er measured in the deaerated PBS used to determine Eb.

Results

We tested a total of 420 nitinol test specimens in order to get sufficient information regarding the probability of pitting in 130 mechanical polished and 290 EP NiTi samples. The pre-test exposure times to the deaerated and aerated environments were 1, 12, 60, 156, and 720 h for the MP surface finish and 1, 12, and 60 h for the EP samples. Then, 3-99 replicate specimens were tested at each pre-test immersion time. The results from our testing are summarized in Fig. 5-7 and Table 1. The mean and standard deviation for Eb for EP samples can only be calculated for the samples that pitted and therefore underestimate the true resistance of the material to pitting. As can be seen from Table 1, there are very clear differences between the corrosion resistance of the MP and EP samples. All of the MP samples exhibited pitting below 0.8 V (SCE) whereas only 13% of the EP samples pitted. Figure 5 illustrates that increasing oxygen concentration in the PBS increases Er and that Er also tends to increase with increasing pre-test exposure time in both deaerated and aerated PBS solutions. Er increases less rapidly with increasing pre-test exposure time for the EP surface condition. Eb for either the MP or EP surface condition is about the same and unaffected by oxygen concentration in the PBS as shown in Fig. 6. However, Eb does increase with increasing pre-test exposure time. The safety margin against pitting corrosion, Eb − Er, is lower in aerated environments primarily from the effect of oxygen on raising Er (Fig. 7).

Fig. 5
figure 5

Effect of pre-test immersion on rest potential Er

Table 1 Summary of test results
Fig. 6
figure 6

Effect of pre-test immersion on break down potential Eb

Fig. 7
figure 7

Effect of pre-test immersion on Eb − Er

Discussion

All the MP samples pitted, whereas only about 13% of the EP samples pitted indicating a significant effect of surface treatment. The dissolved oxygen concentration also has a very large influence, as expected on the open-circuit potential (Er). The MP wire had an oxide film about five times as thick as the EP samples and had significantly lower corrosion resistance as seen in Table 1. Clarke et al. showed that NiTi with a thicker oxide layer has a lower Eb value and a greater nickel release in both long-term and short-term experiments (Ref 1). Clarke et al. also found that after long-term immersion tests breakdown potentials increased for samples that exhibited lower values initially (Ref 1).

The effect of long-term pre-exposures of MP NiTi to 37 °C PBS clearly increases Eb; however, Eb increases faster for pre-test exposures to aerated solutions compared to deaerated solutions. This can be seen by comparing Eb values for various pre-test exposure immersion times for MP NiTi in Table 1 and Fig. 6. Clarke et al. found a similar trend in Eb for NiTi with a thick oxide (340 nm) exposed to 0.9 wt.% saline for up to 4 months although they do not report how Er values were affected (Ref 1). Our data show that both Er and Eb increased with increasing pre-immersion duration in deaerated PBS (Fig. 5 and 6), but the trends were not equitable as Eb − Er decreased with increasing immersion duration because Er increased faster than Eb.

The pre-test exposure of MP samples to aerated PBS had a more dramatic effect than the longest exposure to deaerated PBS with the Eb − Er values being about 300 mV lower which is primarily due to the very large increase of Er when exposed to aerated PBS.

The statistical distribution of Eb for stainless steels has previously been studied by Shibata and others (Ref 17-19). Shibata has shown that a normal distribution adequately describes the distribution of Eb for AISI 304 stainless steel tested in 3.5 wt.% NaCl solutions at 30 °C (Ref 17, 18). Nevertheless, Kowaka notes that pitting potentials are actually the lowest breakdown potential that is found only at a small area on a sample under test, and as such, a “weakest link” statistical distribution such as a Gumbel or Weibull distribution is generally expected to fit such data (Ref 18). Kowaka showed that the Weibull distribution also provided a good fit to Shibata’s stainless steel Eb data while at the same time being a “weakest link” type of statistical distribution. The Eb data from the MP and EP specimens exposed to deaerated PBS are shown in Fig. 8. Both datasets are well described by a three-parameter Weibull distribution.

Fig. 8
figure 8

Three-parameter Weibull distribution of Eb for MP (left) and EP (right) in deaerated PBS for 1 h

MP Modeling

Regression analysis of Eb − Er for MP NiTi data was performed to quantify the effects of aeration and immersion time. The effect of pre-test immersion exposure time was modeled as linear in the logarithm scale and separate regression analyses were performed on the aerated and deaerated data because of the greater variability of the deaerated data. The fitted regression equation for the deaerated data is as follows:

$$ E_{\text{b}} - E_{\text{r}} ({\text{V}}/{\text{SCE}}) = 0.604 - 0.0184\ln [t] $$

Ninety-one data points were used in this regression and the ANOVA results are reported in Table 2.

Table 2 Regression analysis results of Eb − Er for MP NiTi

Similarly for the aerated data the fitted regression equation is as follows:

$$ E_{\text{b}} - E_{\text{r}} ({\text{V}}/{\text{SCE}}) = 0.308 - 0.0053\ln [t] $$

Thirty-six data points were used in this regression and the ANOVA results are reported in Table 2.

Pre-test exposure to aerated environments has a large effect on the corrosion margin of safety primarily due to the large effect of aeration on Er as can be seen by the constant term dropping from 604 to 308 mV. Table 2 shows that the ln(t) factor is statistically significant for deaerated samples but not for aerated samples. However, the low R2 value for the deaerated samples indicates the presence of large unexplained variability. We suspect that this large variation is a result of the inherent randomness of the pitting process. The average Eb − Er value for the longest duration pre-test immersion (720 h) appears to violate this decreasing trend, but the average is calculated from only three observations and does not alter the statistical significance of the trend.

For deaerated samples, the baseline for Eb − Er is defined as material subjected to pre-test exposure for 1 h. The regression equation for deaerated samples can therefore be expressed as:

$$ E_{\text{b}} - E_{\text{r}} ({\text{V}}/{\text{SCE}}) = {\text{baseline}} + \hbox{pre-test immersion duration effect} $$

where baseline = 604 mV (SCE); this is the Eb − Er from the model for NiTi in the as-received condition (i.e., pre-test exposure to deaerated PBS for 1 h).

Pre-test immersion duration term = −0.0184ln[t]; the model predicts a decrease in Eb − Er as the pre-exposure immersion time increases:

  • 12 h = −30 mV

  • 60 h = −96 mV

  • 156 h = −119 mV

  • 720 h = −121 mV

  • 10-years = −209 mV

Sparging with air provides dissolved oxygen levels that are higher than those generally found in vivo and therefore the aeration effect here is exaggerated over that which would be expected when NiTi is exposed to arterial blood. Nevertheless, an aerated PBS solution better represents the in vivo dissolved oxygen concentration than a deaerated solution. The value provided above of −209 mV for the expected decrease in Eb − Er after 10-years exposure to deaerated solutions is an extreme extrapolation of the data and therefore may not accurately predict actual in vivo performance. However, if testing is done as we suggest, i.e. pre-exposure to aerated solutions, the inherent variability in Eb − Er is decreased and there is no need to account for the time the implant will be in service.

The probability that an MP sample will have a negative Eb − Er can be calculated using the probability distribution statistics shown in Table 3 where we have combined both the 1 and 12 hFootnote 1 and 60 and 156 hFootnote 2 deaerated data since there was no statistically significant difference between the 1 and 12 h or the 60 and 156 h data; however, the difference between the 1 and 12 h and the 60 and 156 h data was statistically significant.Footnote 3 We also calculated the probability that Eb − Er will be negative for MP NiTi for the combined 60 and 156 h immersions in deaerated PBS and the probability of negative Eb − Er for MP NiTi exposed to aerated PBS for all our aerated data (1-156 h data combined) since Eb − Er versus pre-test exposure time is relative insensitive to exposure duration in aerated environments.

Table 3 Estimated probabilities that Eb − Er < 0 for MP NiTi

Table 3 shows the results using several different types of distributions. A normal distribution is used since it has been previously used to describe Eb data for AISI 304 stainless steel in 3.5 wt.% saline (Ref 17-19). A Gumbel extreme value distribution has also been used since it theoretically should model the distribution of smallest values for multiple measurements taken in different locations of constant area (Ref 18). Among a set of candidate models that included the two- and three-parameter Weibull and lognormal distributions, the Gumbel extreme value distribution was found to be the best fit to the Eb − Er data for MP samples pre-exposed to deaerated PBS for 1 and 12 h (Fig. 9). The superiority of the Gumbel versus the normal model is reflected in the larger likelihood value of 65.9 in Table 3. This finding is also reflected in the more reasonable (higher) pitting probability predicted using the Gumbel distribution (6.43 × 10−5 versus 5.14 × 10−12). A three-parameter Weibull distribution was found to best characterize the longer-term deaerated immersion data (60-156 h) with an estimated pitting probability of zero versus a 6.66 × 10−4 probability if a normal distribution is assumed.

Fig. 9
figure 9

Distribution of Ep − Er with confidence bands for MP samples immersed in deaerated PBS for 1 and 12 h as Gumbel (left) and immersed in deaerated PBS for 60-156 h as three-parameter Weibull (right)

The three-parameter Weibull distribution was also found to best characterize the aerated Eb − Er data. This is not surprising since the two- and three-parameter Weibull distributions are typically among the probability models providing the best fit to minimum extreme value distributions such as Eb. The three-parameter Weibull model gives an estimated probability of pitting in vivo of zero whenever the threshold parameter—i.e. the minimum observable value of Eb − Er—is estimated to be positive. For example, the Eb − Er threshold is estimated at 149 mV (SCE) for the samples exposed to aerated solution and at 224 mV (SCE) for the deaerated exposures. Although both distributions would predict zero probability of pitting in vivo, the threshold values suggest that the specimens exposed to aerated PBS have a lower margin of safety against pitting. The fit of three-parameter Weibull distributions to Eb − Er data for short-term and long-term exposure to deaerated PBS are shown in Fig. 9. Similarly, Fig. 10 shows the three-parameter Weibull fit to the aerated Eb − Er data (all exposure times included).

Fig. 10
figure 10

Distribution of Ep − Er with confidence bands for MP samples immersed in aerated PBS for 1-156 h (three-parameter Weibull)

The entries in Table 3 clearly demonstrate that estimated failure probabilities depend very sensitively on the type of probability distribution used to model the observed variation in Eb − Er. When a typical amount of testing is performed, several models may fit the available data and lead to greatly differing inferences regarding extreme values. The normal distribution fits the observed test data reasonably well but, in all cases studied, is not the best distribution to describe the distribution of Eb − Er data. Specifically, Table 3 shows that, if the normal model is used to calculate the probability of pitting in vivo, it leads the counter intuitive conclusion that pitting is less likely in aerated versus deaerated solutions. This result can be explained by the short tails of the normal distribution and the fact that, although Eb − Er, on average, is greater under deaerated conditions, the associated variability about the mean is also much greater than that under aerated conditions. The Weibull model provides a marginally better fit to the data but effectively precludes the possibility that Er could exceed Eb. Use of the Gumbel extreme value distribution can be justified by physical considerations and produces conservative estimates of lower-tail probabilities, but the fit of the model to observed data is poor in two of the three cases studied.

EP Modeling

A total of 290 EP samples were tested (169 in deaerated PBS and 121 in aerated PBS). Analysis of variance of the EP rest potential data (see Table 4) indicates that Er is significantly affected by both the amount of dissolved oxygen in the PBS and the length of time of immersion. The effect of dissolved oxygen on Er is the combined effect of the dissolved on solution redox potential and the effect of long-term exposure on the passive film (morphology, composition, and thickness).

Table 4 Analysis of variance of Er data from EP samples

Of the 290 EP samples tested, only 39 (13%) pitted below 0.800 V (SCE) the minimum voltage specified in ASTM F 2129 at which the cyclic polarization test can be stopped. Table 5 shows how probability of pitting of EP samples varies with pre-test exposure time and oxygen concentration in the PBS. The results from Fisher’s exact test (Table 5) and binary logistic regression analysis using the censored data versus the oxygen and time variable (Table 6) showed that the differences observed were not significant indicating that neither oxygen concentration or duration of immersion in PBS significantly affect the percentage of EP samples that pit below 1.1 V (SCE).

Table 5 EP NiTi—Fisher’s exact test indicates probability of breakdown does not vary significantly with oxygen concentration or duration of immersion time in PBS
Table 6 EP NiTi—binary logistic regression confirms probability of breakdown does not vary significantly with oxygen concentration or duration of immersion time in PBS

Modeling of the EP test data is more challenging than for the MP test data as most of the data are censored, i.e. most of the samples did not break down. One way to treat such data is through the use of nonparametric statistical methods. However such methods require a large number of samples. For instance, to specify a 99% reliability value with 95% confidence, 299 samples need to be tested. The number of specimens required for a nonparametric approach is much greater than a parametric approach (Ref 20).

One parametric approach is to fit the censored Eb − Er values to a standard statistical distribution using a methodology that can handle the censoring. We have done this for the aerated and deaerated EP specimens as shown in Fig. 11. These data were best fit by a three-parameter Weibull distribution for both the aerated and deaerated exposures. The threshold parameter (gamma), below which the probability of pitting is zero, is estimated at 512 mV for the deaerated and 286 mV for the aerated exposures. These results suggest that aerated environments are more prone to produce pitting than deaerated solutions, but neither solution is likely to result in breakdown in vivo since the estimated probability of failure is zero for both.

Fig. 11
figure 11

Censored treatment of Eb − Er data from EP samples immersed in deaerated (left-two-parameter exponential) and aerated (censored three-parameter Weibull distribution) PBS for 1, 12, and 60 h

The prediction of a zero failure probability from a fitted three-parameter Weibull or LogNormal distribution, for instance, does not seem physically reasonable, since there is presumably some probability of pitting no matter how small that actual probability. Further, we believe that single-population models lack physical basis in this context, and our investigation has shown that other probability distributions provide an adequate fit to the observed Eb − Er data.

To overcome some of the limitations of the parametric approach we have previously proposed (Ref 5), we derive a physically based, mixture model, using a Poisson distribution to describe the fraction of samples that will likely break down. This was done by modeling the occurrence of surface flaws, and then using a normal distribution to model the distribution of breakdown potentials for flawed specimens. The Poisson distribution has been used previously to characterize the number of pits expected (Ref 18). The Poisson probability function \( (p(k) = \exp^{ - \uplambda } \uplambda^{k} /k!) \) actually gives the probability of finding a specimen with k flaws that will pit before 0.8 V is reached. In computing this probability, \( \uplambda = \uptheta a \) is proportional to the area (a) of the surface being evaluated (1.15 cm2) and θ is the expected number of occurrences per unit area.

Our proposed approach to an acceptance criterion is to estimate the probability of corrosion (Π) based on the probability that the sample will pit (Π1) and, if the specimen will pit, the probability (Π2) that Eb − Er is negative:

$$ \Uppi = \Uppi_{1} \times \Uppi_{2} , $$

where \( \Uppi_{1} = 1 - \exp ( - \uplambda ) \) is the probability that any individual sample will possess one or more flaws, and will therefore pit, and Π2 = F(0), where F(·) is the cumulative distribution function that describes the variation in Eb − Er. We consider alternative parametric choices for F(·) by considering their suitability for and goodness-of-fit to the values of Eb − Er observed for the pitted specimens. Using the normal distribution for F(·),

$$ \Uppi_{2} = \left( {\frac{1}{2}} \right)\left[ {1 + {\text{Erf}}\left( {\frac{\updelta - \upmu }{\upsigma \sqrt 2 }} \right)} \right] $$

is the probability that Eb − Er < 0 if the specimen pits. Here, δ is an arbitrary extra margin of safety above Eb − Er which for this analysis we have assumed to be zero.

Other choices for F(·) can be considered; for example, using the three-parameter Weibull distribution,

$$ \Uppi_{2} = 1 - \exp [ - ( - \upgamma /\upalpha)^{\upbeta } ], $$

where α and β are the (positive) scale and shape parameters, respectively, and γ is the (negative) threshold parameter. Alternatively, one could develop probability models for Eb and Er separately. Although we do not pursue such an approach in this paper, some justification can be found in the observed lack of correlation (r = 0.024, P = 0.882) between the Eb and Er values for pitted specimens.

The probability (Π) that Eb − Er is negative can be approximated by:

$$ \Uppi \approx P_{\text{E}} = P_{1} \times P_{2} $$

where P1 is the observed proportion of breakdowns and P2 is computed by substituting data-derived estimates of μ and σ in the expression for Π2, as shown in Fig. 12 for the deaerated exposures and Fig. 13 for the aerated exposures. The next step is then to determine the upper bound PU for Π. The lot of material can then be accepted or rejected if PU is less than a specified limit on the probability for pitting and subsequent breakdown.

Fig. 12
figure 12

EP mixed probability model for deaerated PBS

Fig. 13
figure 13

EP mixed probability model for aerated PBS

For our data, 39 out of 290 specimens broke down. With \( \uplambda \) representing the expected number of flaws per 1.15 cm2 of surface area, this rate can be estimated by

$$ \uplambda = - \ln (1 - P_{1} ) = - \ln (1 - 39/290) = 0.144 $$

or about one flaw per seven specimens.

Restricting attention to the 39 pitted specimens, we can use analysis of variance to see if Eb is affected by oxygen concentration or duration of exposure to PBS. The result of this analysis is shown in Table 7 and confirms that Eb is not significantly affected by oxygen concentration but is influenced by the duration of exposure. Similarly, an analysis of variance of Eb − Er (see Table 8) indicates that the margin of safety against pitting Eb − Er is not significantly affected by duration of exposure but is significantly affected by the dissolved oxygen concentration in the PBS. Thus, from this analysis we see that for EP samples:

Table 7 Analysis of variance of Eb data from EP samples
Table 8 Analysis of variance of Eb − Er data from EP samples
  1. 1.

    Er is greater for aerated solutions

  2. 2.

    Eb is similar for aerated and deaerated solutions

  3. 3.

    Both Eb and Er increase with exposure time

  4. 4.

    Eb − Er is lower in aerated solutions

  5. 5.

    Eb − Er does not vary significantly with immersion time for aerated or deaerated solutions

Because aeration significantly affected Eb − Er and immersion time did not, estimates of P2 are developed by fitting probability distributions to the 21 pitted specimens tested under deaerated conditions and to the 18 pitted specimens tested under aerated conditions. Compared to a single-population approach, analysis using this physically based mixture model gives a more realistic result of 1 in 1600 for the probability of pitting for EP exposed to aerated PBS versus 1 in 730,000 for specimens exposed to deaerated solutions. This can be compared to the probability for pitting of MP samples of 1 in 1500 in deaerated PBS (60-156 h) and 1 in 1,800,000 for MP exposed to aerated PBS (1-156 h).

It appears that the MP samples have a very high surface density of pit-initiation sites and that the breakdown potentials for these sites are reasonably well characterized by a normal distribution of voltages with a small standard deviation. The breakdown is most likely associated with defects in the oxide coating from mechanical polishing. The EP samples, however, appear to have a relatively low surface-defect density that results in pitting only once in 10 specimens. For the samples that pit, the distribution of the breakdown potentials are slightly higher than those observed for the MP samples. It is possible that once the defective oxide coating has been removed by electropolishing, the only defects that remain are where inclusions and cracks surrounding the inclusions intersect the surface. Shabalovskaya et al. investigated typical inclusions in NiTi and found that the inclusions are generally titanium rich as we observed (Ref 21, 22). They indicate that these titanium rich inclusions are generally Ti2NiO, TiO2, TiC, or oxidized TiC. The probability of inclusions, associated crevices, and voids intersecting the surface is, hopefully, a rare event and may explain why only one in seven of the EP samples failed. The intersection of inclusions and voids and cracks surrounding such inclusions has been used in the past to characterize the probability of fatigue failure (Ref 23) and we believe a similar process can be used to explain the pitting probabilities for NiTi once the more damaging/frequent oxide defects have been removed by electropolishing.

Conclusion

  1. 1.

    Eb − Er as determined from the current version of ASTM F 2129 which does not include pre-test exposures to aerated solutions and can over estimate the margin of safety of NiTi to pitting corrosion in vivo. This indicates that current acceptance testing of medical device implants (such as ASTM F 2129) should be modified or extended to include at least several days pre-test exposure of implant material to aerated physiological solutions prior to cyclic polarization testing using the Er values at the end of the pre-test immersion and the Eb from the testing in deaerated PBS.

  2. 2.

    Acceptance criterion should be based on a calculation of the probability that the device will pit in vivo is sufficiently small. This can be done by cyclic polarization testing a sufficient number of devices that have been pre-exposed to aerated PBS for sufficient times. This data should then be used to calculate the probability of pitting in vivo, i.e. the probability that Eb − Er is negative.

  3. 3.

    The margin of safety (Eb − Er) of MP NiTi decreased significantly after exposure to deaerated PBS at 37 °C. Modeling showed that with a pre-test exposure of 1 h Eb − Er decreased approximately 296 mV when exposed to aerated rather than deaerated solutions. This change is primarily a result of the increase in Er rather than from a change in Eb. If testing is done with pre-exposure to aerated solutions, the inherent variability in Eb − Er is decreased and there is no need to account for the time the implant will be in service.

  4. 4.

    To achieve a high degree of confidence that the probability of in vivo pitting is acceptably small, the amount of testing required will depend on the population variation in breakdown and rest potentials. Small-scale pilot testing should be performed when this information cannot be obtained from other sources.

  5. 5.

    The pitting resistance (Eb − Er) of EP NiTi is higher than MP NiTi. Only five of the EP NiTi samples broke down whereas all of the MP samples broke down. Using only the samples that broke down, the average Eb − Er was 0.774 V SCE for the EP NiTi and 0.586 V SCE for the MP NiTi. This, however, does not take into account the fact that over 90% of the EP samples did not break down.

  6. 6.

    MP NiTi breakdown potentials can be treated with a normal quality-assurance type of statistics and sampling theory. Eb − Er distributions can be used to predict the probability that Eb − Er from the sample will be negative or some pre-determined threshold value. The use of a threshold value can be used to account for the effect of drift in Eb and Er with long-term immersion, or testing can be done on samples subjected to long-term immersion in aerated solutions to verify that Eb − Er has stabilized.

  7. 7.

    Quality-assurance testing and statistical analysis of EP NiTi, and other medical-device materials, with breakdown potentials frequently greater than 0.8 V SCE, is much more challenging. Here we propose a combined model in which the censored data (no break down data) is used to model the probability that any given sample may, or may not break down as a Poisson process, then model the breakdown potentials of the samples that have failed as a normal distribution.

  8. 8.

    The probability that NiTi will pit in vivo (Eb − Er < 0) was estimated as:

    1. (a)

      1 in 15,500 for MP exposed to deaerated PBS for 1 and 12 h (Gumbel)

    2. (b)

      0 for MP exposed to aerated for 1-156 h (three-parameter Weibull)

    3. (c)

      1 in 730,000 for EP exposed to deaerated for 1 h (mixture model)

    4. (d)

      1 in 1,600 for EP exposed to aerated solutions for 1 h (mixture model)

  9. 9.

    The estimated probability of pitting in vivo is higher for EP samples pre-exposed to aerated PBS than for deaerated PBS primarily as a result of the increase in dissolved oxygen raising Er.