1. Introduction

The development and introduction of second-generation antipsychotics (SGAs) has led to an expectation for better pharmacological treatment of schizophrenia. In addition to this, the range of outcomes for identifying success with antipsychotic therapy has expanded and has begun to include consideration of negative and cognitive symptoms. Initial studies involving SGAs concentrated on the essentials (safety and efficacy). Trials were short term (up to 8 weeks), randomized and placebo controlled to fulfil regulatory requirements. These short-term trials have now given way to longer, larger clinical effectiveness trials. The aim of these larger trials essentially is to examine to what extent therapeutic efficacy is translated into clinical effectiveness and to answer the question: what is the most effective antipsychotic to treat schizophrenia?

The four clinical effectiveness studies on SGAs included in this article are CATIE (Clinical Antipsychotic Trials of Intervention Effectiveness), CUtLASS (Cost Utility of the Latest Antipsychotic Drugs in Schizophrenia Study), SOHO (Schizophrenia Outpatient Health Outcomes) and ZODIAC (Ziprasidone Observational Study of Cardiac Outcomes). The main objectives, description and outcomes of each trial are summarized below.

2. CATIE

The main objective of CATIE was to determine the comparative effectiveness of a representative conventional antipsychotic (perphenazine) and different atypical antipsychotics in chronic schizophrenic patients as measured by time to treatment discontinuation.[1] Table I summarizes the main description of CATIE and the main outcomes.

Table I
figure Tab1

Description and main outcomes of the CATIE trial (US) [not industry sponsored]

In addition to the main body of CATIE trials listed above, several other studies and analyses were carried out within the population.

2.1 Extrapyramidal Side Effects Trial

The CATIE EPSE (Extrapyramidal Side Effects) trial[7] used a variety of measures of dystonia, parkinsonism, akathisia and tardive dyskinesia (TD) and analysed incidence rates and continuous rating scales. They found no substantial and significant differences between the SGA and perphenazine or between any pair of the SGAs in the incidence of treatment-emergent extrapyramidal symptoms (EPS) and change in EPS ratings.[7] There was a trend towards a higher proportion of patients in the risperidone and perphenazine group to have medications added to treat akathisia compared with patients in the olanzapine, quetiapine and ziprasidone groups.[7] The dose range of perphenazine used in the CATIE trials was chosen to minimize the potential for EPS. Despite this, the original report from CATIE showed significantly more patients (8% vs 2–4%, p = 0.002) discontinued perphenazine because of EPS.[4]

The reason for this discrepancy could be because the former trial looked at the incidence of EPS separately, excluding from the analysis those who had a specific EPS at baseline. This method may be useful for determining incidence of emergent side effects but does not allow for pooled incidence of all types of EPS. The rating scales used in the CATIE EPSE trial were more sensitive compared with the original trial because they were not simply looking at reasons for discontinuation.

2.2 Change in Metabolic Syndrome Parameters Trial

The Change in Metabolic Syndrome Parameters Trial[8] trial used the National Cholesterol Education Programme (NCEP) derived diagnostic criteria for the metabolic syndrome (summarized in table II).

Table II
figure Tab2

National Cholesterol Education Programme derived diagnostic criteria for the metabolic syndrome (≥3 criteria must be present to establish diagnosis)

Among the 933 subjects who had baseline and 3-month data, metabolic syndrome could be classified in 660 patients. In this all-classifiable group, within 3 months, the proportion of patients meeting criteria for metabolic syndrome was highest for olanzapine (+9.1%) and this difference was significant compared with ziprasidone (−7.8%) [p = 0.001]. This study also indicated that quetiapine when used at higher dosages had a significant effect on increasing central adiposity and high-density lipoprotein (HDL) [in White subjects] despite not having an effect on glucose homeostasis.

2.3 Staying versus Switching Trial

Phase 1 of the CATIE trial allowed for analysis of patients who continued with their original antipsychotic medication before entry into CATIE compared with those who were randomized to switch their antipsychotic medication. The first analysis compared patients who were randomly assigned to olanzapine or risperidone and had been taking the same drug prior to CATIE (‘stayers’, n = 139), and patients who were assigned to olanzapine and risperidone as a new medication (‘switchers’, n = 496).[9] This analysis found that switchers discontinued the study drug more rapidly than those who were assigned to stay on the medication they had been taking previously.[9] An additional analysis compared outcome measures that included symptoms, neurocognition, quality of life, neurological side effects, weight and heath costs.[10] These analyses concluded that there was no advantage in switching to a new medication in any of the outcome measures and found that patients who stayed on olanzapine gained more weight.[10]

2.4 Cost-Effectiveness Trial

An analysis of cost effectiveness[11] was performed through the CATIE trial and involved 1424 patients. This analysis demonstrated significantly lower total health costs for the perphenazine group. The difference was attributed to the lower drug costs associated with perphenazine in the context of no significant differences in healthcare costs.

3. CUtLASS

The main objective of CUtLASS 1 was to test the hypothesis that SGAs other than clozapine are associated with improved quality of life across 1 year compared with first-generation antipsychotics (FGAs) in people with schizophrenia who required a change in treatment.[12] Additionally, a cost-effectiveness analysis was carried out.[13] CUtLASS 2 was designed to test the hypothesis that the use of clozapine (compared with other SGAs) would be associated with improvement in quality of life over 1 year and that this improvement would be associated with fewer symptoms, improved patient satisfaction and possibly lower total health costs.[14,15]

Table III summarizes the main description of CUtLASS and the main outcomes.

Table III
figure Tab3

Description and main outcomes of the CUtLASS trial[12,13] (UK) [not industry sponsored]

4. SOHO

The main objective of SOHO was to compare outcomes of patients initiating olanzapine with those patients initiating other antipsychotic medication. Table IV summarizes the main description of SOHO and the main outcomes.

Table IV
figure Tab4figure Tab4figure Tab4figure Tab4

Description and main outcomes of the SOHO triala

In addition to the main body of the SOHO trials listed above, several other studies and analyses were carried out within the population.

4.1 Tardive Dyskinesia: 6-Month Evaluation and Early Extrapyramidal Side Effects Analysis Trials (Pan European)

The Tardive Dyskinesia – 6-Month Evaluation compared the incidence and persistence of TD between patients with schizophrenia who were treated with SGA and FGA at baseline, 3 months and 6 months. Of 10 972 patients who were enrolled, 9912 were considered in the analysis for TD at baseline and 912 (9%) were diagnosed with existing TD. By 6 months, only 8632 were analysed due to missing data and eligibility. The rate of emerging TD was higher in the FGA group than in the SGA group (3.8% vs 0.9% [odds ratio {OR} = 0.29; 95% CI 0.18, 0.46]). In order to remove any possibility of industry-sponsored bias, olanzapine was excluded for a second comparison and the rate of emerging TD was 3.8% for FGA versus 1.4% for SGA (OR = 0.43; 95% CI 0.25, 0.72). Persistence of TD for FGA was 60.8% versus 46.7% for SGA (OR = 0.60; 95% CI 0.33, 1.11).[29]

Another analysis examined whether EPS predicted onset of TD at 1 year. 8036 patients were analysed as these patients had data on all four occasions: baseline, 3, 6 and 12 months after the study began. In order to take into account the waxing and waning course of TD, a sensitivity analysis approach was used with two risk sets: a broad risk set included all individuals with no TD at baseline; a narrow risk set included all individuals with no TD at baseline and 3 months. The total incidence of TD was 3% (95% CI 2.6, 3.4) in the broad risk set and 1.6% (95% CI 1.4, 1.9) in the narrow risk set. The sensitivity of baseline EPS as a test for TD (the percentage of patients who developed TD who had EPS at baseline) was 50% (broad risk = 53%; narrow risk = 46%). The authors concluded that the clinical implications of this study were that strategies aimed at reducing risk factors for EPS in the whole population are more likely to reduce TD morbidity, and that the sensitivity and specificity of EPS as a baseline test for TD were too low to justify a high-risk prevention strategy.[30]

4.2 36-Month Tolerability Trial (Pan European)

The incidence of EPS and TD, anticholinergic use, loss of libido or impotence, amenorrhoea, galactorrhoea or gynaecomastia and weight change was assessed in 4939 patients who started antipsychotic monotherapy.[31] Assessments occurred at baseline, 3, 6, 12, 18, 24, 30 and 36 months. Of the 4939 patients included in the analysis, 2922 (59.2%) were classed as completers.

EPS were present at baseline in 31.6–44.4% of patients across treatment cohorts and were related to the medications the patients were taking before the baseline visit. Decreases in the percentage of patients with EPS after 36 months occurred in all patient cohorts; the largest improvement occurred in the clozapine and olanzapine cohort. TD was present at baseline in 9% of patients ranging from 7.9% to 12.3%. After 36 months, the incidence of TD varied from 3.4% in the olanzapine group to 8.6% in the depot group. Among the patients who did not have TD at baseline, 2.3% had TD at 36 months. The results for each drug cohort at 36 months are olanzapine (1.7%), risperidone (2.7%), quetiapine (1.3%), amisulpride (2.4%), clozapine (3.3%), oral typical (5.0%) and depot typical (3.8%).

Loss of libido/impotence was a frequent adverse event among patients in all treatment cohorts at baseline; range 46.2–56.5%. Decreases in the percentage of patients suffering loss of libido and impotence occurred in all treatment cohorts. Patients receiving olanzapine were less likely to suffer loss of libido/impotence compared with risperidone (OR = 1.38), amisulpride (OR = 1.50), clozapine (OR = 1.39), oral typical antipsychotics (OR = 1.71) and depot typical antipsychotics (OR = 1.37). No difference was observed between olanzapine and quetiapine.

Weight gain occurred in all treatment cohorts over the first 12 months and mean increases ranged from 1.7 kg in the quetiapine group to the largest mean gain of 4.2 kg in the olanzapine group. Regression analysis showed that there was significantly less weight gain in the risperidone, quetiapine and oral typical antipsychotic cohorts compared with olanzapine. No significant differences in weight change were found between olanzapine and clozapine, amisulpride and depot typical antipsychotics.

4.3 Sexual Dysfunction Trials: the First-Time Neuroleptic-Treated and the 12-Month Analysis Trials (Intercontinental) and the Tardive Dyskinesia and Sexual Dysfunction Trials (Pan European)

In the first-time treated patient trial, data from 570 patients were analysed. In as early as 3 months after antipsychotic initiation differences between treatment groups on neuroleptic-related loss of libido and sexual dysfunction existed, which became statistically significant at 6 months. Olanzapine showed the lowest prevalence of neuroleptic-induced sexual difficulties.[32]

In the 12-month analysis trial, the proportion of patients reporting sexual dysfunction during the 12-month period was highest for haloperidol (71.1%), risperidone (67.8%), quetiapine (60.2%) and olanzapine (55.7%). When compared with the olanzapine group the odds of a patient reporting problems were greater with risperidone (OR = 2.92; 95% CI 1.63, 2.49) and haloperidol (OR = 2.47; 95% CI 1.61, 3.77). Based on patient perception, the odds of emergent sexual dysfunction during 12 months of therapy was lower for olanzapine (13%) and quetiapine (22.9%) than with patients on haloperidol (30.9%) or risperidone (27.8%).[33]

5. ZODIAC

The main objective of ZODIAC[2,34] was to compare the rate of non-suicide mortality of patients initiating ziprasidone with those patients initiating olanzapine. Table V summarizes the main description of ZODIAC and the main outcomes.

Table V
figure Tab5

Description and main outcomes of the ZODIAC triala

6. Discussion

So, what have we learned from these recent effectiveness studies of antipsychotics? Efficacy studies have already demonstrated that clozapine is uniquely effective in refractory schizophrenia[35] and that olanzapine has some efficacy advantages over other drugs.[36] We also knew that clozapine and olanzapine were particularly and relatively more likely to cause metabolic adverse effects.[37] It was also accepted that SGAs generally caused less frequent and severe EPS (including TD) than FGAs.[38,39] What more was uncovered? What assumptions were challenged?

6.1 CATIE

In the CATIE programme, clozapine’s unique effectiveness in resistant schizophrenia was confirmed, while olanzapine’s improved outcomes compared with risperidone, quetiapine and ziprasidone were also noted, supporting findings from some prior efficacy studies.[36,40,41] Intriguingly, no advantage was seen in people changing antipsychotics compared with those staying on the original drug. Clozapine, olanzapine and quetiapine seemed particularly prone to cause metabolic adverse effects.

Despite these expected findings, results from CATIE, the apparently equal effectiveness and better cost-effectiveness of perphenazine have provoked strong reactions and extensive discussions among clinicians, academics, patients and policy makers. CATIE apparently did not support the hypothesis that SGAs are treatments of choice in schizophrenia. Instead the study supported the view that FGA and SGAs have similar therapeutic effect with diverse side effect profiles. Perhaps most importantly, at the end of the 18-month study, 74% of patients in phase 1 were not taking the same drug they started with, which tells us that switching or discontinuation in the maintenance phase of schizophrenia is the rule rather than the exception.[42]

6.1.1 Adverse Outcomes: Metabolic Considerations

Individuals with schizophrenia and affective disorders have an increased risk of death from medical causes and up to a 20% shorter lifespan compared with the general population.[43] More than two-thirds of patients with schizophrenia compared with half in the general population die of coronary heart disease.[43] One of the clearest and most compelling messages from CATIE is the issue of metabolic side effects. Except for clozapine, olanzapine clearly caused the heaviest burden of metabolic side effects.[8]

The risks of metabolic syndrome need to be assessed in line with the issue of mortality and schizophrenia, and the role of antipsychotic therapy on mortality. Hogan et al.[44] conducted a large population-based cohort study (the FIN11 study). Data for the study conducted in Finland, included 66 881 patients and data were collected from 1996 to 2006. The primary outcome measure was all-cause mortality with secondary outcomes of mortality as death due to suicide and ischaemic heart disease. These authors concluded that long-term use of antipsychotics is associated with a reduction in mortality among patients with schizophrenia compared with no or short-term use.[44] Clozapine was the most effective in terms of preventing death among patients with schizophrenia.[44] However, overall life expectancy for patients with schizophrenia was not shown to be further decreased with the increased use of atypical antipsychotics.[44] None of the antipsychotics had an effect on mortality due to ischaemic heart disease.[44]

6.1.2 Tardive Dyskinesia

CATIE was not designed to maximize detection of EPS. There was no wash-out phase at the start of CATIE, not even for anti-parkinsonian medication, which may have further diminished differences in EPS.[45] More patients discontinued perphenazine because of EPS than with SGAs but the excess was modest.[7] Patients with TD were excluded from the perphenazine arm, making a true comparison of relative risk of TD onset impossible. The length of drug exposure may have also been too short to assess its true long-term toxicity, as highlighted by Casey “It is a major error to conclude that perphenazine offers the same benefit-risk ratio as the atypical agent when the efficacy is compared to the acute EPS and tardive dyskinesia risk.”[46] CATIE does not contradict the evidence that strongly suggests a lower risk of TD with SGAs.[47] Some authors have suggested that lower doses of FGAs will reduce the incidence of TD. However, a study that evaluated the 12-month incidence of TD in first-episode patients prescribed a very low dose of haloperidol (mean dose = 1.67 mg/day) concluded that the incidence of TD was at least as high as in other samples treated with standard doses of conventional antipsychotics and unacceptably high for clinical use.[48]

6.1.3 Limitations of CATIE

Kraemer and colleagues, summed up the study limitations rather coarsely: “Too many drugs, too many strata, too many sites with inadequate numbers of subjects, too many outcome measures and too many statistical tests.”[49]

If we start with the study population, the mean time since first treatment was 24 years and mean time since first treatment with an antipsychotic was 14 years. This population is thus perhaps more representative of poor or partial responders.[50] Once enrolled, clinicians knew that there was a second and third phase of the trial; therefore, discontinuing medication in the first phase could have been influenced to some extent by the availability (and the explicit or implicit desire to recruit subjects) of a second and third phase.[50]

In terms of medication, in phase 1 of the trial, patients could have been randomized to the medication they had previously been taken. Of patients in phase 1 of CATIE, 15% were randomly assigned to the medication they had previously been receiving prior to the study. Patients assigned to risperidone or olanzapine who had been receiving those medicines prior to the study remained on their medication significantly longer than other patients. When these patients were removed from the intention-to-treat analysis, the overall test of the comparison of treatments was not statistically significant regarding the primary outcome measure of all-cause discontinuation.[51]

The mean modal doses prescribed during phase 1 were 20.1 mg/day for olanzapine, 20.8 mg/day for perphenazine, 543.4 mg/day for quetiapine and 3.9 mg/day for risperidone. During phase 2E the trial the doses were 332.1 mg/day for clozapine, 23.4 mg/day for olanzapine, 642.9 mg/day for quetiapine and 4.8 mg/day for risperidone.[4,5] These mean doses are not considered equivalent: the olanzapine dose is above the licensed dose of 20 mg/day, while the quetiapine and risperidone doses are modest in comparison (at least when the maximum doses are considered). More recently, however, the optimal dose of quetiapine has been suggested as being between 300 and 400 mg/day[52] and, similarly, a review was unable to support the use of high dose (>800 mg/day) quetiapine.[53] In addition, there is probably no particular advantage for higher doses of olanzapine compared with, for example, 10 mg/day.[54] In respect to dose then, the relatively higher doses of olanzapine might have added nothing to its effectiveness but added significant burden to adverse effects.

6.2 CUtLASS

The results of CUtLASS 1 rejected the hypothesis that SGAs are superior to FGAs in terms of improvement in Quality of Life Scale (QLS) scores. CUtLASS 2 showed that commencing clozapine in treatment-resistant patients led to significantly more improvement in symptoms but not QLS over 1 year compared with commencing one of the other SGAs. The results also show there is no evidence to suggest SGAs are more cost effective than FGAs, and that clozapine is associated with higher costs and higher quality-adjusted life-years (QALYs) than other SGAs for the 12-month period following a change in therapy.

6.2.1 Limitations of CUtLASS

Sulpiride, a selective dopamine D2 receptor blocker, is thought to pose less of a risk of EPS compared with other FGAs, which may be why it was chosen more often in CUtLASS compared with previous efficacy trials (haloperidol is most often used). Its presence in the FGA arm itself may pose a bias (as it is considered to have SGA properties); however, results from a systematic review have not supported this SGA property or superior efficacy.[55] When clozapine was used in CUtLASS 2, the mean doses were low and no clozapine levels were done to ensure the dose was at an effective therapeutic level.

In terms of the patient population, QLS scores were not well balanced at baseline. The proposed effect size of 5 points in the QLS may have been unrealistically large. The results of randomization were known to the clinician and participant, which may have led to bias.

6.3 SOHO

One of the most outstanding strengths of the SOHO trial was its high retention rate: at 12 months the retention rate was around 85%, whereas the expected drop-out rate would be around one-third.[56] Patients were also allowed to remain in the study even if they changed medication at any point. Broadly speaking, SOHO suggested relatively better effectiveness for clozapine and olanzapine compared with other SGAs and FGAs and reduced rates of TD for SGAs compared with FGAs. As in CATIE, quetiapine suffered somewhat compared with some other antipsychotics in respect to effectiveness.

6.3.1 Limitations of SOHO

SOHO allowed clinicians to select the treatment and therefore a bias in assignment of patients to groups could not be controlled for and confounding by indication is a real possibility. Direct comparisons between medication groups are therefore not entirely appropriate as clinicians would choose the medication they felt would provide maximum benefit. Over-sampling of the olanzapine cohort was included in the study because the main objective of the study was to compare olanzapine with other antipsychotics. Although this may not be representative of the outpatient population setting, it does allow for precise estimates of outcome compared with olanzapine.

When reviewing SOHO trial results chronologically, one would expect 12, 24 and 36 months’ results that continue to report the primary outcomes of Clinical Global Impressions (CGI) scale and EuroQol-5 Dimension; and continue to use the Pan European population as they did in the baseline and 6-month study. However, we are left with publications from the intercontinental study and several other reports with measures such as remission, recovery and relapse. One is left with the feeling of too many publications from the same population and publications that vary slightly from the main aim set out originally by SOHO (e.g. there was a polypharmacy arm that was ultimately not reported on).

6.4 ZODIAC

The ZODIAC study showed that there was no difference in all-cause mortality between ziprasidone and olanzapine. Admission to hospital was more likely with ziprasidone than with olanzapine.

6.4.1 Limitations of ZODIAC

Ziprasidone is associated with a risk of QTc prolongation. The study was not designed to examine the risks of rare cardiac events such as torsade de pointes. The fact that there was no difference in all-cause mortality may go some way to increasing ziprasidone’s use especially as it has a low risk of metabolic syndrome. ZODIAC also indicated that the relatively more adverse metabolic effects of olanzapine do not, at least in the short term, cause more cardiovascular mortality than the metabolically neutral ziprasidone.

7. Conclusion

Clozapine is the most effective antipsychotic, followed by olanzapine and then by all other SGAs and modest doses of FGAs. We do not believe CATIE, CUtLASS, SOHO or ZODIAC provide ‘new found’ effectiveness or tolerability of FGAs. The biggest surprise from the results of these large pragmatic studies is how surprised the field has been by the results.[57] There has never been acceptance by governing bodies of the superiority of SGAs other than for clozapine.

We could not recommend non-clozapine SGAs over FGAs on the basis that SGAs have efficacy for primary negative symptoms or cognitive impairment. We would, however, recommend the use of SGAs over FGAs (even at modest doses of the latter) because the lower risk of extrapyramidal effects and TD.

Should we start using the cheapest AP? More studies are needed to add to the limited body of evidence that compares the cost effectiveness of antipsychotics.[58] As the earlier SGAs lose patent protection and the drug costs become less of an issue, the pharmacoeconomic debate will probably not focus on drug cost.

Should we change the way we use clozapine? Clozapine remains unique in its effects in the treatment-resistant population. Patients who fail to respond to two antipsychotics should be offered clozapine. We could also probably say that one of the antipsychotics should be an SGA[58] and that data also support the notion that one of the drugs should be olanzapine.[59]

What type of trial may give us more conclusive answers? Swartz and colleagues[60] assessed and outlined eight principles for a practical clinical trial to maximize its impact on clinical decision making:

  • questions posed should be straightforward and relevant;

  • trials should be carried out in real-world settings to ensure results can be generalized;

  • study should be sufficiently powered to detect small to moderate outcomes;

  • randomization should be used;

  • there should be clinical uncertainty of the outcome;

  • outcomes should be simple and meaningful;

  • assessments and treatments should reflect best clinical practice;

  • subject and investigator burden should be minimized.

On the surface, recent real-world trials seemed not to have added new information to that available before they were conducted; clozapine is superior and other antipsychotics are more or less equal apart from the drugs’ side effect profiles. Often though, clinical trial results are not about ‘ground-breaking discoveries’ but rather adding layer upon layer to our knowledge and experience as clinicians. No one study can adequately answer the complex question of optimizing pharmacological treatment of schizophrenia. These trials have provided additional important factors to consider when choosing the very best therapy for patients and may thus be considered valuable and worthwhile (even cost effective) when one considers outcomes and costs of conducting these studies. They have reminded us of: the importance of TD and metabolic syndrome, the slight advantages olanzapine may have, that most patients will switch their medication, and the need for new and novel therapies especially to address cognitive and negative symptoms.