Introduction

In Germany, about 350,000 cancers are diagnosed each year and approximately 210,000 persons die from the disease. Epidemiological considerations indicate that at least 60% of these deaths are related to environmental factors in a broad sense and should be avoidable in principle. Under realistic assumptions, the factually achievable mortality reduction may range between 18–33% within medium-term periods (Willett et al. 1996; Becker 2001; Adami et al. 2001).

These figures have two implications: first, they demonstrate the enormous potential of primary prevention (20,000–60,000 avoidable incident cancers and cancer deaths in Germany annually) which has been insufficiently exploited so far. Second, the main proportion of cancer cases, however, cannot be avoided even if knowledge regarding primary prevention were optimally used. This underlines the great importance of “secondary prevention” with screening as a part of it.

Principles of screening

It has widely been accepted that; (a) a number of preconditions must be fulfilled to make screening for a specific cancer reasonable; and (b) in advance of its introduction as a population-based programme, a screening modality has to be evaluated with regard to its efficacy to reduce mortality from the respective cancer. In short, the preconditions state the circumstances under which early detection of the cancer of interest may be feasible and effective, and the false-positive rate limited and acceptable (Wilson and Jungner 1968; Morrison 1992). Previous evaluation of the efficacy of a screening modality is required because early detection of asymptomatic cancers does not necessarily imply a mortality reduction for the respective cancer site. Randomised trials provide the most reliable evidence, though other designs, e.g., case-control studies, are feasible as well, but may be considerably biased (Miller 1985; Morrison 1992; German readers should also see Becker 2002 for details).

In a series of expert meetings, the UICC has evaluated the evidence regarding screening trials for several cancer sites and published the results and recommendations in a number of publications (Miller 1978; Prorok et al. 1984; Chamberlain et al. 1986; Chamberlain and Miller 1988; Hakama et al. 1985; Hakama et al. 1986; Day et al. 1986; Day and Miller 1988; Miller et al. 1990; Miller et al. 1991).

Finally, there is wide acceptance that if screening is offered and promoted, it should be offered as an organised screening programme with attention to quality control (Hakama et al. 1985).

Established screening modalities

Given these preconditions, in its last evaluation in 1991, the UICC has judged as efficacious/effective only screening for cervix and breast cancers. A specific recommendation was given for Japan regarding stomach cancer due to the particularly high incidence in this country. This is not applicable to European countries which have much lower rates. Screening was not recommended for the skin, but it was stated that health promotion programmes advocating enhanced individual awareness may be beneficial, though data are not available which confirmed this. Meanwhile, the efficacy of the faecal occult blood test (FOBT) has been proven for colorectal cancer screening by three randomised trials (see below).

Cervical cancer

Numerous observational studies have accumulated a large body of evidence for the effectiveness of this screening approach (reported or reviewed in Hakama et al. 1985, 1986) and provided data about different screening modalities and the related mortality reduction.

Despite the simplicity of the test and its wide distribution, the quality of its performance, and thus the effectiveness of the early detection programmes, continues to be a matter of concern (see, for example, Anonymous 1985; Koss 1989; Miller 2002). The cytological smear is considered to be a routine procedure but the individual steps required (taking the smear, fixation, laboratory processing, interpretation) are susceptible to errors, and therefore the effectiveness of early detection is dependent upon a series of steps executed by qualified personel (Koss 1989). Guidelines for quality assurance have been released (Coleman et al. 1993) which have been adopted in different countries to various degrees (see below for Germany). Studies suggest that organised programmes might be superior to ‘opportunistic’ early detection in terms of factually achieved mortality reduction (Nieminen et al. 1999).

Breast cancer

Breast cancer screening by mammography alone or together with physical examination has been evaluated by several large randomised trials in different countries, most of which demonstrated a reduction of breast cancer mortality by up to 30% (Day et al. 1986; Day and Miller 1988). Based on this evidence, many countries introduced mammography screening with or without physical examination and for different age ranges, at least for the minimal age range 50–64 years (Shapiro et al. 1998; Ballard-Barbash et al. 1999; IARC 2002). Recently, the evidence has been reviewed by an international expert meeting which confirmed the conclusiveness of the studies in terms of breast cancer mortality reduction by mammography screening (IARC 2002). The experts concluded that mammography screening in the age range 50–69 years may reduce breast cancer mortality by about 25% and in programmes by 5–20%. Inadequate evidence is available for the efficacy of physical examination and breast self-examination.

Quality issues appear especially critical for breast cancer screening. Less experienced radiologists have a lower sensitivity and specificity of diagnosis (Esserman et al. 2002). Limited specificity implies the risk of false positive results, i.e., further assessment including biopsies which may accumulate over the attended screening rounds to a substantial amount (Christiansen et al. 2000). Guidelines for quality assurance have been released for various countries (for citations see IARC 2002). For the European Community, guidelines which have the character of recommendations have been issued by the European Commission (Perry et al. 2001).

Physical examination (PE) has never been evaluated against ‘no screening’ and no clear evidence about its efficacy is available. However, studies suggest some benefit (Stockton et al. 1997; Kuroishi et al. 2000; IARC 2002). It is used in many countries for early detection including Germany (see below) and needs consideration in the present context.

Colorectal cancer

For colorectal cancer, three studies proved the efficacy of FOBT (Hardcastle et al. 1996; Kronborg et al. 1996; Mandel et al. 1993, 1999). The studies demonstrated mortality reductions of up to 33% with annual screening and up to 21% with biannual screening.

Before FOBT was validated and incorporated into screening recommendations, digital rectal examination (DRE) was partially recommended for rectal screening. This approach was never evaluated by a randomised trial and no evidence about its efficacy is available (Cuzick 1999). A case-control study of deaths from distal rectal cancer found an odds ratio of OR = 0.96 (confidence interval = 0.56–1.7) for a screening history, i.e., a null effect. However, due to the limited size of the study, a modest effect could not be ruled out (Herrinton et al. 1995).

Prostate cancer

The most strongly debated approach is early detection of prostatic cancer by PSA testing. For this potential screening modality, two large randomised studies are under way, but no evidence about a mortality reducing effect yet exists. Since no proper quantitative data about its potential benefit are available and harm is known to be considerable (strong overdiagnosis and overtreatment leading to a high proportion of incontinence and impotence as outcome among the treated subjects), PSA testing is clearly not yet recommended (Auvinen et al. 1996; 2002).

A frequently used screening test is digital rectal examination (DRE) for which, however, no evidence about a mortality reducing effect exists (Bentvelsen and Schröder 1993). A case-control study among men with metastatic prostate cancers found an odds ratio of OR = 0.9 (confidence interval 0.5–1.7) for a screening history of at least one DRE versus none, which provides little support for efficacy. However, a small benefit cannot be ruled out (Friedman et al. 1991).

Skin cancer

No evidence exists regarding the potential efficacy of visual examination of the skin as a screening modality from a randomised trial. A case-control study on skin self-examination provided an odds ratio of OR = 0.37 (confidence interval = 0.16–0.84) suggesting a strong benefit (Berwick et al. 1996). However, due to potential biases in a case-control approach the quite strong effect may also be interpreted non-causally (Elwood 1996).

The German Statutory Early Detection Programme

In 1971, the so-called “statutory early detection programme” was established in West Germany by law. It states that every person who is health-insured with a statutory health insurance company and within a cancer-site and gender-specific age range is eligible to receive a specified annual early detection examination paid by the respective insurance company (Herwig 1975; Schenck and von Karsa 2000). Since about 90% of the German population are covered by statutory health insurances and the remaining 10% by so-called “private” health insurances (membership in a health insurance is mandatory) who joined the programme, the coverage is factually 100%.

The law allows that the programme may be regularly updated and assigns the responsibility for the decision about the target cancers and compensated modes of early detection to the so-called “Federal Board of Physicians and Health Insurances” (Herwig 1975). In the initial years, the programme comprised gynaecological and rectal examinations for females beginning at age 30 years, and digital rectal and prostate examinations for males beginning at 45 years old. Over the years, the board mentioned above expanded the programme to younger women and the coverage to further target sites, that is, the breast, colon, and rectum, including FOBT since 1977 (Table 1). The most recent expansions include full colonoscopy every 10 years from the age of 55 years since 2002 and biannual breast screening by mammography at ages 50–69 years from 2004.

Table 1 Target population, target organs, and methods of examination in the German “Statutory Early Detection Program” (source: Bundesausschuss 1996)

The programme was established relatively early compared to other countries, since decision-makers did not wait until the results of randomised trials confirmed the efficacy of screening modalities for the target cancers of interest. Nevertheless, the principles outlined in Wilson and Jungner (Wilson and Jungner 1968) were taken into account as guidelines for the decisions about the content of the programme leading to pragmatic considerations about technical and personal practicability, acceptability by healthy subjects, and reasonable sensitivity and specificity of the tests (Flatten 1988). However, the problem mentioned above that early detection does not intrinsically imply a benefit in terms of mortality reduction appears to have been ignored.

While the recent inclusion of colorectal screening by full colonoscopy followed this manner of decision-making, the decision about mammography screening was based on the consideration of the evidence from randomised studies, expert evaluations and pre-investigations. After an initial pilot study identified serious shortcomings in existing clinical mammography (Frischbier et al. 1994), model projects have been initiated and are currently running with the purpose of exploring whether organised and quality-controlled mammography screening according to the guidelines of the European Commission (Perry et al. 2001) is feasible under the conditions of the decentralised health care system in Germany (Junkermann et al. 2001). The first data indicate that the pilot studies achieved the required quality from the beginning.

As just mentioned, the statutory early detection programme is adjusted to the decentralised German health care system which is largely based on office-based physicians. It is factually conceptualised as an opportunistic screening programme. The basic shortcomings of this approach are poor compliance and lack of proper quality control.

Overall, compliance was low from the very beginning and considered unsatisfactory by the decision-making institutions. Only about 13% of males and 22% of females participated in 1972, the first year of action. The attendance increased slowly to about 50% of females and 30% of males in the late 1990s. However, outside of this programme, increasing numbers of “curative” mammographies developed as factual screening mammographies which are paid by the insurance companies as well but are not covered by the regular statistics of the programme. These peculiar circumstances are addressed in the Results section in the context of the appropriate assumptions on screening prevalence for the effectiveness assessment.

Early cancer detection which is structured in this way creates problems regarding establishing effective quality control. Beginning with the data of 1972, the Central Research Institute of Health Insurance Physicians issued regular annual reports about compliance and the results of the early detection programme (Herwig 1975 and subsequent annual issues). However, these data are not very specific and far from, for example, the requirements outlined in the guidelines for some of the target cancers (Coleman et al. 1993; Perry et al. 2001). Attempts have been undertaken to improve the quality of the existing opportunistic screening gradually (for cervical cancer see Schenk and von Karsa 2000; Miller 2002). Even with mammography screening, gradually upgrading of existing screening has been proposed, avoiding an organised approach. These attempts leave the epidemiological parameters for quality control largely out of consideration and apparently underestimate their relevance as surrogates for the actually interesting target quantity mortality reduction (see also Rittgen and Becker 2001).

In East Germany, screening was not organised in a national programme. Participation at cervical screening was encouraged and facilitated since, in the 1970s, the test was increasingly offered by the occupational health services. With the reunification of Germany, the West German programme was expanded to East Germany (Schenck and von Karsa 2000).

Material and methods

The age-standardised incidence rates for Germany and, if used, for other countries, have been taken from ‘Cancer in Five Continents, Vol I—VIII’ (Doll et al. 1966, 1970; Waterhouse et al. 1976, 1982; Muir et al. 1987; Parkin et al. 1992, 1997, 2002) with the ‘world population’ as standard. The incidence rate for cervical cancer in the USA for the years 1947–1950 was computed as a standardised rate, with the world population as standard, from the age-specific incidence rates of the SEER program presented in Appendix 2 of Devesa et al. (Devesa et al. 1987). The age-standardised incidence rate for 1960 in Germany/East was taken from Möhner et al. (Möhner et al. 1994) and for 1993–1997 communicated from the Common Cancer Registry of the five East German federal states.

The absolute number of cancer deaths and the overall cancer mortality rate for Germany in 1995 were taken from the German Cancer Atlas (Becker & Wahrendorf 1997). The mortality rates are likewise age-standardized with reference to the world population. Despite more recent data being available, the data of 1995 have been used to keep the figures comparable to the article on primary prevention in Germany (Becker 2001).

The effects of screening were computed as population preventable fraction PF = p×(1−RR) given in percent, where p denotes the proportion of the population under exposure (attending the screening) and RR denotes the rate ratio of mortality reduction under screening, i.e., 1−RR the preventable fraction (Rothman and Greenland 1998).

The benefit of cervical cancer screening was directly estimated from the incidence and mortality rates. However, the figures of the mortality statistics are rather unreliable for cervical cancer (ICD-8 180), at least in the years before 1970, due to misclassification of cervical cancer into the category of ‘unspecified uterine cancer’ (ICD-8 179 and 182). Thus, mortality from all uterine cancers (ICD-8 179, 180, and 182) were combined and only mortality among the young age groups of 20- to 44-year-old women was considered, which is likely to be attributable mainly to cervical cancer (Levi et al. 2000). These rates were fitted with piecewise regression (Kim et al. 2000) using a computer program made available in the Internet (Joinpoint Regression Program, Version 2.5, March 2000, National Cancer Institute; http://srab.cancer.gov/joinpoint/). Then the resulting parametric regression lines were used to extrapolate the age-standardised mortality rates for cervical cancer (ICD-8 180) over all age groups to the years before 1970. The parameters of this set of regression lines were kept fixed and a multiplicative adjustment factor was determined by a least squares fit to the observed mortality rates after 1970 (Fig. 2).

Results

In the following, past achievements and future potential are considered for the cancer sites covered by the programme.

Cervical cancer (ICD-9 180)

Initial efforts to offer cervical cancer screening were started in the 1960s. Thus, for the assessment of the presumed benefit of early cervical cancer detection, the data of the 1960s should provide a reasonable picture of cervical cancer incidence before the start of screening. Figure 1a shows the incidence data of East and West Germany in comparison to selected countries of Europe and North America. It demonstrates that: a) in Germany in the early 1960s, the incidence ranged around a level comparable to that of other countries (e.g., USA) or even higher; and b) it shows a strong decline of German cervical cancer incidence from 1960 to 1990 by 73% (see Table 2). Figure 1b shows the secular trend of cervical cancer mortality among young women (age groups 20–44 years) and shows a decline of 74%, consistent with the incidence data. Notice that mortality declined even before the start of screening (explained by increased diagnostic activity outside screening, changes in diagnostic criteria, and improved treatment (Pontén et al. 1995)) so that these figures clearly indicate an upper limit of benefit.

Fig 1a,b
figure 1

Evolution of age-standardized cervical cancer: a incidence (all age groups); and b mortality (20–44-year-old women, 3-years moving average) in different countries

Table 2 Age-standardised rates for cervical cancer incidence (all ages) in 1959–1961 and 1993–1997, and mortality (20- to 44-year-old women) in 1959–1961 and 1994–1996 with percentages of change in different countries

Regarding the consistency of these findings with the performance data of the statutory early detection programme, one has to take into account that despite annual attendance rates ranging around 45%, biannual participation reached about 55%, and less than 20% of women were reported as never attending screening (Robra 1990). For a lower limit, the benefit of a biannual screening (91% mortality reduction) and the observed attendance rate (35%) have been used. Notice that the potentially relevant poor sensitivity of the test (see Discussion section) has been ignored with this procedure.

For the 1950s and 1960s, the cervical cancer mortality data for all age groups are unreliable (Becker and Wahrendorf 1997). Thus, in order to assess the overall benefit of the screening, the data of the young women was fitted with piecewise regression, and the resulting regression lines fitted to cervical cancer mortality (all age groups) after 1970 (Fig. 2) to obtain an extrapolation to the previous decades. The resulting estimated cervical cancer mortality rate for 1960 has been used to assess the proportion of cancer deaths prevented by screening, yielding a range of 3.2–4.7%. Using the mortality data of 1969–1971 would have provided much lower values and underestimated the effect (Table 3).

Fig 2
figure 2

Observed and fitted/extrapolated mortality rates for cervical cancer in Germany (age standardised, world standard)

Table 3 Estimated achieved reduction of cancer mortality by past early detection activities in Germany (West)

Regarding future potential, the theoretical benefit of early cervical cancer detection (about 90%) is assumed to be achievable which allows for a further reduction of cervical cancer mortality by about 60%; however, this reduction decreases cancer mortality starting from a baseline which is already very low (about 1% of overall cancer mortality in 1995, see Table 4).

Table 4 Estimated future potential for cancer mortality reduction by screening modalities with established efficacy in Germany

Breast cancer (ICD-9 174)

Physical breast examination has been part of the early detection programme since the very beginning. However, since there is inadequate evidence for a mortality reduction by physical breast examination (IARC 2002), a 0% programme benefit cannot be excluded. In order to obtain an upper limit of a potential programme’s effectiveness, the mortality reduction potentially attributable to increased awareness of breast cancer may be taken into consideration. Increased awareness may shift breast cancers to earlier stages at the time of diagnosis as seen, for example, in the UK after the mid-1980s. Stockton et al. (Stockton et al. 1997), assessed the early diagnosis-related decrease of mortality to about 3.5–5%.

Mammography screening is not officially offered, but it is estimated that about 4–5 million “grey” mammographies are carried out annually (Schultz et al. 2001, page 7). Given a) the unsatisfactory quality of existing mammographic early detection of breast cancer as observed in a demonstration project (Frischbier et al. 1994) and b) the estimation of Blanks et al. (Blanks et al. 2000) that until now only a small fraction of changes in mortality can be attributed even to organised screening (6%), it is unlikely that the use of mammography has led to a considerable reduction of breast cancer mortality so far.

Overall, the programme effectiveness achieved so far may not exceed 6% mortality reduction as a generous upper limit providing an estimated maximal benefit of 0.4% (Table 4).

Regarding future potential, the benefit achievable by organised quality-controlled screening has been assumed to be about 35% among the participants in the age range 50–69 years, as assessed by the recent expert meeting (IARC 2002, p. 179) providing a breast cancer mortality reduction of about 15.7% over all age groups. With a participation rate of 50%, an overall cancer mortality reduction ranges around 0.7%, and with a participation rate of 70% it is around 1.0% (Table 4).

Colorectal cancer (ICD-9 153–154)

Based on the routinely collected data about participation in the early detection programme, Robra and Schwartz (Robra and Schwartz 1986) reported attendance rates of 15% (males) and 25% (females) for the year 1981. Taking the slightly increasing trend of compliance—at least among females—into account, we assumed attendance rates to FOBT of 15% among males and 35% among females (Table 3).

For efficacy, we used the 33% mortality reduction found for annual screening (Mandel et al. 1993) as the upper limit. The lower limit of zero takes into account that subjects may participate with irregular and sometimes long screening intervals and thus lose the benefit arising from regular participation.

DRE will not be taken into account separately, because: a) these cancers may also be found by FOBT; and b) even under a hypothetically assumed mortality reduction of 56% (lower confidence limit of the above-mentioned case-control study on DRE) among the distal rectal cancers (representing about 10% of all colorectal cancers), the benefit specific to DRE would be rather small and is well covered by the above-mentioned upper limit assumption.

Table 3 shows that, due to the different colorectal cancer mortality and attendance rates among males and females, the effect of screening may have been higher among females (0–11.6%) than males (0–5.0%). Since attendance at regular annual screening seems to have been rather low in the past (see Robra and Schwartz (1986)), a future increase of compliance at regular screening would probably provide almost the full effect of mortality reduction (16.5% or 23.1% with 50% or 70% compliance, respectively) implying a reduction of total cancer mortality by about 2.4% or 3.3%, respectively (Table 4).

Prostate cancer (ICD-9 185)

For estimating the past effect of DRE, we assumed a 10% mortality reduction as an upper limit of programme effectiveness assuming that the study of Friedman et al. (Friedman et al. 1991) might have been unable to confirm the observed effect (OR = 0.9, see above) statistically due to a too-small study size. The average participation rate of males was about 15% in the late 1980s. Despite the generous assumption, the potential reduction of mortality from prostate cancer is in the magnitude of 0–1.5% and for total cancer of 0–0.06% (Table 3). If one assumed—as a very extreme and unlikely maximal theoretical efficacy—the lower confidence limit (CL1) of CL1 = 0.5 in the DRE study (Friedman et al. 1991), providing a 50% mortality reduction, the upper limit of programme effectiveness in Table 3 would be 7.5% (prostate cancer mortality) and 0.31% (total cancer mortality), respectively (data not shown). PSA testing did not play a role in Germany in the 1980s which is the most relevant calendar time period for prostate cancer mortality in the mid-1990s. However, notice that for the year 2002 it is estimated that about 1.5 million tests are sold and most likely applied annually (personal communication).

Cancer of the skin (ICD-9 172–173)

Though no evidence about the efficacy of skin examination exists and a 0% effect may not be excluded, as an upper limit of programme effectiveness the OR = 0.37 of the case control study on skin self-examination (Berwick et al. 1996) wase incorporated into the evaluation of the potential past benefit. This is a rather strong assumption, since case-control studies tend to overestimate a screening effect (Begg et al. 1996; Miller 2002). Nevertheless, the mortality reducing effect is in the magnitude of 0–15.8% for cancers of the skin and, due to the small percentage of skin cancers among all cancers, around 0–0.15% for total cancer mortality (Table 3).

Overall evaluation

Taken together, the past benefit of the statutory early detection programme might range in the magnitude of 2.0–6.5%. Since the upper limit results from generous assumptions (breast, prostate, skin cancer) or inclusion of treatment effects (cervical and breast cancer), the true value might most likely be lower than the upper limit of 6.5% and closer to the estimates of the effect of cervical cancer screening (2.0–4.7%). The achievable future benefit, by exploiting the theoretical potential of screening more exhaustively, could provide a further cancer mortality reduction of about 3.4% (50% compliance) or 4.7% (70% compliance). Since the calculations behave additively, it can very simply be extrapolated that with a compliance of 90% the cancer mortality reduction would increase to about 6%.

Discussion

The methods used to assess past benefit and future potential of cancer screening in Germany may be considered to be crude, and they may be refined at some point or another. However, they appear to be sufficiently accurate to quantify the magnitude of the respective target quantities. Similar assessments are in agreement with the range presented above. An assessment carried out in the US in the context of a projection of the future potential of cancer control obtained a mortality reduction of 3% among women and zero among men (Greenland and Sondik 1986). Adami et al. (Adami et al. 2001) assumed a mortality reduction by cancer screening achieved in the past of about 6%, without, however, presenting the underlying quantitative assessment.

The estimated past achievement may be considered disappointing. However, these percentages mean—on the basis of the cancer mortality data of 1994–1996 (Table 4)—about 4,250–13,800 saved lives per year. On the other hand, the data on future potential indicate to what extent the available potential has not been exploited so far and imply that thousands of lives could have been saved annually with the existing screening methods even leaving full colonoscopy, which is part of the current programme, out of consideration.

In view of this partial failure of the programme, three issues must be addressed: compliance, the quality of the programme, and improvement and extension of available screening tools.

Organised screening

The remarkably poor compliance is apparently a characteristic of the opportunistic approach to screening. Other countries report attendance rates of approximately 80% (van Ballegooijen et al. 2000; Kuska 2000; Wang et al. 2001; Olsson et al. 2000) to 90% (Dean et al. 1999; van Ballegooijen et al. 2000) for organised screening programmes.

These rates are achieved on the basis of organised programmes which imply regular written invitations to the subjects eligible for screening. Beyond the purely quantitative aspect of high compliance, the organised approach allows for: a) focussing specifically on the target age range; b) strictly maintaining the screening intervals, i.e., avoiding too few or too many screening tests; and c) inclusion of social groups which have an above-average cancer risk, but are represented as below-average in opportunistic screening. All these topics may increase the effectiveness of the programme and, importantly, decrease misallocation of resources and, thus, save costs. Additionally, a revision of the offered screening methods based on the scientific evidence of their effectiveness would further improve the cost/benefit relationship.

Quality

Much concern has been raised about the quality of the programme. For example, at present about 14 million pap smears are taken annually (Bollmann 2001) and evaluated in about 2,000 cytological laboratories (Schenk and von Karsa 2000), leading to an average of about 7,000 smears per laboratory. This is below the recommended minimum of 20,000–30,000 smears per lab (Miller 2002). Reported values for the sensitivity of cytological screening are in the range of 20–30% (Schneider 1996; 2000), i.e., at the lower limit of internationally reported average values of 30–87% (Nanda et al. 2000). Taking the specificity of 96% (Schneider et al. 1996) to 99% (Schneider et al. 2000) into account, which is in agreement with international data (Nanda et al. 2000), and the fact that annual screening is recommended, the lifelong risk of a false-positive result is 36–84%.

For breast cancer, the estimated 4–5 million annual opportunistic mammographies are carried out by 2,500–3,000 licensed doctors plus a considerable unknown number of physicians in hospitals, leading to an average of less than about 1,670–2,000 mammographies per doctor per year. This is below the recommended minimum of 5,000 annual mammographies per doctor.

In the 1970s, digital rectal examination for prostate cancer was carried out for 77.9% of males by general practitioners and internists (Faul 1982) and provided a detection rate of 0.13% which is below the average of 0.85% in other countries (Bentvelsen and Schröder 1993). Generally, the organisational structures of the programme are unable to generate the data which are needed for regular in-time quality control (Kreienberg 2001; Schenk and von Karsa 2000).

Quality control has turned out to be crucial for the effectiveness of screening programmes. Screening tests may cause harm to the individuals undergoing the test. The only ethically justifiable way to offer screening is to provide it with highest possible quality and to perform routine in-time control. Excellent quality prevents expensive and time-consuming workup of false-positive diagnoses. It is thus not only ethical but additionally saves valuable resources. Again, the organised approach provides a superior frame for collecting the basic data for quality control.

Research on new or more efficacious screening modalities

Cervical cancer

A test would be desirable that indicates whether an oncogenic HPV virus has enhanced cell-cycle dysregulation and rendered infected cells susceptible to transformation, thereby facilitating the development of cancer. Such a test might be available now (Sano et al. 1998a and 1998b) and deserves careful evaluation. It should help to conduct cervical cancer screening more sensitively and more specifically and thus to enhance effectiveness and to reduce the costs of the programme.

Breast cancer

Even among small tumours (TNM stage pT1) found under regular high-quality mammography screening, a considerable portion (in the German pilot projects about 15%) have already disseminated into the lymph nodes implying an advanced stage of disease. Recent findings indicate that variants in genes coding for proteins regulating angiogenesis might affect the risk of early metastasis (Bange et al. 2002). The use of modern molecular biological tools in cancer screening might help to identify subjects who are at risk of early tumour spread and might benefit from a modified screening schedule.

Lung cancer

Currently, 85–90% of subjects diagnosed with lung cancer die of the disease. However, up to 70% of those diagnosed and confirmed as having stage 1a disease will survive 5 years. Recently, uncontrolled studies in Japan and the United States indicated that low-dose helical (spiral) computerised tomography of the lung is capable of detecting approximately four times as many small stage 1 lung cancers as chest X-rays, and that these patients appear likely to have a good prognosis (Henschke et al. 1999; Kaneko et al. 1996; Kakinuma et al. 1999; Sone et al. 1998). Before broad application, the new approach needs careful examination for several reasons: a) it is still unclear whether the technique selectively detects only those cancers with a good prognosis, leaving those with a bad prognosis occurring at about the same time in their natural history as at present, and thus leading to little overall benefit; b) the longer survival generally observed with early detection can be caused by advancing the time of diagnosis and may not be a result of prolonged lifetime as expected from effective screening; and c) a real benefit in terms of mortality reduction has still to be demonstrated. Currently, in Europe and North America, several randomised trials are in preparation and need funding aiming at a conclusive result in 5 year’s time (see, e.g., van Klaveren et al. 2001).

Conclusions

Screening partially requires an expensive medical infrastructure and is not without risks for the participants. The overall benefit is critically dependent upon the quality of the programme and its in-time control. Any benefit may be annulled by poor quality while costs are overflowing.

Well-organised high-quality screening may be a valuable part of cancer control. In order to preserve or to increase the impact of screening and to control its expenses: a) further research efforts are needed towards new or better targeted screening tools or modalities; b) the efficacy of new modalities has to be evaluated carefully in advance of a broad application: c) the programme has to be reconceptualised as organised screening based on scientifically justified characteristics (screening methods, screening frequency, target age groups); and d) in-time quality control based on the collection of the basic performance data must be an intrinsic part of the programme.