Does How You Measure Income Make a Difference to Measuring Poverty? Evidence from the UK

Hansen, Kirstine; Kneale, Dylan

doi:10.1007/s11205-011-9976-5

Does How You Measure Income Make a Difference to Measuring Poverty? Evidence from the UK

Published: 06 January 2012

Volume 110, pages 1119–1140, (2013)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Social Indicators Research Aims and scope Submit manuscript

Does How You Measure Income Make a Difference to Measuring Poverty? Evidence from the UK

Download PDF

Kirstine Hansen¹ &
Dylan Kneale^1,2

1407 Accesses
15 Citations
Explore all metrics

Abstract

Income is regarded as one of the clearest indicators of socioeconomic status and wellbeing in the developed world and is highly correlated with a wide range of outcomes. Despite its importance, there remains an issue as to the best way to collect income as part of surveys. This paper examines differences in how income is collected in a nationally representative UK birth cohort, the Millennium Cohort Study, looking at variations by questions asked and by respondent characteristics before then examining the implications different methods of collecting and reporting income may have for measuring poverty. Results show that less than a third of respondents give consistent information on income between measurement tools. Using multiple questions is associated with a substantially lower response rate but this method generally results in a higher estimate of family income than using a single question. This is particularly true for certain groups of the population—those on means tested benefits, in self-employment and in part-time employment. Not surprisingly then in our analysis of poverty, using a single question produces an inflated proportion of families who could be classified as living in poverty and is less associated with other measures of financial deprivation than the more conservative poverty measure based on multiple questions.

Missing Top Income Recipients

Article 19 March 2022

An integrated approach for a top-corrected income distribution

Article 02 August 2018

Differences Between Household Income from Surveys and Registers and How These Affect the Poverty Headcount: Evidence from the Austrian SILC

Article Open access 12 June 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Income is regarded as one of the clearest indicators of socioeconomic status in the developed^{Footnote 1} world and is highly correlated with outcomes across most life course domains including longevity, education, health, crime, family life, and happiness (for example Easterlin 2001; Hansen and Machin 2002; Lynch et al. 2000; Mayer 1997; Wilkinson 1992). Classical definitions linked income with consumption, so that income was defined as ‘the maximum value a man can consume during the week and still be as well off at the end of the week as he was at the beginning’ (Hicks 1939, p 172). The proliferation of consumer credit since this definition was coined may alter this meaning somewhat. Nevertheless, collecting information on the total monies entering a household or an individual’s budget is viewed as an important part of establishing socioeconomic status. For children, establishing household income can give a strong indication of the family’s potential purchasing power in terms of providing food, shelter, clothing, educational resources and other essential and non-essential items that will influence a child’s development.

Despite its importance, there remain issues as to the accuracy of income collected as part of surveys. These include the sensitivity of asking about income; whether weekly, monthly or annual income is reported; respondent knowledge of different types of income (gross compared to net for instance); and the more fundamental issue of whether respondents actually know their income. In addition, there is no consensus as to the best way to minimize these problems and therefore no agreement as to be best way to collect income data in surveys. Some surveys ask a single question, others use a number of more detailed questions; some ask for exact amounts, others for a banded range. These differences all lead to potential error in the data (Micklewright and Schnepf 2010; Moore et al. 2000).

This paper makes a unique contribution to the debate by examining whether different methods of collecting income within a single survey produce consistent estimates of income. This is possible within the Millennium Cohort Study (MCS), a UK nationally representative study described later in the paper, due to the different ways in which income is collected from the same respondents. We compare differences in how the same individual reports income in the most recent sweep of data collection (at age 7) when asked as a single question compared to multiple questions. We then look at whether the pattern varies across different sub-groups of the population, or by the period over which incomes are reported, for example weekly, monthly etc. The paper finally examines the implications any differences may have on the subsequent analysis of that data—in particular in an area of key substantive interest—the measurement of child poverty. We begin by examining known issues in the measurement of income.

1.1 Respondent Issues

The fact that people are said to be more reluctant to discuss their income than they are their sexual behavior (Gordon 1998) is testament to the sensitivity of the subject matter, although acceptability is just one aspect of respondent issues. In piloting questions on income for the 2001 UK census, Collins and White (1996) encountered respondents who were unsure as to the definition of household or family income (whether it represented earnings alone or total income); unsure of the meaning of net income (whether it referred to earnings after taxes or earnings after taxes and regular bills (disposable income)); and could not, in several cases, report on the partner’s income and thereby the family or household income (see also Atkinson and Micklewright 1983; Gordon 1998; Micklewright and Schnepf 2010).

Recall and reconciliation are other areas in which the respondent may introduce bias into the data (Atkinson and Micklewright 1983). The importance of recall in surveys is affected by the relevant measurement unit (Clarke et al. 2008); in the case of income it is usually annual income that is taken as a gold-standard (Becker et al. 2003).^{Footnote 2} This is generally not problematic for many salaried employees. However, for a large section of the population whose incomes come from a range of sources, recalling each source may be difficult. For those who are in short-term or seasonal employment, those reliant on several sources of income, or who otherwise have inconsistent forms of income, reporting an annual total figure can be problematic (Schrapler 2006). Even for salaried employees, providing both net and gross income estimates may be challenging. This moves from an issue of recall to an issue of reconciliation in reporting income. In the case of respondents who recently changed employment and income, providing one sum to reflect income during the period of change further compounds any possible existing difficulties in providing accurate information. For all respondents, regardless of earnings instability, providing a figure for annual income is a complex process involving issues of interpretation, understanding, retrieval, judgment, calculation, conversion and finally, communication (Collins and White 1996).

1.2 Measurement Issues: To Ask a Single Question?

There is no consensus as to the best way to collect income data in surveys. Large scale surveys in the UK collect information on income in a variety of ways—the majority either by single questions or by a set of detailed questions. Table 1 shows the different ways income is collected in a number of key UK studies. Some of this lack of consensus in measurement between surveys is likely due to the fact that surveys collect income data for different purposes (Davern et al. 2005; Micklewright and Schnepf 2010). However, even in multipurpose and multidisciplinary studies, such as the MCS, where income is not collected for a specific purpose, there is no agreement as to the best method.

Table 1 Collection of income data in major UK studies

Full size table

In reality single question and multiple question strategies both have their merits and shortcomings. Limiting a respondent to answering a single question on income can introduce difficulties in recall and reconciliation as discussed above. These difficulties are likely to be encountered differentially amongst various groups in the population resulting in differential item non-response across sub-populations. When coupled with more basic problems of study coverage and representativeness (Becker et al. 2003; Francesconi et al. 2009), this may lead to serious flaws in estimates.

In asking a single question, there are also considerations to be made as to whether this is asked for as a total figure or whether respondents should select a band. Banded data may improve response but may limit detail, although the loss of detail may not be so severe as to affect the data quality for the majority of the population (Micklewright and Schnepf 2010).^{Footnote 3} But banded data can also introduce respondent bias. In pilot studies, reported in Collins and White (1996), respondents were found to select the band below which their actual income lay because they were concerned that selecting the correct band would imply that they had an income close to the top limit of the band.

There are also issues related to the loss of detail encountered when using a single question even if not collected in bands, particularly if researchers are interested in the contribution different components of income make to the overall total. However, while providing additional detail, asking multiple questions on income can introduce new problems of unmanageable questionnaire length and heavy respondent burden (Micklewright and Schnepf 2010), in addition to vastly increased survey costs. In large scale longitudinal surveys, the implications of heavy respondent burdens can have a twofold effect through unit missingness as well as attrition in subsequent sweeps (Burchell and Marsh 1992). Moreover, many sections of detailed questions on income may be irrelevant for some respondents; questions on state benefits are likely to be irrelevant for the most advantaged while questions on dividend from stocks, shares and investments are likely to be irrelevant for the least advantaged respondents, for example. However, restricting these questions to certain respondents only could introduce further inaccuracies.

In addition, detailed questions on income may compound issues of acceptability in responding to these questions; while some respondents may be comfortable in providing a total figure for income, they may not be as comfortable providing details of the origin of each component of income. Not all researchers agree, some unequivocally state that asking a number of detailed income questions provides a more accurate response than a single income question. Of this persuasion Davern et al. (2005) argue the individual components allow policy makers to better understand the dynamics of poverty and income in determining eligibility for intervention programs. Yet this debate remains unresolved in the literature at present.

1.3 Other Measurement Issues: What to Count as Household Income and when to Count it?

The Canberra Group, an international expert group concerned by inconsistencies in the measurement of income, defined the essential elements included in a measure of household income as: (1) income from employment, (2) income from self-employment, (3) property income (including from stocks and shares), (4) income from social insurance benefits, (5) deductions from income (social insurance), and (6) social transfers in kind (for example government provided education and healthcare services) (The Canberra Group 2001).^{Footnote 4}

While suitable for standardizing the measurement of income for international comparison, in reality, such a framework is difficult to implement in the UK setting for a nationally representative population survey. Even if we limit the focus to the first four components alone, difficulties arise. In the UK context, measuring income from state benefits (which would include means tested benefits) directly from respondents is difficult as respondents may be unaware of the total benefits they receive. For example, housing benefit and council tax benefits (means-tested benefits that cover housing rents for those on no/low incomes) may constitute a large proportion of household income, but may be unknown to the respondent as they are often paid directly to landlords. This measure is also removed from the element of consumption, to which classical definitions of income, referred to earlier, are implicitly linked.

Seasonality in employment or in income (for example through performance related bonuses) can mean that results are dependent on when questions are asked. In analyses comparing results from questions on annual income and current income in the British Household Panel Survey, Boheim and Jenkins (2006) found small differences in the distributions, which did not alter the relative position of groups. However, within some groups, the differences were more pronounced with annual income reports producing lower values when harmonized with current values, notably in cases where the head of household was unemployed (Boheim and Jenkins 2006). Furthermore, it is easy to speculate that those in seasonal or unstable employment may also be those who have most difficulty in providing a single figure for annual income (as discussed earlier), and known correlates of irregular earnings include having low qualifications and being self-employed (Drewinka 2010).

The period of payment used to collect the information on income also has the potential to cause problems for a wide range of respondents^{Footnote 5} who may prefer to report income in the period in which they receive payment, as opposed to the period dictated by the researcher. This will then vary by sub-population, as those paid weekly will answer in weekly amounts, those monthly in monthly amounts. As the low paid are more likely to be paid weekly we will see systematic differences across different groups of the population. Moreover, respondents may prefer to report the figures they are most familiar with; often this means reporting gross annual income as an annual total but their net income on a monthly basis.

Each component of income will have substantial variation in terms of the response rate and the reliability. Hawkes and Plewis (2008) found that questions on income from self-employment had particularly low response rates in the MCS^{Footnote 6} and Moore et al. (2000) found that estimates of survey-collected self-employment income were substantially lower than independently verified self-employment income. Wages and salaries data collected in surveys generally have the highest levels of consistency with externally verified sources (Biancotti et al. 2008; Moore et al. 2000), while income from dividends and interest from investments have the lowest, with estimates being approximately half of the actual total (Moore et al. 2000). Income from pensions is found to have high levels of reliability in some studies (Biancotti et al. 2008), but others argue that in a number of cases income from pensions tends to be overstated (Moore et al. 2000).

This paper builds upon the literature and offers a unique contribution to the income debate by comparing differences in how the same MCS respondent reports income in the most recent sweep of data collection (at age 7) when it is asked as a single question as opposed to multiple questions. We then examine the implications that different methods of collecting and reporting income have for data collected from different groups within the population. We also briefly discuss the potential issue of measurement error in the collection of income data.

More specifically, based on the evidence presented above, we ask:

What differences can be observed in average annual income from using single question banded income data versus multiple questions?
- Does the pattern change across different sub-groups of the population?
- How does the pattern change by unit of measurement, for example weekly, monthly etc.?
What effect does using a banded single question versus detailed questions have on the analysis of a key area of substantive interest—the estimation of child poverty in the survey?

2 Data

The data used in this paper are from the most recent sweep of the UK Millennium Cohort Study (MCS), collected from families when children were aged 7.^{Footnote 7} This study recruited families of children born between 2000/1 in randomly selected electoral wards, disproportionally stratified to ensure adequate representation of children from disadvantaged and ethnic minority families. Information has been collected at 9 months, 3, 5 and 7 years, with the next sweep of data collection due when the children are aged 11 years. Initially, over 19,000 households were recruited into the study; by age 7 the number of participating families had dropped to 13,800. In the most recent sweeps, information on family income was collected through both a battery of detailed questions as well as a single question.^{Footnote 8} ^, ^{Footnote 9}

2.1 Detailed Income Questions

At the age 7 survey detailed income questions collect information on earnings and income from benefits as well as other sources. Each question typically asked first whether the respondent received a particular component of income before asking about the amount, whereupon respondents would report an exact amount as opposed to selecting a value from a set of bands. Respondents were then asked to confirm the unit in which they reported the component of income from a set range of options (weekly, monthly etc.), although this may not cover all arrangements. In an effort to improve the response rate for individual components of income, respondents who didn’t give an initial answer to the amount of income were given a number of follow-up options known as ‘unfolding brackets’. Respondents who were unable, or refused, to give an exact answer were asked a series of follow up questions designed to elicit a minimum and maximum number, defining a range within which the value lies (Centre for Longitudinal Studies 2009).

While income from earnings and benefits was collected in detail, income from other sources was collected less comprehensively, and was collected from main respondents only who could only give a monthly amount. These included income from investments, studentships and cash from parents.^{Footnote 10}

2.2 Single Question

In addition to the detailed questions at age 7 each main respondent (who is usually the mother) was presented with a card that showed incomes in weekly, monthly and annual amounts. They were asked ‘Which of the groups on this card represents you [^and your husband/wife]’s total take-home income from all these sources and earnings, after tax and other deductions’. Respondents were given a choice of nineteen brackets (see Table 2), although the question was not asked of partner respondents (usually fathers) and therefore assumed that a mother was aware of both her own and, where applicable, her partner’s income. The question asked about income after deductions, and while respondents were given a choice to report a total income in weekly, monthly or annual amounts, they were nevertheless expected to know and report different components of their income in the same periods of receipt.^{Footnote 11} At the age 7 sweep in 2008, the single question achieved a response rate of 89%, which was substantially higher than the 58% achieved for income from the multiple questions.^{Footnote 12}

Table 2 Income bands used in single question and components of income included from multiple questions

Full size table

The single question followed the detailed questions on income. It can therefore be expected that the multiple questions served as a prompt for what should go into the report of net family income. However, this was not made explicit. For example receipt of housing benefit was included in the detailed questions but not the amount. It is expected that housing benefits will also be excluded from the answers to the single question, although respondents were not explicitly reminded to exclude housing benefits from their total calculation in responding to the single question. Show cards were shown to respondents with the intervals displayed in weekly, monthly or annual amounts, although no record exists of which period respondents chose to report their answers.

3 Results

3.1 What is the Impact of Using Single Question Banded Income Data Versus Multiple Questions for Measuring Income?

To examine our first research question, we compare the descriptive statistics from five different calculations of income based on data handling conventions (detailed below) using information from both the detailed set of questions and the single income question.

1.
Income as collected with the single question in banded groups (calculation 1). Main respondents were asked to report a single band for total net household income (after taxes and other duties) from a choice of twenty bands (see Table 2). Different bands were used for couple and lone parent households. Respondents could choose bracket values that corresponded to the household income in weekly, monthly or annual multiples, although for consistency with our other definitions, we examine annual amounts of income. Income brackets were narrower towards the bottom of the income distribution compared to the top; the top bracket was open at £100,000 or more per annum. Respondents were also given the choice of don’t know or could refuse to answer—around a quarter of those without a valid response represented refusals.
2.
Income as collected with the single question in banded groups continuously using the mid-point of the interval as the value (calculation 2).^{Footnote 13} This calculation of income uses the same information as above, although we apply mean substitution for the band intervals, taking the mid-point of the interval.
3.
Income collected from multiple questions, and including the income of any respondent with a valid answer to any of the component questions as having a valid income response (calculation 3). This calculation of income represents an aggregate sum from all components of income as reported, with sums multiplied to an annual amount. Respondents who gave a valid response to any component of income were included in this calculation of income—this includes respondents giving valid amounts for income from employment, self-employment, state benefits, dividends, among other sources (see Table 2). This method of calculating income includes reports from partners on income where appropriate. No checks are imposed for the validity of the data including checks for logical inconsistencies; this calculation of income instead is indicative of the potential sample size should imputation techniques be employed.
4.
Income collected from multiple questions, excluding incomes from respondents with any missing component parts and logical inconsistencies in benefits and other information (calculation 4). This calculation of income represents an aggregate sum from all components of income as reported above, with sums multiplied to an annual amount. This calculation includes only those cases who indicated whether they received a given source of income or not, the amount from that source of income, as well as the frequency of payment for each potential component included in the calculation (see Table 2). Logical consistency checks were also imposed to filter out inconsistencies in the data. For example, those whose gross income is less than their net income; couples with a valid main but no valid partner report; those who claim certain benefits but give no amount; and those who give inflated estimates of benefits (which in some cases, for example child benefit, are not means tested and paid in universal amounts); were excluded from this calculation whereas they were included in the above calculation. The effect of filters in depleting sample size of income derived from using multiple questions poses one of the main disadvantages of collecting income information in this way.
5.
Income collected from multiple questions, excluding incomes from respondents with any missing component parts and logical inconsistencies in benefits and other information, but including information from brackets data (calculation 5). This calculation is derived in the same way as calculation 4 above. However, we boost the number of valid responses by using information derived from a series of follow up questions referred to as ‘unfolding brackets’. Respondents who are unable, or refuse, to give an exact answer are asked a series of follow up questions designed to elicit a minimum and maximum number defining a range within which the value lies. We take the midpoint of this bracket.

Reassuringly, examining Table 3 shows that the different calculations of income produce similar results in terms of the mean value of income. They vary from the lowest, £29,329 (using calculation 3) to the highest, £32,936 (using calculation 5). However, the sample sizes do vary across calculation methods—from over 12,250 using the single question to under 6,900 using multiple questions after the elimination of missing elements and logical inconsistencies. When we apply listwise deletion in the bottom part of Table 3, we find that many of the differences are caused by variations in the sample composition—the median and mean values now lie within £1,000 of each other regardless of the definition of income used. However, there remain considerable differences in the value of the lowest quartile which is approximately £3,000 higher using the multiple questions as opposed to the single question. Consequently, the 90:10 index of inequality is substantially lower using the multiple questions as opposed to the single question. Using the brackets increases the sample size by almost 150, and makes a moderate difference to the upper quartile and mean. The correlation coefficient of 0.63 between the income derived from multiple questions and from the single question indicates a strong, but imperfect, correlation between measurement tools.

Table 3 Descriptive information for differing calculation methods of income

Full size table

3.2 Does the Difference Between Using Single Question Banded Income Data and Multiple Questions for Measuring Income Vary by Sub-Group or Period of Measurement?

Having charted the differences seen in the income data by using the separate calculations of income we now want to examine variations in this pattern. We are particularly interested in whether income, measured in these different ways, varies across sub-groups of the population. We do this because different groups in the population are more or less vulnerable than others, have more or less income than others and vary in their degree of need and their likelihood of experiencing poverty. It may be that some methods of recording income work better for some groups rather than others. For this reason we compare the descriptive statistics using calculation methods 2 (taking the single question using the mid-point of the intervals) and 5 (using multiple questions with restrictions to filter out inconsistencies but including information from brackets). We examine these calculation methods for different groups including: single parents versus those in couples; workless families compared to families where at least 1 person works; those reliant on state benefits versus households who do not claim any means tested benefits; those self-employed and those not self-employed; part-time workers (defined as those working less than 30 hours per week) compared to full-time workers; and variations across different ethnic groups.^{Footnote 14} We also compare the similarity in estimates by unit of pay and period of pay. All these variations are shown in Fig. 1).

We can see the sub-groups of the population that might be considered more vulnerable such as those on means tested benefits, part-time workers, non-whites and the self-employed are more likely to record higher incomes through the multiple questions than the single question measurement. The difference is largest for couple households where both are in self-employment or lone parent families headed by a parent in self-employment, where the multiple questions appear to capture a substantially higher mean level of income (approximately £46,000) than data from the single banded question (approximately £32,500), a discrepancy of over £13,000.

When the period of reporting (defined as 1 week, 4 weeks, calendar month or annual) is examined the results show that higher values of income are recorded through the multiple questions than the single question when respondents report their income annually or in a 4 week period. But respondents who report their income by calendar month report higher incomes using the single question.

So far we have examined differences in income recorded by respondents using a range of definitions based either or the single banded income question or the multiple questions by comparing the income at various points of the distribution. An alternate way to illuminate differences between income measurements is to apply the upper and lower limits of the twenty income bands that are used in the single question on to the continuous data from the multiple questions, so that the data from both calculation methods are now grouped according to the same limits, and examine the differences. Figure 2 shows the difference between the income distributions from the single question (using calculation 2) and multiple questions (calculation 5), expressed as the difference in number of income bands. A band in this case represents an absolute range of income (see Table 2).^{Footnote 15} A negative value indicates that the income band derived from the multiple questions is lower than that derived from the single questions. In total 29% of the cohort fell into the same bracket using a single question, 26% had a lower income band using the multiple questions and 44% had a higher income band using the multiple questions. In a substantial number of cases (19%), the difference was greater than two bands, equating to a potential difference of £48,000 or more in some cases.

We have examined descriptively differences in income recorded by respondents using a range of definitions. Results have shown that there are non-negligible variations between the income measures for certain sub-groups of the population and by the unit of time respondents record their income over. However, as many of the factors that are examined may overlap, we explore these relationships more robustly using a multinomial probit regression model (presented in Table 4).^{Footnote 16} Our model examines the relative probability of reporting a higher income band or lower income band using the multiple questions compared to reporting the same band using both the multiple and single questions (the latter being the baseline category). The process is similar to constructing a (binary) probit regression model, although as we have a nominal variable, as opposed to a binary, the models for predicting the probability of ‘higher band versus the same’ and ‘lower band versus the same’ are run simultaneously, with the added constraint that resulting predicted probabilities sum to one across all categories. In our model, we enter all of the factors contained in Fig. 1 as predictors and present, in Table 4, the relative magnitude of our predictors in determining the probability of achieving, for example, a higher income band with the multiple questions than a single question, compared to the probability of achieving the same or lower band, respectively.

Table 4 Multinomial probit regression showing estimation of income using single versus multiple questions

Full size table

The results confirm earlier descriptive analyses that those with more diverse sources of income are more likely to report higher incomes using the multiple income questions. Those in self-employment, part-time employment and those in receipt of means tested benefits are significantly more likely to report higher incomes using the multiple questions. Net of other factors, lone parents are more likely to report the same income across measurements. Ethnicity is generally not statistically significantly predictive of response patterns, except that in comparison to white families, black families are less likely to report income within the same bracket across measurements.^{Footnote 17}

3.3 What Implications does Using a Banded Single Question Versus Detailed Questions have for Measuring Child Poverty?

Looking at the differences in reported income according to measurement method allows us to gain insight into which groups are most likely to give inconsistent income estimates. Nevertheless, we remain unable to comment on which of these methods is likely to provide the most accurate estimate.^{Footnote 18} However, due to the extensive range of information collected in the MCS we are able to look at how the different measures of income are associated with different financial wellbeing measures. In the first instance we examine how the classification of families in poverty in the MCS varies according to measurement instrument. Furthermore, we speculate on the accuracy of those classified as living in poverty (or not) by comparing our results with other, more subjective measures of financial wellbeing collected in the MCS.

We classify households as living in poverty based on a similar definition as the European Commission (2010) definition, a relative classification of poverty in which equivalised household incomes that fall below 60% of median income are classified as being in poverty. Our definition approximates the DWP practice of comparing net income to the median using a modified version of the OECD equivalised scales and focusing on families rather than the household. This resembles the procedure adopted by Ketende and Joshi (2008) but here we use a separate median threshold for households who are in receipt of housing benefits for whom housing costs are not included in our family income estimates (£206pw), compared to those who are not in receipt of housing benefit (£244pw) (Adams et al. 2010).^{Footnote 19}

The analysis continues to compare banded data and continuous data, although if we were to use the banded data as the basis for our poverty cut-off point, we risk significantly underestimating or overestimating the proportion in poverty by ignoring the within band distribution^{Footnote 20} (for example up to 10% of the cohort fall within the same income band). In order to obtain a more accurate estimation of the numbers classed as ‘in poverty’ from the banded data, we use interval regression to obtain predicted values. Interval regression is used to model banded data, and holds an advantage over OLS when estimating the distributions at the bottom and top of the interval (Ketende and Joshi 2008; Micklewright and Schnepf 2010). Interval regression also has advantages where the bottom or top thresholds of the interval are unknown (or censored), as in the case of our single income variable.

We construct an interval regression model using a number of predictors (region and sample stratum, worklessness, self-employment, part-time working, educational level (of main respondent), age (of main respondent), family structure, ethnicity (of child), unit of payment of main component of income, housing tenure and receipt of means tested benefits) to obtain predicted values of income. As the predicted values reflect the choice of predictors used, to facilitate comparison, we compare the predicted results from the interval regression with the predicted results from an OLS model that uses the continuous income from the battery of questions with brackets (calculation 5 earlier).^{Footnote 21} In addition, we also compare the impact of using the observed continuous income (calculation 5 earlier), and for reference the banded single question data using mid-points (calculation 2 earlier), on estimates of poverty. We equivalise our four income values using OECD scales (Adams et al. 2010) and present the results in Table 5.

Table 5 Per cent in poverty by different measurement tools and derivations of income

Full size table

By comparing the predicted values from the two different data sources in the models constructed, we see that a greater number of families would be classed as being in poverty using the data from the single question compared to the multiple questions (looking at either predicted or observed values). Essentially, the higher level of income estimated using data from the multiple questions leads to a more conservative estimate of poverty.^{Footnote 22}

Due to the range of alternative measures of financial wellbeing available in the MCS we can additionally compare how well our measures of poverty (using the different calculation methods of income) correspond to measures of poverty defined in more subjective ways. We do this in Table 6 which shows the positive predictive power of our poverty definitions in detecting other indicators of financial deprivation: free school meal eligibility and uptake (FSM), reports of financial wellbeing (financially ‘just about getting by’ or ‘finding it difficult’), receipt of housing benefit, and all three indicators combined (termed indicator of deprivation).^{Footnote 23} This is a simple measure which identifies the percentage of people who are identified as being in poverty using our income measures who are predicted to be in poverty using the alternative measures of financial wellbeing.

Table 6 The positive predictive values of poverty defined using the different income measures against poverty defined using other measures of financial wellbeing

Full size table

The results show the definitions of poverty derived from multiple questions have higher positive predictive values across measures of financial wellbeing than definitions of poverty derived from a single question. In other words, those we define as being in the poverty category from the multiple questions have a higher likelihood of being in a deprived category on another indicator of financial wellbeing, than poverty definitions derived from the single income question.

While our analysis is highly dependent on our choice of indicators of financial well-being, the evidence suggests that income measured through a single question may be overestimated, and as such, the higher levels of poverty estimated through a single question may not necessarily be corroborated by other measures of financial wellbeing. However, we do exercise caution in drawing our conclusions as the definition of poverty does not correlate perfectly with other indicators of financial wellbeing—for example in the latest official records, 50% of households with children defined as being ‘in income poverty’ (before housing costs) were not in receipt of housing benefit and 13% were not in receipt of any means tested benefits (Adams et al. 2010). Furthermore, a large body of literature exists that cautions against treating measures of income poverty and other measures of deprivation synonymously (for example Whelan et al. 2004; Saunders and Adelman 2006).

4 Summary

Family income is measured in a variety of different ways across national surveys impeding comparability between sources. Here, we examined differences between two of the main distinctions in measurement methods—whether to use a single variable with a choice of intervals or multiple questions examining each portion of income in detail. Each has its own merits. Researchers wishing to examine certain portions of income will need to use information from multiple questions. However, for many studies, researchers may not need the additional detail, and respondent time may be better spent addressing other questions. Additionally, our results show that using multiple questions is associated with a substantially lower response rate especially when safeguards to ensure logical consistency are imposed.

Given that income represents a key variable of interest in many studies, obtaining accurate measurements is of great importance, and we demonstrate that in the case of the MCS, the collection of income is associated with differential estimates of income and measurement error. If the variable in question is used as a dependent variable in a regression and if the measurement error is ‘classical’, i.e. a random error, then the regression coefficients are unbiased, albeit with larger standard errors. But the error may not be ‘classical’, that is, it may be systematically related to the true unobserved value. Or, if used as an explanatory variable, the error even if classical causes ‘attenuation bias’, i.e. the coefficient is biased downwards and may not fully account for confounding effect of income on other variables (for example Bound et al. 2000).

Our analyses of the differences in the estimate of income by measurement method suggest that the measurement error may be unequally distributed among socioeconomic and socio-demographic groups. Use of the multiple questions generally results in a higher estimate of family income than using a single question. This is particularly true for certain groups of the population—those on means tested benefits, those in self-employment and those in part-time work and on low incomes. This could indicate systematic bias introduced by one or other measurement instruments. As the battery of income questions result in higher income estimates, particularly for those in self-employment and on low incomes, it could be assumed that the results from the battery of questions provide the most accurate answer. While it is only through external verification that such an assumption could be qualified, we do explore this further looking at the definition of poverty. The results show that using a single question generally produces a higher proportion of families who could be classified as living in poverty. These results concur with a number of other studies, where the use of a single question was found to overestimate the rate of poverty (Davern et al. 2005). In the MCS when examining other measures of financial deprivation, the more conservative estimate of poverty using the multiple questions shows greater concurrence with the other measures of financial wellbeing as indicated by higher positive predictive power. The evidence suggests that use of multiple questions to collect income information, as opposed to a single question, may reduce the possibility of measurement error among lower income groups and those with diverse income sources.

Notes

In the developing world, expenditure is regarded as a clearer indicator of socioeconomic status (for example Van de Poel et al. 2008).
Although Britain varies somewhat in the high prevalence of ‘current’ measures of income (the amount of income last received, reported in a variety of units) as opposed to annual (Boheim and Jenkins 2006).
Although this depends on the number and width of bands, and the density of income within bands.
Elements excluded from this measure of income were irregular payments such as lottery winnings, inheritance or retirement and redundancy pay outs.
Although according to Hurd et al. (2003) there is very little literature on these effects.
This only included sweeps 1 and 2 which did not include questions on the amount of state benefits received.
Although the focus of this paper is on data collected at age 7, Hansen and Kneale (2011) examine trends between the third sweep in 2006 (MCS3) collected at age 5 and the age 7 sweep collected in 2008 (MCS4). They show consistency in the reporting of income over the different sweeps. The majority of respondents who reported higher incomes using the multiple questions at MCS4 also did so at MCS3.
Although this varies from sweep to sweep [for more detailed information see Hansen and Kneale (2011)].
Our measure is family rather than household income as it refers only to the parental unit of the child and any dependent children in the household who are siblings of the cohort member (biological, adopted, step or foster), excluding other adults. MCS collects employment information for all household members over the age of 15. As such, it may be questionable to classify 16–18-year-olds as dependent if they are employed. However, as the employment status theoretically treats any paid employment, from a paper round upwards, as being ‘in employment’, then we treat any 16–18-year-old who is a sibling of the cohort member as dependent member of the household because of this ambiguity.
Although in the case of investments, dividends and studentships, these may not necessarily be paid on a monthly basis to respondents.
A different set of income bands was given to main respondents in couples and those who were lone parents (Centre for Longitudinal Studies 2009).
Once logical inconsistencies and missing data were excluded from the data.
For the top brackets (which are not closed, accounting for 0.4% of lone parents and 2.0% of couples), we selected the mean value from the continuous income for those whose selected the top bracket and whose income also fell into the top bracket. For the lowest bracket (0.8% lone parents and 0.4% of couples), we selected the mid-point.
Based on the ethnic group of the child.
Note, income bands varied in width ranging from the smallest for lone parents of £1,000 at the bottom of the income distribution, to the open bands for families at the top of the income distribution.
We initially tested a multinomial logistic regression model but were unable to satisfy the Independence of Irrelevant Alternatives test using the Hausman test.
The negative coefficient on the workless family variable being in the opposite direction to the coefficient on families claiming means-tested benefits is an unexpected result. For this reason we also ran the same regression as Model A but this time included an interaction term in an attempt to illuminate this issue but the interaction term was not significant. This was further explored by examining the predicted probabilities for each variation of worklessness and means tested benefits. The results (not shown here) indicate that those families in receipt of means tested benefits, but where at least one partner worked, are particularly likely to record a higher income using the multiple questions (52%), but those who were workless and do not claim means tested benefit are among the least likely to (27%) and to report a lower band using the multiple questions (56%). Although this analysis compares the cleaned (banded) family income variable from the multiple questions with the single question variable, the latter result for workless families not claiming means tested benefits could suggest a residual effect of underreporting of benefits income. Nevertheless, clearly these results demonstrate that the consistency in reporting income between measurement instruments is very much dependent on socioeconomic characteristics.
This will be possible if MCS data are matched to administrative records on income but this has not been done to date. We did carry out analysis which compare MCS income data to income data recorded in the FRS. We also considered the validity of our income estimates using other sources. We identified a number of possible comparisons (Table 1) although were unable to find a suitable match. We identified the closest match as the Family Resources Survey (FRS) collected by the Department of Work and Pensions, and selected only those families with a child aged 6–8 years. However, even after weighting, the FRS average household income estimate was much higher at £40,863 than any estimate for the MCS (there were similar discrepancies for the median and quartile values). Likely reasons for this discrepancy include the estimation of housing benefit in FRS incomes and the more detailed collection of incomes from ‘other’ sources. However, this does not necessarily affect the generalizability of our results to other surveys, as our focus is on within survey differences in income by measurement instruments.
This innovation allows for income from housing benefits not being included in our family income measure.
We estimate regressions using the log of income to reflect the shape of the within-band distribution within the critical band.
As we are only interested in the predicted values and not in the effect of the covariates, we do not present the full output.
All our estimates of poverty are also lower compared to other estimates in the literature because of our treatment of those in receipt of housing benefits, our calculation of the OECD equivalisation factor, and our choice of income predictors (Ketende and Joshi 2008). In addition, we make no correction for non-response on our income variables here, which may bias the sample composition. As is the case elsewhere in the literature, the results for income measured through a single question using either the band mid-point or predicted income from interval regression are almost identical (Ketende and Joshi 2008).
Families are eligible for free school meals and housing benefit if they are in receipt of unemployment benefits or low income benefits. Free school meals are a lunchtime meal provided to children on school days. Housing benefit is given to families to assist with accommodation rental costs; both are used in the UK as indicators of poverty.

References

Adams, N., Barton, A., Bray, S., Johnson, G., & Matejic, P. (2010). Households below average income: An analysis of the income distribution 1994/95–2008/09. London: Department for Work and Pensions.
Google Scholar
Atkinson, A. B., & Micklewright, J. (1983). On the reliability of income data in the Family Expenditure Survey 1970–1977. Journal of the Royal Statistical Society Series A, 146(1), 33–61.
Article Google Scholar
Becker, I., Frick, J. R., Grabka, M. M., Hauser, R., Krause, P., & Wagner, G. G. (2003). A comparison of the main household income surveys for Germany: EVS and SOEP. In R. Hauser & I. Becker (Eds.), Reporting on the income distribution and poverty: Perspectives from a German and European point of view. Berlin: Springer.
Google Scholar
Biancotti, C., D’Alessio, G., & Neri, A. (2008). Measurement error in the Bank of Italy’s Survey of household income and wealth. Review of Income and Wealth, 54(3), 466–493.
Article Google Scholar
Boheim, R., & Jenkins, S. P. (2006). A comparison of current and annual measures of income in the British Household Panel Survey. Journal of Official Statistics, 22(4), 733–758.
Google Scholar
Bound, J., Brown, C., & Mathiowetz, N. (2000). Measurement error in survey data: Report No. 00-450. Ann Arbor, MI: Population Studies Center, University of Michigan.
Burchell, B., & Marsh, C. (1992). The effect of questionnaire length on survey response. Quality & Quantity, 26, 233–244.
Article Google Scholar
Centre for Longitudinal Studies (2009). Millennium cohort study sweep 4: Questionnaire documentation. http://www.cls.ioe.ac.uk/core/documents/download.asp?id=1235andlog_stat=1.
Clarke, P. M., Fiebig, D. G., & Gerdtham, U.-G. (2008). Optimal recall length in survey design. Journal of Health Economics, 27(5), 1275–1284.
Article Google Scholar
Collins, D., & White, A. (1996). In search of an income question for the 2001 census. Survey Methodology Bulletin, 39(7), 2–10.
Google Scholar
Davern, M., Rodin, H., Beebe, T., & Thiede Call, K. (2005). The effect of income question design in health surveys on family income, poverty and eligibility estimates. HSR: Health Services Research, 40(5), 1534–1552.
Article Google Scholar
Drewinka, S. (2010). Cross-sectional variation in individuals’ earnings instability. Review of Income and Wealth, 56(2), 291–326.
Article Google Scholar
Easterlin, R. A. (2001). Income and happiness: Towards a unified theory. The Economic Journal, 111(473), 465–484.
Article Google Scholar
European Commission. (2010). Income poverty and material deprivation in European countries. Luxembourg: Publications Office of the European Union.
Google Scholar
Francesconi, M., Sutherland, H., & Zantomio, F. (2009). A comparison of earnings measures from longitudinal and cross-sectional surveys: Evidence from the UK. Colchester: Institute for Economic and Social Research.
Google Scholar
Gordon, D. (1998). Measuring income in the Breadline Britain 1998 survey. In J. Bradshaw, D. Gordon, R. Levitas, S. Middleton, C. Pantazis, S. Payne, & P. Townsend (Eds.), Perceptions of poverty and social exclusion 1998. Bristol: Townsend Centre for International Poverty Research, University of Bristol.
Google Scholar
Hansen, K., & Kneale, D. (2011). Measuring income in the millennium cohort study: Bands, units and groups—does it make a difference? CLS Working Paper. London: Centre for Longitudinal Studies, Institute of Education.
Hansen, K., & Machin, S. (2002). Spatial crime patterns and the introduction of the UK Minimum Wage. Oxford Bulletin of Economics and Statistics, 64(Supplement), 677–697.
Article Google Scholar
Hawkes, D., & Plewis, I. (2008). Missing income data in the millennium cohort study. London: Centre for Longitudinal Studies, Institute of Education.
Google Scholar
Hicks, J. R. (1939). Value and capital: An inquiry into some fundamental principles of economic theory. Oxford: Clarendon Press.
Google Scholar
Hurd, M. D., Juster, F. T., & Smith, J. P. (2003). Enhancing the quality of data on income. The Journal of Human Resources, 38(3), 758–772.
Article Google Scholar
Ketende, S. C., & Joshi, H. (2008). Income and poverty. In K. Hansen & H. Joshi (Eds.), Millennium cohort study third survey: A user’s guide to initial findings. London: Centre for Longitudinal Studies, Institute of Education.
Google Scholar
Lynch, J. W., Smith, G. D., & Kaplan, G. A. (2000). Income inequality and mortality: Importance to health of individual income, psychosocial environment, or material conditions. British Medical Journal, 320, 1200–1204.
Article Google Scholar
Mayer, S. E. (1997). What money can’t buy: Family income and children’s life chances. Cambridge, Massachusetts: Harvard University Press.
Google Scholar
Micklewright, J., & Schnepf, S. (2010). How reliable are income data collected with a single question? Journal of the Royal Statistical Society Series A, 173(2), 409–429.
Google Scholar
Moore, J. C., Stinson, L. L., & Welniack, E. J. J. (2000). Income measurement error in surveys: A review. Journal of Official Statistics, 16(4), 331–362.
Google Scholar
Saunders, P., & Adelman, L. (2006). Income poverty, deprivation and exclusion: A comparative study of Australia and Britain. Journal of Social Policy, 35(4), 559–584.
Article Google Scholar
Schrapler, J.-P. (2006). Explaining income non-response—a case study by means of the British Household Panel Study (BHPS). Quantity and Quality, 40, 1013–1036.
Article Google Scholar
The Canberra Group. (2001). Final report and recommendations. Ottawa: The Canberra Group, Expert Group on Household Income Statistics.
Van de Poel, E., Hosseinpoor, A. R., Speybroek, N., Van Ourti, T., & Vega, J. (2008). Socioeconomic inequality in malnutrition in developing countries. Bulletin of the World Health Organisation, 86(4), 241–320.
Google Scholar
Whelan, C. T., Layte, R., & Maître, B. (2004). Understanding the mismatch between income poverty and deprivation: A dynamic comparative analysis. European Sociological Review, 20(4), 287–302.
Article Google Scholar
Wilkinson, R. G. (1992). Income distribution and life expectancy. British Medical Journal, 304, 165–168.
Article Google Scholar

Download references

Acknowledgments

The authors would like to thank John Micklewright for early discussions about the issues covered in this paper and to he and Heather Joshi for comments on an earlier draft of this paper. Thanks too to the anonymous referees and the Editor of Social Indicators Research for their useful comments and suggestions.

Author information

Authors and Affiliations

Centre for Longitudinal Studies, Department of Quantitative Social Science, Institute of Education, University of London, 20 Bedford Way, London, WC1H 0AL, UK
Kirstine Hansen & Dylan Kneale
International Longevity Centre UK, 11 Tufton Street, Westminster, London, SW1P 3QB, UK
Dylan Kneale

Authors

Kirstine Hansen
View author publications
You can also search for this author in PubMed Google Scholar
Dylan Kneale
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dylan Kneale.

Additional information

Kirstine Hansen and Dylan Kneale are joint first authors.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hansen, K., Kneale, D. Does How You Measure Income Make a Difference to Measuring Poverty? Evidence from the UK. Soc Indic Res 110, 1119–1140 (2013). https://doi.org/10.1007/s11205-011-9976-5

Download citation

Accepted: 05 December 2011
Published: 06 January 2012
Issue Date: February 2013
DOI: https://doi.org/10.1007/s11205-011-9976-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Does How You Measure Income Make a Difference to Measuring Poverty? Evidence from the UK

Abstract