1 Introduction

Due to the COVID-19 pandemic, stringent control measures were implemented globally in 2019 end (Mahato & Pal, 2022). To contain and delay the spread of the COVID-19 pandemic, various countries have taken several measures, such as canceling large public events, travel restrictions, social distancing and mandatory face masks in public places. These measures have been effective in slowing the spread of the virus among people and in controlling pollution and emissions from human activities. The data of China Statistical Yearbook from 2019 to 2021, the concentration of PM2.5, PM10 and other air pollutants decreased year by year (Zeng & Bao, 2021), with the average concentration decreasing by 20% and 10%, which showed significant changes in air quality compared with before the COVID-19 pandemic (Mostafa et al., 2021). More stringent curfews have been imposed in Thailand in response to the COVID-19 pandemic, nitrogen dioxide level in 2020 declined by 9.1% and 9.8% for the whole of Thailand and Bangkok metropolitan area (Oo et al., 2021). During the COVID-19 pandemic, a nationwide lockdown was imposed in India, pollution levels in cities dropped sharply and air quality improved markedly. Concentrations of PM10 and PM2.5 have witnessed a maximum reduction (> 50%) in comparison with the pre-lockdown phase (i.e., 2019) and the reduction about 60% and 39% (Mahato et al., 2020). The lockdown was considered an effective measure to control air pollution. It can reduce health risks from epidemics and improve air quality. The change in air quality before and after the COVID-19 lockdown has made people realize that air pollution has a large negative impact on daily life (Bang & Khadakkar, 2020), it will prompt a rethinking of past attitudes and perceptions toward improving air quality.

Recent studies have proved that individual preferences are temporally stable (De Andrés Calle et al., 2020; Hynes et al., 2021a). Wunsch et al. (2022) tested the reliability of individual preferences and welfare estimates of coastal climate change over the course of the COVID-19 pandemic and found that preferences were stable in the context of the COVID-19 pandemic. Hynes et al. (2021b) found that both preferences and WTP remained relatively stable in the face of major public health crises and economic turmoil. Sereenonchai et al. (2020) indicated that increased WTP for self-protection of urban respondents result from severe haze experience. Another research on preference stability has focused on the extent to which individuals respond differently to choice tasks within the same set of tasks. Hoeffler and Ariely (1999) investigated the impact of several dimensions of experience (effort, choice, and experience) on preference stability. The results reveal that the type of experience and its corresponding effort have a large impact on the process of preference development. Duersch et al. (2017) analyzed the stability of preferences experimentally, by repeatedly eliciting ambiguous attitudes toward multiple 3-color Ellsberg urns over a period of two months. Schaafsma et al. (2014) tests the temporal stability of stated preferences and willingness to pay values from a choice experiment in a test–retest. The results show that the estimated preference and scale parameters are different, indicating that the choice has changed between the two surveys. Few studies explore the stability of preferences for air quality improvement, especially the change in public preferences and willingness to pay for air quality improvement in the context of a major crisis. This study attempts to fill this gap and explore changes in public preference and willingness to pay for air quality improvement before and after the COVID-19 pandemic.

As air is a public good, its value cannot be directly assessed by market. The stated preference method can be used to obtain the public preference degree and assess the economic value of public goods (González et al., 2019). Previous studies on the stability measurement of environmental goods preferences mainly focus on the conditional valuation (CV). Gregory (1986) estimated the temporal reliability of learning in the range of CVM. Skourtos et al. (2010) summarized 20 studies of temporal stability based on CVM, noting significant differences in the time periods studied (from 2 weeks to 20 years) but mostly finding support for stability within 5 years. There were also some studies using CE retest procedures to examine the temporal stability of preferences (Bliem et al., 2012), and find the stability between different researchers and samples (Lew & Wallmo, 2017). Compared to CVM, the choice experiment involves hypothetical scenarios where respondents need to make trade-offs between multiple attributes and reveal their preferences and valuations of these attributes (Birol et al., 2006). Given that air is characterized by multiple properties and levels (Tang & Zhang, 2016), it is of particular interest to study changes in public preferences for air quality improvement based on CE. Hoeffler and Ariely (1999) found that respondents construct their preferences in different contexts and adjust their decision-making behavior over time and these preferences change over time. To reflect the changing preferences of respondents, we used a choice experiment to test the stability of public air quality improvement preferences and WTP in the wake of the onslaught of a global crisis. With two choice experiment questionnaires, which reflecting changes in public air quality improvement preferences and willingness to pay, on samples from the same region, decision-makers identified the importance of public’s attribute and decision-making preference motives when making decisions.

Some researchers confirmed the temporal stability of preferences in CE, but they frequently ignored the role of complex psychological factors, such as environmental attitudes (Hoyos et al., 2015), psychological distance (Singh et al., 2017), and risk perceptions (Roder et al., 2019), and failed to analyze the impact of psychological factors during the COVID-19 pandemic and whether it affects the stability of individual preferences. The existing studies explore the public’s risk perceptions on air pollution (Hodgson & Hitchings, 2018), and how these perceptions influence their behavior in response to the environment change (Wang & Watanabe, 2019). There are few studies to incorporate the public’s risk perceptions in special cases into econometric models to explain temporal stability of preferences. Especially, people collect information about psychological factors in the environment without relying on psychometric scales in particular environmental contexts. In this study, the LCM was used to capture preference heterogeneity and its sources of individual characteristics, and the psychological factors of risk perception were incorporated into the model. Comparison of the parameter estimation results of the LCM after the COVID-19 pandemic and before the COVID-19 pandemic, and analysis of the mechanisms affecting the change in the preferences of public heterogeneous groups. Psychological factors taken into account in the COVID-19 pandemic will enrich the behavioral features of preferences, understand individual preference heterogeneity, and improve the interpretation of valuation models.

In this paper, we conduct using the same questionnaire and choice set at two different time points before and after the COVID-19 pandemic to test the change of public preferences and WTP for air quality improvement, and compare the impact of risk perception on the heterogeneity of public preferences before and after the COVID-19 pandemic by constructing a latent class model. This research enriches the non-market value evaluation and preference motivation research in the air quality, specifically include (1) explore the change of public preference heterogeneity and willingness to pay for air quality improvement under the influence of the COVID-19 pandemic; (2) explore the changes in sources of preference heterogeneity before and after the COVID-19 pandemic, especially how public’s risk perceptions affect their preference for air quality improvement. Previous studies on air quality improvement preference were limited to one-time choice experiment, without considering the impact of environmental changes on public preferences and willingness to pay. The results of this study can be incorporated into environmental protection data. By learning from current experiences, management systems and policies, policymakers consider reducing the level of air pollutants and increasing the research and development of clean energy technologies, it is of great significance to change the traditional government-led regulatory air pollution control model and promote the multi-party cooperative environmental governance system of the government. Especially, integrating time-varying environment with existing policies will help enhance public satisfaction and acceptance of air quality improvement policies in a specific period.

The rest of this paper is arranged as follows. Section 2 elaborates on the properties and levels of the choice experiment, the experimental design of CE, the generated questionnaire and data collection. Section 3 briefly introduces the Random utility model and the latent class model approach. Section 4 presents the model results, including reliability and validity tests for the risk-aware construction, preference heterogeneity analysis, and stability tests for the WTP valuation. Section 5 discusses the broader implications of the results. Section 6 presents the conclusions, policy recommendations, directions for future research and research limitations.

2 Choice experiment and data collection

2.1 Choice experiment

The fundamental purpose of the choice experiment is to elicit public preferences for diverse air quality improvement policies. To understand the preferences and support for air quality improvement policies of the respondent, the alternatives were described by five attributes: clean air days, hazy days, mortality, policy delay and annual payment. The selection of these attributes and their levels are based on previous research, consultations with scientists and government officials, and a pilot test (Mao et al., 2020a). A summary description of the above attributes and their levels is shown in Table 1.

Table 1 Attributes and levels used in the CE

After determining the attributes and levels used in the experiment, there are 405 (34 × 5) alternatives combined by different attributes and their levels. It is cumbersome and cost prohibitive for respondents to compare and select from so many alternatives. The D-efficient fractional factorial principal effect design is applied to reduce the number of alternatives. A total of 15 choice sets are generated. These choice sets are then randomly allocated into five blocks, each of which is assigned three choice sets, the choice set consists of a status quo option and two alternatives presented in Table 2. Respondent will be assigned a task with three choice sets and then choose one of the three options in the given choice set as their preferred option.

Table 2 Sample choice set

2.2 Survey and data collection

This study chooses the city of Harbin, the capital of Heilongjiang province, with an area of 10,192 km2 in northeastern China, as the study area. By the end of 2020, the city will have 5.53 M permanent urban residents. It is an important economic and cultural center in Northeast China and manufacturing base for the country. The “2020 Statistical Yearbook of Harbin” issued by the Harbin Municipal People's Government, the average PM2.5 concentration in Harbin was 53 μg/m3, higher than the 10 μg/m3 recommended by the World Health Organization (WHO) (http://www.harbin.gov.cn). Harbin was chosen as an empirical study area due to its representative and typical characteristics. The urban area of Harbin mainly covers 9 districts, and six urban districts located in the inner city of Harbin are selected as the research area, which contains most of the city's resident population (i.e., Songbei District, Daoli District, Nangang District, Pingfang District, Xiangfang District and Daowai District).

Based on the feedback from a pre-survey of 100 questionnaires conducted in September 2019 and conducted extensive consultations with government officials and experts in environmental science, we corrected inaccurate statements and questions to ensure the efficiency and operation of the survey. The official survey collected data from six urban districts in Harbin in October 2019 and October 2021 to gather information on the stability of public air quality preferences before and after the COVID-19 pandemic. Different samples were randomly interviewed in the same survey area, the researchers were able to answer the respondents' questions to obtain direct information feedback and improve the effectiveness of the questionnaire.

The final questionnaire was divided into four parts: the first part includes the topic of the survey, the status and hazards of air pollution. The second part encompasses the questions related to the public’s risk perception, including the degree of air pollution control (Risk controllability, RC), perceived risk effect of air pollution (Perceived risk effect, PRE), what extent he/she trust in the air quality improvement policies launched by the government (Government trust, GR) and acceptability of air pollution effect (Acceptability pollution, AP). Details of the constructions involved in the questionnaire and their measurement metrics are reported in Table 3. The third part contains three choice sets generated by the choice experiment approach to elicit individuals’ preferences and WTP. The final part of the questionnaire covers the socio-economic status of the respondents, including gender, age, education, income, number of children, number of cars, etc.

Table 3 Reliability and validity tests for the measurement model

3 Methodology

3.1 Random utility model

Choice experiment, as an important method to evaluate the economic value of non-market goods or services, is based on the random utility theory (Lancaster, 1966) and consumer theory in economics. The utility that an individual obtains from a commodity or service depends on the characteristics (i.e., attributes and their levels) of the commodity or service, it will be weighed by comparing the differences among its multiple attributes, the change of any attribute level may lead to the change of individual’s preference.

According to the random utility theory, the utility (\({U}_{nij}\)) that individual \(n\) chooses alternative \(i\) from the choice set \(J\) can be divided into a deterministic element, \({V}_{nij}\), and a stochastic element, \({\varepsilon }_{nij}\).

$${U}_{nij}={V}_{nij}+{\varepsilon }_{nij}$$
(1)

For a given choice set \(J\), the probability that respondent \(n\) chooses alternative \(i\) but not alternative \(j\) is given as follows:

$$P_{nij} = \exp^{{V_{nij} }} /\mathop \sum \limits_{j = 1}^{J} \exp^{{V_{nij} }}$$
(2)

3.2 Latent class model

Based on the responses from CE surveys, multiple discrete choice models can be employed to analyze public preferences. The most widely used is the conditional logit (CL) model, which offers an overview of the average preferences (Andreopoulos et al., 2015). The CL model assumes that the preferences are homogenous across respondents, and alternatives are independent and uncorrelated (Xie, 2010). This assumption has been challenged since the fact that individuals’ preferences are heterogeneous, particularly in complex environmental problems (Chen & Ting Cho, 2019). To overcome the limitations of the CL model and illustrate motivations of behavior and preferences among individuals or groups, the latent class model has been developed (Greene & Hensher, 2003). In contrast to the CL model, the LCM was a semi-parametric model, it does not require the a priori selection of individual-specific characteristics (Boxall & Adamowicz, 2002). In the LCM, the sample consists of a finite number of identifiable classes. These classes differ substantially in their preference structure, each class is characterized by relatively homogeneous preferences. The LCM can estimate the probability of individual choice in each class and explain the formation mechanism of preference heterogeneity (Meldrum, 2015).

In the LCM, an individual \(n\) belongs to a latent class \(s\). The utility function becomes:

$${U}_{nij\left|s\right.}={V}_{nij\left|s\right.}+{\varepsilon }_{nij\left|s\right.}={\beta }_{s}{X}_{nij}+{\varepsilon }_{nij\left|s\right.}\begin{array}{cc}& s=\mathrm{1,2},\cdots ,S\end{array}$$
(3)

where \({X}_{nij}\) is a vector of air quality improvement attributes. \({\beta }_{s}\) is a vector of coefficient assigned to each attribute in class \(s\). The choice probability of individual \(n\) in class \(s\) takes the following form:

$$P_{nij\left| s \right.} = \frac{{\exp \left( {\mu_{s} \beta_{s} X_{nij} } \right)}}{{\mathop \sum \nolimits_{j = 1}^{J} \exp \left( {\mu_{s} \beta_{s} X_{nij} } \right)}}$$
(4)

where \({\mu }_{s}\) is the scale parameter. Respondents are classified into one of the \(S\) latent classes according to the class membership likelihood function \(M\), which is determined by the respondent characteristics \({S}_{n}\). The membership likelihood function is given by:

$${M}_{ns}={\lambda }_{s}{Z}_{n}+{\xi }_{ns}$$
(5)

where \({\lambda }_{s}\) (s = 1, 2, …, S) is the vector of class-specific coefficients, \({Z}_{n}\) represents the characteristics of individual \(n\), including socio-economic and risk perception factors, \({\xi }_{ns}\) are the error elements. The probability that an individual \(n\) is in class \(s\) is given by the following function:

$$H_{ns} = \frac{{\exp \left( {\lambda_{s} Z_{n} } \right)}}{{\mathop \sum \nolimits_{s = 1}^{S} \exp \left( {\lambda_{s} Z_{n} } \right)}}$$
(6)

Combining conditional choice Eq. (4) with membership classification Eq. (6), the unconditional probability function is given as:

$$\begin{aligned} P_{nij} = & \mathop \sum \limits_{s = 1}^{S} P_{{_{nij\left| s \right.} }} H_{ns} \\ = & \mathop \sum \limits_{s = 1}^{S} \left[ {\frac{{\exp \left( {\mu_{s} \beta_{s} X_{nij} } \right)}}{{\mathop \sum \nolimits_{j = 1}^{J} \exp \left( {\mu_{s} \beta_{s} X_{nij} } \right)}}} \right] \times \left[ {\frac{{\exp \left( {\lambda_{s} Z_{n} } \right)}}{{\mathop \sum \nolimits_{s = 1}^{S} \exp \left( {\lambda_{s} Z_{n} } \right)}}} \right] \\ \end{aligned}$$
(7)

3.3 Willingness to pay

Based on the coefficient estimated by LCM that provides useful information on the preference heterogeneity, the value of marginal WTP change in each attribute can be calculated by the ratio of the coefficients of any interest attribute \(k\) and the negative price attribute \(p\).

$${\text{WTP}}_{k\left| s \right.} = - \frac{{\beta_{k\left| s \right.} }}{{\beta_{p\left| s \right.} }}$$
(8)

where \({\beta }_{k/s}\) and \({\beta }_{p/s}\) are the coefficients on air quality improvement and price attributes in class \(s\).

4 Results

4.1 Descriptive statistics

The formal survey was administered twice in October 2019 and October 2021 using a same choice experimental questionnaire and face-to-face interviews survey. The samples of both surveys are residents living in the six main districts of Harbin, and involved a total of 380 (2019) and 350 (2021) respondents. After screening the incomplete and non-effective questionnaires, a total of 372 (2019) and 335 (2021) questionnaires with 1116 (372 questionnaires × 3 choice sets) and 1005 (335 questionnaires × 3 choice sets) valid observations are used for further analysis.

Table 4 shows the socio-economic demographics of the two samples. The degree of similarity between the sociodemographic characteristics across both surveys can be formally tested by the Chi-square test. The last column in Table 4 shows the results of the Chi-square test: gender, age, education, children, cars and income distributions are not significantly different between the two samples. The social and economic characteristics of both samples are similar to the population in Harbin. The gender of respondents is close to the average, with about 57% (2019) and 56% (2021) women, 43% (2019) and 44% (2021) men in the sample. The largest share of respondents in both surveys had an age structure between 18 and 40 years, the age category “ > 60” years and “ < 18” years is lower. In the interview survey, the response rate of older and children is generally low based on a lack of understanding of the questionnaire and the ability to complete the questionnaire independently. The majority of respondents in both surveys have advanced education (College & above), with 65% (2019) and 68% (2021), respectively. Most respondents had a child or none, about 56% (2019) and 57% (2021) of respondents own a car. The largest share of respondents in both surveys has a monthly personal income between 2000 and 5000 CNY, with 43% (2019) and 45% (2021) income of less than 2000 CNY.

Table 4 Descriptive statistics on socio-economic characteristics of respondents

4.2 Risk perception measurement

4.2.1 Construct validity and confirmatory factor analysis

Risk perception was first used in the field of psychology. It is the embodiment of the public’s understanding, recognition and assessment of risk, including a judgment of the probability of risk occurring at a particular time and place and the possible consequences (Slovic et al., 1987). It is difficult to directly characterize and measure public perception of risk, because risk perception reflects the psychological changes that occur in the face of a risky event, it is a process by which the public experiences, understands, and reacts to a risky event. Many scholars have used psychometric models to measure people’s judgments about the riskiness of diverse hazards (Wachinger et al., 2013). Previous studies divided risk perception into several dimensions and used a five-point Likert scale to scientifically analyze how risk is perceived (Wang & Watanabe, 2019) and what factors affect the public's perception of risk (Yan et al., 2019). Based on previous studies, this study divided the psychological scale of air pollution risk perception into four structures. Perceived risk effect of air pollution indicates that people's perceptions of air pollution consequences (Mao et al., 2020b). Risk controllability indicates that the degree to risk events from air pollution is controlled (Li et al., 2022). Government trust is the public’s confidence in the government to reduce the risks posed by air pollution (Pu et al., 2019). Acceptability of pollution means the public's ability to withstand the damage caused by air pollution (Fischhoff, 1978). The estimation results are reported in Table S1.

The Cronbach’s α coefficients of latent variables show that the measured items have good internal consistency. The values for Kaiser–Meyer–Olkin (KMO) are greater than 0.7 and Bartlett’s test of sphericity (p = 0.000 < 0.01) shows suitability for the confirmatory factor analysis (CFA). The KMO and Bartlett’s test of sphericity results as seen in Table S2.

The average variance extracted (AVE) for each factor was assessed for the convergent validity. Before and after the COVID-19 pandemic, AVE values of the constructs all exceed 0.5, the marginal value suggested by Fornell and Larcker (1981). Composite reliability (CR), another indicator of convergent validity, are all higher than 0.7 for all reflective constructs. These results suggest an acceptable convergence validity. The AVE values for any two constructions are compared with the correlations calculated between these two constructions. Table 5 shows strong evidence of the effectiveness of the discriminant.

Table 5 Correlations of the constructs

The goodness-of-fit metric used to consider model validity shows an acceptable fit to the measured model. The acceptable values of GFI, CFI and NFI should be greater than 0.9. REMSA is acceptable between 0.03 and 0.08, and the χ2/df value at or less than 5 is considered a better fit (Fornell & Larcker, 1981). The results of the fit to the measured model as seen in Table S3.

4.2.2 Respondents’ risk perception scores

We construct the weights of the risk perception evaluation metrics from the above evaluation results in Table 3. To explore the hypothesized effect of risk perception, the scores of these four constructions are assigned to values of 1, 2, and 3, representing high, medium, and low levels, on a tertiary level. Sequentially, these scores are used in the next analysis of preference heterogeneity. The estimation results are presented in Table 6.

Table 6 Weights of indictors of risk perception

4.3 Preference heterogeneity

To further analyze the characteristics of preference heterogeneity among different groups and to compare with the results before the COVID-19 pandemic, this study estimated the LCM based on data from 2019 to 2021, and compared the information criteria statistics including the Bayesian information criterion (BIC), the Akaike information criterion (AIC) and the AIC with a penalty factor of 3 (AIC3) of the models with different segmentation. The results of three information criteria under different class numbers of the model as seen in Table 7. The AIC decreases as more classes are added, and the marginal reduction diminishes after the three classes are included. The BIC decreases until two classes are included, then increases as more classes are added to the model, AIC3 is minimized with the 3-class model. Of all the models, the 3-class model is suggested for our analysis.

Table 7 LCM classification

Figure 1a shows the relative importance of attributes in the three-class model in 2019. Of the respondents, 50.47% are assigned class 1, 29.16% belong to class 2, and 20.37% belong to class 3, as indicated by the size of the circle. Class 1 contains the majority of respondents who list clean air days as their preferred attribute, with other attributes having a much smaller relative importance. Respondents from this class pay more attention to the intuitive sensation of air quality and labelled as “sensation preferable.” Class 3 members have a high preference for mortality attributes and are labeled as “health preferable.” Class 2 members have a high preference for the price attribute, while having a low preference for other air quality improvement attributes, thus the Class 2 are called “price sensitive,” and consider payment as the most important determinant.

Fig. 1
figure 1

a Relative importance of the attributes in 3-class LCM. b The relative importance of the attributes in 3-class LCM

Figure 1b shows the relative importance of attributes in the three-class model in 2021. Of the respondents, 66.70% are assigned to class 1, 27.57% to class 2 and 5.73% to class 3, as indicated by the size of the circle. As in 2019, class 1 contains the majority of respondents who regard clean air days as the preferred attributes, with other attributes having a smaller relative importance. Respondents from this class pay more attention to the intuitive sensation of air quality and labeled as “sensation preferable.” Class 2 members have an especially high preference for mortality attributes. This class is called “health preferable.” Class 3 members have a high preference for the price attribute contra a low preference for other air quality improvement attributes “price sensitive,” the most important determinant for class 3 is the payment.

Next, we further analyze and compare the results of respondents’ preference heterogeneous for air quality improvement under the context of different air quality levels before and after the COVID-19 pandemic. The parameter estimation results of the LCM are reported in Table 8.

Table 8 Parameter estimation results of the LCM

The first part of Table 8 shows the utility coefficients of the attributes associated with the air quality improvement policy, which reveal heterogeneity in the preferences for the air quality improvement policy. All parameters in class 1 are statistically significant in 2019 and 2021, and consistent with the expected signs, indicating that the utility of respondents will increase with the decrease in the hazy days, mortality, length of policy delay, and annual payment, the increase in the clean air days. In class 2, the utility coefficients for all but the annual payment attribute are not significant in 2019, indicating that respondents receive low utility from air quality improvements. The coefficient of the annual payment is negative, indicating that decreasing the paid price can increase the probability of this type of respondents to choose the air quality improvement plan. The coefficient of mortality is higher than other attributes in 2021, indicating that individuals or households who are more concerned about the health effects of air pollution fall into this group. In class 3, all coefficients of 2019 are significant, with mortality and policy delay being higher than the other attributes. The annual payment attribute coefficients for 2021 are significant; while, the other attributes are not. The results of the 2019 and 2021 by LCM indicate a clear heterogeneity in the preferences of different classes of respondents for these attributes when faced with various air pollution control options.

The second part of Table 8 shows the parameter estimation of the classification members, which reveals the source of individual preference heterogeneity.

  1. (1)

    For the “sensation preferable,” the gender coefficient is negative; while, the education, number of children, perceived risk effect and government trust coefficients are positive in 2019. In 2021, the gender and age coefficients are negative; while, the education and government trust coefficients are positive. As a result of the COVID-19 lockdown and the shutdown of factories and enterprises, air quality has improved and people's perceived risk effect has decreased. Youngsters have a significant willingness to improve air quality after the COVID-19 pandemic. Females with higher levels of education and stronger trust in the government have a direct sensation related to air quality, regardless of whether they have experienced the COVID-19 pandemic.

  2. (2)

    For the “price sensitive,” the number of children, income, and acceptability of pollution coefficients are positive; while, the education, number of cars and perceived risk effect coefficients are negative in 2019. The coefficients of gender and age are positive; while, the coefficients of education, income, government trust and risk controllability for 2021 are negative. Respondents are more willing to spend less to support air quality improvement policies due to social issues such as employment and income instability caused by the COVID-19 pandemic.

  3. (3)

    For the “health preferable,” the perceived risk effect and risk controllability coefficients are positive; while, the gender, number of cars and acceptability pollution coefficients are negative in 2019. The coefficients for the number of children, income and risk controllability are positive; while, the coefficients for gender, age and acceptability of pollution for 2021 are negative. This may be because the COVID-19 pandemic has made people more aware of the importance of health, so respondents made health a top priority.

4.4 Marginal WTP values

The marginal WTP estimator provides the implied benefits of various changes in attribute levels and captures their differences under air quality improvement scenarios of 2019 and 2021 (Hoyos et al., 2015). Table 9 presents the marginal WTP for the attributes related to the air quality improvement policy and gives an estimate of the marginal WTP value.

Table 9 Marginal WTP estimations

In the 2019 LCM, respondents are willing to pay 170.20 CNY per year for a shorter policy delay. Respondents are willing to pay 48.81 CNY per year to increase the number of clean air days by 1% and 36.03 CNY per year to reduce the mortality by 1%, respondents are willing to pay 27.26 CNY per year to decrease the number of haze days. In the 2021 LCM, the marginal WTP of the attributes is consistent with the 2019 order. Compare with the weight WTP value of 2019, the marginal WTP of policy delay decreased by 45.81 CNY per year, the clean air days increased by 5.21 CNY per year, the mortality decreased by 18.21 CNY per year, and the haze days decreased by 12.62 CNY per year. Comparison of results based on marginal WTP for 2019 and 2021 suggests that the WTP of respondents will decrease after the COVID-19 pandemic as air quality improves to some extent.

5 Discussion

The COVID-19 pandemic has a huge economic (Zambrano-Monserrate et al., 2020), political (Luan et al., 2021) and health (Singh & Mishra, 2021) impact on a global scale and has seriously affected human development. In addition to its adverse effects on life and health, it has positive environmental implications and could become a reflection on the future of human behavior toward nature (Singh & Mishra, 2021). In the COVID-19 pandemic early stages, governments took drastic measures such as lockdowns, suspension of public transport operations and self-restraint orders to control the spread of the virus, leading to global reductions in pollutant emissions and marked improvements in air quality. Mandal et al. (2021) indicated that the cessation of vehicular traffic and industrial shutdowns have a significant environmental impact, short-term lockdowns can be effective in freshening the air in these mega-cities. This large-scale global outbreak has brought additional information to policymakers on the importance of air pollution management. It is worth noting that public perceptions of changes in air quality should be considered alongside government capacity and priorities, the role that trust in government may play an incentive to achieve results.

Research on whether preferences change over time is important when they are used to inform decision-making (Wunsch et al., 2022). The existing literature showed that preferences are not affected by the valuation environment and preferences do not change over time (Czajkowski et al., 2016), especially during periods of uncertainty such as the COVID-19 pandemic (Wunsch et al., 2022). People will face insecure employment opportunities and income prospects in uncertain times, which better reflects people's preference and willingness to pay for air quality improvement policies in different backgrounds compared with safe times. This study is comparable between the two surveys in terms of choice levels and willingness to pay for air quality improvement options, as well as socio-economic characteristics. Due to the sufficiently long times interval, the presented results are consistent with the desired target outcome. It is clear from the respondents’ responses to the questions that changes in perceptions of air quality and individual economic status and perceived risk in the face of a major public health event have an impact on the results of the study.

This research found that the category characteristics of public preference for air quality improvement are similar. The sources of influencing preference heterogeneity before and after the COVID-19 pandemic are different, mainly reflected in risk perception and socio-economic characteristic variables. To further examine the origin of preference heterogeneity, the inclusion of risk perceptions in the latent class model better explains the significant heterogeneity in the preferences of individuals' choice behavior toward air pollution management policies. This is also supported by the results of previous studies (Meldrum, 2015). The reasons for the change in perceived risk effects of the “Sensation preferable” group after experiencing the COVID-19 pandemic may include: First, during the COVID-19 pandemic, the youngsters went out more frequently, paid more attention to air quality and the degree of government control of the epidemic (Strommer et al., 2022), and believed the government could improve air quality as well as control the epidemic after the COVID-19 pandemic (Völker & Lienhoop, 2016). The results are similar to those found by Singh and Mishra (2021). They suggested that the lockdown experience may have affected the public's perception of the fragility of nature and increased public awareness of environmental protection. Second, female age and personal status, especially those related to family and work, showed a higher risk sensitivity due to their experience of lockdown during COVID-19 pandemic (Giordani et al., 2021). The health preference group experienced a significant change in their perceived risk impact and acceptability of pollution after the COVID-19 pandemic, the high transmission and mortality rates of the COVID-19 pandemic, which made people aware of the importance of health (Connor et al., 2020). Singhal (2020) found that in the face of a major public health crisis and future uncertainty, the public has a new understanding of the value of health and life, and people realize the necessity and urgency of improving air quality. Severe air pollution can cause a variety of diseases that threaten people's health, such as lung and respiratory diseases. After people suffering from the health hazards of air pollution, more attention is paid to reducing the health risks. Price sensitive groups experienced changes in perceived risk effect, risk controllability, government trust and pollution acceptability after the COVID-19 pandemic. The low-income people can’t afford expensive governance solutions, they pay more attention to price. The impact of the pandemic lockdown, factory shutdowns and increased unemployment has caused people who are concerned about spending on necessities to not consider additional spending (Schnitzler et al., 2021).

Acquiring the average marginal WTP for various attributes can provide important insights for policymakers to design scientific and reasonable air pollution treatment projects. In terms of the marginal WTP, there is a statistical difference between the coefficients of the two groups for the health risks associated with the COVID-19 pandemic and the uncertainty of future income concerns. With the exception of the clean air day attribute, the marginal WTPs for the other air quality improvement attributes are higher in 2019 than 2021 in “Sensation preferable” and “Health preferable.” Respondents prefer improving current air quality conditions over maintaining the status quo and are willing to pay a certain amount for air pollution treatment. According to the marginal WTP estimates for the attributes, respondents are more willing to pay a certain amount to increase the number of “clean air days” before and after the COVID-19 pandemic. The closure of many businesses and factories, the significant reduction in the concentration of air pollutants (Adams, 2020), the change in air quality evident to respondents, and the increase in the number of clean air days, making the public feel the comfort of clean air gas, prompting them to give more to achieve better goals. The willingness to pay for the attributes “Hazy days,” “Mortality” and “Policy delay” are lower after the COVID-19 pandemic. Respondents fell that heavy pollution is less of a threat to their daily lives than before the COVID-19 pandemic, a fact that is consistent with previous results showing reduced concerns about heavy pollution. Sekar et al. (2022) indicated that pollutant levels decrease after lockdown and that public perception is highly dependent on actual air quality. The immediate threat to human life posed by the COVID-19, there will be concerns about the impact of the pandemic on life and health, with respondents prioritizing the reduction of hazard and less concerned about mortality due to air pollution (Becchetti et al., 2022). Having learned how the government responds to the COVID-19 pandemic, respondents have adjusted their level of confidence in the government's ability to use its tax contributions responsibly to achieve the changes assumed in the survey, which means that the speed and effectiveness of policy implementation determine the amount of public funding support for air quality improvements. The COVID-19 pandemic has changed the marginal WTP of the public for air quality improvement, which supports policy makers in prioritizing different air quality improvement objectives related to these attributes, the need for policy makers to tailor air quality improvement policies to public demand preferences in return for greater public support.

6 Conclusions and policy recommendations

This paper used a choice experiment to test the stability of public air quality improvement preferences and WTP following the onslaught of a global crisis. The results suggested that the category characteristics of public preference for air quality improvement are similar in the face of a major public health crisis and an uncertain future. The sources of influencing preference heterogeneity before and after the COVID-19 pandemic are different, mainly reflected in risk perception and socio-economic characteristic variables. During the COVID-19 pandemic, the marginal WTP is lower than before. Most previous studies have only focused on the public preference heterogeneity and motivation sources under the same background, and there are few studies on the impact of environmental changes on public preference and willingness to pay, as no research has been able to test this stability during such a large global shock, previous studies of air quality improvement preferences are limited to one-time choice experiment surveys that do not take into account the effects of environmental changes on public preferences and willingness to pay. This study attempts to fill this gap to probe into the public’s preferences temporal stability for air quality improvement, and the preferences heterogeneity for air quality improvement policies and the impact of WTP in the context of the COVID-19 pandemic.

This study findings have important implications for both theory and policy making. Theoretically, no previous work has been able to test the change in preferences during large global shocks. This study explores the change in the COVID-19 pandemic on public preferences and willingness to pay, as well as the impact of public risk perception on preference heterogeneity, through a choice experiment that enriches the study of air quality value assessment and provides a new theoretical perspective on environmental protection. In terms of policy making, by comparing the changes in air quality before and after the COVID-19 pandemic, the impact on the heterogeneity of public preferences will provide policy makers with more accurate and effective public information on improving air quality and provide reference for policy formulation, better link air quality improvement with public participation. The adoption of low-energy technologies in the traditional industries of the future, while developing renewable energy sources and increasing the use of clean energy. Secondly, the government should actively develop clean energy vehicles, reduce the purchase tax of new energy vehicles and introduce subsidy policies, increase public transportation (such as shared bicycles, subways, and buses). Thirdly, the government should guide the public to separate waste and improve waste disposal techniques to minimize waste generation and maximize raw materials. Finally, strengthen education on air quality improvement in the context of normal virus prevention and control, publicize and report the harm of the novel coronavirus and the widespread impact of air pollution, solve unexpected public health in a timely manner or environmental pollution incidents to reduce the threat to people's health.

Turning to future research, this study selected only residents of the Harbin area as a sample for the survey and assessed their preferences for air quality improvement and factors influencing their willingness to pay. With the different radiation ranges in the polluted areas around Harbin, the public will be exposed to different levels of pollution, and their preferences and willingness to pay for air quality improvement will also be different. Future research will compare preferences and willingness to pay for air quality improvements among residents around Harbin to better inform and assist policy making and to increase the applicability of our research conclusions in different regions. This study analyzed the impact of latent variables of risk perception on the public’s preference for air quality improvement and willingness to pay; while, the psychosocial drivers of individual behavior are complex. It will be very meaningful to further explore the socio-economic or other potential psychological sources (such as psychological distance) that give rise to this heterogeneity in follow-up studies, which will help environmental managers to develop corresponding air quality improvement measures according to the preferences of groups with different socio-economic or psychological characteristics.