Abstract
Beginning in 2005, the Securities and Exchange Commission (SEC) mandated firms to include a “risk factor” section in their Form 10-K to discuss “the most significant factors that make the company speculative or risky.” In this study, we examine the information content of this newly created section and offer two main results. First, we find that firms facing greater risk disclose more risk factors, and that the type of risk the firm faces determines whether it devotes a greater portion of its disclosures towards describing that risk type. That is, managers provide risk factor disclosures that meaningfully reflect the risks they face. Second, we find that the information conveyed by risk factor disclosures is reflected in systematic risk, idiosyncratic risk, information asymmetry, and firm value. Overall, our evidence supports the SEC’s decision to mandate risk factor disclosures, as the disclosures appear to be firm-specific and useful to investors.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Risk factors are looked upon as boilerplate, ‘Have we considered all our bases just in case the worst case does happen?’… The irony of it is that risk factors are almost meant not to be read, or relied upon.–Tom Taulli, IPO analyst (Reuters 2005)
Beginning in 2005, the Securities and Exchange Commission (SEC) required all firms to include a new section in their annual filings (Secttion 1A of the Annual Report on Form 10-K) to discuss “the most significant factors that make the company speculative or risky” (Regulation S–K, Item 305(c), SEC 2005). Prior to this change, companies were only required to provide this information in registration statements for equity and debt offerings.Footnote 1 By mandating this disclosure for all firms, the SEC is suggesting that risk factor disclosures are informative and that investors benefit from this information.
Critics of the new disclosure requirements argue that risk factor disclosures are unlikely to be informative for at least two reasons. First, firms do not have to estimate the likelihood that a disclosed risk will ultimately be realized. Second, firms do not have to quantify the impact that a disclosed risk might have on their current and future financial statements. Thus, managers simply disclose all possible risks and uncertainties, regardless of the likelihood that they will ultimately affect the firm, and the disclosure surrounding each of these risks and uncertainties is likely to be vague and boilerplate in nature (Reuters 2005). The SEC warned firms in 2010 to “avoid generic risk factor disclosure that could apply to any company” (SEC 2010), and has repeatedly called for increased focus and specificity in risk factor disclosures through the comment letter process (CFO 2010). Seven years after the adoption of the SEC’s risk factor disclosure requirement (Regulation S–K, Item 305c), we have little evidence on whether these newly created disclosures are informative. Our study addresses this gap in the literature.
Understanding the information conveyed by risk factor disclosures is important to regulators, investors, and academic researchers, as on average these disclosures represent 11.0 % of the words in firms’ Form 10-K. Regulators are often criticized for failing to mandate sufficient disclosure regarding firms’ risk and uncertainties and historically react to periods of market volatility by requiring enhanced risk disclosures. In light of the recent financial crisis, the SEC has made firms’ risk factor disclosures a focus of its comment letter process, asking firms to provide more details about the specific risks they face. In fact, the SEC recently added a project to its agenda that revisits the risk factor disclosure rules (CFO 2010). Therefore, it is important to document whether risk factor disclosures convey information that is useful to investors.
Academic research is also interested in the effect of disclosure on investors’ risk perceptions. Although theoretical work suggests that increased disclosure reduces the cost of capital, empirical studies have found mixed evidence. Kothari et al. (2009a) argue that the reason prior empirical findings are mixed is because disclosure tone affects the relationship between disclosure and the cost of capital. While they predict that negative/pessimistic disclosures increase the cost of capital, they are unable to find evidence of this association when the source of the disclosure is the firm itself. The newly created risk factor disclosure section provides a powerful test of the effect of firm-provided disclosure tone on cost of capital for two reasons. First, these disclosures are likely to convey information about firms’ systematic risk, which is a major component of the cost of capital. Second, by definition, the tone of this section is negative/pessimistic.Footnote 2
Our research offers two important results. First, we find a positive association between risk factor disclosures and pre-disclosure measures that proxy for firm risk. To provide further assurance that this association is not spurious, we decompose the risk factor disclosures into five subcategories: (1) financial, (2) tax, (3) legal, (4) other-systematic, and (5) other-idiosyncratic. We find that when firms’ pre-disclosure risk proxies are more strongly associated with a given subcategory of risk, firms devote more of their disclosures towards describing that risk.Footnote 3 Overall, our evidence suggests that—contrary to critics’ assertions—risk factor disclosures are not boilerplate but instead meaningfully reflect the risks a firm faces.
Second, we examine whether investors incorporate the unexpected portion of these disclosures into their assessments of firm risk and value. We find a positive association between the unexpected portion of risk factor disclosures and the post-disclosure level of market beta and stock return volatility, suggesting that the disclosures are positively associated with investors’ assessment of firms’ fundamental risk. That is, investors revise their estimate of the level of the risk parameters in the distribution function of future cash flows.
After controlling for the effect of risk disclosure on post-disclosure beta and stock return volatility, we find a negative association between the unexpected portion of risk factor disclosures and the post-disclosure level of information asymmetry among investors (i.e., bid-ask spread). This suggests that, after controlling for the fact that risk factor disclosure increases the market’s assessment of a firm’s risk, the public availability of risk factor disclosure decreases information asymmetry among that same firm’s shareholders.
To further ensure that our results are capturing investor reaction to the risk factor disclosures (rather than events subsequent to the 10-K release), we document a negative association between the unexpected portion of risk factor disclosures and short-window abnormal returns around the Form 10-K release date. These results remain after controlling for market risk, size, growth, changes in analysts’ estimates of future cash flows, and other information in the 10-K. This suggests that investors incorporate the information from risk factor disclosures into stock price.
Our research contributes to the emerging literature on the effect of disclosure tone on firms’ cost of capital. Theoretical and empirical research that examines the effect of disclosure on market-based assessments of firm risk generally assumes that the tone of disclosure is neutral (Barry and Brown 1985; Lang and Lundholm 1996; Botosan 1997). When disclosure tone is neutral, the expectation is unidirectional—disclosure decreases the cost of capital. Kothari et al. (2009a) argue that disclosure tone affects the association between disclosure and firms’ cost of capital in a directional manner. Although Kothari et al. find that negative/pessimistic disclosure increases cost of capital when a combined set of information sources is analyzed (i.e., corporations, analysts, and business press), they are unable to document this relation when the source of the disclosure is the corporation itself (i.e., MD&A disclosure). They suggest two possible explanations for this result: (1) the software they use to analyze tone is inaccurate, or (2) incentives bias managers against providing informative negative/pessimistic disclosures (Kothari et al. 2009a, p. 1661). By using the newly created risk factor section as a powerful setting where the tone is clear, we provide evidence that negative/pessimistic disclosure is positively related to market-based assessments of firm risk, even when the source of the news is the corporation itself. This is an interesting finding since negative/pessimistic disclosures increase investors’ assessment of systematic risk, and thus the incentive might be strong for managers to resist disclosure. Despite this incentive, we find that the information is, on average, informative.
Second, we contribute to the risk disclosure literature in two key ways. We document that the type of risk a firm faces (i.e., financial, tax, legal, other-systematic, and other-idiosyncratic) affects the type of risks that are disclosed. That is, we provide direct evidence that managers’ qualitative risk factor disclosures about risks and uncertainties are not boilerplate on average but instead meaningfully reflect the specific risks the firm faces across a broad spectrum of risk types. Second, we show that qualitative risk disclosures are incorporated into firms’ market-based measures of risk and stock price. Although prior research has examined mandatory quantitative risk disclosures (Item 7A of Form 10-K) (Schrand 1997; Tufano 1998; Roulstone 1999; Wong 2001; Linsmeier et al. 2002), it is an empirical question whether and how investors would use qualitative disclosures that do not provide investors with (1) the likelihood a risk occurs or (2) the quantified impact a risk would have on the firm’s financial statements.
Third, we contribute to the emerging literature on textual analysis. In a contemporaneous paper, Li (2008a) counts risk words in the 10-K during time periods prior to the newly created risk factor disclosures and finds that, when risk sentiment increases, future earnings decrease and investors underreact (or fail to react altogether) to the disclosures. We complement Li (2008a) by providing evidence that investors do at least partially react to risk disclosures, at least during time periods after the disclosures became more prominent and concentrated in one section of the Form 10-K. Kravet and Muslu (2013) count risk sentences in the entire 10-K across years before and after the newly created risk factor section. They find that changes in the number of risk sentences are associated with investors’ assessments of firms’ idiosyncratic risk (i.e., stock return volatility), as well as analyst earnings forecast properties. We complement Kravet and Muslu (2013) in several ways. By focusing strictly on the risk factor section during time periods when it was mandated, we are uniquely positioned to assess the merits of the new risk factor disclosure section, and we confirm their results related to investors revisions of idiosyncratic risk. We also show that investors incorporate the disclosures into nondiversifiable, systematic risk measures, as well as into firm value. Finally, rather than relying on investors’ reaction to the disclosures, we use a word counting and classification scheme to provide direct evidence that managers’ risk disclosures are not boilerplate but instead reflect the actual risks the firm faces. Overall, our study complements these contemporaneous papers to show a more complete picture of the effects of qualitative risk factor disclosures. Further, since our study examines the determinants of the newly created risk factor section when all firms were mandated to provide this disclosure, we are uniquely able to address the debate on the usefulness of a mandatory risk factor disclosure and the criticism currently facing the SEC.
Moving beyond risk disclosures to textual analysis more broadly, Lehavy et al. (2011) find that the readability of 10-K filings affect analyst forecast dispersion, accuracy, and effort. Similarly, You and Zhang (2009) show that investors underreact to disclosures provided in longer and more complex 10-K filings. We find that risk factor disclosures have an incremental effect on market-based assessments of risk, after controlling for these readability and complexity measures.
Finally, the SEC has recently incorporated a project into its agenda to re-examine the disclosure rules related to risk factors. Its concern is that companies are not accurately disclosing their firm-specific risk but instead are providing “generic” and “mind-numbing risk factor discourse” (CFO 2010). By providing evidence that risk factor disclosures are positively related to pre-disclosure measures that proxy for firm risk, while simultaneously documenting that this disclosure is incorporated into post-disclosure market measures, we provide the SEC with empirical evidence suggesting that firms are disclosing at least some of their specific risks and uncertainties in the newly created risk factor section, and this disclosure is informative to investors. However, our results do not necessarily imply that critics who call for improvements to risk factor disclosure are wrong. Requiring firms to provide even more firm-specific information (and less generic, boilerplate language) could further benefit users, as our results suggest that even in their potentially “flawed” state, the disclosures are informative.
2 Hypothesis development
2.1 Risk factor disclosures and pre-disclosure proxies for firm risk
Prior academic literature on corporate disclosure suggests that managers have a self-serving bias to disclose favorable information about the firm. Accounting information is used by current and potential creditors in debt contract negotiation, by boards of directors in assessing executive performance and compensation, by labor unions in contract renegotiations, and by capital markets in setting firm value and in determining the cost of capital (Watts and Zimmerman 1986; Fields et al. 2001; Kothari et al. 2009a). Similarly, this information will also impact manager’s career concerns (Kothari et al. 2009b), and could also reveal proprietary information to a firm’s competitors or investors (Scott 1994; Ke et al. 2003). As a result, managers are likely biased against providing unfavorable disclosures (Kothari et al. 2009a). Since risk factor disclosure is designed to convey information regarding unfavorable risks and uncertainties of the firm, it seems reasonable that managers would avoid providing meaningful risk factor disclosure and instead have a bias towards vague and boilerplate risk factor disclosures that “attempt to catalog every conceivable problem that might surface: valued employees quitting, tie-ups with suppliers or clients disintegrating, even acts of God, terrorism, and war” (Spindler 2006).
A review of recent 10-K filings presents exactly this sort of vague and boilerplate risk factor disclosure. For example, the 2009 risk factor disclosure for Remington Arms Company states:
Our business is subject to economic, market and other factors beyond our control or ability to predict. … Any significant or prolonged economic downturn could have a material adverse impact on our financial condition and results of operations. … Moreover, additional risks and uncertainties not currently known to us or that we currently deem to be immaterial may also materially and adversely affect our business and operations. These risks and uncertainties include, but are not limited to, weather and other Acts of God that could result in the loss or destruction of warehoused inventory and stoppages in our ability to manufacture our key products for a sustained period of time. (SEC 2009)
It seems likely that this type of risk factor disclosure is not informative about the firm’s specific risks and could be why the SEC states that the disclosures should “avoid generic risk factor disclosures that could relate to any company” (SEC 2010). This example also illustrates that risk factor disclosures may be used by managers to catalog all possible risks and uncertainties, regardless of the likelihood that they will ultimately affect the firm. Consistent with this view, a corporate securities lawyer recently wrote that the disclosures seem to be a way for managers to effectively tell investors: “Seriously–anything can happen. … By investing in our business, you are agreeing that we owe you no duty of care other than not being crooks. We can promise you nothing else” (Corporate Counsel 2006).
Although managers face business and career incentives not to disclose negative information (Watts and Zimmerman 1986; Scott 1994; Fields et al. 2001; Kothari et al. 2009a, b), they also face legal incentives to at least partially provide meaningful, firm-specific risk factor disclosures. Managers may be exposed to high litigation costs if they fail to disclose bad news in a timely manner (Skinner 1994). If a material risk comes to fruition and managers failed to disclose it, they could be sued, face legal liability, and owe damages. Furthermore, lawyers could use anecdotal evidence and the SEC’s criticisms regarding risk factor disclosures against the manager to argue that the firm’s risk factor disclosures were a laundry list of generic risks that did not adequately warn investors of the material risk. For instance, in a recent class action securities lawsuit, plaintiffs argued that Credit Suisse concealed the degree of its risk exposure to mortgage-backed securities in its SEC filings. A lead plaintiff’s complaint heavily referenced an SEC comment letter discussing various deficiencies with Credit Suisse’s 2007 annual report specifically mentioning issues with the risk factor section. Credit Suisse settled the lawsuit in 2011, paying $70 million to the plaintiffs.Footnote 4
If there is an insufficient amount of variation in risk factor disclosures across firms or if variation in the disclosures is not related to variation in risks faced, there should be no association between risk factor disclosures and pre-disclosure measures that proxy for firm risk. On the other hand, if the disclosures are firm-specific, we should find a positive relation between disclosure levels and pre-disclosure, firm-specific risk proxies. That is, managers should provide risk factor disclosures when firms’ quantitative measures suggest that they face more risk. Ultimately, it is an empirical question whether risk factor disclosures are meaningful. Accordingly, we test the following hypothesis:
Hypothesis 1
The length and content of risk factor disclosures are associated with pre-disclosure proxies for firm risk.
2.2 Risk factor disclosures and post-disclosure market-based measures of firm risk
In H1, we investigate whether managers’ risk factor disclosure meaningfully reflects the pre-disclosure risks a firm faces. To the extent that it does, this suggests that managers do not simply disclose a laundry list of generic or boilerplate risks which do not vary across firms. However, H1 does not assess whether managers use risk factor disclosures to only discuss risks that the market already knows about. If so, investors would not find the disclosures informative, as there would be little information conveyed to cause investors to revise their ex ante beliefs about the firm’s risk. Theoretical research suggests that incremental disclosure reduces the cost of equity in one of three ways. First, higher levels of disclosure reduce information asymmetry among current and prospective shareholders, and thus disclosure improves the liquidity of a security (Amihud and Mendelson 1986; Easley and O’Hara 2004). Second, investors may have difficulty estimating firms’ true expected rates of return on securities, and these errors in estimation are nondiversifiable and thus positively related to firms’ cost of capital (Klein and Bawa 1976; Barry and Brown 1985). Third, more precise disclosures decrease the assessed covariance between a firm’s cash flows and the market’s cash flows (i.e., market beta), and thus increased precision of disclosure reduces the cost of equity (Lambert et al. 2007). In sum, the theoretical literature suggests that the effect of disclosure on firm risk is unidirectional. That is, increases in disclosure reduce the cost of capital.
In contrast, Kothari et al. (2009a) note that prior empirical tests of this hypothesis have found some evidence (Lang and Lundholm 1996; Botosan 1997; Botosan and Plumlee 2002, among others), but the results across the literature are mixed. They argue that one explanation for this lack of broad-based empirical support could be that the relation between disclosure and firms’ cost of capital (as well as stock return volatility) is directional. That is, “… on average, unfavorable information would be associated with an increase in cash flow risk and, conversely, favorable information would be associated with a decline in the risk of expected future cash flows” (p. 1646). Their argument is, in part, based on the fact that prior research suggests that, on average, unfavorable news is associated with higher volatility of future earnings, and thus unfavorable disclosure is likely to decrease the certainty with which investors can forecast a firm’s future cash flows (Chan 1988; Ball and Kothari 1989).
The newly created risk factor section of the 10-K is a unique form of disclosure where all the news relates to “unfavorable” information. More specifically, by definition, the information in this section relates to uncertainty about future firm performance. Post-disclosure measures of market beta (for systematic risk) and stock return volatility (for idiosyncratic risk) should proxy for investors’ ex ante, expected risk associated with the firms’ future cash flows, after incorporating the information conveyed by risk factor disclosures. Therefore, if the disclosures are informative and investors incorporate this information into their risk assessments, we should find a positive association between risk factor disclosures and post-disclosure market-based measures of firm risk (i.e., market beta and stock return volatility). Thus, our second hypothesis:
Hypothesis 2
The unexpected portion of risk factor disclosure is positively associated with post-disclosure market-based measures of firm risk (i.e., beta and stock return volatility).
2.3 Risk factor disclosures and post-disclosure information asymmetry
Theoretical research argues that voluntary disclosure reduces information asymmetries among informed and uninformed investors (Diamond and Verrecchia 1991; Kim and Verrecchia 1994). Consequently, these studies suggest that “for firms with high levels of disclosure, investors can be relatively confident that any stock transactions occur at a ‘fair price,’ increasing liquidity in the firm’s stock (Healy and Palepu 2001, p. 429). Consistent with these arguments, prior empirical studies document a negative relation between the level of disclosure and firms’ bid-ask spread. For instance, Leuz and Verrecchia (2000) find that firms listed on the Neuer Market have lower bid-ask spreads than firms listed on the Frankfurt Exchange and attribute this finding to differences in disclosure requirements between the two.
A maintained assumption of this prior literature is that incremental disclosure is credible and informative and not boilerplate in nature or otherwise not meaningful. For reasons discussed in H1 and H2, we expect that managers will provide risk factor disclosures that meaningfully reflect the risks their firm faces and that the disclosures will be informative to market participants. Thus, we also predict that higher levels of risk factor disclosure will reduce information asymmetries among informed and uninformed investors, result in greater liquidity, and therefore will be negatively associated with information asymmetry (as measured by firms’ bid-ask spread). It is important to note, however, that in H2 we predict that risk factor disclosures will increase investors’ perception of a firm’s risk, and prior literature suggests that, when investors perceive a firm to face more risk, the bid-ask spread increases because informed investors may have an even greater information advantage (Kyle 1985; Demsetz 1986; Jayaraman 2008). This presents an empirical challenge when assessing the effects of risk factor disclosure on information asymmetry because it implies that we already know that the phenomenon we have tested with H2 (i.e., investors’ perceptions of risk) also has an effect on the relation between risk disclosures and information asymmetry. Our interest is not in testing two hypotheses surrounding the same phenomenon, but rather more generally on whether risk factor disclosures are informative and thus whether the disclosures reduce the information gap between informed and uninformed investors conditional on investor risk perceptions. Therefore, we control for both pre-disclosure and post-disclosure measures of firm risk. Our third hypothesis follows:
Hypothesis 3
After controlling for investor risk perceptions, the unexpected portion of risk factor disclosure is negatively associated with post-disclosure market-based measures of information asymmetry (i.e., bid-ask spread).
2.4 Investor reaction to risk factor disclosures
So far we predict that managers provide informative risk factor disclosures, and investors incorporate this information into their post-disclosure assessments of firm risk and information asymmetry. However, we have not explicitly examined the timeliness with which investors incorporate this information. This is an important point, as it is possible that investors do not use the information provided in risk factor disclosures. Instead, the information could be correlated with subsequent information released by the firm, analysts, or the business press over the course of the following year for which we do not adequately control, and this subsequent release of information could manifest itself in post-disclosure market-based measures of firm risk at that time. By examining returns around the 10-K release date, we attempt to overcome this potential correlated omitted variable problem.Footnote 5
Beginning with Ball and Brown (1968) and Beaver (1968), prior event study literature uses short-window tests around accounting report release dates to determine whether and how investors react to accounting disclosures. We expect that investors react to the information conveyed by risk factor disclosures at the time the disclosure is made available. We expect that risk factor disclosure will be negatively related to short-window abnormal stock returns around the 10-K release date based on the predicted increase in firms’ risk (H2). Although we predict that information asymmetry will improve (H3), we expect that risk factor disclosures have an overall negative association with short-window stock returns for at least two reasons. First, our tests of H3 control for the extent to which increases in firm risk result in an increase in information asymmetry. Thus, it is not clear ex ante whether the overall effect of risk factor disclosures on information asymmetry would be an increase or decrease. Second, the relation between information asymmetry and a firm’s cost of capital (and thus its market value over short windows around the 10-K release) is still debated in the literature. Lambert et al. (2012) argue that, in models of perfect competition, information asymmetry per se does not affect the cost of capital. Therefore, we predict that risk factor disclosures should be negatively related to market returns. Thus, our final hypothesis:
Hypothesis 4
The unexpected portion of risk factor disclosures are negatively associated with abnormal returns surrounding the Form 10-K release date.
3 Research design and empirical results
3.1 Textual analysis data collection process
Appendix 1 describes the data collection process we use to extract qualitative risk disclosures from firms’ Form 10-K and describes how we then calculate the measures used in our study. Additionally, Fig. 1 presents a summary illustration of the specific sequence of steps used in our textual analysis. As can be seen in Fig. 1, annual Form 10-K filings are downloaded from the SEC’s Electronic Data Gathering and Retrieval (EDGAR) database and are then processed to generate appropriate counting measures that objectively quantify firms’ risk disclosures.Footnote 6 We wrote our text analysis program to identify the various Form 10-K subsections based on the assumption that most EDGAR filings contain visual clues for human readers to recognize section boundaries (i.e., subsection titles, boldface fonts, extra spacing, etc.). Our program searches for various combinations of these clues in the HTML code for each filing and partitions the subsections accordingly. Full details of our programming procedures are outlined in Appendix 1. To verify that our program is accurate, we randomly selected 300 firms in the EDGAR population and manually examined the words from Item 1A Risk Factors, Item 7 MD&A, and Item 7A Market Risk, sections respectively. For cases where text could be extracted, we found that, in over 98 % of the cases across all three subsections, the software extracted the correct subsection, and only the correct subsection, of text.
After having extracted the appropriate subsection of text, we then compute three measures for each subsection: (1) total word count, (2) total key word count, and (3) key word count by risk subcategory. We use the natural logarithm of each of these three measures. We also formally calculate “unexpected disclosure” measures, which control for the risk factor disclosure in the prior year as well as the quantitative determinants of the disclosure. We do not, however, use a simple change in the disclosure measures because Nelson and Pritchard (2007) show that once risk disclosures are made, they very rarely are removed in a subsequent year. Thus, a simple change approach could be misspecified if investors have more reason to be alarmed both when disclosures are removed as well as when they are significantly increased.Footnote 7
Our list of key words is presented in Appendix 3. We develop our key words in two steps. First, we begin with key risk words used by prior literature (Nelson and Pritchard 2007). Then, our software identified a list of additional words that repeatedly appeared in firms’ risk factor sections using a document clustering approach known as the Latent Dirichlet Allocation (Blei et al. 2003). To our knowledge, we are the only study to apply this methodology to risk disclosures, and consequently our key word list of over 300 words is larger than most others in the literature.Footnote 8
To classify each identified key word into subcategories, we first identify those key words associated with (1) financial risk, (2) litigation risk, and (3) tax risk. In general, words were classified as financial risk if they related to liquidity, debt, covenants, or capital structure. Similarly, as in Nelson and Pritchard 2007), words were classified as litigation risk if they related to legal matters, lawsuits, intellectual property, or environmental issues. Finally, words were classified as tax risk if they related to the accounting for income taxes or tax avoidance in general. This process covered approximately 30 % of the key word list. With the remaining 70 %, we classified the terms as either “other-systematic” or “other-idiosyncratic” based on whether the key word most closely related to an economy-wide risk or a firm-specific risk, respectively.
3.2 Sample selection and descriptive statistics
As we are interested in the effects of the newly created risk factor section, we begin our sample selection with the first year in which the SEC mandated this disclosure. Accordingly, our sample of firms begins in 2005. We begin with the overall universe of Compustat firms and then remove firm-year observations that do not have an industry classification or that are missing necessary control variables for our multivariate regression models. As previously mentioned, our textual analysis software requires firms to have a machine-readable (i.e., HTML) filing on the SEC’s EDGAR database that includes the newly created “Risk Factors” disclosure. Accordingly, we remove firm-year observations that do not have filings with risk factor disclosures that allow for textual analysis, though in Panel B we consider whether this leads to a bias in our sample by comparing it to the Compustat universe of firms. Finally, we remove all observations that lack necessary equity market data from CRSP and truncate all necessary data at the top and bottom 1 % to mitigate the influence of outliers. The final sample consists of 9,076 firm-year observations. Details concerning the sample selection for our main empirical tests are described in Panel A of Table 1.
Panel B of Table 1 presents an industry breakdown of our sample into 48 industry groupings (see Fama and French 1993). Panel B also reports the industry groupings for the overall universe of Compustat firms. The most striking observation is that the trading industry is under-represented in our sample relative to the overall Compustat universe by 9.0 %. However, as discussed, our sample requires that firms have a filing in the SEC’s EDGAR database that includes the newly created “Risk Factors” disclosure. General Instruction J to Form 10-K states that asset-backed issuers are not required to disclose the risk factors item. Thus, the under-representation of firms in the trading industry is not surprising.Footnote 9 The remaining industries in our sample are relatively close to the overall composition of the Compustat universe, except for the banking industry, which is over-represented relative to the overall Compustat universe by 4.7 %. To reduce concerns regarding the effects of industry concentration, we control for differences in industry in all our regression models using industry fixed effects.
Table 2 presents descriptive statistics for the 9,076 firm-year observations. Panel A presents descriptive statistics for our primary risk factor disclosure variables and shows that our sample firms, on average, write 4,902 words in the risk factor section (Item 1A) of their Form 10-K. This is a strikingly large portion of the 10-K (i.e., 4,902/45,911, or 10.7 % of the entire document). For comparison purposes, the risk factor section contains approximately one-half the number of words as the MD&A section (Item 7). In addition to the number of overall words, we examine how many of the key words from Appendix 3 appear in the text. We find that, on average, 293 of the key words from Appendix 3 appear in the risk factor section, and most of these key words relate to “other-systematic” (36 %) and “other-idiosyncratic” risks (33 %). We also find that 15, 13, and 2 % of these words specifically relate to litigation, financial, and tax risks, respectively.
Panel B of Table 2 provides the descriptive statistics for our other variables. The average firm in our sample has a log market value of equity of 6.463 (which is equivalent to $640 million), and size appears to vary considerably across the sample. Additionally, the average firm has a book to market ratio of 0.608 and an annual stock return of -0.003 %. This suggests that the severe market downturn in 2008 overwhelms the generally positive returns from the years 2005–2007. The firms in our sample appear to be profitable, as average net income deflated by the beginning of period market value of equity is 0.006. Finally, the firms in our sample appear to have similar market-based risk measures as in prior research. For example, the mean standard deviation of abnormal daily returns is 0.031 for the 250 trading days following the release of the 10-K and 0.025 for the 250 trading days prior to the release of the 10-K. This is comparable to Kothari et al. (2009a), where the mean of daily return volatility is 0.027. Similarly, the mean value of market beta in the 250 trading days after the 10-K release date is 0.915, which is less than 1, indicating that firms in our sample are slightly less risky than average. The mean bid-ask spread for the 250 trading days following the release of the 10-K is 0.68 %. Overall, the evidence from Table 2 suggests that the descriptive statistics of our sample firms are representative of prior studies and are consistent with the time period during which our sample spans. Consequently, our results should be generalizable.
3.3 Risk factor disclosures and pre-disclosure proxies for firm risk
In our first hypothesis, we predict that risk factor disclosures will be positively associated with pre-disclosure proxies for firm risk. To examine H1, we model risk factor disclosures as a function of these measures identified by prior literature that relate to systematic, idiosyncratic, financial, legal, and tax risk (Fama and French 1993; Jones and Weingram 1996; Nelson and Pritchard 2007). Specifically, to test H1, we estimate the following regression equation:
where the independent variables are defined in Table 2, and the dependent variables are allowed to vary as follows:
RF_DISC = variable capturing risk factor disclosures, which we vary as follows:
-
(1)
ALL_WORDS = the log of the total number of words that appear in the firm’s risk factor disclosure section,
-
(2)
KEY_WORDS = the log of the total number of key words identified in Sect. 3.1 that appear in the firm’s risk factor disclosure section,
-
(3)
RF_SYS = the percentage of the total number of key words identified in Sect. 3.1 that relate to systematic risks and appear in the firm’s risk factor disclosure section,
-
(4)
RF_IDIO = the percentage of the total number of key words identified in Sect. 3.1 that relate to idiosyncratic risks and appear in the firm’s risk factor disclosure section,
-
(5)
RF_FIN = the percentage of the total number of key words identified in Sect. 3.1 that relate to financial risks and appear in the firm’s risk factor disclosure section,
-
(6)
RF_TAX = the percentage of the total number of key words identified in Sect. 3.1 that relate to tax risks and appear in the firm’s risk factor disclosure section,
-
(7)
RF_LR = the percentage of the total number of key words identified in Sect. 3.1 that relate to litigation risks and appear in the firm’s risk factor disclosure section.
We expect the coefficient on SIZE (β1) to be negative if larger firms are more stable and thus have lower risk (Fama and French 1993). However, if larger firms face a greater risk of litigation and thus would be more likely to disclose risk factors (i.e., the political cost hypothesis from Watts and Zimmerman 1986), then we would find a positive coefficient on SIZE. Therefore, we do not sign the coefficient on SIZE. The coefficient on BTM (β2) will be negative if growth opportunities are positively related to firm risk (Fama and French 1993). However, if market participants have a negative outlook for the future prospects of firms with a high BTM, the firm could be riskier, and we would find a positive coefficient on BTM. Therefore, we do not sign the coefficient on BTM. If realized returns are a proxy for investors’ expected rate of return (i.e., cost of capital) beyond those controls already incorporated in the model, then the coefficient on RET (β3) would be positive (Vuolteenaho 2002). On the other hand, the coefficient on RET (β3) should be negative if recent negative stock returns indicates that firms have become more fundamentally risky (Nelson and Pritchard 2007). Thus, we make no prediction for RET (β3). LEV (β4) should be positively related to risk factor disclosures if higher levels of debt increase a firm’s default risk. STDERET (β5) should be positively related to risk factor disclosures since stock return volatility is often used as a proxy for firm risk (Kothari et al. 2009a, among others). Similarly, BETA (β6) should be positively related to risk factor disclosures since market beta is often used as a proxy for firm risk (Fama and French 1993, among others).
Consistent with Nelson and Pritchard (2007), we expect a negative coefficient on SKEW (β7) since higher skewness in stock returns is likely to be associated with recent large and positive stock price movements and thus recently released information that reduces the firm’s uncertainty. TURN (β8) should be positively related to risk factor disclosures if turnover is a proxy for information uncertainty (Jiang et al. 2005). The coefficient on BIGN (β9) is expected to be negative if firms that hire the services of a big N auditor are associated with increased audit quality and therefore less risk (Mansi, Maxwell, and Miller 2004). However, the coefficient on BIGN could be positive if big N auditors face more litigation risk and therefore require their clients to disclose more risks and uncertainties in their risk factor section (Lennox 1999). Therefore, we do not make a prediction for the coefficient on BIGN. We expect a negative coefficient on ETR (β10) if firms that engage in greater tax avoidance are subject to greater audit risk from the Internal Revenue Service (Guedhami and Pittman 2008). We expect a negative coefficient on DNI (β11) if firms with lower profitability are riskier. The coefficients on NUMEST (β12) and INSTOWN (β13) could be positive if a sophisticated investor base demands that firms provide greater disclosures and transparency (Lang and Lundholm 1996). On the other hand, these coefficients could be negative if a sophisticated investor base faces less information asymmetry with the firm and thus there is less demand for the risk factor disclosure (Healy and Palepu 2001). Thus, we make no prediction on the coefficients for NUMEST and INSTOWN. Finally, to control for varying macro-economic factors across time and across industry, and since boilerplate and generic disclosures are likely to be sticky across time and across industry, we include both industry and year fixed effects.
3.3.1 Association between risk measures and subcategories of risk
Appendix 3 provides the list of key risk words that we use in our analysis and separates these key words into the following five subcategories discussed in Sect. 3.1 and Appendix 1: (1) financial risk (RF_FIN), (2) tax risk (RF_TAX), (3) litigation risk (RF_LR), (4) other-systematic risk (RF_SYS), and (5) other-idiosyncratic risk (RF_IDIO). Furthermore, prior literature suggests that risk from financial leverage increases both systematic risk and idiosyncratic risk (Hamada 1972; Modigliani and Miller 1958; among others). Thus, in our analyses on the subcategories of risk, we also examine our systematic (RF_SYS) and idiosyncratic (RF_IDIO) risk measures after adding financial risk (RF_FIN) to each of them. We do not include the tax and litigation risk categories in either systematic or idiosyncratic measures because it is not clear from prior literature whether these categories are primarily driven by firm-specific characteristics.
If risk factor disclosures are informative about the specific risks a firm faces, we should find that firms that face greater exposure to a subcategory of risk are more likely to devote a greater portion of the risk disclosures towards describing that risk. Specifically, we expect market risk (BETA) to be more strongly associated with our “other-systematic” risk subcategory (RF_SYS). Similarly, we expect abnormal stock return volatility (STDERET) to be more strongly associated with our “other – idiosyncratic” risk subcategory (RF_IDIO). Finally, we expect financial leverage (LEV) to be more strongly associated with the financial risk subcategory (RF_FIN), effective tax rate (ETR) to be more strongly associated with the tax risk subcategory (RF_TAX), and, consistent with the political cost hypothesis (Watts 1977, Watts and Zimmerman 1986), firm size (SIZE) to be more strongly associated with the litigation risk subcategory (RF_LR).
3.4 Results on risk factor disclosures and pre-disclosure proxies for firm risk
Table 3 presents multivariate evidence regarding H1.Footnote 10 The first column of Table 3 uses the key word count of the risk factor section as the dependent variable. The evidence suggests that nine out of the thirteen risk proxies are statistically associated with risk factor disclosure. Specifically, we find that firms increase their risk factor disclosure when they have higher expected returns (RET), have more leverage (LEV), have higher stock return volatility (STDERET) and turnover (TURN), engage a Big N auditor (BIGN), and have greater analyst following (NUM_EST). Additionally, we find that firms increase their risk factor disclosures when they have lower profitability (NI), lower effective tax rates (ETR), and lower stock return skewness (SKEW). The results are similar in the second column of Table 3 when we use total word count as the dependent variable rather than key word count. Overall, these results are consistent with H1 and suggest that firms with more risks disclose more risk factors.
Table 4 presents multivariate evidence regarding H1 across our five identified risk subcategories. In all five cases, we find that the magnitude and statistical significance of the coefficient on the risk proxy that most closely captures a given subcategory of risk is largest when the dependent variable in our regression analysis is the portion of the disclosure that is allocated to that subcategory.Footnote 11 Specifically, for the “other—systematic” risk subcategory (RF_SYS), we find that market beta (BETA) is positively associated with the portion of disclosure allocated to systematic risk (β6 = 0.0100), and the coefficient and statistical significance of BETA is larger than for any of the other four risk subcategories. Similarly, for the “other—idiosyncratic” risk subcategory (RF_IDIO), we find that abnormal stock return volatility (STDERET) is positively associated with (and the coefficient and statistical significance is largest for) the portion of disclosure allocated to idiosyncratic risk. For the financial risk subcategory (RF_FIN), we find that financial leverage (LEV) is positively associated with (and the coefficient and statistical significance is largest for) the portion of disclosure allocated to financial risk. For the tax risk subcategory (RF_TAX), we find the effective tax rate (ETR) is negatively associated with (and the coefficient and statistical significance is largest for) the portion of disclosure allocated to tax risk. Finally, for the litigation risk subcategory (RF_LR), we find that firm size (SIZE) is positively associated with (and the coefficient and statistical significance is largest for) the portion of disclosure allocated to litigation risk.
In the last two columns of Table 4, we examine systematic and idiosyncratic risk after adding financial risk to each of them. We continue to find that BETA is more strongly associated with systematic risk, while STDERET is more strongly associated with idiosyncratic risk. Overall, the evidence from Table 4 suggests that firms with greater subcategories of risk are more likely to devote a greater portion of their risk disclosures towards describing that risk and, consistent with H1, provides further assurance that managers provide informative risk factor disclosures.
3.5 Risk factor disclosures and post-disclosure market-based measures of firm risk
In our second hypothesis, we predict that risk factor disclosures are positively associated with post-disclosure market-based measures of firm risk (i.e., beta and stock return volatility). In order to test H2, we follow the research design of Kothari et al. (2009a) and modify it to include the pre-disclosure risk controls from Eq. (1). Specifically, we estimate the following regression:
where all variables are as defined in Eq. (1), and RF_DISC = log of the word (key word) count in the risk factor disclosure section, and MDA_DISC = log of the word (key word) count in the MD&A section. If risk factor disclosures are positively associated with post-disclosure market-based measures of firm risk (H2), we expect that β1 > 0. Since our interest is in risk factor disclosures where the tone is negative, and not in disclosures regarding the firm’s current and future operating performance made in MD&A (Bryan 1997; Clarkson et al. 1999; Li 2010; Brown and Tucker 2011), we control for the number of words (and the number of key words relating to risk) in MD&A disclosures at time t.Footnote 12 In addition, since prior research suggests that the complexity and length of 10-K disclosure is associated with investors’ risk perceptions (You and Zhang 2009; Lehavy et al. 2011), we include controls for the length of disclosure in the entire 10-K (ALL_DISC) as well as the fog index of the disclosures in the risk factor section (RF_FOG), in the MD&A section (MDA_FOG), and in the rest of the 10-K (ALL_FOG).Footnote 13 Finally, we use industry and year fixed effects since boilerplate or generic disclosures are likely to be sticky across time for the same firm or are likely to be homogenous across firms in the same industry, as well as standard errors that control for both serial correlation and heteroscedasticity.
Finally, there are two important research design choices in all of our market-based tests (i.e., tests of H2 using Eq. (2), as well as tests of H3 and H4 in Eqs. (3) and (4), which follow). First, even though the tone of risk factor disclosure is negative by definition, it is possible that firms convey less negative information than investors expected and therefore the overall news from the disclosure is positive. That is, it is important to control for investors’ expectations of risk factor disclosures. To do so, we control for pre-disclosure measures that proxy for firm risk (i.e., all of the determinants of risk disclosure from Eq. (1) and tests of H1). By doing so, our measure for risk factor disclosures (RF_DISC) captures the portion of risk factor disclosures that are “unexpected.” See Sect. 4.2 for sensitivity tests where we formally estimate a two-stage regression model. Second, we model our dependent variable as the level of post-disclosure risk at time t + 1. By including a control for the level of risk at time t, this ensures that our model is capturing the change in the risk measure subsequent to the release of the risk factor disclosure. That is, our model is equivalent to a pure changes model, except that in a pure changes model the coefficient on the lagged value of the dependent variable is constrained to equal one rather than being allowed to vary. Not surprisingly, all of our results are quantitatively and qualitatively unchanged if our dependent variable is the change in risk from time t to time t + 1. For additional discussion of the pure changes model and tabulated regression results, see Sect. 4.3.
3.5.1 Association between market-based risk measures and subcategories of risk
As previously mentioned, we decompose the risk factor disclosures into subcategories based on different types of risk, including systematic and idiosyncratic risks. Our two market-based measures of risk are market beta and stock return volatility, and prior literature suggests that market beta mostly captures systematic risk while stock return volatility mostly captures firm-specific idiosyncratic risk. Thus, we re-estimate Eq. (2) and decompose the key word count into our measures of “other-systematic” (RF_SYS) and “other-idiosyncratic” (RF_IDIO) risk.Footnote 14 If investors incorporate risk factor disclosures into post-disclosure measures of firm risk, the association between risk factor disclosures and BETA (STDERET) should mostly be driven by the RF_SYS (RF_IDIO) component of risk disclosures.
3.6 Results on risk factor disclosures and post-disclosure market-based measures of firm risk
Table 5 presents multivariate evidence regarding H2. The first column of Table 5 uses market beta (BETA) as the dependent variable and the key word count of the risk factor section as the independent variable of interest. We find a positive and significant association between RF_DISC and BETA (β1 = 0.0227, std. error = 0.0072, t-statistic = 3.15). The third column of Table 5 uses abnormal stock return volatility (STDERET) as the dependent variable rather than BETA. As with BETA, we find a positive and significant association between RF_DISC and STDERET (β1 = 0.0006, std. error = 0.0002, t-statistic = 3.00). This suggests that, if a firm moves from the 25th percentile to the 75th percentile of risk disclosure, investors incorporate an additional 2.5 % (1.9 %) into their assessment of the following period’s BETA (STDERET).Footnote 15 The results are unchanged in the second and fourth columns when we substitute total word count as the independent variable rather than key word count. Overall, these results are consistent with H2 and suggest that investors incorporate the information conveyed by risk factor disclosures into post-disclosure market-based measures of risk.
As previously discussed, we expect that the positive association between risk factor disclosures and post-disclosure market beta is primarily driven by systematic risk factor disclosures, while the positive association between risk factor disclosures and abnormal stock return volatility is primarily driven by firm-specific idiosyncratic risk factor disclosures. Table 6 presents multivariate evidence regarding H2 across these two risk subcategories. The third column in Table 6 presents results for market beta when both systematic and idiosyncratic risk disclosure measures are included in the regression model. As expected, the coefficient on systematic risk disclosure is positive and significant (β1 = 0.0382, std. error = 0.0103, t-statistic = 3.71), while the coefficient on idiosyncratic risk disclosure is not statistically significant. This suggests that, when managers disclose more systematic risk factors, investors respond by incorporating this risk into the systematic market-based measure BETA. On the other hand, the sixth column in Table 6 presents results for abnormal stock return volatility when both systematic and idiosyncratic risk disclosure measures are included in the regression model. As expected, the coefficient on idiosyncratic risk disclosure is positive and significant (β1 = 0.0005, std. error = 0.00025, t-statistic = 2.00), while the coefficient on systematic risk disclosure is not statistically significant. This suggests that, when managers disclose more idiosyncratic risk factors, investors respond by incorporating this risk into the idiosyncratic market-based measure STDERET. Taken together, the evidence from Tables 5 and 6 suggest that investors incorporate the information conveyed by risk factor disclosures into post-disclosure market-based measures of firm risk.
3.7 Risk factor disclosures and post-disclosure information asymmetry
In our third hypothesis, we predict that—conditional on their impact on investors’ risk perceptions as tested above in H2—risk factor disclosures will be negatively associated with information asymmetry. To examine H3, we follow the prior literature and use bid-ask spread as a proxy for information asymmetry (i.e., Khan and Watts 2009, Lang et al. 2012) and add additional control variables designed to capture a firm’s information environment (Stoll 2000). In order to ensure we capture the incremental association between risk disclosures and information asymmetry, we also include the control variables for pre-disclosure risk and performance from Eq. (1), as well as post-disclosure risk. Specifically, we estimate the following regression:
where all variables are as defined in Eq. (2), SPREAD = the average daily difference between a firm’s bid and ask price measured over 12-months, and INV_PRC = the inverse of fiscal year ending stock price. We include post-disclosure market-based measures of risk to control for the effect of disclosures on firm risk, which can in turn affect bid-ask spread. If risk factor disclosures are negatively associated with post-disclosure market-based measures of information asymmetry (H3), we expect that β1 < 0. As before, we control for the disclosures that are made in MD&A at time t, use industry and year fixed effects, and use standard errors that control for both serial correlation and heteroscedasticity.
3.8 Results on risk factor disclosures and post-disclosure information asymmetry
Table 7 presents evidence regarding H3. The first column of Table 7 uses the key word count of the risk factor section as the independent variable of interest. We find a negative and significant association between RF_DISC and SPREAD (β1 = −0.0440, std. error = 0.0069, t-statistic = 6.38). This suggests that, if a firm moves from the 25th percentile to the 75th percentile of risk disclosure, the following period’s SPREAD is lower by 2.7 %. The results are unchanged in the second column when we substitute total word count as the independent variable rather than key word count. Overall, the results are consistent with H3 and suggest that, after controlling for the fact that risk factor disclosure increases the market’s assessment of a firm’s risk, risk factor disclosure also decreases information asymmetry among that same firm’s shareholders.
3.9 Investor reaction to risk factor disclosures
In our final hypotheses, we predict that risk factor disclosures will be negatively associated with abnormal returns at the 10-K filing date. We follow prior literature that examines information content around the 10-K release (Bernard and Stober 1989; Han et al. 1992; Wahlen 1994, among others) and estimate the following regression equation:
where RF_DISC and MDA_DISC are defined as in Eq. (3), CAR = abnormal return calculated over the three days around the Form 10-K filing date and adjusted for the effects of market returns, size, and growth (Fama and French 1993). We obtain the Form 10-K filing date with the text analysis described in Appendix 1. As before, we control for disclosures made in MD&A and use industry and year fixed effects, as well as standard errors that control for serial dependence and heteroscedasticity. If risk factor disclosures are negatively associated with abnormal returns around the Form 10-K release date (H4), we expect that β1 < 0.
Short-window returns at the 10-K release date could be a function of changes in either (1) firms’ expected future cash flows or (2) investors’ assessment of firm risk (Kothari 2001). Thus, we include controls for the information in the 10-K that is likely to lead to changes in firms’ expected future cash flows. First, we control for firms’ earnings surprise (ΔNI), defined as change in net income before extraordinary items divided by lagged market value of equity. Earnings surprises are likely to be correlated with changes in expected future cash flows. Second, we control for the change in analysts’ consensus earnings estimate for the next year (ΔEST), where the change is measured as the difference between the consensus estimate prior to and after the Form 10-K release date. If our short-window abnormal returns results are attributable to changes in markets’ perceptions of firm risk (and not cash flows), we should continue to find a negative association between risk factor disclosures and abnormal returns after controlling for ΔNI and ΔEST. Finally, we include controls for the effect of firms’ information environment on returns around the 10-K release date. Specifically, we include LOSSES, a dummy variable that takes on a value of one if there is one or more loss years (NI < 0) over the previous 5 years, and AVG_ACC, defined as the average absolute value of accruals calculated using the statement of cash flows deflated by average total assets.
3.10 Results on investor reaction to risk factor disclosures
Table 8 presents evidence regarding H4. The first column shows a negative and significant association between the key word count from risk factor disclosures (RF_DISC) and CAR (β1 = −0.0073, std. error = 0.0025, t-statistic = 2.92). This suggests that, if a firm moves from the 25th percentile to the 75th percentile of risk disclosure, the abnormal return around the 10-K release date is lower by 68 basis points. The results are similar in Column 2 when we use total word count. Overall, the results are consistent with H4 and suggest that investors incorporate the information conveyed by risk factor disclosures into firm stock price.
4 Sensitivity analysis and robustness
4.1 Controls for risk disclosures in Item 7A of Form 10-K
As previously mentioned, risk factor disclosure in Item 1A of Form 10-K conveys information about anything that makes the company “speculative or risky.” Thus, the tone of the disclosure is, by definition, negative, and so this disclosure provides a powerful setting in which to examine whether disclosure tone affects the cost of capital and firm value. However, another section of the Form 10-K conveys qualitative and quantitative disclosure about risk—Item 7A “Quantitative and Qualitative Disclosures about Market Risk” (hereafter referred to as “market risk” disclosures).
In our primary regression analysis, we do not control for market risk disclosures since, just as with the risk factor disclosures in Item 1A, they provide qualitative disclosure relating to items that make the company risky. Thus, including a control for these disclosures could “over-control” for the information we are trying to capture with the Item 1A Risk Factor disclosure. However, these market risk disclosures are different from risk factor disclosures in that, in addition to providing qualitative risk disclosures about a narrow set of risks, they also quantify the impact that these certain risks have on firms’ financial statements. Since our measures are designed only to capture qualitative disclosures, this additional quantitative information contained by the disclosures in Item 7A should not affect our results. Nevertheless, we control for the information in this section in order to provide further assurance that our results strictly hold for qualitative risk factor disclosure rather than a mixture of qualitative and quantitative disclosures.
In untabulated results, we re-test all of our hypotheses but include controls for the Item 7A market risk disclosures. Specifically, when our variable of interest is the total word count for the risk factor section, we control for the total word count in the market risk section, and when our variable of interest is key word count in the risk factor section, we control for the total number of key words in the market risk section. All of the results from our hypothesis testing are quantitatively and qualitatively unchanged by these additional controls for market risk disclosures.
4.2 Expectations model for risk factor disclosures
In our market tests of H2 through H4, we control for the market’s pre-disclosure expectations of disclosure by controlling for pre-disclosure measures of firm risk (i.e., all of the variables from the risk factor determinants model used to test H1). Therefore, the coefficient loading on our variable of interest is purged of the effect of the market’s expectations of risk factor disclosures and instead captures the effect of the abnormal amount of risk factor disclosure. To provide further assurance, we re-examine our hypotheses with a two-stage regression model where we formally calculate abnormal risk factor disclosure. Specifically, we estimate our determinants model in Eq. (1) after also including lagged risk factor disclosure as a determinant. We take the residual from this regression and label it “unexpected risk factor disclosure.” Finally, we use this unexpected risk factor disclosure variable to re-test all of our hypotheses related to H2 through H4. All of the results from our hypothesis testing are unaffected, providing further assurance that our results are not sensitive to explicitly controlling for the market’s expectations regarding risk factor disclosure.
4.3 Alternate regression specifications
In our primary tests, we examine the effect of disclosure tone on the future level of market beta, stock return volatility, and information asymmetry. To ensure this is capturing a response to risk disclosures, we include controls for the lagged level of market beta, stock return volatility, and information asymmetry. This allows the coefficient on the lagged level to vary, depending on the persistence of the variable in question. In contrast, a pure changes model tests a similar construct but instead constrains the coefficient on the lagged value of the dependent variable to equal one.
As a final robustness test, we re-estimate our hypotheses by substituting the change in the dependent variable for the level of the dependent variable. For brevity, and to be comprehensive with all of our robustness checks in this section, we also include controls for market risk disclosures (Sect. 4.1) as well as the two-stage expectations model for the risk factor disclosures (Sect. 4.2), although it should be noted that the results from Tables 5 through 8 are unchanged if we simply change the dependent variable from a level to a change. In untabulated results, all of our results for H2 through H4 continue to hold under a pure changes model, providing further assurance that our results are not due to correlated omitted variables.
5 Conclusion
This study examines the information content of risk factor disclosures for market-based risk, information asymmetry, and stock returns and offers four results. First, we find that risk factor disclosures are positively associated with pre-disclosure measures that proxy for firm risk, and this association varies predictably across subcategories of firm risk (i.e., systematic, idiosyncratic, financial, tax, and litigation risks). This suggests that—contrary to critics’ assertions—risk factor disclosures are not boilerplate. Instead, managers appear to provide informative disclosures. Second, we find that risk factor disclosures are positively associated with post-disclosure market-based measures of firm risk. This suggests that market participants incorporate the information conveyed by risk factor disclosures into their assessments of firm risk. Third, after controlling for the investors’ assessment of fundamental risk, we find that risk factor disclosures are negatively associated with post-disclosure information asymmetry. This suggests that, while risk factor disclosures increase the market’s assessment of a firm’s fundamental risk, the public availability of the disclosures simultaneously decreases information differences across that firm’s shareholders (after controlling for the effect of fundamental risk on information asymmetry). Finally, we find that risk factor disclosures are negatively associated with abnormal returns at the Form 10-K release date. Overall, our results suggest that managers provide useful risk factor disclosures and investors incorporate this information into market values.
Future research might examine whether firms that provide incremental risk disclosures, and thus increase the market’s assessment of their overall risk, are more likely to experience negative future outcomes such as decreased investment or increased probability of bankruptcy and credit default. Additionally, our study focuses strictly on the effect of risk factor disclosures on the equity market’s assessment of firm risk. Future studies could examine whether risk factor disclosures are informative to debt markets either through investors’ pricing of debt or through future changes in credit ratings.
Notes
Some firms voluntarily provide risk disclosures in MD&A if they also provide forward-looking statements about future performance. That is not our focus. We focus on the newly created risk factor disclosure section because it is mandatory for all firms, and throughout all of our tests, we explicitly control for MD&A risk disclosures (and their risk related keywords). We acknowledge that some firms may have moved their voluntary risk disclosures from MD&A to the risk factors disclosure section after it was mandated. Thus, for these firms, the newly created section may not provide completely new information. Nevertheless, our tests are designed to incorporate investors’ expectations of risk disclosure and suggest that the newly created disclosures are informative.
We acknowledge that it is possible for firms to provide risk factor disclosure that describes a negative/pessimistic event that is actually less negative/pessimistic than the market expects it to be. Thus, for our market tests, we explicitly control for investors’ expectations of risk factor disclosure. See Sects. 3.1 and 3.3.
Appendices 1 and 2 describe our text analysis procedures and our methods for classifying key words into the five risk subcategories. We first classify key words into financial, tax, and legal risk subcategories. With the remaining words, we classify them as “other-systematic” if they relate to economy-wide risk and “other-idiosyncratic” if they relate to firm-specific risk. As shown in Table 2, 69 % of keywords in the average risk factor disclosure are comprised of words that fall into the “other” categories.
Cornwell et al. v. Credit Suisse Group et al., No. 08 Civ. 3758, 2010 U.S. Dist.
We acknowledge that short-window returns at the 10-K release date could be a function of changes in either (1) firms’ expected future cash flows, or (2) the assessment of firm risk. We interpret our results as being related to firm risk. For more assurance regarding this interpretation, in Table 8 we control for both firms’ earnings surprise and changes in analysts’ estimates of future earnings, as well as other variables in prior literature that could indicate a change in firms’ cash flows. Our tests suggest that the inferences with respect to abnormal returns reflect a change in investors’ perception of firm risk.
In our main tests, we only use qualitative disclosure information from Item 1A “Risk Factors” and Item 7 “Management’s Discussion & Analysis of Financial Condition and Results of Operation” (MD&A). In sects. 4.1 and 4.3, we also control for disclosures in “Quantitative and Qualitative Disclosures about Market Risk” (Item 7A) and our results are unaffected.
Alternative methods to measure changes in textual risk disclosures include the rate of change in the frequency of specific words used within text or the frequency of word groups within a sentence (Brown and Tucker 2011; Nelson and Pritchard 2007). We choose an expectations model as our proxy for changes in risk factor disclosure due to its relative empirical simplicity and since it explains approximately 80 % of the variation in risk factor disclosures (see Table 9).
We acknowledge that there is no consensus in the literature as to which key word list is most appropriate. There are two other relevant studies that report their list of key risk words. Nelson and Pritchard identify 75 risk factor terms, and Kravet and Muslu (2013) identify 20 risk-related keywords based on their reading of 100 randomly selected annual reports. Our key word list includes the vast majority of these identified key words but is expanded considerably using the Latent Dirichlet Allocation method described in the paper.
Specifically, the trading industry includes the following firm descriptions: “security and commodity brokers,” “closed-end management investments,” “trusts,” and “unit investment trusts.”
Throughout the analysis, we evaluate the effects of multicollinearity with variance inflation factors (VIFs). In their textbook, Kutner et al. (2004) indicate that multicollinearity is not a problem when VIFs are less than 10. The results indicate that multicollinearity is not a serious concern in any of our multivariate regressions. Thus, for expositional purposes, we do not tabulate or discuss these results for each model.
Since each left-hand side variable is specified as a percentage of the total key words in the risk factor section, and since the sample (and dependent variables) are the same across all regressions, the coefficients for a particular risk proxy are comparable across all of the regressions and indicate the percentage increase in total key words resulting from that risk proxy.
We acknowledge that we do not explicitly control for the MD&A tone. As noted by Kothari et al. (2009), this is not easy to do as it requires software-reading technologies that are not particularly accurate. However, we follow prior literature that assesses the tone of MD&A (Tetlock 2007; Kravet and Muslu 2013) and count the number of words that relate to risk, assuming that the context of these words is negative/pessimistic (i.e., our variables MDA_DISC, MDA_SYS, MDA_IDIO, MDA_FIN, MDA_LIT, and MDA_TAX).
The fog index is defined by Li (2008b) as (words per sentence + percent of complex words) * 0.4.
As before, we also add financial risk to each of these measures since prior literature shows that financial risk affects both systematic and idiosyncratic risk. Similarly, we do not include legal or tax risk in either of these categories since it is difficult to determine whether these risks are firm-specific, and prior literature does not provide much guidance in this respect.
For comparison purposes, Kothari et al. (2009) examine the effect of negative/pessimistic disclosure across three sources of disclosure (corporations, analysts, business press), and their results suggest that moving from the 25th percentile to the 75th percentile increases firms’ cost of capital by 2.0%. However, as previously mentioned, they find no such relation when the source of the disclosure is the corporation itself.
To ensure that the removal of plain text filings does not bias our sample, in Sect. 3, we compare our sample to the overall universe of Compustat firms. We find that our sample is generalizable across industries and years. In addition, we include industry and year fixed effects in all of our multivariate analyses.
References
Amihud, Y., & Mendelson, H. (1986). Asset pricing and the bid-ask spread. Journal of Financial Economics, 17, 223–249.
Ball, R., & Brown, R. (1968). An empirical evaluation of accounting income numbers. Journal of Accounting Research, 6, 159–178.
Ball, R., & Kothari, S. P. (1989). Nonstationary expected returns: Implications for tests of market efficiency and serial correlation in returns. Journal of Financial Economics, 25, 51–74.
Barry, C., & Brown, S. (1985). Differential information and security market equilibrium. Journal of Financial and Quantitative Analysis, 20, 407–422.
Beaver, W. (1968). The information content of annual earnings announcements. Journal of Accounting Research, 6, 67–92.
Bernard, V., & Stober, T. (1989). The nature and amount of information in cash flows and accruals. The Accounting Review, 64, 624–652.
Blei, D., Ng, A., & Jordan, M. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.
Botosan, C. (1997). Disclosure level and the cost of capital. The Accounting Review, 72, 323–350.
Botosan, C., & Plumlee, M. (2002). A re-examination of disclosure level and the expected cost of equity capital. Journal of Accounting Research, 40, 21–41.
Brown, S., & Tucker, J. (2011). Large sample evidence on firms’ year-over-year MD&A modifications. Journal of Accounting Research, 49, 309–346.
Bryan, S. (1997). Incremental information content of required disclosures contained in management discussion and analysis. The Accounting Review, 72, 285–301.
CFO (2010). SEC pushes companies for more risk information. CFO Magazine, August 2, 2010.
Chan, K. (1988). On the contrarian investment strategy. Journal of Business, 61, 147–163.
Clarkson, P., Kao, J., & Richardson, G. (1999). Evidence that management discussion and analysis (MD&A) is a part of a firm’s overall disclosure package. Contemporary Accounting Research, 16, 111–134.
Corporate Counsel (2006). MD&A Risk Factors (Nelson Rocks Preserve-Style. By Broc Romanek and Dave Lynn. www.thecorporatecounsel.net, November 30, 2006.
Demsetz, H. (1986). Corporate control, insider trading, and rates of return. American Economic Review, 76, 313–316.
Diamond, D., & Verrecchia, R. (1991). Disclosure, liquidity, and the cost of capital. Journal of Finance, 66, 1325–1355.
Drake, M., Roulstone, D., Thornock, J. (2011). The demand for mandatory disclosure: evidence from investors’ use of the EDGAR database. Working paper, Ohio State University.
Easley, D., & O’Hara, M. (2004). Information and the cost of capital. Journal of Finance, 59, 1553–1583.
Fama, E., & French, K. (1993). Common risk factors in the returns on stock and bonds. Journal of Financial Economics, 33, 3–56.
Fields, T., Lys, T., & Vincent, L. (2001). Empirical research on accounting choice. Journal of Accounting and Economics, 31, 255–307.
Guedhami, O., & Pittman, J. (2008). The importance of IRS monitoring to debt pricing in private firms. Journal of Financial Economics, 90, 38–58.
Hamada, R. S. (1972). The effect of the firm’s capital structure on the systematic risk of common stocks. Journal of Finance, 27, 435–452.
Han, B., Jennings, R., & Noel, J. (1992). Communication of nonearnings information at the financial statement release date. Journal of Accounting and Economics, 15, 63–86.
Healy, P., & Palepu, K. (2001). Information asymmetry, corporate disclosure, and the capital markets: A review of the empirical disclosure literature. Journal of Accounting and Economics, 31, 405–440.
Jayaraman, S. (2008). Earnings volatility, cash flow volatility and informed trading. The Accounting Review, 46, 809–851.
Jiang, G., Lee, C., & Zhang, Y. (2005). Information uncertainty and expected returns. Review of Accounting Studies, 10, 185–221.
Jones, C., & Weingram, S. (1996). The determinants of 10b-5 litigation risk. Stanford Law School: Working paper.
Ke, B., Huddart, S., & Petroni, K. (2003). What insiders know about future earnings and how they use it: Evidence from insider trades. Journal of Accounting and Economics, 35, 315–346.
Khan, M., & Watts, R. (2009). Estimation and empirical properties of a firm-year measure of accounting conservatism. Journal of Accounting and Economics, 48, 132–150.
Kim, O., & Verrecchia, R. (1994). Market liquidity and volume around earnings announcements. Journal of Accounting and Economics, 17, 41–68.
Klein, R., & Bawa, V. (1976). The effect of estimation risk on optimal portfolio choice. Journal of Financial Economics, 3, 215–231.
Kothari, S. P. (2001). Capital markets research in accounting. Journal of Accounting and Economics, 31, 105–231.
Kothari, S. P., Li, X., & Short, J. (2009a). The effect of disclosures by management, analysts, and business press on cost of capital, return volatility, and analyst forecasts: a study using content analysis. The Accounting Review, 84, 1639–1670.
Kothari, S. P., Shu, S., & Wysocki, P. (2009b). Do managers withhold bad news? Journal of Accounting Research, 47, 241–276.
Kravet, T., & Muslu, V. (2013). Textual risk disclosures and investors’ risk perceptions. Review of Accounting Studies. doi:10.1007/s11142-013-9228-9.
Kutner, M., Nachsheim, C., & Neter, J. (2004). Applied linear statistical models (5th ed.). New York, NY: McGraw-Hill Irwin.
Kyle, A. (1985). Continuous auctions and insider trading. Econometrica, 6, 1315–1335.
Lambert, R., Leuz, C., & Verrecchia, R. (2007). Accounting information, disclosure, and the cost of capital. Journal of Accounting Research, 45, 385–420.
Lambert, R., Leuz, C., & Verrecchia, R. (2012). Information asymmetry, information precision, and the cost of capital. Review of Finance, 16, 1–29.
Lang, M., Lins, K., & Maffett, M. (2012). Transparency, liquidity, and valuation: International evidence on when transparency matters most. Journal of Accounting Research, 50, 729–774.
Lang, M., & Lundholm, R. (1996). Corporate disclosure policy and analyst behavior. The Accounting Review, 71, 467–492.
Lehavy, R., Li, F., & Merkley, K. (2011). The effect of annual report readability on analyst following and the properties of their earnings forecasts. The Accounting Review, 86, 1087–1115.
Lennox, C. (1999). Audit quality and auditor size: An evaluation of reputation and deep pockets hypotheses. Journal of Business, Finance, and Accounting, 26, 779–805.
Leuz, C., & Verrecchia, R. (2000). The economic consequences of increased disclosure. Journal of Accounting Research, 38, 91–124.
Li, F. (2008a). Do stock market investors understand the risk sentiment of corporate annual reports? Working paper, University of Michigan.
Li, F. (2008b). Annual report readability, current earnings, and earnings persistence. Journal of Accounting and Economics, 45, 221–247.
Li, F. (2010). The information content of forward-looking statements in corporate filings: a naïve Baysian machine learning approach. Journal of Accounting Research, 48, 1049–1102.
Linsmeier, T., Thornton, D., Venkatachalam, M., & Welker, M. (2002). The effect of mandated market risk disclosures on trading volume sensitivity to interest rate, exchange rate, and commodity price movements. The Accounting Review, 77, 343–377.
Mansi, S., Maxwell, W., & Miller, D. (2004). Does auditor quality and tenure matter to investors? Evidence from the bond market. Journal of Accounting Research, 42, 755–793.
Modigliani, F., & Miller, M. (1958). The cost of capital, corporation finance and the theory of investment. American Economic Review, 48, 261–297.
Nelson, K., & Pritchard, A. C. (2007). Litigation risk and voluntary disclosure: The use of meaningful cautionary language. Working paper, Rice University.
Reuters (2005). “Refco risks boiler-plate disclosure.” By Scott Malone. Friday, October 21, 2005. http://w4.stern.nyu.edu/news/news.cfm?doc_id=5094.
Roulstone, D. (1999). Effect of SEC financial reporting release No. 48 on derivative and market risk disclosures. Accounting Horizons, 13, 343–363.
Schrand, C. (1997). The association between stock-price interest rate sensitivity and disclosures about derivative instruments. The Accounting Review, 72, 87–109.
Scott, T. (1994). Incentives and disincentives for financial disclosure: Voluntary disclosure of defined benefit pension plan information by Canadian Firms. The Accounting Review, 69, 26–43.
SEC (2005). Securities and exchange commission final rule, release no. 33–8591 (FR-75). http://www.sec.gov/rules/final/33-8591.pdf.
SEC (2009). Remington Arms Company, Inc. Form 10-K Filing for the year ended December 31, 2009, SEC EDGAR database.
SEC (2010). Form 10-K instructions. www.sec.gov/about/forms/form10-k.pdf.
Skinner, D. (1994). Why firms voluntarily disclose bad news. Journal of Accounting Research, 32, 38–60.
Spindler, J. (2006). Is it time to wind up the Securities Act of 1933? Regulation, Winter 2006–2007.
Stoll, H. (2000). Friction. Journal of Finance, 55, 1479–1514.
Tetlock, P. (2007). Giving content to investor sentiment: The role of media in the stock market. Journal of Finance, 62, 1139–1168.
Tufano, P. (1998). Who manages risk? An empirical examination of risk management practices in the gold mining industry. Journal of Finance, 51, 1097–1137.
Vuolteenaho, T. (2002). What drives firm-level stock returns? Journal of Finance, 57, 233–264.
Wahlen, J. (1994). The nature of information in commercial bank loan loss disclosures. The Accounting Review, 69, 455–478.
Watts, R. (1977). Corporate financial statements: Product of market and political processes. Australian Journal of Management, 2, 53–75.
Watts, R., & Zimmerman, J. (1986). Positive accounting theory. Englewood Cliffs, NJ: Prentice Hall.
Wong, M. H. F. (2001). The association between SFAS No. 119 derivatives disclosures and the foreign exchange risk exposure of manufacturing firms. Journal of Accounting Research, 38, 387–417.
You, H., & Zhang, X. (2009). Financial reporting complexity and investor underreaction to 10-K information. Review of Accounting Studies, 14, 559–586.
Acknowledgments
We are grateful for the helpful comments and suggestions of an anonymous reviewer, Kris Allee, Terry Baker, Liz Chuk, James Cotter, Jon Duchac, Fabio Gaertner, Ronen Gal-Or, Andrea Kelton, Feng Li, Russell Lundholm, Bill Marcum, Dale Martin, Rick Mergenthaler, Karen Nelson, Deon Strickland, Jake Thornock, the doctoral students at the University of Arizona, workshop participants at Wake Forest University, and conference participants at the 2011 AAA Annual Meeting and the 22nd Annual Conference on Financial Economics and Accounting (FEA) at Indiana University. Finally, we thank both the Securities and Exchange Commission (SEC) and the Institute of Chartered Accountants in England and Wales (ICAEW) for their interest in this paper. Portions of this paper were included in the ICAEW’s October 2011 report titled “Reporting Business Risks: Meeting Expectations.” Professor Lu gratefully acknowledges assistance from the National Science Council of Taiwan (grant number NSC100-2410-H-002-025-MY3).
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: Summary of textual analysis procedures to generate risk factor disclosure measures
1.1 Data collection technique
In this appendix, we discuss the textual analysis procedures employed in the collection of risk disclosures from annual filings in the SEC’s Electronic Data Gathering and Retrieval (EDGAR) database (for a thorough treatment on who uses the EDGAR database to obtain information, and the most commonly retrieved forms, see Drake et al. 2011). Figure 1 illustrates the specific sequence of steps used in our textual analysis. As presented in Fig. 1, annual filings are downloaded and processed to generate appropriate counting measures that can objectively quantify firms’ risk disclosure. The rest of this section presents our system design in detail.
1.2 File collection
Our system starts by collecting all relevant 10-K filings and storing them in a relational database. Since subsequent analysis and processing may require several runs of prototyping and testing, we create our own repository in order to increase the efficiency of subsequent activities. EDGAR provides convenient filing download mechanisms based on the File Transfer Protocol (FTP). The downloader first retrieves all index files beginning in 2005. All 10-K filings are downloaded according to the index files. The procedure ensures that all annual filings that were uploaded to the EDGAR database between January 1, 2005 and December 31, 2009 are collected. In total, 44,998 10-K filings were downloaded and stored in our repository.
1.3 File preprocessing
The goal of file preprocessing is to extract important items, including risk factors, from individual filings for subsequent text analysis. The EDGAR system requires firms to upload their Form 10-K reports in either (1) plain text format or (2) HTML format. Our software is unable to reliably scan plain text filings, so we delete these filings from our sample. These filings account for 21 % of the population.Footnote 16 On the other hand, our software is able to reliably scan HTML filings, and we use an automatic procedure to extract text from the following subsections of the Form 10-K: risk factors (Form 10-K, Item 1A), MD&A (Item 7), and market risk disclosures (Item 7A).
Our item extraction procedure is based on the assumption that HTML filings contain visual clues for human readers to recognize item boundaries easily. These visual clues include the use of subsection titles, boldface fonts, extra spacing, and so on. Most filings also use standard item headings that start with “Item,” followed by an item number and a description (e.g., “ITEM 1A. RISK FACTORS”). The HTML format, however, allows visually the same display to be achieved using different tags. One example is that < b > and < strong > have the same visual effect in most browsers. The flexibility in composing HTML filings creates a technical challenge when designing the item extraction procedure. Our design overcomes this challenge by first converting an HTML file into an intermediate representation that combines HTML tags that have similar effects for human readers when deciding item boundaries. A list of candidate item heading locations than can be identified using the intermediate representation. Finally, items are extracted based on the locations of candidate heading locations. We describe the three steps in details below.
In the first step, the input HTML filing is parsed into a tree structure, where the leaf nodes are text segments of the content and the internal nodes are HTML tags such as “b,” “title,” or “li.” The tree structure allows us to associate the characteristics of a text segment by traveling upward and inspecting the parent nodes.
To facilitate subsequent extraction, we further convert the parse tree into a flat structure by traversing the tree and calculating two scores for each text segment: (1) an emphasizing score and (2) a segmentation score. Both of these scores are important to ensure that we have fully extracted all of the text within a Form 10-K “Item” or subsection. The emphasizing score is designed to determine the prominence of the text within the disclosure section. To compute the emphasizing score, we examine the set of HTML tags surrounding the text. Examples include the “strong” tag and the “underline” style within a “div” tag. A complete list of emphasizing tags can be found in Appendix 2. Emphasizing scores are computed for each text segment according to the number of emphasizing tags in its parent nodes. A positive emphasizing score indicates that the text segment is visually more prominent and may be the heading of an item.
The segmentation score is designed to indicate whether and how the text is visually broken down into sections. HTML tags such as “<br>” and “<p>” are two common tags used to separate text. By tracking the number of “separation tags” in the parent nodes when traversing the parse tree, we can detect the locations where text segments are visually separated. The segmentation score is the number of increased or decreased “separation tags.”
The second step is to construct candidate item heading locations using the flat structure. We assume that the item headings are visually more prominent (emphasizing score > 0) with a text segment that “looks like” item headings (using regular expression “(Item\s +\d\D?\.?),” case ignored). The output of this step is a list of locations that may be the beginning of an item.
The last step is to extract items of interest using the list of candidate item heading locations. To extract the risk factor subsection (Item 1A), we scan the text around the candidate item heading locations for the keyword “risk factors.” The text from a matching item heading location until the beginning of the following candidate headings locations are assigned to the risk factor subsection. Other items of interests are processed in the same manner.
Extracted items go through consistency checks before recording back to the database. The assumption is that Item 1A (Risk Factors) should precede Item 7 and Item 7A. Item heading locations that do not match the ordering are rejected. We conduct performance analysis to ensure the quality of the item extraction procedure. Two aspects are of particular interest. The first aspect is the proportion of HTML filings that can be extracted. The proportion is referred to as the coverage of selected items. The following table summarizes the coverage of our procedure:
Fiscal year | 10 K Filings | HTML filing | Extracted Item 1A | Extracted Item 1A and Item 7 | Extracted Item 1A, Item 7, and Item 7A |
---|---|---|---|---|---|
2005 | 8,996 | 6,326 | 4,255 | 3,593 | 3,558 |
2006 | 8,941 | 6,842 | 5,833 | 4,833 | 4,764 |
2007 | 7,318 | 7,005 | 5,936 | 5,021 | 4,863 |
2008 | 9,236 | 8,624 | 7,081 | 5819 | 5,480 |
Totals | 34,491 | 28,797 | 23,105 | 19,266 | 18,665 |
Coverage | 80 % | 67 % | 65 % |
Among all the filings collected, 34,491 filings are from fiscal years 2005 to 2008. Eighty-three percent (28,797) of them are HTML filings. Our procedure is able to extract 80 % of the risk factor subsections from the HTML filings. If we restrict to the subset that have successfully extracted both Item 1A and Item 7, then the number drops to 67 %. Our procedure can extract Item 1A, Item 7, and Item 7A from 65 % of HTML filings
The second performance aspect of importance is the precision of the extracted items. We visually inspect 300 random filings from the subset that Item 1A was extracted using the procedure. We were unable to extract nine of these filings.Footnote 17 Among the remaining 291, all of them correctly contain only the appropriate subsection. Thus, the precision of Item 1A extraction is 100 %. Similarly, our software was unable to extract Item 7 from 47 firms in this subset. Among the remaining 253, 249 of them were extracted correctly. Thus, the precision for Item 7 is 98 % (249/253). Finally, our software extracted Item 7A from 281 of the 300 firms in this subset and five extracted items contain other subsections, for a precision of 98 % (281/286)
Overall, the results show that our software extracted the risk factors from 80 % of HTML filings. Moreover, for over 98 % of the extracted items, the right subsection, and only the right subsection was extracted. Among the small number of incorrectly extracted subsections, most of them contain the text from the target item, with small chunks of text from other subsections
1.4 Text quantification using risk keywords
The extracted items need to be quantified in order to be included in our empirical models. We first count the total words within a subsection and label it as word count (ALL_WORDS). Then, we identify key words using the predefined dictionary from Appendix 3. We developed our list of key words in three steps. First, we begin with key risk words used by prior literature (Nelson and Pritchard 2007). Second, we add additional key words to the list that, based on our review of risk factor disclosures, appear to be common across firms. Third, we classify the list of key words as relating to financial, litigation, tax, other-systematic, or other-idiosyncratic risk subcategories as described in Sect. 3.1 of the text. To enhance the coverage of our key words, we reviewed important terms identified by a document clustering approach, known as the Latent Dirichlet Allocation (Blei et al. 2003). By inspecting important terms in the same cluster, keywords that were previously missing were included.
Finally, the text quantification module computes the frequency of each term and aggregates the number of key words according to the risk subcategory (KEY_WORDS). To increase the precision of the key word matching process, the text quantification module allows term matching to be case sensitive. This constraint is especially useful for acronyms such as IRS (i.e., Internal Revenue Service) and EU (i.e., European Union). If there are variations of terms that need to be matched, these variations are explicitly specified in the keyword list. For example, the criterion “(lease|leases|leasing)” is used to match the variation of “lease.” By performing these techniques, we compute three measures for each extracted subsection of the Form 10-K: (1) total word count (ALLWORDS), (2) total key word count (KEYWORDS), and (3) key word count (KEYWORDS) by risk subcategory.
Appendix 2
HTML Tags Used to Identify the Text from Applicable Subsections of Form 10-K.
This appendix provides the specific HTML tags that were used to determine the “emphasizing score” and the “separation score.” These tags are used to help us fully extract on those subsections of the Form 10-K that are used in our analysis. For example, the emphasizing tags help to identify when text is bolded or underlined, and the separation tags help to identify when text is segmented by paragraph breaks or section headings. This procedure is fully explained in Appendix 1, Textual Analysis Data Collection Process.
Emphasizing tags | Separation tags | ||
---|---|---|---|
Tag | Attribute | Tag | Attribute |
b | br | ||
strong | dd | ||
h1 | dl | ||
h2 | dt | ||
h3 | form | ||
h4 | hr | ||
h5 | li | ||
u | p | ||
font | style = (bold|underline) | pre | |
div | style = (bold|underline) | table | |
p | style = (bold|underline) | tbody | |
td | |||
th | |||
title | |||
tr | |||
ul | |||
div | . |
Appendix 3
See Table 9.
Rights and permissions
About this article
Cite this article
Campbell, J.L., Chen, H., Dhaliwal, D.S. et al. The information content of mandatory risk factor disclosures in corporate filings. Rev Account Stud 19, 396–455 (2014). https://doi.org/10.1007/s11142-013-9258-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11142-013-9258-3