1 Introduction

The amount of runoff from a forested watershed changes depending on the degree of tree removal (e.g., Bosch and Hewlett 1982). The forest ecosystem can greatly influence the water balance in a watershed, which, at the annual time scale, may be expressed as:

$$ P=E+{Q}_{\mathrm{R}} $$
(2.1)

where P is precipitation, E is evapotranspiration, and Q R represents runoff. Evapotranspiration may be further subdivided into three components:

$$ E={E}_{\mathrm{T}}+{E}_{\mathrm{I}}+{E}_{\mathrm{F}} $$
(2.2)

where E T is transpiration from forest ecosystem, E I is interception loss, and E F is evaporation from forest floor.

As Oki and Kanae (2006) state, water resources engineers consider P − E as a measure of the maximum renewable freshwater resources for a watershed (blue water). Thus, the accurate measurement of P and E, with the latter being mostly comprised of E T and E I for closed-canopy forests (e.g., Wilson et al. 2001), is critical for estimating runoff generation in forest ecosystems.

Interception loss , E I, is usually estimated as the difference between rain falling on the forest canopy and the proportion of that rain delivered to the forest floor (Helvey and Partic 1965; Carlyle-Moses and Gash 2011). Rain depth is usually measured using a tipping-bucket rain gauge or some type of tipping-bucket flow meter (e.g., Reid and Lewis 2009). However, tipping-bucket rain gauges and flow meters commonly underestimate inflow rates, especially for higher intensity rains and flow inputs (e.g., Edwards et al. 1974; Iida et al. 2018; Shimizu et al. 2018), and, as a consequence, a minimum 10% uncertainty in E I estimates should be expected when uncorrected gauge and flow meter data are used (Iida et al. 2012). Transpiration, E T, is evaluated using sap flow measurements with sensors inserted into the boles of trees (e.g., Kumagai et al. 2014); however, this technique may underestimate sap flux densities . For example, Steppe et al. (2010) determined that laboratory-derived sap flux densities and thus, by extension, transpiration rates of freshly cut stem segments of Fagus grandifolia trees were underestimated by 60%. In order to evaluate E I and E T correctly, the measurement uncertainties associated with tipping-bucket gauges and flow meters , as well as sap flow sensors, must be defined and taken into consideration when performing instrument calibration.

In this chapter, we describe in detail how to calibrate tipping-bucket rain gauges , tipping-bucket flow meters, and sap flow sensors (Fig. 2.1). Additionally, based on laboratory calibrations, we evaluate the effect of applying calibration on E I estimates. Tree-to-tree and site-specific differences in calibration results of sap flow sensors and the degree of uncertainty that can be expected in estimating E T are also evaluated and discussed.

Fig. 2.1
figure 1figure 1

Calibration procedures for (a) tipping-bucket rain gauges and flow meters and (b, continued on the following page) three sap flow techniques of thermal dissipation (TD) method , heat field deformation (HFD) method , and heat ratio (HR) method

2 Correction of Tipping-Bucket Rain Gauges and Flow Meters for Interception Loss Estimates

When rain falls on a forest, a proportion of the rain is stored on the tree canopies and boles that comprise some of this storage being evaporated back to the atmosphere. This interception loss (E I) can be appreciable, accounting for 10–50% of precipitation (Roth et al. 2007). Although some researchers evaluate E I based on the estimation of rainwater stored on the tree by detecting stem compression (e.g., Friesen et al. 2008) or a much more promising method of detecting shifts of tree sway frequency using accelerometers (e.g., van Emmerik et al. 2017; see Chap. 6 of this volume), E I in forests is more as commonly estimated as (e.g., Carlyle-Moses et al. 2018):

$$ {E}_{\mathrm{I}}=P-\left({T}_{\mathrm{f}}+{S}_{\mathrm{f}}\right) $$
(2.3)

where T f is throughfall and S f is stemflow .

It is well known that T f commonly exhibits high spatial heterogeneity with some areas of the forest floor receiving T f inputs greater than P (e.g., Lloyd and Marques 1988; Carlyle-Moses and Lishman 2015). In order to obtain spatially representative T f measurements, large numbers of gauges and/or gauges designed for integrating the spatial variability of T f (e.g., trough-style gauge) are required (Carlyle-Moses et al. 2014; Su et al. 2016). Iida (2009), for example, found that for precise estimates of T f (error ≤ 5%) more than 1 m2 of gauge collection area must be distributed beneath the forest canopy of interest. Thus, the heterogeneity associated with forest overstory and the associated water routing through the canopy mean that meeting the statistical objectives of a T f measurement campaign will often require many more gauges than that required to estimate precipitation (e.g., Kimmins 1973; Carlyle-Moses et al. 2004; Ziegler et al. 2009). The minimum number of gauges required to estimate T f for a desired error (ε) and confidence level (t) may be estimated using the following (Kimmins 1973; Puckett 1991):

$$ {n}^{\prime }=\frac{t^2\bullet {CV}^2}{\varepsilon^2} $$
(2.4)

where n is the estimated number of gauges required to sample T f and CV is the coefficient of variation (%) of T f measurements.

Stemflow represents water that is routed by the tree canopy to the bole where it then flows to the forest floor. Like T f, S f can be highly spatially variable at the plot scale since S f yields can vary greatly among and within tree species due to factors such as tree age, canopy structure, and canopy rain shadows (Levia and Germer 2015). The quantitative importance of S f at the individual tree scale may be expressed using the funneling ratio (Herwitz 1985) and at the forest-scale using the stand-scale funneling ratio (Levia and Gemer 2015; Carlyle-Moses et al. 2018). The funneling ratio represents the ratio of the S f volume generated by a tree or trees to the volume of rain that would have been captured by a rain gauge having a diameter equal to that of the tree bole(s) at breast height (DBH). Funneling ratios at both the tree and forest stand-scale are often much greater than unity (e.g., Levia and Germer 2015; Carlyle-Moses et al. 2018), indicating that S f volume is often much more than P at basal area scale. Thus, in order to obtain reasonable estimates of T f and S f, and therefore E I, highly variable and often voluminous water inputs need to be measured with a high degree of accuracy.

Rainfall event scale E I is commonly estimated based on event-scale measurements of P, T f, and S f (e.g., van Dijk et al. 2015), employing storage-type gauges and large-capacity collection reservoirs. However, for a better understanding of the interception process, including the impact of meteorological factors and tree characteristics, more work examining the dynamics of E I is necessary at the intra-event time scale using high temporal resolution T f and S f measurements by tipping-bucket rain gauges and flow meters (e.g., Reid and Lewis 2009; Iida et al. 2017) as well as instrument systems utilizing ultrasonic rangefinders to monitor changes in collected flow depths (Turner et al. 2019). In such cases, the well-known systematic biases of tipping-bucket rain gauges and flow meters must be considered. When the bucket mechanism of a tipping-bucket gauge or flow meter is filled with water, it tips the other bucket into position to continue receiving water input. The water flux is then measured as the number of tips over a duration with this information being stored on a datalogger. If, however, water flows continuously during the period between one bucket tipping and the other bucket being brought into position, a certain amount of water does not flow into either bucket and is not registered with the degree of water input underestimation increasing with increasing intensity of inflow. This systematic underestimation has been reported and dynamically calibrated (e.g., Edwards et al. 1974). Moreover, the static amount of a tip (c), which is the volume associated with one tip of the tipping-bucket mechanism under very low intensity inflows, is frequently found to be different from that stipulated by the gauge or flow meter manufacturer (c m) (Shedekar et al. 2016; Iida et al. 2018; Shimizu et al. 2018). Thus, static calibration is critical for measuring water fluxes correctly with tipping-bucket rain gauges and flow meters. The necessary calibrations for several tipping-bucket and flow meter models have been reported (Iida et al. 2012, 2018; Shimizu et al. 2018; see Table 2.1). Based on these results and the established need to correct for the under-catch of these gauges and flow meters, we explain and detail the procedure necessary to properly calibrate these instruments.

Table 2.1 A list of calibration curves for various tipping-bucket rain gauges and flow meters

2.1 Static Calibration Methods

The calibration results of three tipping-bucket rain gauge models and eight tipping-bucket flow meter models amalgamated from the existing literature and this study are presented in Table 2.1. At first, static calibration, which quantifies the volume of one tip (c) directly, should be conducted (Fig. 2.1). Once c has been established, the relationship between the inflow rate and the actual amount one tip is determined by dynamic calibration.

Since manufacture stated tip-inducing volumes (c m) for the tipping-bucket rain gauges are all <16 mL (Table 2.1), we used injectors to fill the tipping-bucket mechanism with water on a drop by drop basis in order to obtain the volume of water required to cause a single tip (c). More specifically, c was found as the difference in the mass, using an electronic scale, of the injector plus the water within the injector at the start of the calibration procedure and the mass of the injector plus the water remaining in the injector once a tip had occurred. However, tipping-bucket flow meters have stated c m values  > 50 mL and, in order to avoid injection errors while determining c for these instruments, a beaker was used to help facilitate the calibration procedure. When calibrating the U50 flow meter, for example, 40-mL of water was initially added to the bucket with the injector being used for subsequent additions of water until c was realized.

2.2 Dynamic Calibration Methods

For dynamic calibration, it is imperative that a constant inflow rate (q, mL s−1) be obtained. The key to maintain a stable q during the calibration procedure is to keep a constant difference in head between the inlet and outlet of water flow. A simple apparatus, depicted in Fig. 2.2, is used to generate q. The constant difference in head can be attained by maintaining an overflow of water (Fig. 2.2). The value of q is given as the mass of stored water, measured by an electronic balance, over a certain duration determined by a stopwatch. For each intensity of q, at least 10 tips are recorded as the times of each tip with 10 Hz with a datalogger (type CR1000, Campbell Scientific Inc., Logan, Utah). The average time between tips (t, s) is derived, while the volume associated with a tip (v, mL) is found as:

Fig. 2.2
figure 2

Simple instrument to generate constant water flow for tipping-bucket rain gauge and flow meter calibrations

$$ v=q\bullet t. $$
(2.5)

Constant inflow-volume relationships may vary even for the same type of tipping-bucket rain gauge or flow meter model. Shiraki and Yamato (2004) proposed that a common relationship for the same model of gauge or flow meter can be obtained by scaling q and v with c, with Q = q/c and V = v/c. The best fit of the Q − V relationships for all tipping-bucket rain gauges and flow meters evaluated is a quadratic curve (Table 2.1).

$$ V=d\bullet {Q}^2+e\bullet Q+1 $$
(2.6)

where d and e are the fitting parameters.

As aforementioned, the underlying cause of the underestimation in water input rates is the failure of capturing q during the time of the tip (Δt). If Δt is constant to Q, the Q − V relationship must be fitted with a linear equation (V = Δt ∙ Q + 1). However, since a quadratic curve has been found to be a better fit of the Q-V relationship and not the linear line, this implies that Δt decreases with increasing Q (Iida et al. 2012). Shiraki et al. (2018) measured Δt by high-speed-digital video, and confirmed the decreasing trend in Δt with increasing Q. Shiraki et al. (2018) also noted that the movement of stored water resulting from the high kinetic energy of Q, decreases the value of V compared with the predicted value from a linear relationship derived using a smaller range of Q.

Constant inflow, Q, can be obtained from Eq. 2.6 and that V can be equated with Q ∙ t:

$$ Q=\frac{-\left(e-t\right)-\sqrt{{\left(e-t\right)}^2-4d}}{2d} $$
(2.7)

For the application of the calibration curves to data obtained in the field, Q is first calculated by substituting t into Eq.2.7, while the volume associated with a single tip is obtained as the product of c, Q, and t (Eq. 2.5), that is, v = q ∙ t = c ∙ Q ∙ t.

2.3 Dynamic Calibration of Tipping-Bucket Rain Gauges and Flow Meters

Of all the tipping-bucket rain gauges and flow meters evaluated, the U100 tipping-bucket flow meter shows the most significant underestimation of V for smaller values of Q (Table 2.1, Fig. 2.3). For this model, a 10% underestimation of V (i.e., V = v/c = 1.1) occurs when Q > 0.12 s−1. For tipping-bucket rain gauges, the OT and OC models have a relatively small amount of underestimation, and 10% errors are not detected for Q values less than 0.24 s−1 (Fig. 2.3). If U100 has c = 100 mL and Q = 0.12 s−1 then q (q = c ∙ Q) is 12 mL s−1. Additionally, if the c of the OT and OC rain gauges are 0.5 and 0.2 mm, respectively, then Q = 0.24 s−1 and thus q = 0.12 mm s−1 and 0.048 mm s−1, respectively.

Fig. 2.3
figure 3

Relationships between normalized inflow rate (Q) and one-tip volume (V) for a range of Q from (a) 0 to 1 and (b) 0 to 0.25 s−1. Refer to Table 2.1 for details of target tipping-bucket rain gauges and flow meters and calibration curves

For almost all types of large tipping-bucket flow meters (c m ≥ 150–200 mL), Shimizu et al. (2018) suggest that the Q − V relationships obtained through laboratory tests can be plotted within the range of ±2% of that for the I400 model (Table 2.1) when Q is less than 0.1–0.2 s−1 (corresponding to 0.1–0.2 Hz tipping rate). This indicates the calibration equation for I400 can be applicable for most types of large tipping-bucket flow meters. However, our calibration results for the U100 model found that the Q − V relationship exceeds the +2% range of the I400 gauge when Q ≥ 0.1 s−1 (Fig. 2.3b). Considering that a high frequency tip rate occurs more readily on a tipping-bucket flow meter with a relatively small c m, precaution must be taken when applying the equation for the I400 flow meter to flow meters with c m ≤ 100 mL.

Similar to the U100 gauge, the Q − V relationships for the OT and OC tipping-bucket rain gauges lie outside of the −2% range of the I400 flow meter even when Q ≈ 0.1 s−1 or less (Fig. 2.3b). The DI rain gauge (Table 2.1), however, was found to have a Q − V relationship that was within the +2% range when Q < 0.22 (Fig. 2.3b). These findings suggest the difficulty in establishing a representative dynamic calibration equation which would cover most tipping-bucket rain gauges. Shiraki and Yamato (2004) proposed a common calibration eq. (V = 0.353Q + 1) for tipping-bucket rain gauges; however, it should be noted that the equation lies within the lowermost of the plots in Fig. 2.3b. As such, the application of the Shiraki and Yamato (2004) equation underestimates the water input for all tipping-bucket rain gauges. Determining if a common calibration equation that could be applied to most types of tipping-bucket rain gauges can be derived should be revisited once laboratory and field tests for larger samples of several of tipping-bucket rain gauges and flow meters has been accomplished.

2.4 Cautions for Applying the Calibration to Measured Data

Applying static calibration to tipping-bucket rain gauges and flow meters is critical to measure q correctly. Shimizu et al. (2018) reported a maximum difference between c and c m of +2.5% for larger tipping-bucket flow meters. Maximum differences between c and c m among the tipping-bucket rain gauges and 200-mL flow meters calibrated by Iida et al. (2012) were −15% and +3.2%, respectively. Additionally, Iida et al. (2018) found that the difference between c and c m was −16% for the DI-type tipping-bucket rain gauge. These examples clearly illustrate the need for the static calibration of all types of tipping-bucket rain gauges and flow meters.

High-frequency data (e.g., 10 Hz) were used for the dynamic calibrations of the tipping-bucket rain gauges and flow meters. However, field measurements are usually recorded as the accumulated number of tips, n t, during a given duration, D. In these circumstances, instead of actual t, the average time between tipping T, found as T = D/n t, is substituted into (Eq. 2.7) and the corrected v is derived. However, if D is longer than a suitable range, T can be overestimated and the degree of correction to v would be smaller than that based using actual t. Iida et al. (2012) investigated the suitable range of D for a temperate forest in Japan and a tropical forest in Cambodia. The D values obtained were up to 60 min and 10 min for P and T f measurements in the temperate and the tropical forest, respectively. Stemflow measurements required D < 150 s for a tree having a DBH of 25.5 cm and funneling ratios > 10 in the tropical forest, while in the temperate forest D = 600 s was suitable for a tree with a DBH of 27.1 cm and funneling ratios < 20 (Iida et al. 2012). However, these examples of D may not be applicable to measurements of P, T f and S f for different forest ecosystems. As such, we recommend that D be set to the shortest time interval permissible based on the measurements of t for interception studies.

2.5 Effect of Dynamic Calibration on Interception Loss Estimates

Interception loss (E I) is calculated as the difference between P and the sum of T f and S f (see Eq. 2.3). P is the largest component of (Eq. 2.3), and therefore, in most cases, the amount of correction by dynamic calibration is largest for P. Thus, if the dynamic calibration is not applied, E I would be underestimated. However, there are some cases that E I is overestimated for combinations of different types of tipping-bucket rain gauges (Fig. 2.4, Iida et al. 2018). We investigated the nine combinations of P and T f measurements with three tipping-bucket rain gauge models (type OT, OC, and DI; Table 2.1). When the same model of tipping-bucket rain gauge is used to measure both P and T f, the effects on E I are relatively small (Fig. 2.4a, b, and c). The best scenario is the combination of both P and T f measured with the OT gauge, in which a maximum effect on E I of +4% is detected (Fig. 2.4c). A +4% error, however, is not negligible, and thus, even under optimum conditions, tipping-bucket gauges require correction using the dynamic calibration curve. The fact that evaluation of spatially representative T f needs many tipping-bucket rain gauges, it can be expected that lower-cost tipping-bucket rain gauges, such as the DI type, would be used for T f measurements. However, a −20% error in E I (20% overestimation) is found when T f is measured with the type DI gauge (Fig. 2.4c), having the largest systematic bias among the three tipping-bucket rain gauges evaluated (Fig. 2.3a). If the DI-type tipping-bucket rain gauge is also used to measure P, E I is underestimated by as much as 40% (Fig. 2.4a).

Fig. 2.4
figure 4

Effect of dynamic calibrated gross rainfall (P) and throughfall (T f) on interception loss (E I). (a) P is measured by Rain Collector II, Davis Instruments (DI), (b) RG-3M, Onset Computer Corp. (OC), and (c) OW-34-BP, Ota keiki seisakusho Co., Ltd. (OT). 10% effect means 10% increase in E I after applying the correction, that is, 10% underestimation. Data are cited from Iida et al. (2018)

This section discussed the error associated with tipping-bucket rain gauges and flow meters at the rainfall-event scale. If, however, the dynamics of the interception process during the rainfall event are to be investigated (Reid and Lewis 2009, Iida et al. 2017), then dynamic calibrations at the hourly time scale are highly recommended.

3 Calibration of Sap Flow Sensors Measuring Forest Transpiration

Sap flux density , F D, measurements are widely used to evaluate forest transpiration, E T (Wilson et al. 2001; Kumagai et al. 2008; Oishi et al. 2008). Transpiration from a tree, using sap flow (Q T) as a surrogate, can be estimated as the product of sapwood area (A S) and F D averaged over sapwood area, \( \overline{F_{\mathrm{D}}} \):

$$ {Q}_{\mathrm{T}}={A}_{\mathrm{S}}\bullet \overline{F_{\mathrm{D}}}. $$
(2.8)

Transpiration can be estimated by scaling Q T up from the tree to the stand level (e.g., Kumagai et al. 2008). Both A S and \( \overline{F_{\mathrm{D}}} \) are required in order to estimate E T, and a detailed analysis is necessary to derive the number of samples required to estimate the representative values of A S and \( \overline{F_{\mathrm{D}}} \) (Kumagai et al. 2005a, b).

The focus of this section is on the required calibration associated with three different sap flow techniques, namely, (i) thermal dissipation (TD) (Granier 1985), (ii) heat ratio (HR) (Burgess et al. 2001), and (iii) heat field deformation (HFD) (Nadezhdina et al. 2012) (Figs. 2.1 and 2.5a). All three techniques insert probes and a heater into a stem and detect the F D by using heat as a tracer. Recently, some studies report that TD underestimates F D (e.g., Steppe et al. 2010; Peters et al. 2018) and uncertainties of measuring F D with HFD (e.g., Steppe et al. 2010; Fuchs et al. 2017). We sampled stem segments of Japanese cedar (Cryptomeria japonica) planted in the nursery of the Forestry and Forest Products Research Institute (FFPRI), Tsukuba, approximately 50 km northeast of Tokyo, and in the Nagasaka Experimental Watershed, northern Honshu island (Table 2.2). An artificial Q T is generated by applying negative pressure at the upper surface of the segment with a vacuum pump and calibration of the three techniques was conducted in the laboratory.

Fig. 2.5
figure 5

Schematic diagram showing (a) sap flow sensors of three methods of thermal dissipation (TD) , heat field deformation (HFD) , and heat ratio (HR) and (b, c, d) arrangements of sensors used in calibration. Black colored probe includes heater. Arrangements of sensors for tree segments of (b) F1 to F4, (c) F5 to F8, and (d) N1 to N5 are indicated (refer to Table 2.2). Note that the measurement points are marked as black squares for sensors of HFD and HR methods (b and d). Sensors indicated by arrows are used to calculate the coefficient of a (b and d)

Table 2.2 Characteristics of the Japanese cedar test trees

3.1 Characteristics of the Three Sap Flow Techniques

The thermal dissipation, TD , method proposed by Granier (1985) inserts a pair of thermocouples into a plant stem (Fig. 2.5a). The upper probe measures the temperature of an included heater to generate constant heat of 0.2 W, while the lower probe measures the xylem temperature as a reference. The space between probes should be more than 10 cm (Iida and Tanaka 2010), while the length of the probes is 2.0 cm. The temperature difference between the probes (ΔT) increases during low F D and decreases with increasing F D. Granier (1985) obtained the following calibration curve between the parameter K TD and F D:

$$ {F}_{\mathrm{D}-\mathrm{TD}}=42.84\bullet {K_{\mathrm{TD}}}^{1.231}=42.84\bullet {\left(\frac{\varDelta {T}_0-\varDelta T}{\varDelta T}\right)}^{1.231} $$
(2.9)

where F D-TD is the F D measured by the TD method (cm3 cm−2 h−1), and ΔT 0 is ΔT when F D = 0. We employed handmade sensors described by Kumagai et al. (2005a). The TD sensor detected the average F D across the range of probe.

The heat field deformation (HFD) method also uses heat to measure F D. A continuously powered heater is inserted at the center of the upper and lower temperature probes with an additional temperature probe installed tangentially next to the heater (Fig. 2.5a). F D is calculated by the following Nadezhdina et al. (2012):

$$ {F}_{\mathrm{D}-\mathrm{HFD}}=\frac{D\left({K}_{\mathrm{HFD}}+{dT}_{\mathrm{s}-\mathrm{a}}\right){Z}_{\mathrm{ax}}}{dT_{\mathrm{as}}\bullet {Z}_{\mathrm{tg}}\bullet SW}\bullet 3600. $$
(2.10)

where F D-HFD is F D measured by HFD (cm3 cm−2 h−1), dT s-a is the temperature difference between the upper and tangential probes (°C), or can be equated with the difference between dT sym and dT as, which represent the temperature differences (°C) between the upper and lower probes and between the tangential and lower probes, respectively, Z ax is the difference between the heater and upper or lower temperature probe (1.5 cm), Z tg is the distance between the heater and the tangential probe (0.5 cm), and SW is the width of sapwood (cm), while K HFD is the dT s-a when F D is equal to zero.

Nadezhdina et al. (2012) found that the ratio of dT sym to dT as (dT sym/dT as) has a linear relationship with F D and that K HFD may be equated with the y-intercept value of the linear regression between dT sym/dT as (dependent variable) and dT as (independent variable). Since the absolute value of dT s-a is equal to that of dT as when F D is zero, the absolute value of y-intercept derived from the linear regression between dT sym/dT as and dT s-a must be equal to that between dT sym/dT as and dT as. The HFD method determines K HFD objectively based on two linear regressions which meet the condition of having the same absolute values of y-intercepts, representing an advantage of the HFD method (Nadezhdina et al. 2012). We used a commercial HFD sensor (HFD8, ICT international Pty. Ltd.). The length of the heater and temperature probes are 11.7 and 9.7 cm, respectively, and eight thermistors are positioned in a 1-cm span from the probes. We inserted temperature probes into tree stem segments to locate the first thermistor at the depth of 0.5 cm in sapwood.

The heat ratio, HR , method, which is an alternate version of heat pulse method, calculates heat pulse velocity (HPV r, cm h−1) based on the temperature changes of the xylem caused by induced heat by a heater and measured with two temperature probes located at the same distance from the heater (Fig. 2.5a). HPV r is then found as (Marshall 1958; Burguess et al. 2001):

$$ {HPV}_{\mathrm{r}}=\frac{D}{x}\ln \left(\frac{v_1}{v_2}\right)\bullet 3600. $$
(2.11)

where D is thermal diffusivity of fresh wood (2.5 × 10−3 cm2 s−1; Marshall 1958), x is the distance between the heater and the temperature probe (0.5 cm), v 1 and v 2 are the increase of xylem temperature (°C) by heat detected by the upper and lower temperature probes, respectively.

HPV r is affected by the wounding effect caused by the nonconducting wood around the probes, which results from the mechanical damage to the xylem tissue during sensor installation (Swanson and Whitfield 1981; Burgess et al. 2001). Burgess et al. (2001) proposed the following equation to correct for the wounding effect:

$$ {HPV}_{\mathrm{c}}=B\bullet {HPV}_{\mathrm{r}}. $$
(2.12)

where HPV c is corrected HPV r by considering the wounding effect, B is the correction coefficient. In this study, we assume the wound width of 0.17 cm and applied a B value of 1.7283 (Burgess et al. 2001). Sap flux density , F D, measured with the HR method (F D-HR) is then derived as:

$$ {F}_{\mathrm{D}-\mathrm{HR}}=\frac{\rho_{\mathrm{b}}}{\rho_{\mathrm{s}}}\left({m}_{\mathrm{c}}+\frac{c_{\mathrm{dw}}}{c_{\mathrm{s}}}\right){HPV}_{\mathrm{c}}. $$
(2.13)

where ρ b is dry wood density of Japanese cedar (0.314 g cm−3; Fujiwara et al. 2004), ρ b is density of sap assumed to be equal to the density of water (1.0 g cm−3), m c is the mass of the water content of sapwood relative to dry weight of sapwood (1.78), c dw is the specific heat capacity of oven-dried wood (J g−1 °C−1), and c s is the specific heat capacity of sap, assumed to be equal to that of water (4.186 J g−1 °C−1).

The ratio of c dw to c s is often assumed to be constant at 0.33 (=1.380/4.186; Dunlap 1912; Edwards and Warwick 1984; Steppe et al. 2010). We used a commercial HR sensor (type SFM1, ICT international Pty. Ltd.), whose sensor length is 3.5 cm including two thermistors to detect v at different depths. We installed sensors into stem segments to detect F D-HR at the depth of 1 cm of sapwood.

3.2 Artificial Sap Flow Generated by a Vacuum Pump

In order to calibrate the sap flow techniques, an artificial and known Q T is required. Granier (1985) generated Q T by applying positive water pressure on the tree segment. Similar methods of generating Q T using positive pressures have also been proposed (Herbst et al. 2007; Steppe et al. 2010; Hubbard et al. 2010; Bosch et al. 2014; Fuchs et al. 2017; Ouyang et al. 2018; Peters et al. 2018). In some instances, a negative pressure has been used to control Q T (Taneda and Sperry 2008; Bush et al. 2010; Schmidt-Walter et al. 2014). Additionally, the amount of transpiration from a tree, which equates with Q T, has been found by monitoring the change in mass of trees planted in a lysimeter (Lu and Chako 1998; Mcculloh et al. 2007). Takeuchi et al. (2017) excavated a tree, including the root-ball—a common practice for transplanting trees—with Q T equated to the changes in the total mass of the tree and root-ball, a technique the authors termed the “weighing root-ball” method. Sun et al. (2012) estimated Q T using potometer experiments, in which cut foliated tree branches were submerged into water and the amount of absorbed water was measured, while Lopez et al. (In press) cut Eucalyptus grandis trees ranging from 3 to 6 cm in diameter and submerged them into a fixed-volume reservoir with Q T found as the change in the volume of water held in the reservoir.

In this study, we apply negative pressure to generate Q T within a cut stem segment of Japanese cedar (Fig. 2.6) (Shinohara et al. 2016). An attachment designed to clean the inside of the PVC pipe (Fig. 2.6) was fixed to the upper surface of the cut segment and connected to a vacuum pump. Then, the segment was suspended by the cable ties, and its lower surface was submerged into free water surface. Depending on the pressure gradient, which was typically between 0.03 and 0.14 MPa/m, the lower surface absorbs KCl solution. This water surface was kept at a constant level with a Mariotte’s bottle. The water flow, Q T, was then found as the decrease in weight of the Mariotte’s bottle with an electric balance with a 10-sec temporal resolution.

Fig. 2.6
figure 6

A vacuum pump system to generate artificial sap flow within a stem segment, originally devised by Shinohara et al. (2016)

Precise calibration of the sap flow techniques is accomplished by comparing the actual F D at the sensor position to the F D measured by the sensor. When cut stem segments are used, the actual value of Q T is obtained and the gravimetric sap flux density , F D-G, is derived within a segment as Q T/A S (Fig. 2.1, Eq. 2.8). For each of the segments calibrated, a clear radial trend in F D-HFD was found (Fig. 2.7a, Table 2.2), and different F D-TD were measured among the four cardinal directions (i.e., north, east, south, and west) (Fig. 2.7b, Table 2.2). Similar results of radial and azimuthal distribution of the F D within stem segments have been reported by Steppe et al. (2010). Thus, it is likely that the actual F D derived at sensor position is not equal to F D-G and that the sensor output at the single position is not sufficient to perform suitable calibrations. In this study, measurements of F D variations over sapwood area for all cut stem segments were made (Fig. 2.5b, c, d).

Fig. 2.7
figure 7

Panel (a) shows radial distribution of sap flux density detected by HFD sensor (F D-HFD) in cut segment of F4 (see Table 2.2). F D-HFD was shown as relative to F D-HFD measured at 0.5 cm depth (see Fig. 2.5B). Panel (b) indicates azimuthal variation in sap flux density measured by four TD sensors (F D-TD) installed in a cut segment of N2 (see Table 2.2). North, east, south, and west F D-TD are shown as relative to north F D-TD (see Fig. 2.5d)

3.3 Calibration of the TD Method

Tree stem segments with SW being approximately 2.0 cm (equal to the senor length) were selected for the TD calibrations (Table 2.2). The four TD sensors were inserted into the stem segments so that the azimuthal variation of F D (north, east, south, and west directions) was accounted for and the averaged F D-TD is compared with F D-G (Fig. 2.1). Note that when SW is less than 2.0 cm, the correction proposed by Clearwater et al. (1999) should be applied. The F D-TD values were found to be within ±30% of the F D-G of stem segments sampled at Nagasaka site (Fig. 2.8a). Thus, for the Nagasaka site stem segments, clear underestimations of F D-TD were not found. However, for stem segments at the FFPRI nursery, most F D-TD values were found to be less than F D-G and underestimations of F D-TD were detected. Relatively high tree-specific variations among the F D-TD data were found for the four FFPRI stem segments, illustrating the difficulty of establishing a calibration equation. Almost all the variations in the relationship between the parameter K TD and F D-G for the nine stem segments (87% of data plotted in Fig. 2.8b) are in the range of the reported calibration curves.

Fig. 2.8
figure 8

Comparisons of gravimetric sap flux density (F D-G) and (a) measured sap flux density with TD method (F D-TD), and (b) parameter K TD. Open and black symbols show the data measured at the segments of Japanese cedar sampled in FFPRI nursery and Nagasaka site, respectively. Black curve is the original calibration by Granier (1985), and the gray curves are previously published calibrations for softwood species (Sun et al. 2012; Bosch et al. 2014; Ouyang et al. 2018; Peters et al. 2018)

3.4 Calibration of the HFD Method

It is important to recognize that the heat field deformation, HFD, method measures point F D at the detection depths in rather than measuring the average F D along the TD sensor. We derived the gravimetric F D values at the HFD sensor measurement points, F D-G-HFD, as follows: Four FFPRI stem segments and five Nagasaka stem segments (Table 2.2) were used with F D measured in each of the four cardinal directions with a HR sensor (Fig. 2.5b) for the FFPRI stem segments, while for the Nagasaka stem segments, four TD sensors were installed in each direction (Fig. 2.5d). The ratio of the average of two-direction values measured by the HR or TD method to the average of all four direction values (a) was calculated (2.5b, d). The radial distribution of F D is obtained from F D-HFD measurements at the different depths within the sapwood (Fig. 2.5b, d). We calculate the ratio of F D-HFD at a certain depth (i) to the average F D-HFD for all depths within the sapwood (r i), and gravimetric sap flux density at the detection point of the HFD sensor, F D-G-HFD, at the depth of i can be obtained as a ∙ r i ∙ F D-G (Fig. 2.1).

The plots of F D-HFD were found to be distributed around the 1:1 relationship with F D-G-HFD, however, relatively high dispersion of F D-HFD with values overlying +30% or underlying −30% was also found (Fig. 2.9a), indicating large tree-to-tree differences. Focusing on FFPRI stem segments, the HFD method clearly underestimated F D-G-HFD with the linear relationship of F D-G-HFD = 1.33 F D-HFD, R 2 = 0.72, being derived (Fig. 2.9b). However, when the Nagasaka stem segments are considered, the values show relatively high variability and trends of overestimation or underestimation were not detected (F D-G-HFD = 0.94 F D-HFD, R 2 = 0.69).

Fig. 2.9
figure 9

Relationship between gravimetric sap flux density at sensor position of heat field deformation (HFD) method (F D-G-HFD) and measured sap flux density with HFD (F D-HFD) for (a) all cut segments and (b) FFPRI nursery segments (F1 to F4 in Table 2.2). The dashed line shows the linear regression for FFPRI stem segments (y = 1.33x, R 2 = 0.72)

3.5 Calibration of the HR Method

The heat ratio , HR, method was calibrated using the four FFPRI stem segments (Table 2.2, Fig. 2.5b). Similar to the HFD method , the HR sensor measures point F D. The gravimetric F D at the detecting point of the HR sensor (F D-G-HR) is calculated using the similar methodology as for the HFD calibrations. In this case, four-direction values of F D-HR were measured, and we compared the averaged F D-HR values among four directions with the gravimetric sap flux density at the measuring depth of HR, F D-G-HR, obtained as r i ∙ F D-G (Fig. 2.1). A very high correlation between F D-HR and F D-G-HR was found, with almost all F D-HR values being larger than the F D-G-HR values, thereby indicating the tendency of the HR method to overestimate F D (Fig. 2.10). When the HR method was applied to the Japanese cedar trees in the FFPRI nursery, reasonable estimations of F D were obtained by the linear regression: F D-G-HR = 0.69 F D-HR, R 2 = 0.83. In this study, the wounding width of 0.17 cm was assumed resulting in B = 1.7283. F D-HR can vary depending on B (see Eq. 2.12 and 2.13). There are some possibilities that the actual width of the wounding is smaller than 0.17 cm and the overestimation of the wounding effect on F D-HR would occur. For future calibrations, directly measurements of the wounding width are required.

Fig. 2.10
figure 10

Relationship between gravimetric sap flux density at the detecting depth of heat ratio (HR) method (F D-G-HR) and measured sap flux density with HR (F D-HR). The dashed line indicates the regression line (y = 0.69x, R 2 = 0.83)

3.6 Summary of Sap Flow Calibration for the Three Techniques

We strongly suggest that the radial and azimuthal variations in F D occur in the cross-section of cut segments (Fig. 2.7). As a consequence, these variations must be measured in order to compare sensor output with the gravimetric values of F D. Although the Nagasaka stem segments showed relatively low coefficients of determination associated with the calibration curves of the TD and HFD methods, any tendencies, that is overestimations or underestimations, were not found (Fig. 2.8a, 2.9a). These results indicate that stand-scale measurements of F D with the TD and HFD methods would be accurate and representative and that no calibrations were required at the Nagasaka site. However, for FFPRI stem segments, the TD and HFD methods underestimated F D by 30% or more (Fig. 2.8a and 2.9b), while for the HR method at this site, an overestimation of approximately 30% was found (Fig. 2.10). Thus, if no corrections were applied to these stem segments the TD and HFD methods would have underestimated E T by at least 30% and often more, while the HR method would have resulted in an overestimate of E T of approximately 30%.

Compared with the TD and HFD techniques, the HR method shows very small tree-to-tree differences in calibration results (Fig. 2.10). HR is the only technique among three methods that is based on the physics of heat dissipation (e.g., Fuchs et al. 2017). Sun et al. (2012) showed that the accuracy of the TD method is not dependent on differences in xylem anatomy of the target tree species (i.e., diffuse-porous, ring-porous, and tracheid). However, Fan et al. (2018) suggest that the point or “discrete” F D measurements derived using the HR and HFD methods are affected by the differences in both hydraulic conductivity and thermal diffusivity between the earlywood and latewood of pine trees. The tree-to-tree and site-specific differences found in this study are probably caused by the heterogeneity of the xylem anatomy including the spatial distribution of the earlywood and latewood. The precise explanation of the uncertainty of sap flow techniques is still unknown (e.g., Steppe et al. 2010), although site- and species-specific calibrations are likely suitable to obtain the most accurate values of F D and E T (e.g., Peters et al. 2018; Flo et al. 2019).

4 Future Directions in Calibration Studies for Tipping-Bucket Rain Gauges, Tipping-Bucket Flow Meters, and Sap Flow Techniques

The underestimation of rainfall and/or water inflow by tipping-bucket rain gauges and flow meters, respectively, creates a systematic bias dependent on the intensities of inflow. Fortunately, the establishment of correction curves is relatively straightforward, and the equipment required to generate constant inflow (Fig. 2.2), which is a key part of the calibration system, is relatively easy to construct. It is highly recommended to calibrate tipping-bucket rain gauges and flow meters when used for E I studies. With the exception of Takahashi et al. (2011), Iida et al. (2012), Iida et al. (2018) and Shimizu et al. (2018), among a few others, relatively few studies have evaluated the effects of applying calibration curves on P, T f, S f, and/or E I. Given the findings of work reviewed in this chapter, the degree of systematic underestimations on T f and S f, and uncertainties in E I caused by tipping-bucket rain gauges and flow meters should be investigated for many types of forest ecosystems. We should note that the biases are larger for higher inflow rates (i.e., heavy rainfall) and over shorter time scales (i.e., hourly scale rather than event scale). Studies examining the intrastorm dynamics of interception processes with tipping-bucket rain gauges and flow meters are especially encouraged to utilize calibration curves (e.g., Iida et al. 2017). If tipping-bucket rain gauges and flow meters are already installed in the field without any established calibration curves, in situ static calibrations are strongly recommended. If practicable, the tipping-bucket gauges could be returned to the laboratory for dynamic calibrations, or, following Shimizu et al. (2018), a generalized correction equation could be employed for larger flow meters with c m ≥ 150–200 mL. If the target equipment is out of the application range of the generalized equation, we argue choosing similar types of gauges from Table 2.1 and estimating the potential degree of underestimation. We believe that researchers should recognize the substantial effects of systematic biases of tipping-bucket rain gauges and flow meters on P, T f, S f, and E I, and hope that all manufacturers will provide more detailed calibration results for users in near future.

The IPCC’s Fifth Assessment Report (AR5) predicts that extremely large rainfalls will occur more frequently in mid-latitude and tropical areas (Stocker et al. 2014). In Japan, as elsewhere, disasters can be triggered by the localization of heavy rainfall (Tokyo Climate Center, Japan Meteorological Agency 2018). Valid measurements of rainfall are a fundamental requirement for a more precise understanding of the phenomena of localized heavy rainfall. The total degree of underestimation of a rainfall event is affected by not only the inflow intensity but also total number of tips. As such, parallel measurements of rainfall with a 0.5 mm or less tipping-bucket rain gauge and a 1.0 mm or more gauge at the same location are reasonable path forward, provided that data from the lager tipping-bucket rain gauge is selected for events with extremely high intensity. Tipping-bucket rain gauges which regulate high inflow rates using a siphon system are commercially available (e.g., type TB3, HyQuest Solutions Pty. Ltd., Australia). Shedekar et al. (2016) dynamically calibrated the TB3 and showed that the measurement error decreases and stabilizes with increasing inflow rates. Electronic weighing rain gauges, which store rainwater and weigh its amount with a load cell, are available (e.g., Seibert and Morén 1999). Turner et al. (2019) developed an instrument system to measure the change in the depth of stemflow by ultrasonic rangefinders. Theoretically, weighing and depth-measurement approaches have some advantage over tipping-bucket gauges, especially at higher rainfall intensities due to the lack of a tipping-bucket mechanism. However, very few comparisons of rainfall measurements have been made between gauges with inflow regulated and those that have nonregulated flow. Accordingly, further work could compare the differences to interception loss measurements among various tipping-bucket, weighing, and/or depth-measurement gauge combinations under various climate settings. Regardless, as regions of the world experience higher intensity rainfall events, the importance of gauge-corrected measurements from sound calibration procedures will only increase in the future, especially in water-stressed areas where rainfall regimes are changing.

Spurred by Steppe et al. (2010), a large number of TD calibration studies (e.g., Peters et al. 2018) have reported the underestimation of F D. Such findings raise the importance of TD sensor calibration to accurately estimate E T. Accordingly, we highlight the necessity of using stem segments in calibration procedures. We would note that the artificial Q T can be measured directly for stem segments, but artificial F D cannot be obtained from the single sensor measurement due to the azimuthal and radial trends in F D. For TD sensors detecting F D along a sensor length of 2.0 cm, we recommend using stem segments with a sapwood width of ~2.0 cm and applying enough TD sensors to take the azimuthal variation into account. On the other hand, HFD and HR methods detect F D at the certain point of sensors. Thus, careful calculations of F D at each detecting point are required for HFD and HR calibrations. As adopted in this chapter, the HFD method is relatively advantageous to evaluate the radial trend of F D over the cross section of segments. To increase the number and quality of calibrations for HFD and HR methods , we make a call for future studies to pinpoint the actual F D at any given sensor position. In addition, although there are numerous calibration studies, the mechanism causing the uncertainties in sap flow techniques is still unknown and should be clarified.

Despite the existence of numerous laboratory-based TD calibrations existed, the application of calibration equations to living trees in forest settings is very limited, with the notable exception of Steppe et al. (2010). Thus, we make a call for both sap flow sensor calibration studies and their applications to field data in order to improve E T estimates. Wilson et al. (2001) estimated E using both the watershed water balance and eddy covariance methods and evaluated its three components: E I, E T, and E F (Eq. 2.2). Based on the comparison between E and the sum of E T, derived using the TD method, computed E I and measured E F, they suggest the possibility of E T being underestimated using TD method . Shimizu et al. (2015), however, reported the sum of the three components correspond well with E obtained from both the water balance and eddy covariance methods. Similar findings were reported by Oishi et al. (2008). Compared with the number of studies whose focus is on calibration for the TD method, only a few studies evaluate how well E corresponds to the sum of its components, including E T. In particular, no studies have reported the application of TD calibration to improve imbalances between E and the sum of E I, E T, and E F. Based on the laboratory calibrations of the TD method, its future application to actual forests is highly recommended to evaluate E T correctly and to derive a suitable strategy for the precise understanding of the hydrologic cycle in forested watersheds.