Introduction

Most of Iran is covered in desert and it ranks as the 56th lowest jungle country in the world. Iranian land cover from forests is less than 0.25 ha ind−1, considerably less than international standard 0.8 ha ind−1 (Khosroshahi and Ghavami 2005). Northern Iran borders the southern shore of the Caspian Sea, and the regional climate supports a considerable amount of temperate and dense forest (2.1 million ha out of 14.2 million ha across the country) with broad leaf trees from the third geological age, which play a vital role in Iranian ecosystems. Mazandaran Province is located beside the Caspian Sea (Fig. 1). The forests of Mazandaran extend to about 1.5 million ha and are the most significant forests in northern Iran.

Fig. 1
figure 1

Mazandaran Province inside Iranian map

Mazandaran forests are mostly located at high altitude (more than 2500 m asl. in some parts) and as such are low touched forests. Unfortunately, hundreds of hectares of these forests are burned every year [about 300–400 ha (BanjShafieia et al. 2010; HassanNayebi 2003)]. Many researchers indicate that human activity is the main cause of fire ignition in the area. The U. N. Food and Agriculture Organization’s (FAO) reports in 2002 and 2010 stated that approximately 0.06 % of Iran’s forest burns every year.

Forest fires are an extremely complex phenomenon and occur naturally from events such as lightning strike, but there are other causes such as human negligence, accidents and intentional human activity; for example, arson (illegal fires started deliberately with malicious intent). Research into forest fires has been done for many countries including Iran. Such research has mostly focused on modeling forest fires to minimize the impact of expected and unexpected fires on forests. The spatial pattern of fire occurrence is a major focus of studies on the dynamics of forest fires and their role in determining landscape structure and composition of vegetation species within a forest environment (Turner and Romme 1994; Guyette et al. 2002; Ryan 2002; Larjavaara et al. 2005; Mermoz et al. 2005). There are many inherent stochastic factors that contribute to fire occurrence; therefore researchers have applied stochastic models. Moreover, several searches in the U.S. have shown that fires caused by arson (which accounted for up to 80 % of all fires reported in the U.S.) cluster in dimensions of time (Prestemon and Butry 2005), space and space–time (Gralewicz et al. 2011). This has contributed to the increasing popularity of spatio-temporal point process models as an alternative to more computationally demanding deterministic models.

Studies evaluating histories of fire occurrences began early in the twentieth century, mostly in the U.S. and Canada. These early studies analyzed the extent of fire occurrences (for an excellent review see McKenzie et al. 2000). A large number of studies on forest fires apply a static analysis rather than attempting a dynamic approach that takes into account the stochastic spatial distribution of forest fires. Some of most influential dynamic attempts to study forest fires include statistical modeling of forest fires initiated by Dayananda (1977), which introduced stochastic modeling of the pattern of fire occurrence in a given forest. Pew and Larsen (2001) analyzed spatial and temporal patterns of human-caused fires using a GIS dataset. Martínez and Chuvieco (2003) employed a spatial distribution pattern model to classify Spanish municipalities according to their actions regarding properties of forest fire. Peng et al. (2005) used a temporal model to estimate conditional occurrence of natural fires in a Los Angeles forest. Brillinger et al. (2006) evaluated the probability of fires occurring in some specific points in a Californian forest. Yang and He (2007) analyzed human-caused fires in a spatial pattern using the Poisson process model. Amatulli et al. (2007) reported that fires caused by human activity in the Arago’n forest were more spatially diffuse than those caused by lightning. Benavent-Corai et al. (2007) demonstrated that human impact had important implications in terms of decreasing inter-event intervals and increasing the sparking frequency for forest fire modeling. Following the method introduced by Martínez and Chuvieco (2003), Chas-Amil et al. (2010) employed spatial analysis to characterize the distribution of causes and motivation for intentional fire lighting and wild fires in Galicia (northwestern Spain). Couce and Knorr (2010) employed a cellular automata (CA) model to adjust the spatial distribution of more than 750,000 African wild fires detected in 2003. Wang and Anderson (2010) used K-function and kernel estimation methods to determine that annual differences in spatial patterns of fires caused by humans tended to be more clustered with more complex spatial patterns than those caused by lightning. Moreover, the models demonstrated that human-caused fires in 2003–2007 were highly concentrated in southern Alberta, Canada, indicating the existence of an interaction between space and time. Gralewicz et al. (2011) presented a method to identify baseline expectations and ignition trends between 1980 and 2006 across 1-km spatial units. Jiang et al. (2012) determined that the traditional Poisson model overestimated fire occurrence during the fire season (May through August) in a Canadian forest.

Some interesting research has been done on forest fire spatial distribution and its impact on Iranian ecology. Azizi and Yousofi (2009) studied the ecological impact of forest fires reported between December 16–21, 2005 in forests in Gilan and Mazandaran Provinces. Mohammadi et al. (2010) used GIS information with the Analytical Hierarchy Process (AHP) method to evaluate the impact of vegetation, physiographic and weather factors, human activity and distance to rivers and roads on fires in a part of the Paveh forest. Ardakani et al. (2011) demonstrated that data generated by the Moderate Resolution Imaging Spectroradiometer (MODIS) were a valuable source of data used to investigate different phases of fire management in Iranian forests. Following the work of Mohammadi et al. (2010), Mahdavi et al. (2012) employed the AHP method to study a forest in Ilam Province.

We employed a spatio-temporal shot-noise Cox process introduced by Møller and Diaz-Avalos (2010) to: (i) identify influential and significant factors and covariates that impact on Mazandaran forest fires; (ii) determine spatial and temporal structures of fire occurrence in Mazandaran forests; and (iii) explain spatial and temporal variation in occurrences of these fires.

In “Materials and methods” section reviews some concepts of spatial point process modeling, including shot-noise Cox processes, which play a crucial role in the rest of this article. In “Results” section gives results from application of a spatio-temporal shot-noise Cox process to fires reported in 2005–2011 in Mazandaran forests, and in “Conclusion and discussion” section presents a discussion and conclusions based on these findings.

Materials and methods

Spatio-temporal shot-noise Cox-processes

Spatial point processes are a form of mathematical model, which take into account GIS information to identify observed spatial pattern in terms of them being random, regular, or cluster patterns of specific objects in a plane or in a space. Spatio-temporal point processes are an extension of spatial point processes that take into account time as well as space in their models and are termed time–space processes. Standard practice in analysis of spatio-temporal point processes is the employment of a first step to form an estimate of the average function λ(ut) for u ∊ R 2(ut), t ∊ Z. This average function can then be employed to study several properties of the spatio-temporal point process. Then, if the estimated average function is deemed to be uniform, \( \lambda (\varvec{u},t) \equiv \lambda , \) it is used to investigate inter-point interactions by estimating various summary statistics such as the F-function (distribution function for distance between a given point and the nearest event; for example, empty space) and the L-function (distribution function for distance between a given event and a nearest event, say, nearest-neighbor). These statistics are compared with their expected values for the homogeneous Poisson point process that serves as the null hypothesis of complete spatial randomness (CSR); that is, absence of an interaction (Ripley 1976; Diggle 1983). Therefore, testing the CSR hypothesis is an important part of exploratory data analysis in a spatial point process. If the CSR hypothesis is rejected, then one proceeds to investigate the spatial correlations in a given pattern and period of time.

We used the Spatstat package (Baddeley and Turner 2005) available in the R statistical software system for analysis. A spatio-temporal shot-noise Cox process is a spatial point process which can be viewed as a doubly stochastic Poisson process. The time dependent average function of a spatio-temporal shot-noise Cox process can be restated as:

$$ \lambda \left( {\varvec{u},t} \right) = \lambda_{1} \left( \varvec{u} \right)\lambda_{2} \left( t \right)S\left( {\varvec{u},t} \right) = \lambda_{1} \left( \varvec{u} \right)\lambda_{2} \left( t \right) = \mathop \sum \limits_{i = - \infty }^{\infty } \mathop \sum \limits_{{\varvec{q} \in \varPhi_{{\mathbf{u}}} }}^{{}} \frac{{\psi \left( {\varvec{u} - \varvec{q},t - i} \right)}}{\omega } $$
(1)

where, \( \lambda_{1} (\varvec{u}) \) and λ 2(t) are non-negative deterministic functions (which usually are determined by regression methods, see below), \( S(\varvec{u}, t) \) is a spatio-temporal process with unit mean (i.e., \( E(S(\varvec{u}, t)) = 1 \)), Φ u is a stationary Poisson process with intensity \( \omega > 0 \) and \( \psi ( \cdot , \cdot ) \) is a joint density on \( \Re^{2} \times Z \) with respect to the product measure of Lebesgue measure on \( {\Re }^{2} \) and counting measure on Z. The kernel \( \psi ( \cdot , \cdot ) \) is separable whenever \( \psi (\varvec{u},t) = \phi (\varvec{u})\chi (t) \) for all \( (\varvec{u},t) \in \Re^{2} \times Z \) where \( \phi ( \cdot ) \) is a density function on \( {\Re }^{2} \) and \( \chi (t) \) is a probability density function on Z. See Møller and Waagepetersen (2007) for more details on the spatio-temporal shot-noise Cox processes.

Suppose that some related covariate information about temporal and spatial components of the observed pattern is given. Moreover, suppose that such covariate information can be restated into discrete random variables \( V^{(1)} (\varvec{u}), \ldots ,V^{(p)} (\varvec{u}) \) for spatial information \( \varvec{u} \) and \( T^{(1)} (t), \ldots ,T_{(m)} (t) \) for temporal information t). Using such covariate variables, one may consider two following Poisson regression models for spatial intensity function \( \lambda_{1} (\varvec{u}) \) and temporal intensity function \( \lambda_{2} (t) \)

$$ ln \left( {\lambda_{1} \left( \varvec{u} \right)} \right) = \beta_{0} + \mathop \sum \limits_{j = 1}^{p} \mathop \sum \limits_{i = 1}^{{I_{j} }} \beta_{i}^{\left( j \right)} I\left( {V^{\left( j \right)} \left( \varvec{u} \right) = i} \right) + Error $$
(2)
$$ ln \left( {\lambda_{2} \left( t \right)} \right) = \alpha_{0} + \mathop \sum \limits_{i = 1}^{a} \alpha_{i} \cos (i\eta_{t} t) + \mathop \sum \limits_{j = 1}^{b} \alpha_{j} \sin (j\eta_{t} t) + \mathop \sum \limits_{i = 1}^{m} \gamma_{i} T^{\left( i \right)} \left( t \right) + Error $$
(3)

where, \( I( \cdot ) \) stands for the indicator function, {1,2,… I j } represents support of discrete covariate \( V^{(j)} (\varvec{u}), \) and \( \eta_{t} = 2\pi /365 \) for years with 365 days and \( \eta_{t} = 2\pi /366 \) for leap years. For identifiability of regression coefficients, one has to impose the bounds that \( \varSigma_{i = 1}^{{I_{j} }} \beta_{i}^{{^{(j)} }} = 0 \) for all j = 1,2,… p. Regression coefficients β and α are estimated by least squares, while the constant coefficient m is determined by examining the Akaike information criterion (AIC).

For simplicity, this article assumes a separable kernel with form:

$$ \psi \left( {\varvec{u},t} \right) = \phi \left( \varvec{u} \right) \times \chi \left( t \right) = \frac{{\exp \left\{ { - \left| {\frac{{\left| \varvec{u} \right|\left. \right|^{2} }}{{2\delta^{2} }}} \right.} \right\}}}{{2\pi \delta^{2} }} \times \frac{{2(t^{*} - t)}}{{t^{*} (t^{*} - 3)}},\, \quad t = 1, \ldots ,t^{*} - 1 $$
(4)

where, last significant lag t * is determined by the autocorrelation function of the stochastic counting process N t , which represents the number of observed forest fires in an interval [0,t].

Inhomogeneous K- and L-functions

Møller and Waagepetersen (2004) introduced the following inhomogeneous K- and L-functions to investigate existence of random, regular, or cluster patterns for specific objects in a plane or space, under shot-noise Cox process X t .

$$ K\left( {r,t} \right) = 2\pi \mathop \smallint \limits_{0}^{r} sg(s,t) ds,\,\,\,\,\, L\left( {r,t} \right) = \sqrt {\frac{{K\left( {r,t} \right)}}{/pi}} $$
(5)

where, \( r = ||\varvec{u}_{1} - \varvec{u}_{2} || \) and t = |t 1 − t 2| and g(s,t) is the cross pair correlation function:

$$ g\left( {\left( {\varvec{u}_{1} , t_{1} } \right), \left( {\varvec{u}_{2} , t_{2} } \right)} \right) = 1 + \delta \psi { \star }\hat{\psi }\left( {\varvec{u}_{1} - \varvec{u}_{2} ,t_{1} - t_{2} } \right) $$
(6)

where \( { \star } \) denotes convolution and \( \hat{\psi }(\varvec{u}, t) = \psi ( - \varvec{u}, - t). \)

Stoyan and Stoyan (2000) and Diggle et al. (2005) developed a 95 %-inter-quantile envelope for the L(r) − r function based upon an MCMC simulation study. Using such a 95 %-inter-quantile envelope one may test the null hypothesis, H 0: the model fits the data, at the 0.05 confidence level. Whenever the observed L(r) − r function falls into the 95 %-inter-quantile envelope obtained by simulating points under the fitted process X t , one fails to reject the null hypothesis and concludes that the model fit is adequate.

Annual data for Mazandaran forest fires

This investigation used data on starting locations of 531 forest fires in Mazandaran forests reported from March 2006 to March 2010 (1826 days). The starting locations of these fires were used as spatio-temporal points. Calculations were based on the supposition that the stochastic counting process N t stands for the number of daily forest fires reported in the region. Bessie and Johnson (1995), and Møller and Diaz-Avalos (2010) processed information on vegetation type, elevation, slope, exposure and temperature as covariate information that may impact on a forest fire in a given forest. Unfortunately, for this study information on vegetation type was not available from the relevant database. Therefore, this article considers altitude, \( V^{\left( 1 \right)} \left( \varvec{u} \right), \) slope-exposure, \( V^{\left( 2 \right)} \left( \varvec{u} \right), \) and average daily temperature (available at: http://www.mazandaranmet.ir/), of the region, T(t), as covariate information that may affect a forest fire in the region. Altitude of the region averages approximately 1340 m asl. (Fig. 2a, b) and can be classified into 5 classes (Table 1).

Fig. 2
figure 2

a Topographic map of urban and rural regions in Mazandaran Province; b elevation map, in meters, above sea level in the area; c the 20 slope-exposure categories; d spatial pattern of started forest fire from March 2006 to 2010

Table 1 Altitude categories classification

Based on frequencies of slope and exposure, the area was classified into the 20 slope-exposure categories given in Table 2 and illustrated in Fig. 2c. The spatial pattern of started forest fires from March 2006 to March 2010 in the area has been given by Fig. 2d.

Table 2 Slop-exposure categories classification

Figure 3 represents average daily temperature, T(t), of the region (Fig. 3a), the square root of the number of daily reported fires in the region, i.e., \( \sqrt {N_{t} } , \) for \( t \in [0,1826] \) (Fig. 3b), and autocorrelation of stochastic counting process N t for different lags (Fig. 3c).

Fig. 3
figure 3

a Average of daily temperature T (1)(t), for t ∈ [0, 1826]; b squared root of number of daily reported fire, i.e., \( \sqrt {N_{t} } , \) for ∊ [0, 1826]; and c autocorrelation stochastic counting process N t for different lags

Using the above autocorrelation plot for different lags (Fig. 3, left panel), one cannot observe any significant autocorrelation after lag 32. Therefore, t * in Model (3) can be taken equal to 32.

Results

Using the Spatstat package (Baddeley and Turner 2005, 2006) against starting location of such reported fire along with the above mentioned covariate variables \( V^{(1)} (\varvec{u}),\,V^{(2)} (\varvec{u}) \), and power of daily temperature T(t), i.e., \( T^{(i)} (t) \equiv T^{(i)} (t) \), for i = 1,2,…, m. The spatial intensity function \( \lambda_{1} (\varvec{u}) \) and temporal intensity function \( \lambda_{2} (t) \) can be estimated as the following:

$$ ln \left( {\hat{\lambda }_{1} \left( u \right)} \right) = \mathop \sum \limits_{i = 1}^{5} \beta_{i}^{\left( 1 \right)} I\left( {V^{\left( 1 \right)} \left( u \right) = E_{i} } \right) + \mathop \sum \limits_{i = 1}^{20} \beta_{i}^{\left( 2 \right)} I\left( {V^{\left( 2 \right)} \left( u \right) = SE_{i} } \right) + Error $$
(7)
$$ ln \left( {\hat{\lambda }_{2} \left( t \right)} \right) = - 0.838\cos \left( {\eta_{t} t} \right) - 0.330\sin \left( {\eta_{t} t} \right) + 0.643\cos \left( {2\eta_{t} t} \right) - 0.719\sin \left( {2\eta_{t} t} \right) + \mathop \sum \limits_{i = 1}^{6} \gamma_{i} T^{\left( i \right)} \left( t \right) + Error $$
(8)

where, \( E_{1} , \ldots ,E_{5} \) and \( SE_{1} ,SE_{2} , \ldots ,SE_{20} \), respectively, represent values of discrete covariate V (1) and V (2), given above, and other estimated parameters are given by Table 3.

Table 3 Estimated parameters

The maximum value for the exponent on daily temperature, m = 6, was found by taking the largest value of m that minimizes the AIC (AIC = 2581.697 for m = 6, while the AIC equaled 2615.269, 2586.638, and 2581.697 for m = 3, m = 4, and m = 5, respectively). Goodness of fit of the above spatio-temporal shot-noise Cox process model in spatial and spatio-temporal approaches was evaluated using a plot of the L(r) − r function and a 95 %-inter-quantile envelope obtained from 39 simulations from the fitted model. Figure 4 illustrates these plots.

Fig. 4
figure 4

Observed L(r) − r (solid line indicated by ‘obs’) along with their 95 % inter-quantile envelopes (grey region indicated by `lo’ and `hi’) obtained from 39 simulated observations from the fitted shot-noise-Cox model. Part a shows such L(r) − r from spatial model, while part b represents such estimations from spatio-temporal model

Using regression parameters of estimated spatial average function \( \lambda_{1} \) and temporal average function \( \lambda_{2} (t) \) given by Eq. 2, one can conclude that: (1) risk of forest fire increased in areas with an altitude between 1350 m to 3000 m above sea level (corresponding to high positive values of \( \beta_{3}^{(1)} \) and \( \beta_{2}^{(1)} \) and (2) the risk of forest fire in areas with a southern exposure and slope gradient of between 30° and 50°, northern exposure and slope gradient between 0° and 50° and eastern exposure and slope gradient between 00 and 300 (corresponding to high positive values of \( \beta_{16}^{(2)} ,\beta_{19}^{(2)} ,\beta_{3}^{(2)} ,\beta_{7}^{(2)} ,\beta_{11}^{(2)} ,\beta_{1}^{(2)} \) and \( \beta_{6}^{(2)} \) which are significantly higher than other areas.

Figure 4 shows that the L(r) − r function for the spatial shot-noise Cox model is far away from the edges of its 95 % inter-quantile envelope (Fig. 4a). While the L(r) − r function for the spatio-temporal shot-noise Cox model in most regions is close to the center line (red dash line) of its 95 % inter-quantile envelope (Fig. 4b). Therefore, one can conclude that the spatio-temporal shot-noise Cox model provides a more appropriate model compared to the spatial shot-noise Cox process. Moreover, Fig. 4 also demonstrates that the pattern of the spatio-temporal shot-noise Cox model for forest fires in the region is more aggregated than in an inhomogeneous Poisson process. Such aggregated spatial distribution dramatically increases in radiuses below 1 km away from the points for locations of fires (beginning part of solid line in Fig. 4b). This specific spatio-temporal and aggregate model verified the periodic spatial distribution of fire occurrence in the region and the fact that points close to fire areas have a higher chance of fire.

Figure 5 is a topographical map and realization graphs of the above estimated spatial average function λ 1(u) and temporal average function λ 2(u).

Fig. 5
figure 5

a Topographic map of estimated spatial intensity function λ 1(u); b realization of estimated spatial intensity function λ 1(u); and c realization of estimated temporal intensity function λ 2(u)

Comparing topographic maps of urban and rural regions in Mazandaran Province (Fig. 1) with the estimated spatial average function λ 1(u) (Fig. 5a), it can be concluded that the risk of forest fire in the eastern half and southwest of the area (rural and tourist regions) are considerably higher than the risk in other areas. The rate of occurrence of forest fires in the period of study can be determined by the estimated temporal average function λ 2(t) (Fig. 5b). Realization of the estimated temporal average function λ 2(t) is given by Fig. 5c.

Conclusion and discussion

This article employed a spatio-temporal shot-noise Cox process to: (i) determine factors that may impact forest fires and (ii) to identify areas at higher risk of fire in the forests of Mazandaran Province.

Using the fitted model, one may conclude that: (1) daily temperature (with polynomial order 6), altitude and slope-exposure impacted occurrence of forest fires in the region; (2) forest fire occurrence had an aggregated spatial distribution, which means that points close to areas of fire had more chance of fire occurrence compared to other areas. Moreover, fire chance dramatically increased in radiuses below 1 km away from areas of fire; (3) the risk of forest fire in the east and southwest of the area (rural and tourist regions) was considerably higher than the risk in other areas. These two observations were also reported in other research by HassanNayebi (2003) and, in a different context by Majidi et al. (2011); (4) fire occurrence was considerably higher in southwest Mazandaran Province (where elevation was lower than average in the province, at 1340 m above sea level), similar findings were reported in Ardakani et al. (2011); (5) the rate of forest fire occurrence stayed above average during drought periods (September 2008 and September 2009, Tirandaz and Eslami 2012); (6) the periodic spatial distribution of fire occurrence in the region has also been reported by the U.N. FAO (2002, 2010); (7) risk of forest fire is significantly decreased in areas with an altitude higher than 3000 m above sea level.

The Model (7) provides useful spatial and temporal information that is not obvious or cannot be inferred from visual examination of raw data. Such quantitative knowledge could lead to the development of fire-response and fire-suppression strategies appropriate to specific regions within the province. The fitted model can be improved by using other general information on fires such as evaluations for area burned, date and time of ignition, weather conditions, causes and motivation, fire fighting measures and vegetation type as well as detailed information on the forest land affected (e.g., ownership, forest biomass of the area and estimated losses). Unfortunately, such covariate information was not available in the Iranian forest fire database. More information would improve statistical forest fire models and consequently facilitate better fire management. Certainly, effective management practice requires an understanding of the role of biological and physical factors in patterns of fire occurrence in space and over time. Moreover, high quality information about space–time dynamics of fires can be developed by statistical models, which can be applied for long-term resource management of forest ecosystems and human property (McKenzie et al. 2000).