Introduction

Phosphorus (P) deficiency is the major nutritional limitation for agricultural soils in the tropics (Grierson et al. 2004). In this context, understanding the soil P cycle is necessary to establish management efficient strategies to increase plant P availability, especially in low-input production systems. The P cycle can be divided into biological and geochemical processes (Smeck 1985; Frossard et al. 2000) that regulate the availability of soil P (Cross and Schlesinger 1995). The method chosen for evaluating soil P pools greatly influences estimates of soil P availability. The Hedley method of sequential extraction or fractionation of P (Hedley et al. 1982) enables characterization of the different fractions of inorganic and organic P obtained based on solubility of inorganic and organic P in sequential extracting solutions based on changes different pH and extracting strength. Resin inorganic phosphate (Pi) represents an index of exchangeable P that any plant would have access to simply by removing inorganic P from the soil solution by uptake (Sato and Comerford 2006) Bicarbonate Pi and bicarbonate organic phosphate (Po) fractions index P that can be released by ligand exchange with the bicarbonate ion (Sato and Comerford 2006). Hydroxide Pi, hydroxide Po, sonic Pi, sonic Po, hydrochloride (HCl) Pi, and residual P fractions are index P forms that are much more difficult to move into the soil solution P (Cross and Schlesinger 1995).

In tropical soils, especially those that are highly weathered and clayey, the fixation capacity of P is high, reducing P availability to plants. Thus, the soil with strong buffering capacity competes with the plants for the added P, and the soil becomes a sink. In these unfertilized soils, the availability of P is highly dependent on the mechanisms that the plant roots use for accessing the range of soil P sources, both organic and inorganic. The Hedley P fractionation is an effective method for contrasting soil P forms between and among different soils and soil treatments. Fractionation of soil P alone does not permit evaluation of interactions between P fractions. Path analysis with structural equations has been used to study the magnitude of interactions between the different P fractions in temperate soils (Tiessen et al. 1984; Zheng et al. 2002). This modeling method permits the decomposition of the correlations among the observed variables into direct and indirect effects in a model where the regressions of the relationships between the variables can be simultaneously evaluated (Malaeb et al. 2000; Arhonditsis et al. 2006; Prober and Wiehl 2011). In addition, an essential characteristic of structural equation modeling (SEM) is the ability to include latent variables (not measured) together with the measurable variables in hypothetical models. The latent variables are used in the models to represent potential underlying causes, while the measurable variables serve as indicators of the effects or manifestations of the latent factors (Grace and Bollen 2008). One of the advantages of SEM is its ability to rank the descriptive ability of different models, thus enabling comparisons between the models (Mitchell 1992).

Beck and Sanchez (1994) also used path models (observed variable models) to analyze the transformations between P fractions in a tropical Ultisol. But as they did not use structural models (latent variable models) to evaluate the different P pools as latent hypotheses with multi-indicators (P fractions). Significant uncertainties remain and pathways of P transformations in tropical soils worldwide remain poorly understood. Accordingly, the present study aimed to use the method of SEM with latent variables, at the exploratory level, to evaluate hypothetical models of the P cycle and to determine the interactions among the pools of P and to identify which pools act as a P sink or source in unfertilized tropical soils. We evaluated three models in this study. Firstly, we formulated an a priori basic structural model based on our knowledge of soil P dynamics relationships among five latent variables (P pools), described by multiple indicator variables (P fractions): we expected a direct effect of organic pool, inorganic pool, occluded pool and primary mineral pool on available pool (Fig. 1). Secondly, the basic model was split into two sub-models to assess changing from a multi-indicator to single-indicator model.

Fig. 1
figure 1

The hypothesized structural model for the soil P cycle

Materials and methods

The soil P data were collected from the literature (see Appendix Table 7) and were restricted to studies that used the sequential fractionation technique developed by Hedley et al. (1982) and modified by Tiessen and Moir (1993). The fractions of P used were resin Pi, bicarbonate P (Pi and Po), hydroxide P (Pi and Po), sonic P (Pi and Po), HCl Pi, HCl Phot, and residual P. The soil properties used in the correlations with the P fractions were pH, clay content, organic carbon (C), total nitrogen (N), iron oxides (Feox), and aluminum oxides (Alox). The data set (N = 81) covers a wide range of unfertilized soil types under different land use systems in tropical regions.

Factor and path analysis

Initially, the relations between the P fractions and various soil properties (pH, clay, organic C, total N, Feox, and Alox) were determined using Pearson correlation coefficients and multiple regression analyses. We made the assumption that the correlation structure observed from the literature data is the same that we would obtain from a single random sample. We used correlation analysis to determine the intensity ratio of all of the P fractions, while the backward multiple regression analysis was used with the resin Pi fraction as the dependent variable and the other P fractions as independent variables. The P fractions (independent variable) in the soil that did not contribute significantly to the estimation of the resin Pi fraction at P ≤ 0.05 were eliminated using stepwise regression. The model fit was measured by the determination coefficient (R 2) and by the significance of the regression and β coefficients of the direct effect of the independent variables (predictors).

The path analysis was conducted according to Williams et al. (1990) and performed in two stages: the first stage included all of the P fractions except the sonic Pi fraction (eliminated by the test of multicollinearity; data not shown), and the second stage encompassed the P fractions selected only by backward multiple regression. In the path analysis, each normal equation represents a decomposition of the correlation coefficient into direct (path coefficient) and indirect effects between the resin Pi fraction (dependent variable) and the other P fractions (independent variables). The path analysis results were determined using the following equations (Williams et al. 1990):

$$ {\text{r}}_{19} = \varvec{P}_{{{\mathbf{19}}}} + {\text{r}}_{12} P_{29} + {\text{r}}_{13} P_{39} + {\text{r}}_{14} P_{49} + {\text{r}}_{15} P_{59} + {\text{r}}_{16} P_{69} + {\text{r}}_{17} P_{79} + {\text{r}}_{18} P_{89} $$
(1)
$$ {\text{r}}_{29} = {\text{r}}_{12} P_{19} + \varvec{P}_{{\mathbf{29}}} + {\text{r}}_{23} P_{39} + {\text{r}}_{24} P_{49} + {\text{r}}_{25} P_{59} + {\text{r}}_{26} P_{69} + {\text{r}}_{27} P_{79} + {\text{r}}_{28} P_{89} $$
(2)
$$ {\text{r}}_{39} = {\text{r}}_{13} P_{19} + {\text{r}}_{23} P_{29} + \varvec{P}_{{\mathbf{39}}} + {\text{r}}_{34} P_{49} + {\text{r}}_{35} P_{59} + {\text{r}}_{36} P_{69} + {\text{r}}_{37} P_{79} + {\text{r}}_{38} P_{89} $$
(3)
$$ {\text{r}}_{49} = {\text{r}}_{14} P_{19} + {\text{r}}_{24} P_{29} + {\text{r}}_{34} P_{39} + \varvec{P}_{{\mathbf{49}}} + {\text{r}}_{45} P_{59} + {\text{r}}_{46} P_{69} + {\text{r}}_{47} P_{79} + {\text{r}}_{48} P_{89} $$
(4)
$$ {\text{r}}_{59} = {\text{r}}_{15} P_{19} + {\text{r}}_{25} P_{29} + {\text{r}}_{35} P_{39} + {\text{r}}_{45} P_{49} + \varvec{P}_{{\mathbf{59}}} + {\text{r}}_{56} P_{69} + {\text{r}}_{57} P_{79} + {\text{r}}_{58} P_{89} $$
(5)
$$ {\text{r}}_{69} = {\text{r}}_{16} P_{19} + {\text{r}}_{26} P_{29} + {\text{r}}_{36} P_{39} + {\text{r}}_{46} P_{49} + {\text{r}}_{56} P_{59} + \varvec{P}_{{\mathbf{69}}} + {\text{r}}_{67} P_{79} + {\text{r}}_{68} P_{89} $$
(6)
$$ {\text{r}}_{79} = \,{\text{r}}_{17} P_{19} + {\text{r}}_{27} P_{29} + {\text{r}}_{37} P_{39} + {\text{r}}_{47} P_{49} + {\text{r}}_{57} P_{59} + {\text{r}}_{67} P_{69} + \varvec{P}_{{\mathbf{79}}} + {\text{r}}_{78} P_{89} $$
(7)
$$ {\text{r}}_{89} = {\text{r}}_{18} P_{19} + {\text{r}}_{28} P_{29} + {\text{r}}_{38} P_{39} + {\text{r}}_{48} P_{49} + {\text{r}}_{58} P_{59} + {\text{r}}_{68} P_{69} + {\text{r}}_{78} P_{79} + \varvec{P}_{{\mathbf{89}}} $$
(8)

where r ij is the simple correlation coefficient between the P fractions and the resin Pi, P ij are the path coefficients (direct effect), and r ij P ij are the indirect effects of the P fractions on the resin Pi. The subscripts are (1) bicarbonate Pi, (2) bicarbonate Po, (3) hydroxide Pi, (4) hydroxide Po, (5) sonic Po, (6) HCl Pi, (7) HCl Phot, (8) residual P, and (9) resin Pi. The model residual (U) was calculated using the following equation:

$$ U = \sqrt {1 - R^{2} } $$

where R 2 is the determination coefficient. The correlation, multiple regression, and path analyses were done using the software SAEG 8.0 (SAEG Inst. Inc.).

Exploratory factor analysis was used to test the hypothesis that functional P pools can be distinguished (non-measurable variables—latent) based on the pattern of covariance between the P fractions. Following this system, pools of P are formed as a function of their solubility in the range of extractions represented by the Hedley procedure as described above (Gijsman et al. 1996; Novais and Smyth 1999). A basic assumption of the factor analysis is that some latent structure (pool of P) exists in the set of selected variables (P fractions), accordingly we used four factors (latent variables) considered sufficient to represent the latent structure of the data. A factor loading of ≥0.60 was used as a selection criterion to interpret the role that each variable (P fraction) plays in the definition of each factor (pool of P), considering the sample size of the database (N = 81) at a significance level of 5 %. The factor loadings are the correlation of each variable with the factor; therefore, they indicate the degree of correspondence between the variable and the factor (Hair and Anderson 2010). The factor analysis was done using the GENE 7.0 software package (GENE Inst. Inc.).

Structural equation modeling

The results of the isolated multiple regression, path, and factor analyses served as a basis for the development of candidate models of the P cycle in the soil. These models constitute null hypotheses, and the method of SEM was used for model evaluation. SEM is included in the class of generalized linear models, allowing for the simultaneous testing of a set of regression equations. As a flexible multivariate analysis method that includes factor and path analysis, SEM is ideally suited to evaluate the relative importance of the pathways in hypothetical models and to compare models with experimental data (Mulaik 2009; Allison et al. 2007). The SEM variables are classified into two major groups according to two criteria. The variables may be observable (P fractions) or latent (pools of P). The former are based on direct measurement, while the latter are identified only by their effect on the observable variables. The latent variable (or structural) model specifies the causal relationships among the latent variables, enabling the study of the interdependence between the variables, including the determination of whether they are related (Arhonditsis et al. 2006). In addition, this model also allows for testing of the indirect effects that may be mediated by another intermediate variable (Doblas-Miranda et al. 2009).

Structural equation modeling diagrams typically use two symbols to indicate the relationship between different variables: a single arrow, which indicates a direct action or causal relation of the variable from which the arrow departs (exogenous variable) on the variable to which the arrow points (endogenous variable), or a double arrow, indicating a covariance between the variables involved. Because one of the goals of SEM is to determine the regression coefficients for each of these relationships, each endogenous variable in the model and each arrow that points to it are considered. As a result, we obtain a set of equations, namely structural equations that, in turn, will produce a system of linear equations in which the unknown factors are the coefficients of multiple regressions (Kline 2005; Mulaik 2009).

For testing hypothetical models of the P cycle, at the exploratory level, we first designed a preliminary basic structural model (Model generating) with five latent variables to evaluate the relative importance of P pools affecting the soil available P. These included the organic pool, the inorganic pool, the occluded pool and the primary mineral pool as exogenous variables, and the available pool as an endogenous variable (Fig. 1). The exploratory analysis of the basic hypothesized structural model was done in five steps, according to Malaeb et al. (2000): (1) model specification, (2) model identification, (3) parameter estimation, (4) testing model fit, and (5) re-specification of the model. Thus, the basic model was split into two sub-models and tested again using the same data. The goal in this process is to obtain a model that is consistent with the data based on comparisons between the covariances in the data and those implied by the model (Travis and Grace 2010). The matrix representation of the structural equation models is shown in the Appendix 2.

The SEM was implemented using the software AMOS v 21.0 (IBM; SPSS Inc., Chicago, IL, USA). We used the method of Maximum Likelihood to estimate the parameters, and to assess the general model fit, we applied the Chi square test (χ 2). Because of the Chi square test’s sensitivity to sample size, the Goodness-of-Fit Index (GFI), the comparative fit index (CFI), the Akaike Information Criterion (AIC) and the root mean square error of approximation (RMSEA) were also considered as alternative measures of model fit (Malaeb et al. 2000; Grace 2006), assuming that the observed variables follow a multivariate normal distribution. Both univariate and multivariate normality were tested (skewness and kurtosis), followed by appropriate transformations of all of the observed variables; namely, we used the Log 10 transformation for the resin Pi, bicarbonate (Pi and Po), sonic Po, and HCl Pi fractions, and we used the Log (x + 1) transformation for the hydroxide P (Pi and Po), HCl Phot, and residual P fractions. The multivariate kurtosis was assessed using the Mardia test (Mardia 1974). In the χ 2 test, if the null hypothesis was not rejected (P values >0.05), the model was assumed to have a good fit. The RMSEA values less than 0.07 suggest an adequate model fit, and similarly in the AIC index the preferred model is the one with the lowest value, while in the GFI and CFI indices the cut-off criterion ≥0.95 is indicative of good fit (Hooper et al. 2008).

Results and discussion

Factor and path analysis

The soils used here exhibited a wide variation in chemical and physical properties (see Appendix Table 8) and in P concentrations (Table 1). This variation allowed use of correlation and regression analyses to assess how the P fractions vary in relation to soil properties and to determine the interrelationships between the different P fractions. pH values are positively related to the changes in total soil N and organic C but are negatively related to the levels of Fe and Al oxides. Carbon shows a weak relationship with clay. In contrast, Fe and Al are closely related to the variation in clay content (see Appendix Table 9).

Table 1 Soil phosphorus fractions values of the literature survey

The interactions between the different P fractions and other soil properties were also evaluated with correlation analysis (Table 2). This analysis revealed a close relationship of the organic P fractions with organic C and total N, and positive relationships with the pH and clay content. Additionally, organic C, total N, and pH also have a positive influence on the inorganic P fractions. The resin Pi is negatively correlated with the Fe oxide content. Turner and Engelbrecht (2011), using NaOH–EDTA extraction and solution 31P nuclear magnetic resonance spectroscopy in soils under tropical forest in Panama, found close and positive associations of the total soil inorganic and organic P with soil pH and total soil C, but clay content did not appear to influence the contents of soil organic P as determined by simple correlations. However, Harrison (1987) reported weak relationship between organic C and organic P in tropical soils unlike temperate soils.

Table 2 Pearson correlation coefficients between P fractions and selected soil properties

Another analysis using multiple regressions of the selected variables shows that only the contents of organic C and total N had significant direct effects on most of the P fractions (see Appendix Table 10). The accumulation of organic C results in an increase in sonic Po at the expense of labile Po (bicarbonate Po). Resin Pi and hydroxide Pi were also positively related to organic C. The total N content was related to the reduction of bicarbonate Pi and hydroxide Pi amounts but increased Po amounts (bicarbonate and hydroxide). The clay content and pH had an insignificant effect on all of the P fractions. No significant direct effects of the soil properties were found on HCl Pi, HCl Phot, and residual P amounts. The total organic P fraction (bicarbonate, hydroxide, and sonic) was positively related to the soil properties, but only the total N had a direct effect on this fraction.

Table 3 shows the interrelationships of the different P fractions as determined by simple correlations. Close associations are observed between the P fractions. These Pearson correlation coefficients were comparable with those reported by Negassa and Leinweber (2009) for subtropical and tropical soils. However, detailed information regarding the interactions between the different fractions cannot be obtained from a simple correlation analysis. Factor analysis can reduce the dimensionality of the problem by using fewer latent or unobserved variables to explain variation in the measured variables. Thus, using factor analysis we obtained four factors that conceptually clarify the grouping of P fractions (Table 4): factor 1 represents all of the Po fractions, factor two represents the Pi fraction of the primary minerals, factor three represents the occluded P fractions, and factor 4 represents the Pi fraction of the secondary minerals. Similar results were reported by Tiessen et al. (1984) for temperate soils.

Table 3 Pearson correlation coefficients for nine soil P fractions (All the coefficients are significant at P < 0.001)
Table 4 Factor analysis of soil P in different fractions

The relationships between the resin Pi fraction and the other P fractions were evaluated using a series of multiple regressions in a path analysis (Table 5). Only the fractions of organic P (bicarbonate Po, hydroxide Po, and sonic Po) had a significant direct influence on the resin Pi fraction. Of these fractions, only the organic fraction of bicarbonate Po showed a negative effect. Furthermore, the bicarbonate Po, sonic Po, and residual P fractions had a significant positive indirect influence on the resin Pi fraction via the hydroxide Po fraction. Similarly, the hydroxide Po, HCl Phot, and residual P fractions had a significant positive indirect influence on the resin Pi via the sonic Po fraction.

Table 5 Path analysis direct effects (diagonal, bold) and indirect effects (off-diagonal) of different soil P fractions on resin Pi

In the backward regression model only the bicarbonate Po, hydroxide Po, sonic Po, and HCl Pi fractions were identified as statistically significant for predicting resin Pi fractions, with a coefficient of determination (R 2) of 0.84 and a residue (U) of 0.40. The fractions of organic P explain 71 % of the variation in the resin Pi fraction (see Appendix Table 11).

Backward stepwise regression analysis was used to decompose the correlation coefficients between the resin Pi and P fractions into direct and indirect effects (Table 6). The effect of bicarbonate Po on resin Pi (D = 0.35) was negative and not significant. The significant positive correlations found in the correlation analysis (Table 3) between bicarbonate Po and resin Pi and between sonic Po and resin Pi were attributed to the significant indirect effect of hydroxide Po (D = 0.51 and 0.53, respectively; Table 6). Although all of the P fractions analyzed were significantly related to the resin Pi, only hydroxide Po and sonic Po had significant direct effects on the resin Pi, and hydroxide Po had the highest direct effect (D = 0.60).

Table 6 Path analysis direct effects (diagonal, bold) and indirect effects (off-diagonal) of organic P fractions and HCl Pi on resin Pi

Structural equation modeling

The multiple regression, factor, and path analyses were able to distinguish between the direct and indirect effects of the soil properties on the P fractions, as well as to support a hypothetical distinction between the pools of P in the soil. When these statistical techniques are used in combination with the method of SEM, the levels of interdependence between the pools of P are quantified, and thus, the cause and effect relationship related to the transformation pathway of the element in the soil can be inferred. Figures 2 and 3 show the versions of the structural model of the P cycle based on factor analysis. The biological and geochemical processes of P in the soil are shown in the conceptual model A, in which all of the organic P fractions are grouped in the organic pool and all of the inorganic labile fractions are grouped in the inorganic P pool. The recalcitrant fractions (also included the sonic Po fraction) are associated with the occluded pool, the HCl Pi fraction is associated with the primary mineral pool, and the resin Pi fraction constitutes the most exchangeable pool (Fig. 2). Model A showed an unsatisfactory overall adjustment based on the Chi square test (χ 2 = 165.61, df = 17, P < 0.0001) and the other tests (GFI = 0.818; CFI = 0.777; CFI = 0.818; AIC = 221; RMSEA = 0.331). Conversely, all of the regression coefficients between the measured variables (P fractions) and the latent variables (pools of P) were highly significant (P < 0.001), which implies that all of the P fractions were appropriate indicators of the pools of P, demonstrating a plausible causality structure via the SEM method. Moreover, the close relationship between the P fractions and the pools of P reveals the functional nature of these pools when one considers that no clear boundaries exist between the quantified P fractions by the method of sequential extraction. The covariance values of all of the P pools (exogenous latent variables) were also highly significant (P < 0.001), but the regression coefficients of the relationship of these P pools with the resin P (endogenous latent variables) were not significant. The high covariance between the P pools (exogenous latent variables) affected the relationship of each of these pools with the most exchangeable P pool, yielding an overall apparently null effect for all of the P sources.

Fig. 2
figure 2

Model A, structural equation model for the soil P cycle. All measured variables (in boxes) are represented as effect indicators associated with latent variables (in circles). The numbers correspond to the standardized parameters estimated (P < 0.001). Error variables \( \left( {\varepsilon_{1} - \varepsilon {}_{9}} \right. \, and\left. { \, \zeta_{1} } \right) \) are standardized values. Model χ 2 = 165.61, df = 17, P < 0.0001

Fig. 3
figure 3

Model B, structural equation model for the soil P cycle. All measured variables (in boxes) are represented as effect indicators associated with latent variables (in circles). The numbers correspond to the standardized parameters estimated (P < 0.001). Double asterisks significant at P < 0.01. Error variables \( \left( {\varepsilon_{1} - \varepsilon {}_{6}} \right. \, and\left. { \, \zeta_{1} } \right) \) are standardized values. Model χ 2 = 105.13, df = 7, P < 0.0001

We evaluated the different sub-models assuming that at least one of the P sources should have a significant relationship with the resin P pool. Model B is a simplified version of the complete structural model, consisting only of the organic and occluded pools as exogenous variables (Fig. 3). In this model, the regression coefficient of the relationship between the organic pool and the resin P pool was significant (P = 0.003). In contrast, model B also showed an unsatisfactory overall fit (χ 2 = 105.13, df = 7, P < 0.0001; GFI = 0.842; CFI = 0.761; AIC = 133; RMSEA = 0.419).

Model C, a variation of model B shown in Fig. 4, illustrates the logical dependency between the P fractions (measurable variables) and available P pool (endogenous latent variable). This model is a latent variable model with single indicator (Resin Pi) and it is not saturated, as not all of the paths are specified, and represents a testable hypothesis. The general fit of the model was satisfactory (χ 2 = 3.44, df = 3, P = 0.329; GFI = 0.986; CFI = 0.999; AIC = 39; RMSEA = 0.043). All the regression coefficients of the relationships between the P fractions were significant. The available P pool was directly dependent on the organic P fractions (P < 0.05). The hydroxide Po and sonic Po fractions had a positive direct effect, and the overall effect of sonic Po was greater [β = 1.67 + (1.88 × 0.95) = 3.46] than the direct effect of that fraction. The bicarbonate Po had a direct negative effect and indirect positive effect via hydroxide Po and sonic Po on the available P pool, but its overall positive effect was low [β = (0.37 × 1.88) + (0.11 × 1.67) + (0.11 x 1.67 × 1.88) − 0.91) = 0.17]. The indirect effects of the recalcitrant fractions were complex. The residual P had a negative effect via hydroxide Po and a positive effect via sonic Po, but its indirect overall effect was positively very low [β = (−0.28 × 1.88) + (0.20 × 1.67) + (0.20 × 0.95 × 1.88) = 0.16]. In contrast, HCl Phot showed high overall positive effect [β = (0.55 × 1.67) + (0.55 × 0.95 × 1.88) = 1.90]. The residual P had an overall negative effect on hydroxide Po very low [β = (0.20 × 0.95) − 0.28 = −0.09].

Fig. 4
figure 4

Model C, structural equation model for the soil P cycle. Hypothesized model relating labile and stable P fractions (measured variables) to available P pool (latent variable). The numbers correspond to the standardized parameters estimated (P < 0.001) and the R-squared values (numbers in bold). Asterisk, double asterisk significant at P < 0.05 and 0.01 respectively. Error variables \( \left( {\varepsilon_{1} - \varepsilon {}_{3}} \right. \, and\left. { \, \zeta_{1} } \right) \) are standardized values. Model χ 2 = 3.42, df = 3, P < 0.329

The model C (Fig. 4) of the P cycle fitted by the SEM method is theoretically consistent and is likely general, considering that the present study is a comprehensive analysis at an exploratory level of the P fractions in a range of unfertilized tropical soils, but this model needs confirmation using independent data. The weak overall adjustment of the tested structural models A and B (Figs. 2, 3) highlights the need for future studies to improve the adequacy of these models or that they are simply inadequate. However, it should be noted that a well-adjusted model is not necessarily a valid model (Kenny 2012). A good general fit only shows that the model adequately describes the observed data (Mitchell 1992). A model that is not well adjusted can have parameters that are all statistically significant (Kenny 2012). In this case, within certain limits, the hypothetical model can be used to describe the relationships between the variables that comprise it (Fig. 3), where the regressions of these relationships are evaluated simultaneously, allowing for the comparison of the relative importance of the indicator variables (Malaeb et al. 2000). Moreover, the close correlation between the measurable variables (P fractions, Table 3) and the covariance between the latent variables (P pools) affects the overall degree of fit of the hypothetical structural models evaluated by the χ 2 test (Figs. 2, 3). This type of overall fit index is affected by the size of the correlations in the model. Large correlations result in low fit indices (Kenny 2012). Thus, we must consider that this is perhaps a problem inherent to the structural models of P fractions in the soil, based on the sequential extraction method. In this method, the P fractions are not true independent variables. Thus, given that the variables that contribute the most to any high collinearity of the data are discarded, the hypothetical model of the P cycle can be evaluated using the set of regression coefficients to make inferences about the direction and strength of relationship between their variables, as obtained using the SEM method (Tiessen et al. 1984; Beck and Sanchez 1994; Zheng et al. 2002).

Our analysis of the P cycle models, based on all of the statistical techniques, showed that the resin Pi (indicator of available P pool) was closely related to the Po fractions. Thus, there is strong support for the control of available P pool by the mineralization of organic P, from which the moderately stable fraction of hydroxide Po acts as the primary source of available P pool and as a transitional fraction of the other Po fractions. In model C (Fig. 4), the labile bicarbonate Po seems to act as a sink of available P pool, considering the dependency of available P pool with respect only to the organic P fractions (indicators of organic P pool). The sonic Po is also revealed as a fraction with an important role in the overall dynamics of P in tropical soils, although this fraction is considered as the recalcitrant/stable form of Po (Cross and Schlesinger 1995). This fraction showed a close relationship with the other recalcitrant P fractions in the soil. In part, this may occur because the HCl Phot and residual P fractions have a considerable and very stable Po content (Tiessen and Moir 1993; Cross and Schlesinger 1995; Oberson et al. 2001; Linquist et al. 1997; Frizano et al. 2002, 2003; Giardina et al. 2000).

In general, the total organic P made up 25.5 % of the total P, and the average proportion of the hydroxide Po of the total organic P was 64.5 % (Table 1). Other studies reported similar values of the proportion of organic P (24.9–28.7 %) in strongly weathered soils, determined by both ignition and extraction procedures (Sharpley et al. 1987; Harrison 1987; Turner and Engelbrecht 2011). Using path models, Tiessen et al. (1984) and Beck and Sanchez (1994) showed that in highly weathered Ultisols, 44–80 % of the variation in resin P was explained by the organic fractions (bicarbonate and hydroxide). Soil organic P is a major source of plant-available P in low input tropical agriculture and forests (Negassa and Leinweber 2009; Reed et al. 2011). The transformation pathway of P in the soil, especially the magnitude of influence of the organic P on the available P, depends on the type of soil, climate, type of P sources and land use history. But as a number of factors contribute to transformations of P fractions, it is difficult to exactly determine the kinetics of exchange reactions (Negassa and Leinweber 2009).

The models tested in this study may be used like tool in understanding of some processes of P cycling in determining the function and productivity of tropical forest and perennial plants, considering that the Ultisols and Oxisols are the dominant tropical orders and are characterized by relatively low P availability (Reed et al. 2011). As occluded pool of P was the sparingly available P and organic pool of P was considered reversibly available (Model C, Fig. 4), the uptake of P by plants would be regulated by the turnover of organic P and the rapid recycling of P from accumulated litter (Turner and Engelbrecht 2011). However, what is the real contribution of soil organic P to the nutrition of ecosystems is not yet known. Therefore, further research is now required to refine the model C (Fig. 4) to maximize the efficiency of P use in low-input agricultural systems.

Conclusion

We proposed and analyzed conceptual models of P cycling in unfertilized tropical soils, finding that the organic P pool constitutes the main source of available P pool with resin P fraction as its single indicator. This result implies that caution should be used in low-input agricultural systems, when using conventional methods of soil fertility analysis that are not sensitive enough to detect changes in the forms of soil P.

Structural equation modeling proved to be a suitable tool with which to understand the various possible cycles of P in the tested soils, indicating the degree to which the alterations in the P level of a pool can affect the other P pools. Thus, SEM can be used not only to evaluate hypotheses but also to help improve them in an exploratory manner.