1 Introduction

Masonry exhibits distinct directional properties due to the influence of the mortar joints. Depending upon the orientation of the joints to the stress directions, failure can occur in the joints alone or simultaneously in the joints and the blocks. Nowadays, in developed societies, there is an increased interest about reliable and robust computational models for the modeling of masonry structures. This is primarily due to the growing interest of protecting heritage structures. The main characteristic of the majority of these structures is that they are mostly made of masonry materials.

To fulfill this need, in the last decades a large number of macro and micro computational models have been proposed both for linear and nonlinear analysis of such structures [16]. Detailed and in-depth state-of-the-art reports can be found in Lourenço [7], Roca et al. [8], Asteris et al. [9, 10], Sarhosis [11] and Plevris and Asteris [12].

The successful implementation of these advanced models requires reliable analytical constitutive rules such as constitutive equation models of masonry material failure, which is not the case at the present. To date, research work has been focused on isotropic masonry models, which are based, primarily, on models applied for concrete [13]. Regarding non-isotropic masonry, there are a few research works in which failure criteria approximate the failure surface by employing different forms of quadratic polynomials. Many variations of such models have been published to date, including those that define the failure surface through the use of different functions for each quadrant [1419]. However, for certain load cases, such models overestimate strength [20].

The main disadvantage of the above criteria is that the corresponding failure surface is a synthesis of at least four individual surfaces (one for each quadrant). This leads to the formation of singular points, namely “corners” (curves at which the individual surfaces intersect) where the flow vector is not uniquely defined [21, 22], thus leading to indeterminate direction of straining. The existence of singular points causes significant problems to nonlinear analysis [23]. In order to avoid the formation of such points, the use of a closed continuous yield surface described analytically by a single function (cubic polynomial) has been recently proposed [2426].

During the last decades, neural networks have been used extensively for various applications in civil engineering [2730] and earthquake engineering [31, 32], including many attempts on modeling the materials’ constitutive rules [3338]; however, to the authors’ knowledge, there have not been many attempts to apply a neural network (NN) for the prediction of masonry behavior in general. The only proposal in this area is that of a previous work by the authors [12] in which a novel method has been proposed by applying neural networks (NNs) to approximate the masonry failure surface [limited only for the case of biaxial compressive (only) stress state]. The method comprises a series of NNs that are trained with available experimental data. The results demonstrate the great potential of using NNs for the approximation of masonry failure surface under biaxial compressive stress.

In the present paper, following the work undertaken above, the use of neural networks is proposed for modeling the anisotropic masonry behavior under biaxial stress state. In particular, a novel procedure based on NN techniques has been introduced for acquiring the failure/yield surface of masonry material in a dimensionless form, taking into account available experimental results from the literature as input data for the training and validation of NNs. It is the first time this has been achieved in a dimensionless form, for any quadrant of the masonry stress state.

2 Methodology

The highly anisotropic brittle nature of masonry renders complicated, difficult and expensive the realization of reliable experimental tests under conditions of biaxial stress, and, even more, under conditions of biaxial tension or heterogeneous stress. The angle of the applied loading to the bed joint plays a significant role in the behavior of the brick masonry disks. In general, the highest strength of masonry is observed when the compressive load is perpendicular to the bed joints or in other words when the principal tensile stress at the center of the disk is parallel to the bed joints. In this case, failure occurs through bricks and perpendicular joints. On the other hand, the lowest strength is observed when the compressive load is parallel to the bed joints or in other words when the principal tensile stress at the center of the disk is perpendicular to the bed joints. In this case, failure occurs along the interface of brick and mortar joint.

The methodology of the present work is similar to another study by the authors [12]. That previous work focused only on the compression–compression area, where both principal stresses were compressive, i.e. it focused on the third quadrant of the σ Ισ ΙΙ plane (compression–compression), without having to do with the other three quadrants. The present work extends the investigation to all four quadrants, namely the (A) tension–tension, (B) compression–tension, (C) compression–compression and (D) tension–compression quadrant. The former work utilized the experimental data reported by Page [3941], which have been already used by many other researchers [26, 42, 43]. Ratios of vertical compressive stress σ Ι to horizontal compressive stress σ ΙΙ of 1, 2, 4, 10 and ∞ (uniaxial σ Ι) have been used in conjunction with a bed joint angle θ with respect to σ Ι, in directions of 0°, 22.5°, 45°, 67.5° and 90°. A minimum of four tests were performed for each combination of σ Ι/σ ΙΙ and θ.

The present study applies neural networks in order to approximate the experimental failure curves of a brittle anisotropic material such as masonry. The aim of the study was to introduce an anisotropic (orthotropic) neural network—generated 3D failure surface under biaxial stress for masonry for any angle of the joints to the vertical compressive load, as described in detail in the next paragraphs. First, for each angle θ (0°, 22.5°, 45°, 67.5°, 90°) of the joints to the vertical compressive load, a neural network was trained with the experimental data of Page [3941] as inputs (5 NNs in total). Then each one of the five NNs was asked to produce the whole 2D failure curve for each angle θ as its output, filling also the gaps between the experimental points, thus “enriching” the experimental data with appropriate approximations. Then another, bigger, “global” NN was trained using the results of the five NNs as inputs with the angle θ as an input, also. The new NN was then asked to fill also the gaps between the angles θ, providing the whole 3D failure surface (a tube) for any angle θ (0°–90°) and any ratio of σ I/σ II, for all four quadrants of stress.

3 Biaxial testing procedure

Masonry is a composite, multi-phase material that depicts distinct brittle and strongly anisotropic nature, making complicated, difficult and expensive, the realization of reliable experimental tests under conditions of biaxial stress, and, even more, under conditions of biaxial tension or heterogeneous stress. In the next sections, the process of both the preparation of test specimens and the tests conducted under biaxial stress state are presented in detail, with special attention to the parameters that affect the results. At this point, it should be emphasized that available experimental results obtained by other researchers are used in this paper.

3.1 Preparation of test specimens

Two main methodologies are used in the literature in order to obtain the final test specimens with the correct shape and size, presented in Samarasinghe [44], Page [3941] and Tasios and Vachliotis [45].

3.1.1 The first methodology

Specimens with horizontal and vertical joints (Fig. 1a) are constructed with dimensions greater than those of the final specimen, and then a square with the desired dimensions is drawn on the surface of the wall by a pencil at the appropriate orientation to achieve the correct layup angle as shown in Fig. 1b.

Fig. 1
figure 1

Preparation of the masonry specimens. a Wall panel before saw cut, b saw-cut specimen

Angle θ denotes the bed joint angle with respect to the horizontal direction, as shown in the figure, which can be also defined as the angle between the direction of the bed joints and one of the edges of the finished test specimen. After a time-span of 14 days, the larger wall panels were cut to the required size and shape by a “Clipper” saw (Fig. 1b) which has the capacity to hold impregnated diamond edge circular blades of varying thickness and diameters. Two days prior to the date of testing, the “compressive edge” of the panel (the side on which the compressive load was to be applied) was capped with 1:1 (cement/sand) mortar [12].

3.1.2 The second methodology

According to Page [3941], all specimens are constructed directly to their final shape and size as follows: All brickwork is constructed horizontally on a rigid form with bricks glued to a Perspex backing sheet to ensure a constant joint thickness. Panels are made with varying bed joint angles by cutting individual bricks to the required shape before casting.

In the present study, five layup angles were selected for biaxial tests, namely 0°, 22.5°, 45°, 67.5° and 90°.

3.2 Testing rig

A biaxial stress state is induced in the panel by loading with hydraulic jacks in two orthogonal directions, as shown in Fig. 2. A constant load ratio is maintained during each test by means of the spreader beam. The load in each direction is monitored by load cells immediately adjacent to the specimen.

Fig. 2
figure 2

Biaxial testing rig

4 Biaxial compression tests

For all panels that were tested by Page [3941], ratios of compressive stress σ I to horizontal compressive stress σ II of 1, 2, 4, 10 and infinity (i.e. uniaxial σ Ι) were used in conjunction with bed joint angle θ with respect to the σ Ι direction of 0°, 22.5°, 45°, 67.5° and 90°. Figure 3 shows the saw-cut specimens and stress directions for the five cases θ = 0.0°, θ = 22.5°, θ = 45.0°, θ = 67.5° and θ = 90.0°. The cases a(0°)–e(90°), b(22.5°)–d(67.5°) are symmetric to each other, that is why they have been put one under the other in the figure. Principal stress ratios of 0 (i.e. uniaxial σ ΙI), 0.1, 0.25, 0.5 and 1 were obtained from the results using the symmetry of the panels and the loading. A minimum of four tests were performed for each combination of σ Ι/σ ΙΙ and θ. The failure envelopes that Page obtained by plotting mean curves for each bed joint angle can be found in Page [3941] and Plevris and Asteris [12]. These failure envelopes are based on the peak strength values obtained from running bond masonry panel tests in which a uniform loading was applied proportionally.

Fig. 3
figure 3

Saw-cut specimens and stress directions. a θ = 0.0°, b θ = 22.5°, c θ = 45.0°, d θ = 67.5°, e θ = 90.0° [3941]

It should be noted that there are no real experimental data for the cases (d) θ = 67.5° and (e) θ = 90.0°, because these cases are equivalent to the cases (b) θ = 22.5° and (a) θ = 0.0°, respectively, if we reverse σ Ι and σ II, as shown in Fig. 3.

5 Artificial neural networks

In the present study, we use Back-Propagation Neural Networks (BPNNs), in which the output values are compared with the correct answer to compute the value of a predefined error function. The error is then fed back through the network. Using this information, the algorithm adjusts the weights of each connection in order to reduce the value of the error function by some small amount. The procedure is similar to the one used in a previous work by the authors [12], where more details can be found. The transfer function used in the present study is the hyperbolic tangent function, the same for all the hidden and the input layer, which yields output values in the interval [−1, 1], while its derivative yields output values in the interval [0, 1]. The transfer function for the output layer is a linear function. This scheme has been used in all the NNs of the study.

6 Five NN models (for each angle θ)

6.1 NN architecture

For all five NNs, we used a BPNN with one hidden layer, one input layer and one output layer. The input layer had one node (neuron) which corresponds to the angle φ (in the range [0, 2π]), which defines the ratio σ II/σ I, while the output layer had also one node which corresponds to the distance (radius) r of the point on the failure curve to the origin of the axes. These two important parameters (φ and r) will be described in detail in the next paragraphs. The hidden layer of each NN had 4 nodes for all θ (θ = 0°, 22.5°, 45°, 67.5°, 90°), ending in a 1-4-1 BPNN architecture for all five cases. These values have been chosen after some trial tests on various network architectures. The input and output values are normalized before the NN training and the inverse normalization is performed in order to take the NN results for other data afterward. It should be noted that the very same NN architecture was used for all five cases with no exceptions.

6.2 Preparation of the NN input data

The experimental data of Page [3941] have been used as inputs for the first five NN models of the present study. The figure below shows the original data for the θ = 0° case (in a normalized form, shown as blue points in the figure) together with the average values for each σ I/σ ΙΙ ratio (shown as red squares) (Fig. 4).

Fig. 4
figure 4

Normalized experimental data and average values for the θ = 0.0° case

Table 2 (in the “Appendix”) shows the corresponding analytical data for the θ = 0° case. The table consists of four parts, which correspond to four different types of test, published in three different papers, as shown below:

  1. (A)

    1st quadrant: biaxial tension (σ I ≥ 0, σ II ≥ 0), Page [39]

  2. (B)

    2nd quadrant: biaxial heterossemous stresses (compression–tension) (σ I ≤ 0, σ II ≥ 0), Page [41]

  3. (C)

    3rd quadrant: biaxial compression (σ I ≤ 0, σ II ≤ 0), Page [40]

  4. (D)

    4th quadrant: biaxial heterossemous stresses (tension–compression) (σ I ≥ 0, σ II ≤ 0), Page [41]

The first two columns of the table contain the original experimental data, namely the failure principal stresses σ Ι and σ II in MPa. The next two columns (columns 3, 4) contain the same data in a dimensionless form where the stresses have been divided with the stress f wc which is the masonry strength for the case σ Ι = 0 for Case C (compression–compression case). The value of f wc has been calculated as the average of the four values (highlighted in bold in the above table) as 7.56 MPa.

In Parts B, C, D we have 2, 3 or 4 tests for every σ I/σ II ratio. The tension–tension experimental test is the most difficult to perform and also the values are small and the data points more dense, so there is only one test for every σ I/σ II ratio. The next two columns (columns 5, 6) are the averages of the tests for each loading case, i.e. for each σ II/σ I ratio. For Part A, the average values coincide with the values themselves, as there is only one test performed for each σ I/σ II ratio.

For the data to be suitable for the neural network training, a conversion to polar coordinates (r, φ) is needed, where the distance (or radius, column 8) r is given by

$$r = \sqrt {\sigma_{{\text{I},\text{aver}}}^{2} + \sigma_{{\text{II},\text{aver}}}^{2} }$$
(1)

where σ I,aver and σ II,aver are the average stresses for each loading case (columns 5, 6).

Since we are trying to describe a full circle, the polar angle value φ has to be within [0, 2π) range. Based on the values of σ I and σ II, we need to calculate the value of the polar angle φ. This can be done easily using the inverse tangent function (Arctan), which returns values within the (−π, π) range, as follows:

$$\varphi_{A} = \text{Arctan} \left( {\frac{{\sigma_{{\text{II}}} }}{{\sigma_{\text{I}} }}} \right)$$
(2)
$$\varphi_{B} = \varphi_{C} = \pi + \text{Arctan} \left( {\frac{{\sigma_{{\text{II}}} }}{{\sigma_{\text{I}} }}} \right)$$
(3)
$$\varphi_{D} { = 2\pi + }\text{Arctan} \left( {\frac{{\sigma_{{\text{II}}} }}{{\sigma_{\text{I}} }}} \right)$$
(4)

For cases where σ I = 0, the above formulas do not apply and it is obviously φ = π/2 (for σ II > 0) or φ = 3π/2 (for σ II < 0). The angle φ is given in column 7 of the table. The figures below show the corresponding diagrams for the other four θ cases (θ = 22.5°, θ = 45°, θ = 67.5°, θ = 90°) (Figs. 5, 6, 7, 8).

Fig. 5
figure 5

Normalized experimental data and average values for the θ = 22.5° case

Fig. 6
figure 6

Normalized experimental data and average values for the θ = 45.0° case

Fig. 7
figure 7

Normalized experimental data and average values for the θ = 67.5° case

Fig. 8
figure 8

Normalized experimental data and average values for the θ = 90.0° case

It should be noted that the data for θ = 45° are symmetric with respect to the 45° diagonal line (σ I = σ II), due to not only the nature of the loading but also to the geometry of the masonry panel where for this special case (Fig. 3c) the bed joint orientation is equal to angle θ = 45°. Τhe data for θ = 0° and θ = 90° that correspond to complementary angles are also mutually symmetric to the line σ I = σ II. The same also applies for the data for θ = 22.5° and θ = 67.5. These special conditions will be studied in detail in the next paragraphs.

For each of the five angles θ (θ = 0°, 22.5°, 45°, 67.5°, 90°), a neural network is trained with the angle φ as its input and the radius r as its output. Only the average values of each test (denoted as “square” points in the above figures) are used as training data for the neural networks. This proved to be the best strategy for handling this kind of problem, in order to avoid overtraining problems with the NN.

6.3 Properties of the curves: NN corrections

The failure curves have some distinct characteristics, and the NN results have to be in some cases corrected in order to follow the rules and comply with the characteristics. These characteristics are presented below.

6.3.1 Consistency for φ = 0 and φ = 2π

The case φ = 0 (start of circle) is in fact the same point as the case φ = 2π (end of circle), and the results in these two points should coincide. Each curve has to be “closed,” i.e. the two radius r 0 and r 2π for φ = 0 and φ = 2π, respectively, have to be exactly the same. Also the tangents of the curve at φ = 0 and φ = 2π have to coincide. The problem is depicted schematically in Fig. 9a.

Fig. 9
figure 9

Consistency for φ = 0 and φ = 2π. a Initial curve, b corrected curve

For this reason, and in order to improve the performance of the NN, two measures have been taken:

  1. 1.

    Each NN is trained with two identical circles of data (instead of one circle which would normally be the case), so that the input corresponds to the angle φ in the range [0, 4π] and as a result the data points are doubled. This measure helps the NN show a consistent behavior at φ = 2π (end of circle). The number of data points for each case is given in Table 1.

    Table 1 Number of data points for NN training, for each case
  2. 2.

    The first measure mentioned above gives very good results, and it improved dramatically the performance of the NN. Although the problem is minimized, it still exists, in the sense that there is still a slight numerical difference between the NN-obtained r 0 and r 2π . In order to obtain a perfect, closed curve, a second measure is also taken. If r 0 is the predicted distance for the case φ = 0 and r 2π is the predicted distance for the case φ = 2π, then we apply the following correction formulas:

    $$\lambda = \frac{{r_{2\pi } }}{{r_{0} }}$$
    (5)
    $$k_{1} = \frac{1 + \lambda }{2}$$
    (6)
    $$k_{2} = \frac{1 + \lambda }{2\lambda }$$
    (7)
    $$r_{\varphi }^{{\prime }} = r_{{\varphi }} \cdot \left[ {k_{1} + \frac{\varphi }{2\pi } \cdot \left( {k_{2} - k_{1} } \right)} \right]$$
    (8)

    For φ = 0 or φ = 2π, Eq. (8) gives (r 0 + r 2π )/2 which makes a closed curve, as shown in Fig. 9b. This correction is applied to the results of all NNs.

6.3.2 Symmetry for θ = 45°

For the special case of masonry with bed joint angle θ = 45°, the masonry panel exhibits a symmetry which leads to a symmetric failure curve to the line σ Ι = σ ΙΙ as shown in Fig. 10.

Fig. 10
figure 10

Symmetry to the σ Ι = σ ΙΙ line, for the case θ = 45.0°

Since the training data had already been symmetric, there were only slight differences in the NN results and the symmetry was only slightly disturbed. In any case, special care has been taken in order to obtain a perfectly symmetric result for this case. In particular, the results in the two sides of the symmetry line were both taken into account and the average was finally considered, ensuring that the final result would be perfectly symmetric.

6.3.3 Symmetry for complementary angles (θ and 90° − θ)

It has been noted that the failure curves that correspond to complementary angles (θ and 90° − θ), such as θ = 0° and θ = 90° as well as θ = 22.50° and θ = 67.5°, are mutually symmetric to the line σ Ι = σ ΙΙ as shown in Fig. 11.

Fig. 11
figure 11

Symmetry to the σ Ι = σ ΙΙ line, between curves corresponding to complementary angles (θ and 90° − θ)

Although the results obtained by the NNs are satisfactory, we apply a special correction in order to ensure that the results for the case 0° are exactly equivalent to the results of the case 90° and also the results for the case 22.5° are equivalent to the results of the case 67.5°. In particular, the results of the two equivalent cases were both taken into account and the average was finally considered, ensuring that the result would be perfectly symmetric between the two cases.

6.4 Final approximation results of the five NNs

The five NNs were trained with the input and output data of Tables 1, 2, 3, 4, 5 (last two columns) and then each NN was asked to produce the full curves for each bed joint angle, for a set of 80 segments (81 points). Although the NN was trained for two circles of data, it was asked to report results only for the first circle (in the range [0, 2π]). The results are shown in the figures below.

In all the figures (Figs. 12, 13, 14, 15, 16, 17, 18), the black dots denote the input data, i.e. the data corresponding to the last two columns of Tables 2, 3, 4, 5, 6. The blue curve denotes the NN prediction of the fitting curve. It can be seen that the NN managed to fit all the training data with very good accuracy, providing smooth curves in all cases.

Fig. 12
figure 12

NN approximation result for θ = 0° (σ ΙΙ vs σ Ι, dimensionless)

Fig. 13
figure 13

NN approximation result for θ = 0° (Radius r vs Angle φ)

Fig. 14
figure 14

NN approximation result for θ = 22.5° (σ ΙΙ vs σ Ι, dimensionless)

Fig. 15
figure 15

NN approximation result for θ = 45° (σ ΙΙ vs σ Ι, dimensionless)

Fig. 16
figure 16

NN approximation result for θ = 67.5° (σ ΙΙ vs σ Ι, dimensionless)

Fig. 17
figure 17

NN approximation result for θ = 90° (σ ΙΙ vs σ Ι, dimensionless)

Fig. 18
figure 18

NN approximation result for the case θ = 45°, zoomed in

Figure 15 shows the θ = 45° case, where it can be seen that the curve is perfectly symmetric to the σ Ι = σ ΙΙ line, because of the extra measures that have been taken. Figure 18 shows the same case (θ = 45°), where we zoom in the tension–tension area. It is clear that the result is a perfectly closed curve. By comparing Fig. 12 (θ = 0°) to Fig. 17 (θ = 90°) and Fig. 14 (θ = 22.5°) to Fig. 16 (θ = 67.5°), it can be seen that the results for complementary angles are indeed symmetric to the σ Ι = σ ΙΙ line.

7 “Global” neural network model

In the previous part of the study, five NN Models were trained, each NN for one angle θ (0°, 22.5°, 45°, 67.5° and 90°). The final purpose of the study is to add also the angle θ as a parameter of the problem, thus creating a model that will be able to predict the entire failure curve not only for the predefined angles, but for any angle θ, from 0° to 90°.

In order to achieve this, after the five NNs were trained and they had provided their results, another bigger, “Global” NN was trained which took the results of the five NNs as inputs with the angle θ as an additional input, also. The new NN was then asked to fill also the gaps between the angles θ, providing the whole 3D failure “tube” for any angle θ and any angle φ (ratio of σ II/σ I), for all four quadrants.

The Global NN is also a BPNN with one hidden layer containing 10 nodes (2-10-1 architecture). The two inputs are the angles φ (0 to 4π—two full circles) and θ (0° to 90°), while the output is the distance (radius) r. The transfer function of the global NN is the hyperbolic tangent function that was used also in the first five NNs. For the global NN, we use all the results of the other five NNs as training patterns. This means that we have 161*5 = 805 training patterns, as we use 161 points (80*2 + 1) for every angle θ (0°, 22.5°, 45°, 67.5° and 90°).

8 Global NN results

8.1 Failure tube in 3D

The global NN was trained and then it was asked to produce the whole 3D failure surface. Although the global NN was trained in two full circles ([0, 4π]), in order to achieve better performance and to manage to “close” the curve for the first circle, it was asked to produce results only for the first circle ([0, 2π]). Specifically, the NN was asked to produce points where the angle φ was divided again in 160 (80*2) segments (161 points, each segment is equivalent to 360°/80 = 4.5°), while angle θ is divided in 40 segments (41 points, each segment is equivalent to 90°/40 = 2.25°). The figure below shows the result of the NN approximation in 3D. The red points (dots) denote the initial training set of the first five NNs, i.e. the average values gathered from the experimental results and used for the training of the initial NNs (Fig. 19).

Fig. 19
figure 19

Global NN approximation result of the 3D failure surface in terms of principal stresses (tube)

8.2 Closed failure surface in 3D

Knowing the principal stresses σ I, σ II and the angle θ, we can obtain the corresponding values of σ x , σ y , τ xy  = τ, using the following well known formulas:

$$\sigma_{x} = \frac{1}{2}\left( {\sigma_{\text{I}} + \sigma_{II} } \right) + \frac{1}{2}\left( {\sigma_{I} - \sigma_{II} } \right) \cdot \text{cos}\left( {2\theta } \right)$$
(9)
$$\sigma_{y} = \frac{1}{2}\left( {\sigma_{I} + \sigma_{II} } \right) - \frac{1}{2}\left( {\sigma_{I} - \sigma_{II} } \right) \cdot \text{cos}\left( {2\theta } \right)$$
(10)
$$\tau = - \frac{1}{2}\left( {\sigma_{I} - \sigma_{II} } \right) \cdot \text{sin}\left( {2\theta } \right)$$
(11)

This way we can convert the stresses from the system of the triplet (σ I, σ II, θ) to the one of the triplet (σ x , σ y , τ). Figure 20 shows the failure curves in the 3D system of the triplet (σ x, σ y, τ), where an “onion” is formed. Again, the red points (dots) denote the initial training set of the first five NNs (primary training data).

Fig. 20
figure 20

Global NN approximation result of the 3D failure surface (“onion”) in terms of normal stresses, two different views

Figure 21 shows the zoomed-in picture of the tensile area of the failure onion.

Fig. 21
figure 21

The tensile area of the 3D failure “onion,” zoomed in

9 Comparison with other criteria: discussion

The results are compared to the main analytical criteria used to define the masonry failure surface [12]. To better facilitate the comparison, classical failure criteria have been used from the literature such as:

  • The Von Mises modified failure criterion, which has been adopted by many researchers [4650].

  • The second-order polynomial, which comprises a simplified version of the cubic tensor polynomial, to be presented afterward. It applies to anisotropic materials and is commonly used to define the failure of masonry materials [26].

  • The third-order polynomial or cubic tensor polynomial. This criterion comprises the generalization of the aforementioned failure criterion and has been applied to anisotropic composite materials [20, 51, 52] including masonry materials [24, 26].

All three failure criteria assume plane stress state and are expressed in terms of non-dimensionless normal stresses regarding the uniaxial compressive strength perpendicular to the bed joints of masonry \(f_{{\text{wc}}}^{90^\circ }\), as shown in [12]. Extensive and in-depth state-of-the-art reports on all the above criteria used for masonry can be found in [53, 54].

Figures 22, 23, 24, 25 and 26 show the comparison between the results obtained using the proposed NN methodology, in relation to the available analytical failure criteria, which were presented above, and the experimental results obtained by Page [3941]. Although some of the analytical failure criteria perform well in specific cases of the θ angle, and for specific quadrants of the stress state, it is often observed that in other cases they perform much worse, as they do not have the flexibility to adjust and give the whole picture for all angles and all quadrants. On the other hand, as shown in the figures, NN performs very well and shows the best overall (most balanced) performance compared to the other criteria.

Fig. 22
figure 22

Comparison of results for θ = 0.0°

Fig. 23
figure 23

Comparison of results for θ = 22.5°

Fig. 24
figure 24

Comparison of results for θ = 45.0°

Fig. 25
figure 25

Comparison of results for θ = 67.5°

Fig. 26
figure 26

Comparison of results for θ = 90.0°

10 Conclusions

Despite the plethora of analytical models that have been proposed for the modeling of masonry material, there is no analogous progress in models for the description of its failure surface, which is a prerequisite for a comprehensive and reliable implementation of them.

In the present study, an NN procedure for the modeling of the anisotropic masonry behavior under biaxial stress state is presented. In particular, a novel procedure based on neural network techniques has been introduced in order to model the failure/yield surface of masonry material in a dimensionless form. To this end, available classical experimental results from the literature are used as input data for the training as well as the validation of NNs.

Being aware that masonry is a multi-phase material that exhibits distinct brittle and anisotropic nature with wide scatter in the values of its mechanical characteristics, the proposed NN procedure has been shown to be reliable and robust, providing valuable results.

In particular, the derived failure surface, on the one hand, fits the experimental data with high accuracy and on the other hand it is provided in a dimensionless form which is very important as it can be easily applied to other masonry materials of similar geometry and properties. Furthermore, the derived surface provides valuable information about areas of the masonry failure that have not been investigated until now (failure under heterosemous stresses or furthermore failure under biaxial tension for any tilt angle of the bed joints) helping us to better understand the whole fracture mechanics phenomenon.