1 Introduction

The ways and channels of information transmission have changed fundamentally in the information society [1]. A great deal of information spreads rapidly to all corners of society in various ways. The explosive growth of information received in our daily lives occurs. The image has become one of the important carriers of information dissemination by virtue of its simple, easy to understand and intuitive features in the dissemination of information [2]. According to the relevant statistics, the results show that more than 70% of the information is transmitted in an image in the information society [3]. However, the information contained in the image is not all the information needed by the receiver, so it is necessary to extract the information from the image. It is found that the information in the image is often concentrated in certain areas, and they only need to separate the information contained in the specific region from the image to obtain relevant information in the process of image information extraction [4]. There are many ways to extract the information from the image, but the most widely used method is image segmentation [5].

Generally, it includes image segmentation, image analysis and information extraction from three to low level in the process of extracting information from images using image segmentation [6]. Image segmentation is the basis of image information extraction, and the result will directly affect the subsequent steps of image information extraction [7]. The user generally divides the image according to the image features such as gray level, texture or color in the process of image segmentation. And then valuable images are extracted from these regions [8]. Image segmentation has become the main means of image information extraction after decades of development, which involves more and more fields, [9] such as military, industrial and medical fields. Image segmentation has been a very active topic in the academic field in addition to the field of engineering. The related research theories and achievements are also very fruitful. It is because of the image segmentation in the field of application and academic fields are of great value that all over the world have carried out in-depth research and the relevant papers and research results emerge [10]. However, there is still no general image segmentation algorithm for image segmentation. Its application is limited to specific areas [11]. The purpose of this paper is to improve the algorithm of image segmentation based on Improved Fuzzy clustering, which provides a theoretical basis for the general image segmentation algorithm.

2 Fuzzy clustering and image segmentation

2.1 Theoretical basis and analysis method of fuzzy clustering

Fuzzy clustering theory is a branch of fuzzy mathematics, which was proposed by an American Professor Zadeh in 1965 for theoretical characterizations of fuzzy phenomenon. This theory is a generalization of the traditional set theory [12]. The relationship between elements and sets is very clear, and there is no ambiguity in the traditional set theory [13]. Fuzzy set theory is different and the relation between the elements and the set is used to measure the degree of relationship between them. An element can belong to more than one set at the same time. It is precisely because of this that the fuzzy set theory has a wider range of applications and adaptability. After a long period of development, the theory of fuzzy sets has formed a complete theoretical system [14].

The definition domain of fuzzy set theory is similar to the classical set theory. U is set the theoretical field of fuzzy set theory, which represents the range of the research object. The membership function g(u) is used to characterize whether the element u in the fuzzy set is membership domain U. g(u) maps u to a real number on a closed field [0,1]. Formula is expressed as follows:

$$\begin{aligned} g:U \to \left[ {0,1} \right] \hfill \\ g \to g\left( u \right) \hfill \\ \end{aligned}$$
(1)

In the formula, the A of the function g(u) on U is the fuzzy set of U. The higher the value of membership, the higher the degree of correlation between elements and sets are. When the value is 1, the element is dependent on the set completely. When the value is 0, the element is not related to the set.

In order to make the relationship between the elements in the domain and the membership more image, Professor Chad gives different representations according to different fields. When there are finite discrete elements in the universe, the fuzzy set of the Analects is expressed in the following formula:

$$A = \sum\limits_{i = 1}^{n} {g\left( {u_{i} } \right)} /u_{i}$$
(2)

In the formula, g(ui)/ui represents the membership of the element u in the domain, and ∑ represents the content of a fuzzy set.

When the field is continuous, the fuzzy set is represented by the following formula:

$$A = \int\limits_{u \in U} {g\left( u \right)} /u$$
(3)

When a fuzzy set is used to describe a fuzzy thing, the definition of membership function is the key. At present, there are three definitions, and they are normal, small and large. The formula is as follows:

Normal type:

$$g\left( u \right) = \exp \left[ { - \left( {\frac{u - a}{b}} \right)^{2} } \right]$$
(4)

Small size:

$$g\left( u \right) = \left\{ \begin{aligned} \left[ {1 + a\left( {u - c} \right)^{b} } \right]^{ - 1} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} u \ge c \hfill \\ {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} 1{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} u < c \hfill \\ \end{aligned} \right.$$
(5)

Partial large:

$$g\left( u \right) = \left\{ \begin{gathered} \left[ {1 + a\left( {u - c} \right)^{b} } \right]^{{ - 1}} \quad u < c \hfill \\ 1\quad u \ge c \hfill \\ \end{gathered} \right.$$
(6)

The operation of fuzzy sets should conform to the following rules. Idempotent rate: \(A \cup A = A,\quad A \cup A = A\). Distribution rate: \((A \cup B) \cap C = \left( {A \cap C} \right) \cup \left( {B \cap C} \right),\quad (A \cap B) \cup C = \left( {A \cup C} \right) \cap \left( {B \cup C} \right)\). Exchange rate: \(A \cup B = B \cup A,\quad A \cap B = B \cap A\). Absorptivity: \(\left( {A \cup B} \right) \cap A = A,\quad \left( {A \cap B} \right) \cup A = A\). Recovery rate: \(\left( {A^{C} } \right)^{C} = A\). Dual rate: \(\left( {A \cup B} \right)^{C} = A^{C} \cup B^{C} ,\quad \left( {A \cap B} \right)^{C} = A^{C} \cap B^{C} ,\). Rate of combination: \(\left( {A \cup B} \right) \cup C = A \cup \left( {B \cup C} \right),\quad \left( {A \cap B} \right) \cap C = A \cap \left( {B \cap C} \right)\). Two grade rate: \(A \cup \phi = A,\quad A \cap \phi = \phi ,\quad A \cup U = U,\quad A \cap U = U\).

The method of fuzzy cluster analysis based on fuzzy clustering theory is called fuzzy clustering analysis [15]. The fuzzy elements can’t be classified into a certain category. The traditional clustering analysis method is not suitable for the situation of fuzzy boundary. The fuzzy clustering can get the membership degree of each component, and it can reflect the nature of the elements of the elements and establish the uncertain relationship between elements and sets [16]. Fuzzy clustering analysis is more in line with the real situation of the objective world, so it becomes the mainstream method of cluster analysis. Related research and results are also more. At present, there are four types of methods for fuzzy analysis. They are hierarchical clustering analysis method, clustering analysis method based on equivalence relation, clustering analysis method based on graph theory and clustering analysis method based on objective function. In the four basic cluster analysis methods, the calculation of the first three clustering methods is too large, and it is not suitable for practical applications which require real-time or data. Finally, a clustering analysis method based on objective function is introduced to transform the problem into a constrained nonlinear programming problem. And then the optimal objective function is used to optimize the clustering elements. The iterative method is used to search the optimal solution of the objective function in the process of practical application. Therefore, with the rapid development of information technology, the fuzzy clustering analysis method based on objective function has been widely concerned in the practical application.

2.2 Image segmentation research

At present, there are many researches on image segmentation algorithms. The image segmentation algorithm has reached thousands of species after decades of development. There are hundreds of papers on image segmentation algorithm every year. However, there is no objective and unified image algorithm up to now [17]. There are many reasons for this. According to the definition of the relevant research on the image, the set of the whole picture isR, and the image is divided into non-empty set R1, R2,…, Rn. Then the nonempty set must satisfy the following conditions: (1) \(\cup_{i = 1}^{n} = R_{i} = R\). (2) To all i and j, i ≠ j, \(R_{i} \cap R_{j} = \phi\). (3) To all i = 1, 2,···, n, P(Ri) = TRUE. (4) When i ≠ j, \(P\left( {R_{i} \cup R_{j} } \right) = FLASH\). (5) To i = 1, 2,···, n, it is China unicom. So far, none of these five conditions can be satisfied simultaneously, and any segmentation algorithm has its limitations. On the other hand, the characteristics of different areas of the image are also very different. No image segmentation algorithm can adapt to a variety of conditions. Therefore, the development of image segmentation algorithm is still a huge challenge [18].

Although there are many kinds of image segmentation algorithms, these algorithms will consider the following problems. The first is whether the image segmentation area is complete, and the second is how to avoid excessive image segmentation. Thirdly, it is the running time of image segmentation algorithm. That is the performance of image segmentation algorithm. Finally, it is how to realize the non-threshold of image segmentation algorithm. According to the problem of image segmentation algorithm, the current image segmentation algorithms are divided into several categories according to the different emphasis. These kinds of image segmentation algorithms are based on threshold segmentation algorithm, image segmentation algorithm based on region segmentation, image segmentation algorithm based on edge detection and image segmentation algorithm based on Clustering and some image segmentation algorithms based on specific theory. These several kinds of image segmentation algorithm are common algorithms. And so many image segmentation algorithms also led to the image segmentation algorithm evaluation criteria are different. Up to now, there are still no objective and unified algorithm evaluation criteria to evaluate different image segmentation algorithms [19]. At present, the evaluation criteria of image segmentation algorithms have high evaluation value only in a specific environment. It can be roughly divided into two categories. One is the subjective evaluation. That is to say, through the eyes of the image segmentation algorithm to evaluate the results, the method is simple and easy to operate. The other is objective evaluation. This method is evaluated by setting the objective criteria, such as noise immunity. This method is easy to carry out quantitative analysis, but the operation is complex and easily influenced by objective conditions. It can be seen from research on the current image segmentation algorithm that image segmentation algorithm research still needs a long way.

3 Image segmentation algorithm based on Improved Fuzzy Clustering

3.1 Fuzzy C means clustering

Clustering analysis is an important and used widely method in the field of image segmentation. Whether the image is color or gray can use clustering analysis for image segmentation. Fuzzy C means clustering image segmentation algorithm is the most widely used image segmentation algorithm in clustering analysis (Fuzzy C-means, FCM). Fuzzy C mean clustering image segmentation algorithm was first proposed by Dunn in 1974, and Bezdek has supplemented and improved it. The improved fuzzy C mean clustering algorithm is an image segmentation algorithm based on the optimization objective function. As an unsupervised fuzzy clustering algorithm, it is widely used in medical diagnosis, target recognition and image segmentation.

The FCM algorithm was used to solve the extreme value of the objective function in the double layer iteration in this paper. The inner iteration was used to correct the clustering center and the membership matrix in the process of FCM operation. Outer iteration was used to determine whether the iteration to meet the convergence requirements mainly. After the completion of the iteration, the membership degree of the target elements can be obtained in the membership matrix, and then the membership of the elements can be determined. The algorithm is as follows, and the FCM algorithm defines the following form of the objective function.

$$J = \sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{c} {\mu_{ij}^{m} \left\| {X_{i} - V_{j} } \right\|}^{2} }$$
(7)

The objective function must satisfy the conditions:

$$\sum\limits_{j = 1}^{c} {\mu_{ij} = 1,\forall i \in \left\{ {1,2, \cdots ,n} \right\}}$$
(8)

In formula (7) and (8), μij represents the membership of the target element i on class j. Xi represents the feature vector of element i, and Vj represents the clustering center of the class j. m is the weight coefficient. In general, m = 2.

Then, the iterative formula is deduced by Lagrange’s conditional extreme value method.

$$F = \sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{c} {\mu_{ij}^{m} \left\| {X_{i} - V_{j} } \right\|}^{2} } + \lambda \left( {1 - \sum\limits_{j = 1}^{c} {\mu_{ij} } } \right) = \lambda \left( {1 - \sum\limits_{j = 1}^{c} {\mu_{ij} } } \right) + \sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{c} {\mu_{ij}^{m} d_{ij}^{2} }^{{}} }$$
(9)

If we want to solve the extreme value of the function J, the first order necessary conditions must satisfy the following conditions:

$$\frac{\partial F}{\partial \lambda } = 1 - \sum\limits_{j = 1}^{c} {\mu_{ij} = 0}$$
(10)
$$\frac{\partial F}{{\partial \mu_{ij} }} = m\left( {\mu_{ij} } \right)^{m - 1} \left( {d_{ij} } \right)^{2} - \lambda = 0$$
(11)

According to the formula (11):

$$\mu_{ij} = \left[ {\frac{\lambda }{{m\left( {d_{ij} } \right)^{2} }}} \right]^{{\frac{1}{m - 1}}}$$
(12)

The formula (12) is added to the formula (10):

$$\sum\limits_{j = 1}^{c} {\mu_{ij} = \left( {\frac{\lambda }{m}} \right)^{{\frac{1}{m - 1}}} \left\{ {\sum\limits_{j = 1}^{c} {\left[ {\frac{1}{{d_{ij}^{2} }}} \right]^{{\frac{1}{m - 1}}} } } \right\} = 1}$$
(13)

Thus get:

$$\left( {\frac{\lambda }{m}} \right)^{{\frac{1}{m - 1}}} = \frac{1}{{\sum\limits_{j = 1}^{c} {\left[ {\frac{1}{{d_{ij}^{2} }}} \right]}^{{\frac{1}{m - 1}}} }}$$
(14)

The formula (14) is added to the formula (12):

$$\mu_{ij} = \frac{1}{{\sum\limits_{i = 1}^{c} {\left[ {\frac{{d_{ij}^{2} }}{{d_{ij}^{2} }}} \right]^{{\frac{1}{m - 1}}} } }}$$
(15)

Since dij may be equal to 0, it is necessary to discuss separately. Set \(\forall i \in \left\{ {1,2, \cdots ,n} \right\}\) to Ti, then \(T_{i} = \left\{ {j\left| {1 \le j \le c,d_{ij} = 0} \right.} \right\}\). The complement is \(T_{i}^{c}\), then \(T_{i}^{c} = \left\{ {1,2, \cdots ,c} \right\} - T_{i}\). In order to minimize the objective function J, the μij must satisfy the following conditions:

$$\left\{ \begin{gathered} \mu _{{ij}} = \frac{1}{{\sum\limits_{{k = 1}}^{c} {\left( {\frac{{d_{{ij}} }}{{d_{{ik}} }}} \right)^{{\frac{2}{{m - 1}}}} } }}\quad T_{i} = \phi \hfill \\ \mu _{{ij}} = 0,\forall j \in T_{i}^{c} ,\sum\limits_{{j \in T_{i} }}^{c} {\mu _{{ij}} = 1\quad T_{i} \ne \phi } \hfill \\ \end{gathered} \right.$$
(16)

Then the same method can be used to obtain the iterative formula of the cluster center Vj:

$$V_{j} \frac{{\sum\limits_{i = 1}^{n} {\mu_{ij}^{m} x_{i} } }}{{\sum\limits_{i = 1}^{n} {\mu_{ij}^{m} } }}$$
(17)

The FCM algorithm starts from the random initial value by formulas (16) and (17) approaching the extreme value.

After determining the algorithm flow of FCM, the rules are as follows. First, set the number of clusters c and parameters m. Secondly, cluster center Vj is initialized. And then iterate in accordance with the FCM algorithm process until the objective function satisfies \(\left| {J_{t} - J_{t + 1} } \right| < \varepsilon\). ɛ is a very positive number of admissible errors. When the result is satisfied, the result of clustering is proved to be good. Then, the new membership matrix is calculated according to the formula (16). Based on the current membership matrix, a new cluster center is calculated by formula (17). When the FCM algorithm converges, we can get all kinds of clustering centers and the membership values of the target elements for all kinds of objects.

3.2 Sub-graph decomposition and region merging

According to the FCM algorithm, we must first determine the number of clusters in the calculation, and then it can be calculated on this basis. However, the number of clusters is difficult to determine in the actual operation process, especially in the automation system is more difficult to achieve. Therefore, it is necessary to improve the FCM algorithm [20]. This paper mainly aimed at the improvement of the number of clusters, and the purpose was to make the FCM algorithm has the ability to automatically determine the number of clusters. Firstly, the image was decomposed by using the characteristics of the two dimensional space. The decomposition method was a four fork tree structure until the decomposition of the image histogram could meet the conditions after the decomposition. Then the clustering number of two FCM segmentations was carried out to achieve the initial image segmentation. Finally, according to the similarity of the adjacent region of the original image, the region was merged to complete the final segmentation of the image. This procedure avoided the direct determination of the number of clusters.

  1. (1)

    Sub graph decomposition and the initial segmentation. The gray histogram of the image contains a wealth of information. Among them, the wave crest and trough position provides the most valuable information. Threshold based image segmentation method is based on the peak and trough position to determine the threshold. However, there are two problems in the analysis of image gray histogram. One is a false peak and trough. If the gray histogram is smoothed, it is possible to filter out the true peaks and troughs. The other is that if the histogram is uniform, it is difficult to provide valuable information. So we need a standard to measure the uniformity of gray histogram (Fig. 1).

    Fig. 1
    figure 1

    Image segmentation algorithm

    In order to solve the above two problems, this paper put forward two indexes to measure the gray histogram of the image. An index is image gray histogram information entropy. According to the theory of Shannon’s information theory, the entropy of image gray histogram is defined as follows:

    $$\inf = - \sum\limits_{i = 0}^{255} {p_{i} \times \log_{2} p_{i} }$$
    (18)

    In formula (18), pi = ni/ni, where ni represents the number of pixels of the first level gray value, and n is the total image of the image. The range of the pixel value of the digital image is generally 0 ~ 255, so the range if i is 0 ~ 255. This parameter can be used to measure the concentration of image pixels. The higher the value of information entropy is, the higher the degree of concentration of the image pixels is, and the more scattered it is. Experiments with two pixels of the same image can prove this conclusion, which can be seen in Fig. 2.

    Fig. 2
    figure 2

    Gray histogram of two pixels with the same pixel

    The entropy of the histogram of A is 6.9965 and the entropy of the histogram of B is 6.3989. As can be seen from Fig. 2, pixel gray value figure A is about 100. But the pixels are scattered with a peak of around 3000. Pixel gray values of figure B are mostly concentrated in about 200 with the peak value of about 10,000.

    Another indicator is the number of image gray histogram clustering peaks. In view of the gray histogram of image, it can’t deal with the false peaks and troughs. In this paper, a new method is used to calculate the number of wave crests. The specific steps of the method are as follows:

    • First step: the 256 level histogram of image is conducted to get the array hist. hist contains a total of 256 elements, and each element represents the number of pixels in the gray level of the image.

    • The second step is to conclude normalized histogram:

      $$normal\_hist = {{hist} \mathord{\left/ {\vphantom {{hist} {\hbox{max} \left( {hist} \right)}}} \right. \kern-0pt} {\hbox{max} \left( {hist} \right)}}$$
      (19)
    • Third step: all elements of the normalized histogram take the clustering number of 2 for FCM clustering. The initial clustering center is {0} or {1}, and tolerance error is 0.001.

    • Fourth step: the elements are classified according to the principle of maximum membership degree to obtain cluster_hist. The number of peaks of cluster_hist is calculated as peak_n, and is the final value of the wave number.

    In order to prove the validity of the method, the results were verified by an example. The results are shown in Fig. 3.

    Fig. 3
    figure 3

    Original histogram of image and histogram after clustering

    It can be seen from Fig. 3 that the original histogram has no small fluctuations after clustering, and has good regularity, and the number of peaks is also easy to statistics, which shows that the method is effective.

  2. (2)

    Regional consolidation. After image segmentation, region merging should be carried out according to the similarity of adjacent regions. The criterion for judging the similarity of image gray histogram is the distance between the images, and the definition of PAP distance is as follows:

    $$\rho \left( {r,q} \right) = \sum\limits_{i = 0}^{255} {p_{ir} \times p_{iq} } {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} ,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} p_{ir} = {{n_{ir} } \mathord{\left/ {\vphantom {{n_{ir} } {n_{r} }}} \right. \kern-0pt} {n_{r} }},p_{iq} = {{n_{iq} } \mathord{\left/ {\vphantom {{n_{iq} } {n_{q} }}} \right. \kern-0pt} {n_{q} }}$$
    (20)

    In the formula, nr represents the number of pixels in the region r. nir is the number of pixels in the region r of i gray level. The meanings of nq and niq ibid are as above. The greater the distance, the higher the similarity of the two histograms is.

3.3 Image segmentation algorithm based on improved fuzzy clustering

There are two main steps in image segmentation algorithm based on improved fuzzy clustering. The first step is mainly to the image in accordance with the four fork tree structure graph decomposition. Secondly, the fuzzy C means clustering image segmentation based on the fixed number of clusters is performed. The image is divided into 2 × 2 before image decomposition. Then, the gray histogram of each sub graph is analyzed. If the sub-graph satisfies one of the following two conditions:

  1. (1)

    The information entropy inf of sub-graph histogram is smaller than the original histogram inf, and the wave number peak_n ≤ 2.

  2. (2)

    The area of the sub-graph is less than the threshold value, and the value of alpha can be adjusted according to the histogram of the minimum area. Here the uniform value is 1024.

It can be used to cluster the number of fuzzy C mean clustering of 2, and then the principle of maximum membership degree is classified. If the histogram of the sub-graph does not satisfy one of the above two conditions, the sub-graph is further decomposed until the condition is satisfied. The allowable error in this process is 0.001, and the initial cluster centers are {0, 255}. Detailed flow is shown in chart 4 (Fig. 4).

Fig. 4
figure 4

Segmentation process based on sub-graph decomposition

The second step is regional consolidation, and detailed flow is shown in Fig. 5. Because of the algorithm segmentation on the original image after decomposition, it is inevitable that there are many sub graphs in some areas. Therefore, it is necessary to merge the segmented regions. The merger process must meet one of the following two conditions:

Fig. 5
figure 5

Regional consolidation process

  • First condition: the distance between adjacent regions of gray histogram is ρ(r, q) > ββ = 0.22. Detailed flow is shown in chart 4.

  • Second condition: the segmented area is < γγ = 50, and the region is similar to the gray value mean.

In order to verify the effectiveness of the fuzzy C mean clustering image segmentation algorithm based on sub-graph decomposition and region merging, the above algorithms were also analyzed.

4 Experimental analysis

4.1 Algorithm analysis

This paper used the classic rice image as the experimental image in the process of image segmentation algorithm. Some related data needed to be given in the course of the experiment. The fuzzy clustering number was 2 in the image segmentation algorithm. The initial cluster center was {0, 255} and the allowable error e = 0.001, and the weight function was m = 2. Figure 6 is the rice diagram in the experimental process of the decomposition of the 4 tree sub-graph.

Fig. 6
figure 6

Rice diagram of the fork tree sub-graph decomposition of the structure of Fig. 4

Figure 6 shows the root node of the original 1. Nodes 2, 3, 4, 5 represented the original decomposition into a total of 2 × 4 sub- graph. The node 6 ~ 14 was a sub-graph of 4, and the 5 was again decomposed into a total of four pairs of sub-graphs of × 2. And so on, node 7 and 9 were the same true. Table 1 shows the information entropy, the peak value and the area statistics of each section.

Table 1 The data of each node of the four fork tree

We can see from Table 1 that the two images in the four picture only nodes, 3, represented by the two maps to meet the conditions in the decomposition of the original image. Their information entropy was less than the original number and the wave number was ≤ 2. The node 4, 5 did not meet the conditions need further decomposition. Nodes 7 and 9 also did not meet the conditions for further decomposition in the second round of decomposition. The condition could satisfy the conditions of the node number of FCM segmentation. This result was consistent with the decomposition of the four fork tree sub-graph in Fig. 6.

The number of the first round was 135 and the number of regions in the second round was about 86, and the third round was 74 in the decomposition process of the whole rice graph. The results show that the region merging can effectively avoid over segmentation and make the segmentation result more accurate.

4.2 Objective analysis

The objective index mainly included the running time and the segmentation accuracy of the image segmentation algorithm in the objective index analysis. These two indicators were mainly compared with the artificial clustering algorithm (FCM algorithm), F_stl algorithm and SCG algorithm. Run time used Rice diagram, Bird diagram and Lena diagram for verification, and the correct rate of segmentation used Rice diagram verification. Set the algorithm to divide the actual number of pixels, the rice grain was n1, the total image of the prime number was m1, the actual number of pixels in the background was n2, the total image of the background was m2, then there were the following definitions, the accuracy of the element segmentation was n1/m1; the background segmentation accuracy rate was n2/m2; the overall segmentation accuracy rate was (n1 + n2)/(m1 + m2).

It can be seen from Table 2 that the longest time was the SCG algorithm, followed by the F_stl algorithm, once again to determine the number of artificial clustering algorithm, and the shortest algorithm was the three image segmentation algorithm in this article. The data statistics show that the algorithm in this paper had a great advantage in computing time.

Table 2 Comparison of running time

Table 3 shows the statistical accuracy of the four algorithms for the correct segmentation of Rice graphs. From the elements, the background as well as the overall correct segmentation rate from high to low, they were the algorithm, the F_stl algorithm, the algorithm to determine the number of clusters, SCG algorithm. The experimental results show that the proposed algorithm was more accurate and more advantageous than other algorithms.

Table 3 Comparison of segmentation accuracy

5 Conclusions

Image segmentation is one of the main means to extract valuable information from images, and it also has double value in application and academic fields. Relevant research results are also very much. But so far, there is no objective and uniform image segmentation algorithm. In view of this, an improved fuzzy clustering algorithm for image segmentation was proposed in this paper. In this paper, the method of sub graph decomposition and region merging is used to improve the clustering method of fuzzy C value clustering algorithm. Then the improved image segmentation algorithm is verified by this method, and the verification is mainly from two aspects: algorithm and objective index. The analysis results showed that the algorithm was effective, and the image segmentation was more accurate when the image was over segmented. The results show that the proposed algorithm had more advantages than the improved fuzzy clustering algorithm, F_stl algorithm and SCG algorithm in the running time and the accuracy of image segmentation. The results showed that the algorithm was feasible. Of course, there were also some shortcomings, such as the determination of the threshold of the distance to improve the space, which can improve the accuracy of the operation.