Deep Neural Networks Application in Models with Complex Technological Objects

Meshalkin, Valeriy; Puchkov, Andrey; Dli, Maksim; Lobaneva, Yekaterina

doi:10.1007/978-3-030-32579-4_23

Valeriy Meshalkin⁵,
Andrey Puchkov⁶,
Maksim Dli⁶ &
…
Yekaterina Lobaneva⁶

Part of the book series: Studies in Systems, Decision and Control ((SSDC,volume 259))

881 Accesses
6 Citations

Abstract

A method for creation of computer models in complex multiply connected technological objects based on the application of machine learning methods is described. For technological information processing hierarchical neural network structure integrated into cyber-physical systems of control is developed. It allows to monitor an object condition and forecast its development trends. A description for the algorithm and program, which performs the proposed method of model building, is given.

Access provided by Autonomous University of Puebla. Download chapter PDF

Identification of a Technological Process with Application of Neural Network Modeling

Simulation Modeling Using Neural Networks to Control Complex Technical Systems

Application of Artificial Neural Networks in the Process of Catalytic Cracking

Article 01 July 2018

Keywords

1 Problem Statement

The number of information channels in automated process control system (APCS) increases due to the raise of the complexity in the technological process under control, which requires the application of a new paradigm when creating complex control systems, such as a cyber-physical system (CPS) [1]. This system is characterized by the use of multidisciplinary approaches in its operation as well as Big Data methods caused by the increase in APCS complexity [2,3,4].

Therefore, a distinctive feature for a number of manufactures is their long service life which leads to the use of outdated technological solutions. Their change is impossible without a general modernization of production. Modernization in its turn involves significant financial expenditure because of downtime as well.

An alternative direction of modernization is reengineering of the information support for technological process by improving the systems for collecting and processing technological information, their duration and the volume distribution in the entire physical process, which is typical for CPS, the use of modern state diagnostics and control algorithms. This procedure has less financial and time expenditures compared with equipment modernization. Thus, the direction to improve control and measuring infrastructure of APCS, based on modern achievements of information technologies, presents and actual research problem which solution can bring tangible advantages for enterprises in a short period.

2 Background and Methods

The proposed methods are based on machine learning [5]. Deep learning using convolutional neural networks (CNN) is among its methods which fined wide application in solving real problems. High results, shown by CNN when recognizing images, lead to a great spectrum of their applications in the subject fields where there is an opportunity to reformulate the initial problem for the task of images recognition. These fields include medicine [6, 7], social engineering [8], text processing [9], gesture recognition [10], automatic identification of vehicles in coating production line based on computer vision [11], vehicles identification in a tunnel surveillance control system [12], cracks recognition in concrete [13] etc. The above mentioned list shows, that CNN can be a universal tool for problem solution of data deep analysis too.

The proposed approach for developing a model of complex technological processes is based on the implementation of CNN ensemble connected to the analysis of data in different points of technological process with subsequent processing of the obtained results of the neural classification in the analytical block. In addition to spatial partitioning of industrial zones the processing is discretized according time, it helps to monitor the processes dynamics.

There is a great diversity of buildings and CNN ensembles work interpretation [14], they are widely used in medical applications, in systems of biometric data control, people activity [15,16,17,18], but there is practically no works on generalized modeling and diagnostics of technological processes.

Control technical areas, where CNN is supposed to be used, can contain data of various format data sources. In addition to places with evident presence of video streams, for example[19, in Russian], other forms of signals can be also processed, including in-plant noise control, using various methods of sound waves conversion [20].

The enlarged structure of the proposed model of a complex technological process based on the application of deep neural networks is shown in Fig. 1.

The general algorithmic structure of the proposed model for technological information processing in CPS is as follows. It is supposed, that control and measuring information from technological zones (zone 1, …, d) is presented by multichannel sets of data for each zone. Through the commuter this information is fed to the output of local CNN ensemble which, for each zone, is formed taking into account the form of data representation in information channels and requirements of their further analysis which is carried out in the group of zone analyzers (analyzer 1, …, d). From the zone analyzers output information goes to neural network output (CNN_out) which carries out the estimation and forecast for the state of the entire technological process.

The algorithm for processing information is as follows. Denote the interval of discretization for information channel k_zd in technological zone d by Δt^zd_i = t^zd_i–t^zd_i-1. At t^zd_i and t^zd_i–1 moments channel commutator k_zd is closed and the image of the technological parameter under control is supplied to CNN input. The technological parameter under control is denoted by P^zd. The interval value of discretization Δt^zd_i is calculated by a particular analyzer with regard to Kotelnikov theorem (it is also known as Nyquist–Shannon theorem) [21].

CNN of information channel k_zd recognizes an image on the introduced time step. This procedure consists in forming output vector $V_{i}^{{k_{zd} }}$ with dimension corresponding the number of classes n_CL_kzd for each channel:

$$V_{i}^{{k_{zd} }} = \left( {V_{i,1}^{{k_{zd} }} ,V_{i,2}^{{k_{zd} }} , \ldots ,V_{{i,n\_CL\_kzd}}^{{k_{zd} }} } \right)^{\text{T}}$$

(1)

It is supposed, that the architecture of the output CNN layer is formed in the way the elements of vector (1) contain values from 0 to 1. It characterizes the degree of neural network confidence in parameter P^zd controlled according to the image belonging to a particular class at moment t^zd_i.

CNN application for information channel k_zd is carried out through the whole technological process, but all this time can be divided into fragments with Ti. duration given as initial data for the algorithm operation. Fragments with Ti. duration are identified by the requirements for the periodicity of information flow into APCS. Thus, the matrix of results classification is processed in particular counting analyzers i_T and becomes available to the moment Ti:

$$MV_{{i_{T} }}^{{k_{zd} }} = \left( {\begin{array}{*{20}c} {V_{1,1}^{{k_{zd} }} ,\quad V_{1,2}^{{k_{zd} }} ,\quad \ldots ,\quad V_{1,n\_CL\_kzd}^{{k_{zd} }} } \\ \ldots \\ {V_{{i_{T} ,1}}^{{k_{zd} }} ,\quad V_{{i_{T} ,2}}^{{k_{zd} }} ,\quad \ldots ,\quad V_{{i_{T} ,n\_CL\_kzd}}^{{k_{zd} }} } \\ \end{array} } \right)^{\text{T}}$$

(2)

The obtained set (2) contains the data about CNN confidence dynamics in classification results and can be used when making control decisions in APCS. For this purpose the relation matrix of elements increments (2) to the given discretization Δt^zd_i interval is:

$$DV_{{i_{T} }}^{{k_{zd} }} = \left( {\begin{array}{*{20}c} {\tfrac{{V_{2,1}^{{k_{zd} }} - V_{1,1}^{{k_{zd} }} }}{{\Delta t_{i}^{zd} }}} & \ldots & {\tfrac{{V_{{i_{T} ,1}}^{{k_{zd} }} - V_{Ti - 1,1}^{{k_{zd} }} }}{{\Delta t_{i}^{zd} }}} \\ {\tfrac{{V_{2,2}^{{k_{zd} }} - V_{1,2}^{{k_{zd} }} }}{{\Delta t_{i}^{zd} }}} & \ldots & {\tfrac{{V_{{i_{T} ,2}}^{{k_{zd} }} - V_{Ti - 1,2}^{{k_{zd} }} }}{{\Delta t_{i}^{zd} }}} \\ \ldots & \ldots & \ldots \\ {\tfrac{{V_{2,n\_CL\_kzd}^{{k_{zd} }} - V_{{1,_{n\_CL\_kzd} }}^{{k_{zd} }} }}{{\Delta t_{i}^{zd} }}} & \ldots & {\tfrac{{V_{{i_{T} ,n\_CL\_kzd}}^{{k_{zd} }} - V_{Ti - 1,n\_CL\_kzd}^{{k_{zd} }} }}{{\Delta t_{i}^{zd} }}} \\ \end{array} } \right)$$

(3)

For the purpose to simplify the notation denote matrix (3) elements as dv_i,j. Their sense load can be interpreted as analogue of derivatives for continuous functions because they reflect the change of neural confidence in image belonging to a particular class.

The rate of change, in this case it is element dv_i,j value, can be used for the forecast of the technological process development. For this purpose matrixes of type (3) are calculated for all measuring channels and all technological zones.

The formation of an output tensor for CNN_out, based on the fragment in i_T timing, is performed by the combination of matrix (3) for all channels and zones

$$TR_{{i_{T} }} = \left( {DV_{{i_{T} }}^{{k_{z1} }} \quad DV_{{i_{T} }}^{{k_{zd} }} \quad \ldots \quad DV_{{i_{T} }}^{{k_{z2} }} } \right)$$

(4)

It should be noted, that the number of classes for various information channels and various technological zones can be different (see Fig. 1), therefore, matrix (3) dimension in a general case is different. The number of lines in (3) for all zones is equal as it is determined by the number of counting for discrete time i = 1, 2, …, i_T. The number of columns for matrix TR_iт is determined by the number of information channels in each zone, the number of classes identified for each channel and is equal to (kz1 · n_CL_kz1) + (kz2 · n_CL_kz2) + ··· + (kzd · n_CL_kzd).

When input tensor TR_iт is formed, the multi-stage preprocessing of data ends to ensure the work of output neural network (preprocessing for local CNN was not described as it is not of special interest in this work). Then, it remains to determine the hypotheses space for CNN_out as the application of deep learning deletes the need for construction features to replace complex, contradictory and heavy conveyors with simple learning models which area usually built with the use of several tensor operations [4].

Tensor TR_iт is formed on the basis of data on the transformation dynamics of class membership for information channels parameters reflected in matrix (3). Thus, when forming the space of hypotheses good results to forecast the state of a technological process, values of any parameters for a finished product can be expected with the help CNN_out.

3 Application and Results

To test the proposed model for processing information in CPS of a complex technological process control, a simulation experiment to recognize aluminum alloy ingots images was carried out to determine their aggregate state. The aggregate state is estimated according to the image of a surface observing through a viewing window fitted on the furnace door [19, in Russian]. The received image is shown in Fig. 2 in its left part. For the application under study the general scheme is presented in Fig. 1.

The image for a working surface of aluminum ingots forming by a video camera, has a resolution of 640 × 480 (Fig. 3a). If this resolution is left and used as an example at input of CNN, then the process of learning requires more time considering that the numbers of such examples will be several thousand. Therefore, the dimension of an initial image is programmatically reduced to 90 × 90. In addition, it is taken into account, that the melting zone is lighted by an electric arc, so the image brightness can fluctuate when the supply voltage changes in a circuit. To reduce the influence of this factor brightness was normalized (Fig. 3b).

The existing methods of visual identification for aggregate state of a substance (method of triangulation, burst mode, area method) analyze the changes of the surface image area for a melting metal in a three-dimensional space, but not the area of the surface for remelting ingots.

The proposed method for control of an aggregate state based on the deep neural networks also analyzes the three-dimensional surface of ingots. It is ensured by the fact, that the analysis of the melting process in images matrix is characterized by the changes of rises heights in histogram (Fig. 3c) which is taken into account by the neural network when forming the answer. One example from the learning set is a tensor of the second order, the totality of such examples forms the tensor of the third order which is fed to the output of CNN_1,1. Temporal discretization interval of a video sequence coming from the video camera focused on the bars surfaces is taken equal to one second. Free Video to JPG Converter was used to take shots from the video. To enlarge the number of learning examples the augmentation procedure was applied. During this procedure shifts, zooming, rotations, mirror reflection were implemented for the initial image.

The time for melting of an aluminum bars lot is approximately 300 s, but it was divided into intervals corresponding different classes identified by CNN_1,1 (see Table 1).

Table 1 Melting time distribution into classes

Full size table

The software model for the technological process (aluminum bars melting) under study implementing the recognition of aggregate state transition was performed in Python 3.6 language. IDE Spyder from Anaconda (version for Linux) was chosen as the development environment. Convolutional neural networks were developed using specialized Keras library, which is a superstructure above the tensor computation framework TensorFlow [22, 23].

CNN_1,1 containing seven alternating convolutional layers and subsampling and one output fully connected layer with four outputs (according to the numbers of recognizable classes), was implemented in the software. The learning sample has a size of 2000 examples (400 examples are from the testing sample).

The analysis of the results for the software work, presented in the upper graph of Fig. 4, shows that within one run at CNN_1,1 outputs the dynamics of classification during melting process is visible.

The final results of classification can be obtained if the choice of CNN_1,1 output according to the majority principle is made at every second counting. At multiple run the number of classes, as it is shown in the graph at the bottom of Fig. 4, are correctly recognized by CNN_1,1 neural network, unstable recognition is occurred only in the zone of transition from one class to the other one. This circumstance can be explained by the complexity of detection for difference in the surfaces on the junction of classes, in addition the relative length of unstable time intervals is not long.

To forecast the dynamics of the process development matrix (3) is calculated, the number of which forms the initial data for CNN_out working. The results of the obtaining images for two melting’s are presented in Fig. 5. Figure 5 shows that for different processes of aluminum bars melting the image is different, this is the evidence of possibility to use the methods of texture recognition in CNN_out block to forecast the process development. In this study the experiment to calculate the dynamics was not carried out, but it is planned for further work.

CNN learning was performed on GeForce GTX 1060 video card installed on Asus FX502VM notebook with CPU IntelCore i7-7700HQ. The process control was exercised with the help of TensorBoard which allows to visualize the current accuracy and learning error.

4 Conclusion

As a result of the conducted study for the possibility of using deep neural networks as a part of a cyber-physical system of complex technological objects control the following results were obtained:

1.
The algorithmic structure for the model of processing information coming from different technological zones of the integrated multi-stage technological process based on the convolutional neural networks ensemble implementation is developed.
2.
The results of model experiment for recognition of aluminum aggregate state based on the proposed algorithm are given. The results show that the algorithm, based on the convolutional neural networks, solves the task for classification of metal aggregate state well, excepting some zones in the transition regions from one class to the other one.
3.
The method to follow up the dynamics in technological process based on the formation of matrix values changes at the output of the convolutional network, performing the classification for technological process conditions and its further visualization for output convolution neural network recognition is proposed.

References

Wolf, W.: Cyber-physical systems. Computer. 9(3), 88–89 (2009)
Article Google Scholar
Lee, J., et al.: Recent advances and trends in predictive manufacturing systems in big data environment. Manuf. Lett. 1(1), 38–41 (2013)
Article Google Scholar
Namiot, D.: On big data stream processing. Int. Journal of Open Info. Technol. 3(8), 48–51
Google Scholar
Sleep, S., Gooner, R., Hulland, J.: The big data hierarchy: a multi-stage perspective on implementing big data. In: Obal, M., Krey, N., Bushardt, C. (eds.) Let’s Get Engaged! Crossing the Threshold of Marketing’s Engagement Era. Developments in Marketing Science: Proceedings of the Academy of Marketing Science. Springer, Cham (2016)
Chapter Google Scholar
Scholle, F.: Deep learning in Python. SPb, Peter, 400 p (2018)
Google Scholar
Rohit, S., Chakravarthy, S.: BMC Neurosci 12(Suppl 1), 35 (2011). https://doi.org/10.1186/1471-2202-12-S1-P35
Article Google Scholar
Khryashchev, V., Lebedev, A., Stepanova, O., Srednyakova, A.: Using convolutional neural networks in the problem of cell nuclei segmentation on histological images. In: Dolinina, O., Brovko, A., Pechenkin, V., Lvov, A., Zhmud, V., Kreinovich, V. (eds.) Recent Research in Control Engineering and Decision Making. ICIT 2019. Studies in Systems, Decision and Control, vol. 199. Springer, Cham (2019)
Chapter Google Scholar
Severyn, A., Moschitti, A.: Twitter sentiment analysis with deep convolutional neural networks. In: ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 959–962. ACM Press, Santiago (2015)
Google Scholar
Wang, P., Xu, J., Xu, B., Liu, C., Zhang, H., Wang, F.: Semantic clustering and convolutional neural network for short text categorization. In: 53rd Annual Meeting of the Association for Computational Linguistics and 7th International Joint Conference on Natural Language Processing, vol. 2, pp. 352–357. ACL Press, Beijing (2015)
Google Scholar
Ahlawat, S., Batra, V., Banerjee, S., Saha, J., Garg, A.K.: Hand gesture recognition using convolutional neural network. In: Bhattacharyya, S., Hassanien, A., Gupta, D., Khanna, A., Pan, I. (eds.) International Conference on Innovative Computing and Communications. Lecture Notes in Networks and Systems, vol. 56. Springer, Singapore (2019)
Google Scholar
Xiang, L., et al.: Automatic vehicle identification in coating production line based on computer vision. In: International Conference on Computer Science and Engineering Technology, pp. 260–267. World Scientific Publication Co. Pvt. Ltd (2016)
Google Scholar
Chen, H.T., et al.: Multi-camera vehicle identification in tunnel surveillance system. In: IEEE International Conference on Multimedia & Expo Workshops, pp. 1–6. IEEE (2015)
Google Scholar
Cha, Y.J., Choi, W.: Vision-based concrete crack detection using a convolutional neural network. In: Caicedo, J., Pakzad, S. (eds.) Dynamics of Civil Structures, vol. 2. Conference Proceedings of the Society for Experimental Mechanics Series. Springer, Cham (2017)
Google Scholar
Frazão, X., Alexandre, L.A.: Weighted Convolutional Neural Network Ensemble. In: Bayro-Corrochano, E., Hancock, E. (eds.) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2014. Lecture Notes in Computer Science, vol. 8827. Springer, Cham (2014)
Chapter Google Scholar
Fan, Y., Lam, J.C.K., Li, V.O.K.: Multi-region Ensemble Convolutional Neural Network for Facial Expression Recognition. In: Kůrková V., Manolopoulos Y., Hammer B., Iliadis L., Maglogiannis I. (eds.) Artificial Neural Networks and Machine Learning—ICANN 2018. ICANN 2018. Lecture Notes in Computer Science, vol. 11139. Springer, Cham (2018)
Chapter Google Scholar
Kori, A., Soni, M., Pranjal, B., Khened, M., Alex, V., Krishnamurthi, G.: Ensemble of fully convolutional neural network for brain tumor segmentation from magnetic resonance images. In: Crimi, A., Bakas, S., Kuijf, H., Keyvan, F., Reyes, M., van Walsum, T. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries. BrainLes 2018. Lecture Notes in Computer Science, vol. 11384. Springer, Cham (2019)
Chapter Google Scholar
Koitka, S., Friedrich, C.M.: Optimized convolutional neural network ensembles for medical subfigure classification. In: Jones G. et al. (eds.) Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2017. Lecture Notes in Computer Science, vol. 10456. Springer, Cham (2017)
Google Scholar
Kasnesis, P., Patrikakis, C.Z., Venieris, I.S.: PerceptionNet: a deep convolutional neural network for late sensor fusion. In: Arai K., Kapoor, S., Bhatia, R. (eds.) Intelligent Systems and Applications. IntelliSys 2018. Advances in Intelligent Systems and Computing, vol. 868. Springer, Cham (2019)
Google Scholar
Shkundin, S. Z., Kolistratov, M.V., Belobokova, Y.A.: Algorithms performance testing to determe changes of a metal aggregate state. Syst. Administrator. 10(191), 90–93 (2018)
Google Scholar
Fu, G.: A novel isolated speech recognition method based on neural network. In: Zhong, Z. (ed.) Proceedings of the International Conference on Information Engineering and Applications (IEA) 2012. Lecture Notes in Electrical Engineering, vol. 220. Springer, London (2013)
Google Scholar
Ahlswede, R., Ahlswede, A., Althöfer, I., Deppe, C., Tamm, U.: Shannon’s model for continuous transmission. In: Ahlswede, A., Althöfer, I., Deppe, C., Tamm, U. (eds.) Transmitting and Gaining Data. Foundations in Signal Processing, Communications and Networking, vol. 11. Springer, Cham (2015)
Google Scholar
Ramasubramanian, K., Singh, A.: Deep Learning using Keras and TensorFlow. In: Machine Learning Using R. Apress, Berkeley, CA (2019)
Google Scholar
Srinivasa, K.G., Siddesh, G.M., Srinidhi, H.: Advanced Analytics with TensorFlow. In: Network Data Analytics. Computer Communications and Networks. Springer, Cham (2018)
Google Scholar

Download references

Acknowledgements

The reported study was funded by RFBR according to the research project № 19-01-00425

Author information

Authors and Affiliations

D. Mendeleev, University of Chemical Technology, Miusskaya square 9, 125047, Moscow, Russia
Valeriy Meshalkin
Moscow Power Engineering Institute (Branch) in Smolensk, National Research University, Energetichesky proyezd 1, Smolensk, 2014013, Russia
Andrey Puchkov, Maksim Dli & Yekaterina Lobaneva

Authors

Valeriy Meshalkin
View author publications
You can also search for this author in PubMed Google Scholar
Andrey Puchkov
View author publications
You can also search for this author in PubMed Google Scholar
Maksim Dli
View author publications
You can also search for this author in PubMed Google Scholar
Yekaterina Lobaneva
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrey Puchkov .

Editor information

Editors and Affiliations

Volgograd State Technical University, Volgograd, Russia
Alla G. Kravets
Peter the Great St. Petersburg Polytechnic University, St. Petersburg, Russia
Alexander A. Bolshakov
Volgograd State Technical University, Volgograd, Russia
Maxim V. Shcherbakov

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Meshalkin, V., Puchkov, A., Dli, M., Lobaneva, Y. (2020). Deep Neural Networks Application in Models with Complex Technological Objects. In: Kravets, A., Bolshakov, A., Shcherbakov, M. (eds) Cyber-Physical Systems: Advances in Design & Modelling. Studies in Systems, Decision and Control, vol 259. Springer, Cham. https://doi.org/10.1007/978-3-030-32579-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-32579-4_23
Published: 26 November 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32578-7
Online ISBN: 978-3-030-32579-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Deep Neural Networks Application in Models with Complex Technological Objects

Abstract

Similar content being viewed by others

Identification of a Technological Process with Application of Neural Network Modeling

Simulation Modeling Using Neural Networks to Control Complex Technical Systems

Application of Artificial Neural Networks in the Process of Catalytic Cracking

Keywords

1 Problem Statement

2 Background and Methods

3 Application and Results

4 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Deep Neural Networks Application in Models with Complex Technological Objects

Abstract

Similar content being viewed by others

Identification of a Technological Process with Application of Neural Network Modeling

Simulation Modeling Using Neural Networks to Control Complex Technical Systems

Application of Artificial Neural Networks in the Process of Catalytic Cracking

Keywords

1 Problem Statement

2 Background and Methods

3 Application and Results

4 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation