Prediction with coastal environments and marine diesel engine data based on ship intelligent platform

Liu, Yuxue; Jia, Shuli; Yu, Yuan; Ma, Liyong

doi:10.1007/s13204-021-02042-9

Prediction with coastal environments and marine diesel engine data based on ship intelligent platform

Original Article
Published: 08 September 2021

Volume 13, pages 1437–1441, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Applied Nanoscience Aims and scope Submit manuscript

Prediction with coastal environments and marine diesel engine data based on ship intelligent platform

Download PDF

Yuxue Liu¹,
Shuli Jia¹,
Yuan Yu² &
…
Liyong Ma²

183 Accesses
1 Citation
Explore all metrics

Abstract

An intelligent platform prototype is established for a coastal environment monitoring ship. LSTM and GBDT methods are developed for pH value and fuel consumption prediction in the intelligent platform. The results of applying the general prediction algorithms to actual environments’ data and marine diesel engine data are reported. GBDT has the best predictive results with the smallest error. SVM and SVR have similar prediction effects, while FNN has the largest error. As the prediction time increases, the error of LSTM becomes large. The ship intelligence platform can provide unified data support and general intelligent algorithms for data-driven applications, and it has the potential to be widely used in coastal environmental monitoring applications.

Application Research of Ship Maritime Safety Decision System Based on Big Data and Artificial Intelligence

Applying artificial neural networks for modelling ship speed and fuel consumption

Article Open access 16 June 2020

A Novel Approach to Energy Management in Large Passenger and Cruise Ships: Integrating Simulation and Machine Learning Models

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Sensing, communication, computing, and information technology are developing rapidly, and ships can provide more and more digital information. The amount of this information data is huge, and the data format and update frequency vary significantly. The centralized data center approach cannot provide the necessary support for the increasing number of applications with very different requirements. A ship intelligence platform to support big data came into being. Ship intelligence platforms generally use distributed storage and hierarchical data architecture to provide different data support for different applications. This intelligent platform can not only meet the underlying ship equipment control requirements, but also support advanced decision applications. In recent years, researches based on ship intelligence platforms have received more and more attention, such as the big data architecture of ship intelligence platforms (Jia et al. 2018), intelligent obstacle avoidance based on data fusion (Wang et al. 2019; Ma et al. 2019a, b), intelligent navigation applications, and so on. Ship intelligence platforms are very important, but there are very few intelligent platforms and their applications that have been established.

Water quality prediction and fuel consumption prediction are two important prediction applications. Water quality monitoring is an important part of coastal environmental monitoring (Ahmed et al. 2020).Water quality prediction has always been one of the focus issues being studied (Sun et al. 2020).Chlorophyll concentration in coastal waters has been studied, water pollution is predicted with the least square support vector machine method, PCA-SVR method is employed for seawater quality prediction by Sun et al. (2020). In this paper, the pH value, which is very important in water quality, is predicted. The fuel consumption prediction of marine diesel engines has always been the focus of energy efficiency research (Manigandan et al. 2020; Pauline et al. 2021; Saravanan et al. 2020). Le et al. (2020) and Jeon et al. (2018) used neural network methods to predict the fuel consumption of the ship's main engine. Gkerekos et al. (2019) not only compared the effects of different forecasting methods, but also analyzed the factors that affect fuel consumption. In this paper, the diesel consumption of the marine engine is predicted.

Although the ship intelligence platform is very important, there are few reports about the establishment of intelligent applications on the platform. There is little research on the application of coastal environmental based on ship intelligent platform. Recently, an intelligent platform for a coastal environment monitoring ship was successfully constructed by Shanghai Marine Diesel Engine Research Institute. The intelligent platform stores many sensor data in a distributed manner. These sensors collect a large amount of data on marine power equipment including a marine diesel engine. The platform also stores the detection data of the monitoring ship on the coastal environment. To better provide support for data analysis and utilization, the ship intelligence platform provides some modular analysis algorithms. The purpose of this paper is to evaluate the effectiveness of these general analysis algorithms of the intelligent platform applied to real data. In the intelligent platform prototype system, the water measurement data of the environmental monitoring ship is stored. The prediction algorithms LSTM (long short-term memory) and GBDT (gradient boosting decision tree) are selected for the prediction of diesel engine state and water quality of environmental monitoring data.

The main contributions of this paper include an intelligent platform prototype which is established for a coastal environment monitoring ship, and the general intelligent prediction algorithms are developed for the platform. The results of applying the general prediction algorithms to actual data are reported in this paper. This paper is organized as follows. After introduction section, LSTM and GBDT algorithms are described in methods section, and data collection is also described. In results and discussion section, LSTM and GBDT algorithms are compared with other common methods, and results are reported. The conclusion is given at the end of the paper.

Methods

In the prototype of the ship intelligence platform, two important prediction algorithms, LSTM and GBDT, were implemented. Two prediction algorithms will be introduced, and the actual data applying these algorithms will be described in detail.

LSTM prediction algorithm

The main feature of RNN (recurrent neural network) design is that the current state is related to the previous state. The emergence of RNN solves the two problems of processing variable-length input and analyzing the sequence information of the sequence. Since each state of the RNN contains the output of the previous state, this architecture leads to a long-term dependency problem. LSTM is specially designed to solve this problem. To predict the time series of some future time values, in addition to the known time series, such as the current value of series $x_{t}$, the LSTM prediction algorithm also sends two pieces of information backward in the time dimension, namely the hidden layer state $h_{t - 1}$ and the cell state c_t−1.The state of $h_{t - 1}$ can be regarded as the prediction of the value of the LSTM algorithm at the future time based on the current information and a small amount of memory information, or it is called one of the LSTM outputs at time t−1.Obviously, another output of LSTM is the cell state c_t, which is a pure memory state used to tailor the memory information for the hidden layer to prevent the hidden layer from processing too long information.

Unlike RNN, which has only one tanh layer in its repeated structure module, the LSTM model uses a state and gate mechanism to preserve long-term memory. The gate in LSTM is a fully connected network layer. Its input can be multiple vectors and its output is a real number. This real number indicates the degree of the door opening. In deep learning, this gate represents the degree of influence of variables on variables. LSTM introduces three gates to control the inflow and outflow of long-term state c. The three gates are the forgotten gate, input gate, and output gate. LSTM has many successful prediction application (Shahid et al. 2021).

The forgotten gate can be expressed as

$$f_{t} = \sigma \left( {W_{f} \cdot \left[ {h_{t - 1} ,x_{t} } \right] + b_{f} } \right).$$

(1)

Let $f_{t}$ be the output of the forgotten gate, $\sigma$ be the activation function, ${\text{W}}$ be the network weight matrix, $h_{t - 1}$ be the output of the hidden layer,$x_{t}$ be the network input, and $b$ be the network bias. The forget of the input gate can be expressed as

$$i_{t} = \sigma \left( {W_{i} \cdot \left[ {h_{t - 1} ,x_{t} } \right] + b_{i} } \right).$$

(2)

The current input cell status is

$$\overline{c}_{t} = {\text{tanh}}\left( {W_{c} \cdot \left[ {h_{t - 1} ,x_{t} } \right] + b_{c} } \right).$$

(3)

The current cell status is

$$c_{t} = f_{t} \circ c_{t - 1} + i_{t} \circ \overline{c}_{t} ,$$

(4)

where the symbol $\circ$ means multiply by element. The output gate is used to control how much of the united state c_t is input to h_t. The output of the output gate is

$$o_{t} = \sigma \left( {W_{o} \cdot \left[ {h_{t - 1} ,x_{t} } \right] + b_{o} } \right).$$

(5)

Then the final output of the LSTM is

$$h_{t} = o_{t} \circ \tanh (c_{t} ).$$

(6)

LSTM uses a chain structure to connect repeated neural network modules to achieve the purpose of memorizing values of indefinite length of time.

GBDT prediction algorithm

GBDT is a machine learning model with good performance for completing regression, classification, and ranking tasks, and it is widely used in the industry. GBDT belongs to the boost algorithm cluster, and its idea is to predict the data after combining multiple weak classifiers. The core of the gradient boosting algorithm is as follows: first, the residual of the decision tree model is evaluated, then negative gradient obtained from loss function is employed for boost residual, and finally, a fitted regression tree is gradually obtained by iteration. Therefore, gradient boosting is an iterative process that continuously fits the residuals and reduces the residuals.

The GBDT model is

$$F_{M} (x) = \sum\nolimits_{m = 1}^{M} {T(x;\Theta_{m} )} ,$$

(7)

where $T(x;\Theta_{m} )$ is the decision tree, x is the input vector,$\Theta_{m}$ is decision tree parameters, and M is the number of decision trees. The squared loss is chosen as the loss function of GBDT, which is

$$L(y,T_{m} (x)) = \frac{1}{2}\left( {y - T_{m} (x)} \right)^{2} ,$$

(8)

where y is the output. Then the residual is equal to the negative gradient, and the direction of the residual is the direction of global optimization. GBDT can determine the parameters $\hat{\Theta }_{m}$ of the next decision tree by minimizing the loss function L as

$$\hat{\Theta }_{m} = \arg \min \sum\nolimits_{m = 1}^{M} {L(y_{i} ,T_{m - 1} (x_{i} ) + T(x,\Theta_{m} ))} .$$

(9)

It can be seen from the above formula that GBDT has the characteristics of a good fit to the real distribution, especially feature selection and model integration are its advantages.

Coastal environment and marine diesel engine data for prediction

Water quality parameters are measured and recorded hourly by the coastal environment monitoring ship. The pH value is one of the most important parameters reflecting water quality (Ahmed et al. 2020; Tiyasha et al. 2020; Sun et al. 2020), and it is selected as the environmental prediction data. pH value is measured with pH meter PHSJ-6L (Inesa Instrument, China). A total of 10 consecutive days of measurement data are used for prediction experiments. Seven days of data are used as training dataset, and others are used as test dataset.

To predict diesel consumption, the coastal environment, and marine engine data are used. The coastal environment data include wind speed, wind direction, air temperature, and sea wave height, and these are measured with the instrumentAM-4836C (Reall tech Instrument, China). The engine and navigation system provide diesel consumption data and ship speed. All the data are collected when the ship is stable for more than 20 min. All these data from intelligent platform and sensors are employed for data analysis. 30 pieces data of 4-h voyage on the same ship route are collected. The total of 24 pieces are regarded as the training dataset, while the other pieces are used as test datasets. When forecasting, the first hour of data are used as the starting sequence, and the remaining 3 h data are used as a multivariate time series for prediction.

Results and discussion

Other algorithms used in the literature for water quality prediction and diesel engine prediction are used to compare with algorithms in the intelligent platform. These algorithms include support vector machine (SVM) (Nie et al. 2019), regression vector machine (RVM) (Sun et al. 2020), and feedforward neural network (FNN) (Jeon et al. 2018). FNN is a typical neural network method, which is widely used in nonlinear models. SVM is the result of the application of statistical learning theory. It uses the kernel function to solve the convex quadratic programming problem to maximize the classification interval. RVM is a more robust statistical learning regression method.

The performance indicator means absolute error (MAE) is employed to evaluate these algorithms. It is defined as

$${\text{MAE}} = \frac{1}{n}\sum\nolimits_{i = 1}^{n} {\left| {\hat{y}_{i} - y_{i} } \right|} .$$

(10)

The true value of the pH value of a whole day and the prediction results of different methods are illustrated in Fig. 1, where the predicted pH value is for every hour. The prediction performance of different algorithms, the prediction capabilities of different algorithms in 2 h, 3 h, and 4 h were also predicted. The MAE indicators of the prediction results of these algorithms at different time lengths are listed in Table 1.

Table 1 MAE prediction comparison of pH with different prediction periods

Full size table

The prediction of fuel consumption is a multivariate prediction problem. First, the fuel consumption is predicted every 20 min, which is consistent with the measurement sampling interval of the main diesel engine. To illustrate the prediction result figure, the principal component analysis (PCA) method (Sun et al. 2020) is used to determine the two most important parameters affecting fuel consumption: they are wind speed and ship speed. Fuel consumption prediction results of different methods for every 20 min are illustrated in Fig. 2. In the figure, wind speed and ship speed are measured values. Second, fuel consumption is predicted separately for every 40 min and 60 min. The MAE for all these predictions with different methods is calculated and listed in Table 2.

Table 2 MAE prediction comparison of fuel consumption (kg) with different prediction periods

Full size table

For the prediction of pH, the MAE of the GBDT method is the smallest, indicating that the prediction result of this method is the best. FNN has the worst prediction effect, and MAE is 2.16 times to 3.53 times that of the GBDT method. The results of RVM is similar to SVM, and their MAEs are better than FNN. Although FNN can express nonlinear models, for prediction problems, due to the insufficient depth of its neural network, its ability to solve prediction problems is limited. SVM and RVM use statistical learning models, and the best decision position can be selected through a small amount of data, so the effect is good. GBDT uses complex network architecture and has the advantage of integrating and synthesizing the results of multiple models, so the best results are obtained. The LSTM method is quite special, and its effect is better for forecasts with a small interval of 1 h. The LSTM method is worse than GBDT and better than RVM and SVM. But for predictions with longer intervals, the error increases. This is mainly due to the structure of the cyclic network adopted by LSTM. When the interval is large, the prediction result still needs the support of small interval data, and the resultant error is accumulated. For the prediction of fuel consumption data, we can also observe results similar to the pH prediction.

The average MAE is also listed in the two tables, GBDT has the smallest average MAE, and the average error of FNN is greater than that of SVM and RVM. LSTM has the largest average error in pH prediction, and it is better than SVM and FNN in fuel consumption prediction.This is due to the different steps of multiple predictions and different cumulative errors. Its single-step forecasting effect is second only to GBDT. But for multi-step prediction, as the cumulative number of times increases, the error gradually increases.

Conclusions

An intelligent platform prototype is established for a coastal environment monitoring ship, LSTM and GBDT methods are developed for pH value and fuel consumption prediction. GBDT method has the smallest prediction error and gets the best prediction results. The intelligent platform can provide unified data support and general intelligent algorithms for data-driven applications, and it has the potential to be widely used in coastal environmental monitoring applications.

References

Ahmed U, Mumtaz R, Anwar H, Mumtaz S, Qamar AM (2020) Water quality monitoring: from conventional to emerging technologies. Water Supply 20(1):28–45
Article CAS Google Scholar
Gkerekos C, Lazakis I, Theotokatos G (2019) Machine learning models for predicting ship main engine fuel oil consumption: a comparative study. Ocean Eng 188:1–41
Article Google Scholar
Jeon M, Noh Y, Shin Y, Lim OK, Lee I, Cho D (2018) Prediction of ship fuel consumption by using an artificial neural network. J Mech Sci Technol 32(12):5785–5796
Article Google Scholar
Jia S, Ma L, Zhang S (2018). Big data prototype practice for unmanned surface vehicle. In: Proceedings of the 4th International Conference on Communication and Information Processing, pp 43–47
Le LT, Lee G, Park KS, Kim H (2020) Neural network-based fuel consumption estimation for container ships in Korea. Marit Policy Manag 47(5):615–632
Article Google Scholar
Ma L, Ma C, Liu Y, Wang X (2019a) Thyroid diagnosis from SPECT images using convolutional neural network with optimization. Comput Intell Neurosci 2019:1–12
Google Scholar
Ma LY, Xie W, Huang HB (2019b) Convolutional neural network based obstacle detection for unmanned surface vehicle. Math Biosci Eng MBE 17(1):845–861
Article Google Scholar
Manigandan S, Atabani AE, Ponnusamy VK, Pugazhendhi A, Gunasekar P, Prakash S (2020) Effect of hydrogen and multiwall carbon nanotubes blends on combustion performance and emission of diesel engine using Taguchi approach. Fuel 276:1–15
Article Google Scholar
Nie Z, Yuan Y, Xu D, Shen F (2019) Research on support vector regression model based on different kernels for short-term prediction of ship motion. In: 2019 12th International Symposium on Computational Intelligence and Design (ISCID), pp 61–64
Pauline JMN, Sivaramakrishnan R, Pugazhendhi A, Anbarasan T, Achary A (2021) Transesterification kinetics of waste cooking oil and its diesel engine performance. Fuel 285:1–11
Google Scholar
Saravanan S, Kumar CR, Pugazhendhi A, Brindhadevi K (2020) Role of thermal barrier coating and porous medium combustor for a diesel engine: an experimental study. Fuel 280:1–7
Google Scholar
Shahid F, Zameer A, Muneeb M (2021) A novel genetic LSTM model for wind power forecast. Energy 223:120069
Article Google Scholar
Sun X, Wang X, Cai D, Li Z, Gao Y, Wang X (2020) Multivariate seawater quality prediction based on PCA-RVM supported by edge computing towards smart ocean. IEEE Access 8:54506–54513
Article Google Scholar
Tiyasha, Tung TM, Yaseen ZM (2020) A survey on river water quality modelling using artificial intelligence models: 2000–2020. J Hydrol 585:124670. https://doi.org/10.1016/j.jhydrol.2020.124670
Article CAS Google Scholar
Wang C, Zhang X, Cong L, Li J, Zhang J (2019) Research on intelligent collision avoidance decision-making of unmanned ship in unknown environments. Evol Syst 10(4):649–658
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Automation Engineering Department, Shanghai Marine Diesel Engine Research Institute, Shanghai, China
Yuxue Liu & Shuli Jia
School of Information Science and Engineering, Harbin Institute of Technology, Weihai, China
Yuan Yu & Liyong Ma

Authors

Yuxue Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shuli Jia
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Yu
View author publications
You can also search for this author in PubMed Google Scholar
Liyong Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shuli Jia.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This research does not involve any human or animal.

Informed consent

No informed consent was necessary for the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, Y., Jia, S., Yu, Y. et al. Prediction with coastal environments and marine diesel engine data based on ship intelligent platform. Appl Nanosci 13, 1437–1441 (2023). https://doi.org/10.1007/s13204-021-02042-9

Download citation

Received: 14 May 2021
Accepted: 18 August 2021
Published: 08 September 2021
Issue Date: February 2023
DOI: https://doi.org/10.1007/s13204-021-02042-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Prediction with coastal environments and marine diesel engine data based on ship intelligent platform

Abstract

Similar content being viewed by others

Application Research of Ship Maritime Safety Decision System Based on Big Data and Artificial Intelligence

Applying artificial neural networks for modelling ship speed and fuel consumption

A Novel Approach to Energy Management in Large Passenger and Cruise Ships: Integrating Simulation and Machine Learning Models

Introduction