Keywords

1 Introduction

Facing the increase in environmental pollution caused by greenhouse gas emissions produced by combustion engines [1], electronic injection systems were implemented. Through the on-board diagnostic (OBD-II), available since 1994 [2], different variables of sensors are obtained, like longitudinal acceleration (Ax), Engine Speed (RPM), Vehicle Speed Sensor (VSS), Manifold Air Flow (MAF), Manifold Absolute Pressure (MAP) [3]. A methodology used to determine sensors behavior is carried out by driving modes study (Sport, Normal and ECO) where Lv et al. [4], highlight behaviors and interactions between the driver and the vehicle, these modes are identified according to driving styles by means of maneuvers [5], to estimate fuel consumption and polluting emissions [6]. Due to the average driver can increase efficiency by 10% simply by changing the driving mode [7], Hooker et al., in case of Eco drive mode, covers low consumption driving behaviors due to the accelerator pedal differential use [8], which influences in fuel economy and vehicle emissions to reduce its environmental impact [9]. In addition, an efficient driving mode is the main solution to reduce greenhouse gases emission produced by the vehicles [10]. According to Sivak et al., factors such as drivers decision respect to the vehicle maintenance or route selection influence in fuel economy [9], as well as making early gear changes avoid high speeds, reduce polluting emissions and fuel consumption [11, 12]. Lapuerta et al. [13], considers how it infers the altitude above sea level effect in internal combustion engines due to the air density that modifies its thermodynamic cycle and as consequence decreases the engine power, increases the polluting emissions and fuel consumption, considering Cuenca altitude, 2550 m.a.s.l. [14].

2 Materials and Methods

2.1 Experimental Overview

For the development of the present investigation, a review of previous researches was realized to propose an adequate data acquisition and analysis methodology. Zhou et al. [15], propose an optimization model of engine repair based on the classification of a Random Forest algorithm, which by adaptation and decision of more significant variables allows classifying engine real data, demonstrating that the more samples are trained the higher the classification accuracy. Considering the weight of applied variables can reach 79.1% of precision in the applied model. Damström et al. [16], propose a methodology that consists of finding the ideal number of groupings controlling similar characteristics between them by means of a cross correlation, but it has the disadvantage that it doesn’t guarantee to find a global minimum which the approximation of cluster centroid changes its results. Based in the Corcoba et al. methodology [17], a diffuse logic system was used to simulate human behavior to estimate an efficient driver behavior by input and output variables, its output is rated with a classification between 0 to 1 depending how efficient the driving is, the proposal characterized the driving style by the data acquisition through telemetry. Pereira et al. [18], present a mobile application based on diffuse logic, through OBD-II data acquisition to characterize the driving efficiency by estimating the specific consumption with an accuracy of 85% in this model.

Oñate et al. [19], propose an intelligent system of erratic driving diagnosis through OBD-II and GPS data acquisition, which proposes reduce accidents by extracting driving signals, mathematical signal analysis and driving fault detection through Neural Networks, evaluating the model they guaranteed its viability and reliability. Chen et al. [20], present a semi – supervised classification method through a Support Vector Machine (SVM) algorithm, by data acquisition of velocity and engine speed data by OBD II from gasoline and diesel passengers vehicles with the vehicle identification number (VIN), it can classify the vehicles through a specific classification rule, whose purpose is applying to vehicles with internet connection and to improve estimation of CO2 emissions. Corcoba et al. present an efficient driving model through OBD-II data acquisition by a smartphone which provides ecological driving suggestions to the driver for bad driving habits, with the aim of improve the driving style through the efficiency consumption by algorithms such as J48 [21], Naïve Bayes and Random Forest, where the last one allows a better control of random attributes for each node and was selected because its execution time is less than J48, especially when the attributes number or the training set is bigger with a classification certainty of 83.17 and 71.15% respectively.

Given the background, the following methodology is proposed (Fig. 1):

Fig. 1.
figure 1

Methodology

2.2 OBD-II Data Acquisition Though Data Logger Device

OBD-II data acquisition is realized by the data logger device, “Freematics ONE+”, which allows to extract all the Electronic Control Unit (ECU) sensors data and register the information in a “.CSV” archive, it’s saved in a micro SD for a later processing and research. Stored Variables are shown on Table 1.

Table 1. Stored variables

For the experiment development, some test were realized on the dynamometer bank MAHA LP 3000 based in the ISO 17359 2018 “Condition monitoring and diagnostics of machines” [22] to obtain engine power and specific consumption data according to quality criterions.

To characterize method based in Meseguer et al. [1,2,3], Dörr et al. propose analyze driving incidence factors, such as, input, study, noise and controllable variables. These variables are shown in Table 2.

Table 2. Incidence factors

2.3 Signal Filtering

Motor signals when acquiring data present noise due to the existence of incidence factors in the vehicle, for this reason, these signals go through a filter stage; this consists in a comparison between real signals versus those obtained from sensors, for this purpose the S-Golay filter [24] was applied in order to obtain clean signals for a correct interpretation.

3 Results

3.1 Experimentation Results

With 4 power test, the vehicle performance was characterized as the volumetric fuel consumption that a low engine speed it is low, then when speed engine is increased by the increased throttle opening, the volumetric fuel consumption gets a gradually increase due to a bigger amount of air – fuel mixture causing more engine power, (see Fig. 2).

Fig. 2.
figure 2

Volumetric consumption

Relating obtained results of volumetric fuel consumption with engine power at 25%, 50%, 75% and 100% throttle valve opening, engine specific consumption is determined by (1):

$$ {\text{Equation:}}\;C_{esp} = \frac{{\rho \dot{v}}}{\text{P}}\left[ {\frac{g}{kWh}} \right] $$
(1)

Where:

  • \( \dot{v} = Fuel\,Volumetric\,Flow\,(l/h) \)

  • \( \rho = Fuel\,Density\,(g/l) \)

  • \( {\text{P}} = Engine\,Power\,(kW). \)

The engine specific consumption depends directly of engine speed and the input air – fuel mixture regulated by the throttle valve opening angle, (see Fig. 3).

Fig. 3.
figure 3

Engine specific consumption.

3.2 Machine Learning Architectures Application

For obtained data application from dynamometer bank in real driving cycles a model that explain the relationship between engine performance and normal circulation vehicle is proposed, for later determine by means of an artificial neural network application the fuel consumption and finally to apply fuzzy logic to obtain the driving mode.

Vehicle performance depends directly of transmissions ratios of each gear, so these values are obtained by the application of K-means method based in relation analysis between speed vehicle and speed engine (VSS/RPM) taken from a normal driving cycle, (see Fig. 4).

Fig. 4.
figure 4

Driving estimation model

Where the following transmission ratios values are obtained (Table 3):

Table 3. Transmission ratios

Fuel Volumetric Consumption depends directly of engine speed and the vehicle gear selected, as shown in (2) (see Fig. 5).

Fig. 5.
figure 5

K-means classification.

$$ C_{n} \left[ {\frac{l}{h}} \right] = R _{n} *RPM $$
(2)

Where:

  • \( C_{n} = Fuel\,Volumetric\,Consumption\,for\,each\,gear\,(l/h) \)

  • \( R_{n} = Transmission\,Ration\,for\,each\,gear \)

  • \( RPM = Engine\,Speed\,(rpm) \)

Relating fuel consumption with engine power delivered, the specific consumption for each gear is found, in which can be noted minimum values for each one of the selected gear, that are ideal values in which the vehicle should circulate (see Fig. 6).

Fig. 6.
figure 6

Fuel volumetric consumption

Fig. 7.
figure 7

Specific consumption for each gear.

Given the performance, maximum engine acceleration for each selected gear is determined, as shown in (3)

$$ A_{x} = \frac{{M_{nor} *\eta _{i} *\frac{{R_{C} R_{Cn} }}{re}}}{{m_{veh} * \gamma_{m} }} $$
(3)

Where:

  • \( A_{x} = Longitudinal\,Acceleration \)

  • \( \eta_{i} = Gear\,efficiency \)

  • \( M_{nor} = Normal\,Momentum \)

  • \( \frac{{R_{c} \,R_{cn} }}{re} = R_{n} = Total\,Transmission\,Ratio \)

  • \( m_{veh} = Vehicle\,Mass \)

  • \( \gamma_{m} = Rotative\,Mass\,Coficient \)

  • \( n = Selected\,gear \)

  • \( \gamma_{m} = 1,05 + 0,0025R_{n}^{2} \)

Vehicle maximum acceleration for each gear depends on engine speed and due to transmission ratio of the vehicle velocity [25], as shown in Fig. 8.

Fig. 8.
figure 8

Vehicle maximum acceleration for each gear

For an efficient driving gear changes should be done when specific consumption is relatively low according with Fig. 7. Table 4 shows values where fuel consumption is minimized.

Table 4. Specific consumption eco drive

Random Forest algorithm is presented, which its main purpose is to find relationship between input and output study variables, to determine more important variables in fuel consumption according with driving characteristics. Through study attributes, Random Forest allows to classify real engines data, demonstrating that while more samples being trained the greater would be the classification accuracy [15], also it allows a better control of the random attributes number for each node with an execution time really efficient [21], proving that fuel consumption depends directly of MAP, TPS, VSS, Ax and the selected gear, but not from engine speed, because this factor is in close relationship with VSS and Ax, with a determination index of 0.98905, (see Fig. 9).

Fig. 9.
figure 9

Random forrest.

From this, an artificial neural network is trained to estimate the volumetric fuel consumption based on determined variables with 16661 samples to estimate the driving mode. The prediction is efficient due to the high reliability of RNA with a global index of 0.98905 (Table 5).

Table 5. ANN training

The found settings (see Fig. 10):

Fig. 10.
figure 10

Relationship between expected values and estimated values by ANN

With this model an efficient vehicle speed is estimated by VSS and acceleration is determined to evaluate change gear according to engine speed. As the engine speed increases, a gear change is required, with the aim of increase vehicle speed and avoid excessive fuel consumption so that when the gear change is made the engine revolutions fall until get the new gear, considering that if the change gear is not realized, fuel consumption gets increased.

By Predictor importance the strength of the relationship of the predictors and the PID data result is quantified by the import of the samples, the viability of the established model is determined according to its importance and the training given.

In this work a diffuse classifier such as Manami is proposed with a total of 43 rules, which has many input variables: speed, acceleration, volumetric fuel consumption of the vehicle and determines the driving mode between Sport, Normal and Eco. The classifier is applied to obtained data in real driving (see Fig. 11) during start-up the Sport driving mode is used meanwhile eco mode is used in decelerations and small displacements and for the rest of scenarios normal mode is frequently used.

Fig. 11.
figure 11

Characterization of the driving mode [23]

Through the characterization of the driving modes it is determined that the Ecodriving mode is the most efficient due to the differentiated use of the acceleration pedal which establishes the specific consumption is low compared to the other modes by optimizing fuel consumption in a random route of conduction.

4 Conclusions

In this paper it was obtained a model that explains PID´s signals behavior according to a person driving mode through thresholds values based on Machine Learning architecture by statistic method Random Forest that identifies the major importance variables on the estimation of fuel consumption in order to classify the driving modes in a random route, representing by sections the driving modes Eco drive, Normal and Sport according to driving behavior. Random Forest allows estimating sensors variables values that are unknown, with great precision through data set adjustments using characteristics of engine sensors variables, with the advantage that the model presents a minimum error for driving mode data classification of each vehicle with a determination index of 0.98905. PID’s signals like MAP, TPS, VSS, Ax and the selected gear, directly influence in the person driving mode due to the big input mixture air – fuel, where highlights specific consumption, polluting emissions and vehicle performance considering that engine speed doesn’t have a relationship with VSS and Ax, it doesn’t infer in the model behavior. The application of K-means method based on the analysis of the relationship between vehicle speed and engine speed (VSS/RPM) though obtained data from the dynamometer bank establishes a model based on the specific consumption and engine performance for driving modes characterization under normal driving conditions.

This paper aims to facilitate the study of future projects for the determination of the estimation of the driving mode through Machine Learning architectures applied to the study of the analysis of polluting emissions.