Keywords

1 Introduction

In a country like India, with more than half the population working in the agricultural or labor sector where physical injuries are inevitable, a large number of people undergo limb losses. Apart from the limb loss due to physical injuries, India is also the third largest home to diabetes patients who eventually have to amputate their limbs to the disease [1]. The discovery of an artificial limb or prosthesis provides the much needed respite to the amputees in India and across the globe.

Prosthesis is defined as a man-made device attached to the subject body to replace a part which could have been lost through trauma, disease or other congenital conditions. The statistics of the National Limb Loss Information Centre reveal that the upper limb amputations account for a much greater part of the overall trauma related amputations [2] and hence in this entire paper, we focus on the arm amputations and the prosthetic arms.

Many different kinds of robotic arms have been developed since its inception [3], ranging from grippers capable of performing basic industrial applications like lifting, moving, etc., to extremely detailed ones capable of enacting every possible human hand function.

The functioning of prosthetic arms is indeed a complex mechanism and involves deciphering the intended movement with the help of the electrical activities of the muscles, which are known as electromyogram (EMG) signals. The recording of the EMG signals is done through non-invasive methods through the surface of the skin of the amputee.

An overall process of controlling a prosthetic arm usually involves an electrode placed directly on the surface of the amputee used to record the signals, which are amplified, filtered and sampled to get a refined data set to be considered for deciphering the movement to be performed. This data is used for EMG pattern classification which includes processing of EMG signals, extraction of features and classification [4].

Continuous research in this field has brought advancements in this field with around 90% accuracy [5]. A number of factors that influence pattern recognition have been studied, for example, strength exerted by muscle [6], limb orientation [7], and electrode placement [8]. Other forms of noise may also cause problems [9]. The effect of these factors has been studied individually but [10] recently studied the effects of a combination of muscle contraction levels and forearm orientation on classification of EMG signals.

Feature extraction methods are used to get useful information out of almost meaningless and random time series EMG signals. Feature extraction can be done using methods like Time Domain based features [11], Discrete Fourier Transform [12], or Time Domain-Power Spectral Descriptors (TD-PSD) [13], etc. TD-PSD method as it has been established to be the best among all methods by previous literature [10]. TD-PSD method quantifies the angle of the EMG pattern rather than the amplitude which gives more robust results in comparison to other feature extraction methods. Feature extraction is then followed by a classifier which ultimately differentiates between actions and force levels with which it needs to be performed by giving an input to the digital controllers which controls the prosthetic arm. Support Vector Machine classifier was used in [10]. It has been proven to be equally good if not better to other classifiers such as Linear Discriminant Analysis (LDA), kNN, Random Forest and Naive Bayes [10].

This paper has tried to study the combined influence of different orientations of the forearm and varied muscular contraction levels on EMG pattern recognition. It has tried to continue and improve on the work done in this domain previously [10] and used their data set which was recorded live in Iraq as well as Australia. The data consists of ten intact limbed amputee subjects on which multiple electrodes were placed. These electrodes captured the EMG signals generated by 6 different movement classes at 3 varied contraction levels, with 3 trials given to each recording performed in 3 different orientation of the forearm. This data was used to extract the features by using the TD-PSD feature extraction method. Here, we have utilised the Neural Network Pattern Recognition toolbox available on MATLAB 2014 and subsequently compared the classification accuracy of this classification method with the previously used support vector machine classification method.

Section 2 discusses the data acquisition as well as the major methodologies and concepts used in feature extraction as well as classification involved in the study. Section 3 discusses the results obtained followed by the concluding remarks in Sect. 4.

2 Methods

2.1 Experimental Protocol

A normal computer display is placed in front of the subjects. The subjects were made to perform six actions or movements, which are, closed fist, opened fist, extension of wrist, flexion of wrist, wrist ulnar deviation, and wrist radial deviation. Three forearm orientations were considered: wrist fully supinated, at rest, and fully pronated, marked as 1st, 2nd, and 3rd orientations. Movements are performed at varied contraction levels: low, average and high in each orientation of all the six movements. Overall, each subject gave 162 trials: 6 classes of moves × 3 orientations of the arm × 3 levels of muscular contraction × 3 trials per movement.

2.2 Feature Extraction Method: Time Domain Power Spectrum Descriptors

Recent studies in Electromyogram (EMG) pattern recognition show the error when the implementation of myoelectric control system is carried out [14]. When tests are carried out on EMG patterns for the same movement at different position, the controller shows limited performance. The feature extraction method is known as TD-PSD [13] is utilized to minimize the effect of limb position on classification.

The feature vector is obtained in two steps. In the first step, using the sampled time series EMG signal and transforming them through Fourier transform and Parseval’s relations, a set of power spectrum features is extracted. Then the sampled time domain EMG signal is logarithmically scaled and the power spectrum moments are obtained from it. This is also called cepstral feature extraction method.

In the final step, the total six features are extracted which are nothing but the orientation between the power spectrum moments for original electromyogram signal and its cepstral version using a cosine similarity rule. The next section explains the feature extraction method in detail.

In Fig. 1 x[j] with j = 1, 2,… N, of length N denotes the sampled set of EMG signal. EMG trace within a certain epoch can be expressed as a function of frequency by means of Discrete Fourier transform (DFT). The feature extraction process begins by observing Parseval’s theorem which states that the sum of the square of the function is equal to the sum of the square of its transform.

Fig. 1.
figure 1

Block diagram of time domain power spectrum descriptors feature extraction method

$$ \sum\nolimits_{j = 0}^{N - 1} {|x\left[ j \right]|^{2} = \text{ }\frac{1}{N}\sum\nolimits_{k = 0}^{N - 1} {|X\left[ k \right]X \, \text{* }\left[ k \right]| = \sum\nolimits_{k = 0}^{N - 1} {|P[k]|} } } $$
(1)

Frequency index is denoted by k and P[k] is the power spectrum without phase [15]. This method will deal with the whole spectrum because the full frequency description is symmetric in nature which we obtained from Fourier Transform and we cannot obtain the power spectral density directly from the time-domain. So, all odd moments will be considered as zero. So, m is denoted as moment and n as order of the moment of the power spectrum P[k].

$$ \text{ }m_{n} = \sum\nolimits_{k = 0}^{N - 1} {\text{k}^{\text{n}} P[k]} $$
(2)

In the equation shown above, if the value of n is nonzero then the Fourier transform’s time-differentiation property is used and when n = 0 then Parseval’s theorem is used. This kind of property states that Δn is denoted as discrete time signals, which can also be written as multiplying the X[K] by k to the n th power.

$$ F\left[ {\Delta^{n} x[j]} \right] = \text{ k}^{\text{n}} X\left[ k \right] $$
(3)

The moments which will help in extraction of feature sets as shown in Fig. 1 are:

Root squared zero order moment: This feature mainly denotes the strength of muscle contraction, or the frequency-domain’s total power.

$$ \bar{m}_{0} = \text{ }\sqrt {\sum\nolimits_{j = 0}^{N - 1} {x[j]^{2} } } $$
(4)

Root squared second order moments: Power spectrum is denoted as the second order moments.

$$ \bar{m}_{2} = \text{ }\sqrt {\frac{1}{N}\sum\nolimits_{j = 0}^{N - 1} {\left( {\Delta x[j]} \right)^{2} } } $$
(5)

Root squared fourth order moments: For the fourth order moment we raise the power of frequency index by 2 in second order moment.

$$ \bar{m}_{4} = \text{ }\sqrt {\frac{1}{N}\sum\nolimits_{j = 0}^{N - 1} {(\Delta^{2} x[j])^{2} } } $$
(6)

So, the second and the fourth moment derivatives of the signals are used to minimize the signal’s full energy; hence, we normalize (λ = 0.1) to limit the influence of noise on all moments.

$$ m_{0} = \frac{{\bar{m}_{0} }}{\lambda }\text{ }m_{2} = \frac{{\bar{m}_{2} }}{\lambda }\text{ }m_{4} = \frac{{\bar{m}_{4} }}{\lambda } $$
(7)

Now, the first three features using the above moments are:

$$ f_{1} = \text{log}(m_{0} ) $$
(8)
$$ f_{2} = \text{log}(m_{0} - m_{0} ) $$
(9)
$$ f_{3} = \text{log}(m_{0} - m_{4} ) $$
(10)

The other three features are:

Sparseness: Sparseness measures the amount of energy is packed in only minor components of a vector. This feature can be expressed as:

$$ f_{4} = \log \left( {{{m_{0} } \mathord{\left/ {\vphantom {{m_{0} } {\sqrt {\left( {m_{0} - m_{2} } \right)\left( {m_{0} - m_{4} } \right)} }}} \right. \kern-0pt} {\sqrt {\left( {m_{0} - m_{2} } \right)\left( {m_{0} - m_{4} } \right)} }}} \right) $$
(11)

Such a feature describes a vector with all elements equal with a sparseness measure of zero that is m 2 and m 4 equal to zero because of differentiation and so f 4 = 0. For all other sparseness levels, value should be greater than 0 [14].

Irregularity Factor (IF): It denotes the measure of ratio of the count of upward zero crossings to the number of peaks. This feature can be expressed as in terms of spectral moments:

$$ f_{5} = { \log }(m_{0} /\sqrt {m_{0} m_{4} } ) $$
(12)

Waveform Length Ratio (WL): The summation of absolute value of the first and second derivative of EMG signal over its entire length is calculated to find the waveform length (WL) feature using the formula:

$$ f_{6} = { \log }\left( {\frac{{\mathop \sum \nolimits_{j = 0}^{N - 1} |\Delta x|}}{{\mathop \sum \nolimits_{j = 0}^{N - 1} |\Delta^{2} x|}}} \right) $$
(13)

Now on the basis of Fig. 1, we form a matrix a = [a 1a 2a 3a 4a 5a 6] by using the six extracted features. We also add another feature vector, expressed as b = [b 1b 2b 3b 4b 5b 6], which is extracted logarithmically scaled version log (\( x^{{ 2^{{x^{2} }} }} \)). For each EMG channel the final 6 features are extracted which are nothing but the orientation between the vectors obtained earlier. A cosine similarity rule is used to find these features given as

$$ f_{\text{i}} = \frac{{ - 2a_{\text{i}} b_{\text{i}} }}{{a_{\text{i}}^{2} + b_{\text{i}}^{2} }} $$
(14)

2.3 Artificial Neural Network

It makes a network of artificial neurons that map the input to the output, both of which are known to us with certainty. A backpropagation network consists of at least three layer which are one input layer, one output layer, and one or more hidden layers in between the input and output layers. The hidden layer has a number of hidden neurons. The network is trained, that is, the various connection weights and bias values are adjusted so as to generate the desired outputs for the given inputs. The error generated at the output is the difference in our desired output and the present output at the output nodes. This error is backpropagated from the output layer to the input layer through the hidden layer(s). This changes the connection weights to reduce the error. This process is called backpropagation.

For the classification process, Neural Pattern Recognition toolbox has been used, available in MATLAB 2014, where the extracted TD-PSD features from EMG signals obtained from 10 subjects are treated as input, that is, the data to be classified. Networks of pattern recognition are feed-forward networks which can be used to train and classify inputs according to their target classes.

To train the network, we use the scaled conjugate gradient backpropagation method of classification. This training function comprises of three layers in total. There is just one hidden layer. The number of neurons is selected according to our network size.

3 Experimental Results

To report our results, we combine all the data and classification results from our 10 subjects. There are a total of 1620 samples from 162 trials of our 10 subjects. We first divided our samples randomly to feed into the pattern recognition tool using scaled conjugate gradient backpropagation algorithm. We used 70% of those samples for training, 15% of those samples for validation and 15% of those samples for testing. Figure 2 shows the confusion matrix for training with TD-PSD features:

Fig. 2.
figure 2

Preliminary testing results with the entire dataset.

We now conducted a systematic study where, at each forearm orientation, we selected the data from one contraction level to train our classifier and then used the data from all contraction levels for testing our model.

For the case when we use the data of low contraction level in orientation 1 for training, Fig. 3 shows the confusion matrix and receiver operating characteristics (ROC), and Fig. 4 shows the performance curve. These plots help in the gauging the efficiency of a supervised learning algorithms (Table 1).

Fig. 3.
figure 3

(a) Confusion matrix and (b) ROC for training with low force level at orientation 1 and testing with all forces of the same orientation.

Fig. 4.
figure 4

Performance curve for training with low force level at orientation 1 and testing with all forces of the same orientation.

Table 1. Same orientation analysis. Testing was done using all forces within an orientation.

The classification accuracy was the highest when training data from medium contraction level was used, at each forearm orientation.

Next, the classifier was trained and tested with data from different orientations. Specifically, for training, each orientation was selected one by one and the TD-PSD features extracted from data of all three muscle contraction levels was used. The TD-PSD features exacted from all the contraction levels of the other two orientations was the testing data.

There was a decline in classification performance from the earlier scenario, predictably so. For the case when the network was trained using data from orientation 1, and tested using data from orientations 2 and 3, Fig. 5 shows the confusion matrix and ROC. Figure 6 shows the performance curve.

Fig. 5.
figure 5

(a) Confusion matrix and (b) ROC for training the classifier using orientation 1 data. Testing using data from orientation 2 and 3

Fig. 6.
figure 6

Performance curve for training the classifier using orientation 1 data. Testing using data from orientation 2 and 3

Training the classifier with data from orientation 2 (as opposed to orientations 1 or 3) and testing the network with the other orientations resulted in the highest classification accuracy, which is 69.1% on average.

These results can be compared to those obtained by using support vector machine (SVM) classifier with SVM parameters as C = 32 and γ = 0.0625 (Table 2).

Table 2. Different orientation analysis.

4 Concluding Remarks

Using TD-PSD as the feature extraction method, a comprehensive study of how varying muscle contraction levels and different forearm orientations together affect the EMG pattern recognition was conducted. It has been well established that the TD-PSD is a superior feature extraction method in comparison to other feature extraction methods. This paper has employed a new classifier, namely, Neural Network Classifier with scaled conjugate gradient back propagation training algorithm on MATLAB to improve the classification accuracy. Training this new classifier with data of six movement classes, each carried out with multiple muscular contraction levels and at various forearm orientations, by using TD-PSD features offers satisfactory classification accuracy and an improvement over Support Vector Machine classifier.

Maximum classification accuracy is obtained when the training data includes all forearm orientations. It is also observed that using training data from medium muscle contraction level provides the best accuracy when testing in comparison to low or high force levels.