A Systematic Review of Hidden Markov Models and Their Applications

Mor, Bhavya; Garhwal, Sunita; Kumar, Ajay

doi:10.1007/s11831-020-09422-4

A Systematic Review of Hidden Markov Models and Their Applications

Original Paper
Published: 12 May 2020

Volume 28, pages 1429–1448, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Archives of Computational Methods in Engineering Aims and scope Submit manuscript

A Systematic Review of Hidden Markov Models and Their Applications

Download PDF

Bhavya Mor¹,
Sunita Garhwal¹ &
Ajay Kumar¹

11k Accesses
136 Citations
4 Altmetric
Explore all metrics

Abstract

The hidden Markov models are statistical models used in many real-world applications and communities. The use of hidden Markov models has become predominant in the last decades, as evidenced by a large number of published papers. In this survey, 146 papers (101 from Journals and 45 from Conferences/Workshops) from 93 Journals and 44 Conferences/Workshops are considered. The authors evaluate the literature based on hidden Markov model variants that have been applied to various application fields. The paper represents a short but comprehensive description of research on hidden Markov model and its variants for various applications. The paper shows the significant trends in the research on hidden Markov model variants and their applications.

Hidden Markov Models

Introduction to Hidden Markov Models and Its Applications in Biology

Markov Chain

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

A series of papers in the late 1960s and early 1970s by Leonard E. Baum and other researchers introduced statistical methods of Markov source and hidden Markov modeling [1]. HMMs have become popular models in the last two decades due to its flexible nature. The mathematical structure of HMM makes the theoretical basis for many real-world applications like speech recognition, facial expression recognition, gene prediction, gesture recognition, musical composition and Bio-informatics.

HMM, a statistical model designed using a Markov process with hidden states. Andrey Markov introduced the Markov model in the early 20th century. Later, a series of papers in the late 1960s and early 1970s by Leonard E. Baum and other researchers introduced statistical methods of Markov source and Markov modeling [1]. State transitions refer to the random change in states of the Markov process in discrete time. Markov model follows the concept of memory-less property, i.e. the transition from one state to other state depends only on the present state [2]. In HMM, emitted symbols are observable, and random transitions from one state to another state remains unobserved. The ease in the implementation, handling of sequential data and handling of variable-length inputs, makes HMM applicable for many real-life applications.

1.1 Motivation

In the last five decades, various researchers explored the HMM and its variant in various application domains. In 1970s, HMM has been applied in speech recognition. Since 1980, HMM has been extensively used in the domain of bioinformatics [3]. HMM are further classified into First-order HMM, Higher-Order HMM (HO-HMM), Hidden-Semi Markov Model (HSMM), Factorial HMM (FHMM), Second-Order HMM, Layered HMM (LHMM), Autoregressive HMM (AR-HMM), Non-Stationary HMM (NS-HMM) and Hierarchal HMM (HHMM) as depicted in Fig. 1. There is a need to bind the work done by various researchers in the area of HMMs.

1.2 Outline

This survey paper is structured as follows: Sect. 2 outlines the review process. Section 3 gives the preliminaries required for HMM. We reviewed the work of first-order HMM, HOHMM, HSMM and FHMM with their applications in Sects. 4.1, 4.2, 4.3 and 4.4 respectively. Application of second-order HMM, LHMM, AR-HMM and NS-HMM are explored in various domains in Sects. 4.5, 4.6, 4.7 and 4.8 respectively. In Sect. 4.9, we lay out the various applications of HHMM and finally, Sect. 5 summarizes the conclusions of the paper.

2 Review Process

2.1 Classification of Papers

In this review paper, we explored the applications of various types of HMM and categorized the papers based on several criteria. Table 1 represents the properties and categorization of Papers. Research questions in Table 2 helped in fetching all the essential information from the papers.

Table 1 Classification of papers

Full size table

2.1.1 Distribution of Papers for HMM Variants (RQ1)

Figure 2 represents the number of papers reviewed for nine different types of HMM variants. Figure 2 shows that HSMM (29%) and first-order HMM (23%) are the commonly used HMMs variants. Rest of the variants are almost equally used with a difference of 1–2%. Only 3% of researchers used NS-HMM for their research work.

Table 2 Research questions

Full size table

2.1.2 Application Fields of HSMM (RQ2)

HS-HMM is mainly used in the area of analyzing tool wearing and musicology. We had considered eight and seven published papers in the area of tool wearing and musicology, respectively. Besides, HS-HMM are also explored in the stock market, data analysis, speech recognition and network analysis by considering two, three, three and three papers, respectively (Fig. 3).

2.1.3 HMMs for Speech Recognition (RQ3)

At present, HMM is the most successful and simplified approach for speech recognition. Figure 4 represents that the first-order HMM is explored maximally by researchers for speech recognition. As evident from Fig. 4, Researchers had published three papers using each variant of HO-HMM, FHMM, second-order HMM and AR-HMM in the area of speech recognition. Furthermore, no paper had published using NS-HMM and HHMM in the area of speech recognition.

2.1.4 Application Areas with HMMs (RQ4)

Figure 5 represents that HMMs are widely used in the area of speech recognition (25% papers) and human activity recognition (25% papers). Additionally, HMMs are also used in the area of musicology (9% papers), data processing (7% papers) and network analysis (6% papers).

3 Preliminaries

HMM is a doubly stochastic finite model that calculates probability distribution over an infinite number of possible sequences [2]. It is used for studying the observed items from a discrete-time series. States have assigned transition probabilities, and every state emits symbol according to the emission probability of the state [5]. Figure 6 represents the underlying architecture of HMM.

Definition 1: HMM [4] is defined by quintuple \((S, O, A, B, \pi )\) where,

\(S={S_1,S_2,S_3,\ldots ,S_n}\) is a set of hidden states.
\(O(t)={o_1,o_2,\ldots ,o_m}\) is set of m-observable symbols at each time intervals.
A represents state transition probability and denoted by \(A=a_{ij}=\{P(X_{t+1}=S_j|X_t=S_i)| 1 \le i, j \le n\}\). Here \(a_{ij}\) represents the probability of moving from state i at time t to state j at time \(t+1\).
B represents symbol emission probability and denoted by \(B=b_j(t)=\{P(O(t)|X(t)=S_j)| 1 \le j \le n \}\) represents the probability of emitting symbol O(t) from state j.
\(\pi =\{\pi _i=P(X_1=S_i)| 1 \le i \le n \}\) is initial state probability.

4 Literature Survey

4.1 First-Order HMM

The basic HMM (discussed in Sect. 3) referred as first-order HMM [6]. We had summarized the first-order HMM in the area of speech recognition, human action recognition and analyzing genome structure.

Rabiner et al. [7] combined the techniques of vector quantization with HMM for generating speaker-independent and isolated word recognition system. Their system produced higher accuracy rate for word recognizer on the vocabulary of isolated digits. Levinson [8] recognized speaker-independent isolated digit using HMM and Linear Predictive Coding (LPC). Schwartz et al. [9] improved HMM for modeling phonemes in speech recognition by considering the trade off between robustness and specificity. Rabiner [1] reviewed various aspects of HMM and applied it in speech recognition. Juang and Rabiner [10] applied HMM in speech recognition and observed an accuracy rate higher than 95% in speaker-independent tasks. Figure 7 represents various applications of first-order HMM.

Bahl et al. [11] described a method for estimating the maximum mutual information for various parameters of HMM in speech recognition. Poritz [12] proposed a linear predictive HMM for analyzing the speech signals. The method was further applied for talker verification. Rose and Paul [13] described a system for baseline keyword recognition using HMM. Their system deals with the effect of linear channels and non-keyword speech. Lee and Hon [14] applied in speaker-independent phone recognition. They improved the accuracy using multiple codebooks of LPC parameters and Viterbi decoding. Juang [15] used HMM and dynamic time wrapping techniques for speech recognition. Varga and Moore [16] improved the task of speech recognition by signal decomposition using HMM. They recognized the concurrent events simultaneously for stationary and non-stationary noises.

Sonnhammer et al. [17] predicated the location and orientation of transmembrane helices in protein sequences. Churchill [18] studied the structure of a human genome segment and explored the correlation between discrete compositional domains and genome function. Soruri et al. [19] introduced a novel gene clustering approach using HMM and optimized it using particle swarm optimization algorithm. They described specific HMM for each gene sequences and evaluated probabilities for every individual sequence. Yamato et al. [5] proposed a method using HMM and feature-based bottom-up approach for human action recognition from a set of time-sequential images. Krogh [20] introduced HMM for labeled observations and developed a maximum likelihood method for estimating the parameters of the model.

Manogaran et al. [21] used Bayesian HMM with Gaussian mixture clustering for cancer diagnosis. They proposed a machine learning approach to model DNA copy number change in genome structure. Xin et al. [22] introduced a semi-automated diagnosis method for handling fault detection, identification and extraction at the same time. Yao et al. [23] proposed a routing method based on HMM for vehicle Adhoc Networks (VANET). Their proposed hybrids scheme predicted the vehicles future path based on the history of mobility patterns. Petersen et al. [24] modeled sepsis progression with HMM for studying patients heterogeneity. It extracts a patients physiological trajectory to identify patients with higher risks. Tang and Dong [25] detected malicious domain name using improved HMM in Spark environment. Zhuo et al. [26] used profile HMM for website fingerprinting attack on anonymous networks. The proposed approach identified website and webpage in the closed world setting. Putland et al. [27] detected underwater bio-phonic sounds using HMM. Their approach effectively detected Brydes whale vocalization irrespective of the duration and conflicted vessel passage sounds. Habayeb et al. [28] proposed HMM for identifying the time to fix bug reports. The approach enabled software quality teams for early indication of forecasted bug reports.

Ullah et al. [29] designed HMM-based algorithm for predicting the energy consumption in smart buildings. Further, they validated their model using the real-data collected from few selected building of South Korea. Yip et al. [30] modeled HMM for predicting earthquakes and introduced a latent Markov process for explaining the underground dynamics. Their model also predicts the magnitude and arrival time of further earthquakes. Pastell and Frondelius [31] developed HMM for calculating the time spend by dairy cows at the feed bunk using ultra-wide bands indoor positioning system. Further, they showed that the performance of their model could be improved using the Viterbi algorithm with logistic regression. Alshamaa et al. [32] designed HMM-based mobility model for tracking of older people. It will help in determining the trajectory of older peoples in an indoor environment. Liu et al. [33] predicted the driver intention for autonomous vehicles using HMM. They trained and tested their model by taking real data from the flyover.

Jiang et al. [34] introduced a dynamic fault prediction model based on HMM by analysing the dissolved gas. Using Jiang et al.s model, preventive action can be taken for maintaining the power transformers. Lu et al. [35] proposed a data mining approach based on HMM. Xu et al. [36] applied HMM with Eskins probabilistic detection algorithm for detecting the low-carbon anomaly and abuse of resources. It helps in the green technology innovation ecosystem. Joo et al. [37] generated an adaptive approach for estimating the batch size with HMM. The adaptive model could capture the changes in the process deduced from analyzing product quality data. Coast et al. [38] detected cardiac arrhythmia using HMM with statistical knowledge of ECG signals. They calculated the parameters using the maximum likelihood re-estimation algorithm. Yang et al. [39] applied HMM with vector quantization to recognize speaker-independent lexical tones for Mandarin speech. They showed that the recognition of speaker-independent tone requires pitch-base adjustment. Table 3 represents classification of papers related to first-order HMM.

Table 3 Classification of first-order HMM papers

Full size table

4.2 Higher-Order HMM

HO-HMM generalizes the first-order HMM and extends the dependency from the previous state to n states (Fig. 8). Both transition and observation probability distribution depend on several previous states [40]. A HO-HMM of kth order is a HMM which considers HMM values up to lag k order [41].

Xiong and Mamon [41] introduced a self-updating model for the evolution of daily average temperature using HO-HMM. Further, they analysed the weather derivatives using their designed model. Zhu et al. [42] discussed the asset allocation problem using HO-HMM. They studied optimal portfolio selection using long term memories of varying hidden economic conditions and optimal asset collection. Lee and Jean [40] modeled piece-wise linear processes with HO-HMM. Their model will help in better behaviour approximation of real processes and reduced the error rate in the speech recognition for noisy Mandarin digits. Quan and Ren [43] recognized the most likely sequence of emotions in the text using weighted HO-HMM. Seifert et al. [44] applied parsimonious HO-HMM for analyzing array-based comparative genomics hybridization. The model enabled the interpolation between a mixture model and HO-HMM for detecting DNA polymorphism in a closely related genome.

Lee and Lee [45] applied the HO-HMM for capturing the dynamics and duration of speech signals. Their proposed approach is robust against noise and speech recognition can be carried out with reduced error rates. Xiong et al. [46] applied the HO-HMM for car ownership behavioural analysis. Zhang et al. [47] presented a high accuracy and low-risk approach for predicting the trend in stock market price using HO-HMM. Chen and Qiu [48] proposed an approach for channel state of cognitive radio using HO-HMM. The approach was based on spectrum sensing slots to reduce the effect of latency between spectrum sensing. Figure 9 represents the application areas of higher-order HMM.

4.3 Hidden Semi-Markov Model

HSMM provide a way to deal explicitly with state durations. In HSMM, the underlying process of hidden state is a semi-Markov chain (Fig. 10). A hidden state remains in the same state for time duration d, also the hidden state emits d observed states [49]. The probability of going from one hidden state to others depends on the time elapsed since entering into the current state [50]. HSMM is also known as explicit duration HMM (DHMM) or variable-length HMM (VLHMM).

Narimatsu and Kasai [52] proposed two extended models (Interval state HSMM and Interval length probability HSMM) for analysing sequential data. These models support concepts of state interval and state duration representation. Zhu and Liu [53] monitored online tool wearing using duration-dependent HSMM. Liu et al. [54] applied duration-dependent HSMM to diagnose equipments degradation process. Li et al. [55] applied an optimal Bayesian control scheme based on the three-state continuous-time hidden semi-Markov process for early detection of the fault gear shaft. Liu and Wang [56] decoded the time-varying distribution of Chinese stock market returns using three-state HSMM.

Xiao et al. [57] proposed a duration-dependent HSMM for analyzing online machine health. The analysis is useful in predicting the useful residual lifetime of the machine. Kong et al. [58] estimated tool wearing in the mining process with HSMM. The straightforward model provides higher accuracy rate. Wu et al. [59] presented lightweight and real-time fused deposition modeling for monitoring machine condition. The method used HSMM with acoustic emission to improve product quality and printing process reliability. Pertsinidou et al. [60] studied the application of HSMM for the assessment of seismic hazard in Greece. They used a simplified novel Viterbi algorithm for detecting precursory phases and provided warning for any anticipated earthquake occurrences.

Bang et al. [61] designed a scheme based on HSMM for detection of an anomaly in network-initiated LTE signaling attacks in wireless sensor networks. The proposed scheme captured both the temporal and spatial characteristics of the normal nodes. Tanwani and Calinon [62] investigated semi tied HSMM in learning of robot manipulation tasks. Cai et al. [63] applied HSMM for analyzing network protocols of the application layer. They modeled the protocol message format for maximizing the likelihood probability of keyword selection and message segmentation. Galvez et al. [64] HSMM model can be applied for generation and analysis of processes. Liu et al. [65] proposed a novel method for multi-sensor monitoring of health equipment.

Xiao and Dong [66] designed HSMM-based reputation management system in the online to the offline e-commerce market. They performed the usefulness of the model by demonstrating in real-life application. Yue et al. [67] proposed a logical hierarchal HSMM for recognizing the intention of each team member, team intention and working mode. Altuve et al. [68] introduced an online system for detecting apnea-bradycardia along with temporal evolution using HSMM. Votsi et al. [69] modeled HSMM for estimating occurrence rate of earthquakes. The application of HSMM in seismology was studied to identify features in the earthquake generation process. Figure 11 shows various applications of Hidden Semi-Markov Model.

Du et al. [70] performed genomic segmentation by using HSMM. The model was designed as a general segmentation engine for better sensitivity and specificity in genomic segmentation. Xu et al. [71] proposed a method for identifying user click patterns using HSMM. Further, they proposed a state selection algorithm and evaluated their result on the real data set of a state Telecom. Liu et al. [72] trained HSMM in max-margin learning framework for segmentation of mitosis event. The segmentation was performed in the time-lapse phase-contrast microscopy image sequence of stem cell populations. Boussemart and Cummings [49] presented a methodology for learning HSMM with human supervisory control setting. Dong and Peng [73] applied non-stationary segmental HSMM for predicting equipments health and maintenance. Liang et al. [74] introduced a voice activity detector with noise-robust using HSMM. They considered issues of feature distributions, temporal dependence and speech feature related to noise robustness. Xie et al. [75] proposed a forward-backwards algorithm for nested HSMM and applied it to a network traffic model.

Kerk et al. [76] applied HSMM in geographic positioning system location to reveal the multiphasic movement of the endangered Florida panthers. Duan et al. [77] used HSMM for detecting faults and predicting the useful remaining life of computer numerically controlled equipment. Chen et al. [78] generated audio chord recognition system using DHMM. They explicitly considered chords duration for recognizing the system. Karg et al. [79] performed clinical gait analysis with DHMMs. They modeled time series data of a group and applied the reference-based measure to compare the observations. Benouareth et al. [80] designed a recognition system for off-line handwritten Arabic words using explicit state duration semi-continuous HMM and segmentation-free approach.

Benetos and Weyde [81] used pitch-wise DHMM for transcription of multiple-instrument polyphonic music. It could be useful in model tone durations and temporal evolution presented in musical patterns. Yue et al. [82] presented DHMM based prognostics and diagnostics method for evaluating the residual life distribution of face milling. Calinon et al. [83] applied DHMM to encode information about time and position constraint in robot learning. Chordia et al. [84] modeled north Indian tabla sequences with Variable-length Markov model and VLHMM. The model could determine the next stroke from an audio file of tabla sequences. Senturk [85] performed computational modeling of improvised Turkish folk music with VLMM and prediction of melody in the music. Senturk and Chordia [86] designed a VLHMM for predicting melodies in musical structures. They generated melodic improvisation for Turkeys folk music. Pikrakis et al. [87] classified musical patterns from raw data using variable duration HMM. Dumont [88] statistically analyzed VLHMM and proposed an algorithm to find a consistent estimator for context tree estimation.

Chen et al. [89] proposed a system for recognizing off-line handwritten words. Their approach was based on continuous density VLHMM and morphological segmentation for recognition. Liang et al. [90] applied VLHMM for analyzing human behaviour. The model consists of labeling posture and learning-recognizing atomic human action modules. Cao et al. [91] proposed an approach for context-aware search using VLHMM. Various contexts of queries could be captured from the search session of log data. Bernard et al. [92] recognized Arabic isolated handwritten words using context-dependent and VLHMM.

4.4 Factorial HMM

FHMM is an extended HMM, allowing the modeling of several loosely coupled random processes. FHMM is a multi-layer state structure with improved representational capacity [93]. Each FHMM layer can be considered as a HMM and each layer work independently from other layers. The output of FHMM depends only on the current states of all the layers at the time [94] (Fig. 12).

Ozerov et al. [96] designed Factorial Scaled HMM for representing the polyphonic audio music files. FSHMM was a generalization of Gaussian scaled mixture model and Itakura-Saito Non-negative Matrix Factorization model. Bonfigli et al. [97] proposed a non-intrusive monitoring algorithm for appliances using active-reactive power of additive Factorial HMM. Their proposed algorithm will help the user to modify their habits for saving the electrical energy. Khorasani et al. [98] recognized amyotrophic lateral sclerosis (ALS) patient using FHMM. FHMM distinguishes ALS patients and healthy subjects by removing the unwanted data from stride interval time and extracting useful data. Chen et al. [93] recognized gait features with FHMM and Parallel HMM (PHMM). FHMM and PHMM were introduced as a feature-level fusion scheme and decision-level fusion scheme respectively for combining gait features. The applications of Factorial HMM are shown in Fig. 13.

Betkowska et al. [94] recognized robust speech for the home environment by applying FHMM architecture. They recognized speech in the presence of sudden non-stationary noises. Li et al. [99] recognized faults using independent component analysis (ICA) and FHMM. ICA reduced redundancy and extracted features from multi-channel detection. FHMM recognized the faults in speed up and down process of the rotating machinery. Husmeier [100] detected mosaic structures in DNA sequence using a phylogenetic tree and FHMM. The model discriminated between rate heterogeneity and inter-specific recombination in the DNA sequence alignment. Durrieu and Thiran [101] proposed FHMM with source/filter model to achieve robust pitch and formant tracks in speech processing. Kolter and Jaakkola [102] worked on approximate inference problem in additive FHMM. Table 4 represents the major research findings of FHMM and its applications.

Table 4 Major Research findings of FHMM and its applications

Full size table

4.5 Second-Order HMM

In a second-order HMM, the transition probability of a state at any time depends on the two previous states at the time. The sequence of the state depends on the second-order Markov chain. The state duration of these models is estimated by the probability of entering any state only once, and the probability of visiting any state at least twice [103]. Figure 14 represents different applications of second-order HMM.

Hyun et al. [104] designed a log-Viterbi algorithm for recognizing human activities in smart homes with increased accuracy and decreased time complexity using second-order HMM. Kabir et al. [105] also recognized human activity in the home environment using two-layer HMM. One layer contains the location information, whereas the second layer contains the object information. Their model also mapped low-level sensor data to high-level activity based on binary sensor data. Zhou et al. [106] used a two-stage HMM for detecting biomarker. They modeled HMM with the local false discovery rate (FDR) for detecting a significant association in microbiome research for practical analysis. Liang et al. [107] presented a system to filter and classify ECG signals using two-layer HMM in a free-living environment. Othman and Aboulnasr [108] applied second-order HMM for face recognition. The model used a non-overlap strategy to reduce the computational load. Wu et al. [109] proposed a two-layered HMM for human action recognition by decomposing the problem in two layers. First layers modelled the actions of two-arms, whereas the second layer modeled the relation in two arms. Zhang et al. [110] modeled the actions of individuals and groups in a meeting using two-layered HMM. The first layer mapped low-level features of individual actions, and the second layer takes input from the first layer to recognize group actions.

Mari et al. [111] showed that second-order HMM could yield high-performance forward and phoneme-based speech recognition task. Thede and Harper [112] used second-order HMM for tagging part-of-speech using lexical and contextual probabilities. Wei et al. [113] proposed a model for monitoring daily activities using body sensor network with two-layered HMM. The lower-layer HMM processed sensory data locally to decrease data transmission and the top-layer extracted the sequence of activity from locally processed data.

4.6 Layered HMM

In LHMM, several composed HMMs at each layer runs parallel to each other. Each layer provides an output to the higher layer. For enabling fast re-training of the model, these models are trained layer-by-layer [114]. Each layer is connected to the next layer through inferential results [115].

Lee and Cho [116] applied LHMM for recognizing long and short-term activities with in-built mobile sensors on Android platform. The LHMM could model temporal patterns using multi-dimensional data. Razin et al. [117] learned characteristics of the human operators performance from surface electromyography for predicting their intentions in task operations using LHMM. Glodek et al. [114] applied LHMM for recognizing human activities based on the modalities multitude. The model detected complex activities from a stream of class assignments provided by the classifiers on the previous layer. Glodek et al. [118] improved human activity recognition problem by incorporating uncertainty of the class decision. Aarno and Kragic [119] modeled human skills using LHMM with greater discriminating power. They modeled the complex task of motion intention recognition even with miss-classifications present in the layers. Oliver et al. [120] represented humans activity from real-time streams of video, computer interaction and acoustic with LHMM. The applications of Layered HMM are shown in Fig. 15.

Oliver et al. [115] recognized the users activity of a multimodal, real-time approach in an office environment using LHMM. The layered representation enabled the learning of humans office activity with multiple sensory channels. Barnard and Odobez [121] used LHMM with an unsupervised low-level clustering to recognize events in sports videos. Zhang et al. [122] proposed cross-layered HMM (CLHMM) for surveillance events recognition. The cross-layer reduced computational complexity and increased the accuracy rate. Runsewe and Samaan [123] proposed layered multi-dimensional HMM for cloud resource scaling in big data streaming applications. Solaimanpour and Doshi [124] used LHMM with Monte Carlo algorithm to predict the motion of a robot online. The predicted motions enabled updating nested track that could track other robots in a known environment. Ingels [125] recognized connected text with LHMM and token passing. The robust tokenizer was implemented to recover from segmentation and lexical error on the text input. Perdikis et al. [126] also recognized the inherent characteristics of human actions with LHMM. The first layer of the model detected short and primitive motions and upper layer were processed to recognize human actions.

4.7 Autoregressive HMM

AR-HMM models can capture temporal structures in time series data. The current observation \(x^t\) of AR model is the linear combination of p previous observations \(x^{t-p},\ldots ,x^{t-2},x^{t-1}\) [127]. AR-HMM can explicitly model the longer-range correlations of sequential data by adding direct stochastic dependence among observations [128] (Fig. 16).

Stanculescu et al. [128] designed an AR-HMM for early detection of neonatal sepsis. They modeled the distribution of observed physiological events of patients with AR-HMM. Dang et al. [130] proposed an AR-HMM for Effective connectivity (EC) learning in brain regions with fMRI signals. They modeled unobserved fMRI data and neuronal activity lost over time. Zhao et al. [131] proposed an order self-learning ARHMM for detecting the online outlier in the grade analysis process of geological minerals. The model did not set any detection threshold and applied detection-before-update and detection-based update strategies to avoid outliers influence. Malesevic et al. [132] presented a computational technique to control the multifunctional artificial hand with multichannel surface electromyography (EMG). The vector AR-HMM was used for decrypting movement of every individual finger through surface EMG signals. Figure 17 represents applications of Autoregressive HMM.

Seifert et al. [133] exploited local dependencies in local chromosomes for identifying tumour genes using higher-order AR-HMM. Nakamura et al. [134] modeled symbolic music performances with AR-HSMM. The model had better computational time and accuracy as compared with HMMs. Barber et al. [135] used AR-HMM in the wind power industry to model short-horizon wind forecasting. The ARHMM with some approximation inference methods could be used in missing data situations. Sasou et al. [136] applied AR-HMM to extract features from singing voices. The model estimated the characteristic of the articulatory system and signals from the high-pitched voice. Quillen [137] used AR-HMM for synthesizing speech. The model enhanced the stability of estimated predictor coefficients.

Ai et al. [138] investigated the use of AR-HMM estimated occupancy for smart building. They calculated the total number of occupants in a research laboratory of a building using a deployed network with wireless sensors. Guan et al. [127] recognized activities from time-series data with AR-HMM. They proposed a graphical model that could predict instance and bag labels using tractable inference algorithm. Dong [139] diagnosed equipments health with AR-HSMM that combined temporal knowledge and shape information. Bryan and Levinson [140] proposed an approach based on AR-HMM for inferring structures in linguistic of the speech signal.

4.8 Non-stationary HMM

NS-HMM was introduced to capture state duration behaviour by defining a set of dynamic transition probability parameters. It can model state duration probabilities explicitly as a function of time. In transition process, the time duration in a state is used for estimating the probability of the next transition. NS-HMM is a generalized version of the state duration model and Baum–Welch algorithm [141]. The applications of Non-stationary HMM are shown in Fig. 18.

Chen et al. [142] used NS-HMM for predicting spectrum occupancies. The model realized the time-varying property of stochastic behaviour of a primary user and estimated parameters by using a variant of the Baum-Welch algorithm. Chatzis and Demiris [143] on the modeling of sequential data with NS-HMM. Lin and Tseng [144] modeled the fading properties of mobile satellite link channels using NS-HMM and predicted the characteristics in the satellite-to-earth channel. Hui et al. [145] studied the principles of NS-HMM and applied in POS tagging and pinyin-to-character conversion.

4.9 Hierarchical HMM

HHMM is a stochastic process having multi-levels states that describe a sequence of input at various levels of granularity. It is an HMM with internal states generated from sub-HMM. HHMM has a tree-like structure where nodes of a tree are states of the model, and the trees edges define their transitions. The states of HHMM emit sequences by the repeated activation of any of sub-state of a state [146] (Fig. 19).

Fine et al. [146] introduced HHMM in 1998 and modeled natural English text with HHMMs. They also applied HHMM for identifying the repeated strokes, which represent letters in the cursive handwriting. Kerr [148] designed HHMM for analyzing the melodic structures. The analyzed structures could be used in music compositions. Weiland et al. [149] extracted musical pitch structures representing musical patterns using HHMM. Hoffman et al. [150] explored the application of Hierarchical Dirichlet Process HMM (HDP-HMM) for generating data-driven music. The models were trained with multiple songs and produced output from different hybrid inputs.

Patel et al. [151] used multi-level HHMM to deduce the users manipulative activities. The probabilistic algorithm was used to learn and grasp complex manipulation activities of human in everyday life. Martindale et al. [152] performed smart annotation of cyclic data with HHMM and reduced the cost of labeling data based on sensors. Marco et al. [153] presented an HHMM for systematic annotation of chromatin states at different length scales. The model investigated the use of higher-order chromatin structure of gene regulation. Chen et al. [154] performed a single-molecule protein transportation experiment with HHMM. Raman and Maybank [155] used non-parametric HHMM for human activity recognition. The model enabled automatic inference of all states and facilitated information with semi-supervised learning. Figure 20 represents applications of Hierarchical hidden Markov model.

Karaman et al. [156] proposed HHMM to detect daily living activities in videos collected from the wearable camera. The patients wore the camera for studying dementia disease. Table 5 represents the major findings of HHMM in various applications.

Table 5 Major findings of HHMM in various applications

Full size table

5 Conclusion

HMMs were introduced in the late 1960s, but the basic theory of Markov chain was known to the mathematicians for around 80 years. HMM was first applied to the problems of speech recognition in the mid-1970s. Many researchers in 1980 began to use HMMs in various fields like bioinformatics, musicology, gesture recognition, trend analysis, data analysis and many more. Work done by various researchers with HMM variants for different application fields is reviewed in this paper. The paper provides an overview of HMM variants and their applications areas. Much work has been done with various HMMs for many application fields, but still, the use of HMMs in many new application fields are yet to be explored. To the best of author’s knowledge, this is the very first attempt to compile the research performed with different types of HMMs.

References

Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77(2):257–286
Google Scholar
Alghamdi R (2016) Hidden Markov models (HMMs) and security applications. Int J Adv Comput Sci Appl 7(2):39–47
Google Scholar
Hidden Markov model. https://en.wikipedia.org/wiki/Hidden_Markov_model. Accessed 5 May 5 2019
Liu J, Zhu L, Wang Y, Liang X, Hyyppa J, Chu T, Liu K, Chen R (2015) Reciprocal estimation of pedestrian location and motion state toward a smartphone geo-context computing solution. Micromachines 6(6):699–717
Google Scholar
Yamato J, Ohya J, Ishii K (1992) Recognizing human action in time-sequential images using hidden Markov model. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition, Champaign, Illinois, USA, 15–18 June, pp 379–385
Yanchenko AK (2017) Classical music composition using hidden Markov models. Master’s Thesis, Duke University, Durham, North Carolina, 1st edn
Rabiner LR, Levinson SE, Sondhi MM (1983) On the application of vector quantization and hidden Markov models to speaker-independent, isolated word recognition. Bell Syst Tech J 62(4):1075–1105
Google Scholar
Levinson SE, Rabiner LR, Sondhi MM (1983) Speaker independent isolated digit recognition using hidden Markov models. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP), Boston, Massachusetts, USA, 14–16 April, pp 1049–1052
Schwartz R, Chow Y, Roucos S, Krasner M, Makhoul J (1984) Improved hidden Markov modeling of phonemes for continuous speech recognition. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP’84), San Diego, California, USA, 19–21 March, pp 21–24
Juang BH, Rabiner LR (1991) Hidden Markov models for speech recognition. Technometrics 33(3):251–272
MathSciNet MATH Google Scholar
Bahl LR, Brown PF, De Souza PV, Mercer RL (1986) Maximum mutual information estimation of hidden Markov model parameters for speech recognition. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP’86), Tokyo, Japan, 7–11 April, pp 49–52
Poritz A (1982) Linear predictive hidden Markov models and the speech signal. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP’82), Paris, France, 3–5 May, pp 1291–1294
Rose RC, Paul DB (1990) A hidden Markov model based keyword recognition system. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP), Albuquerque, New Mexico, USA, 3–6 April, pp 129–132
Lee KF, Hon HW (1989) Speaker-independent phone recognition using hidden Markov models. IEEE Trans Acoust Speech Signal Process 37(11):1641–1648
Google Scholar
Juang BH (1984) On the hidden Markov model and dynamic time warping for speech recognition—a unified view. AT&T Bell Lab Tech J 63(7):1213–1243
MathSciNet MATH Google Scholar
Varga AP, Moore RK (1990) Hidden Markov model decomposition of speech and noise. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP), Albuquerque, New Mexico, USA, 3–6 April, pp 845–848
Sonnhammer EL, Heijne GV, Krogh A (1998) A hidden Markov model for predicting transmembrane helices in protein sequences. In: Proceeding of 6th international conference on intelligent systems for molecular biology (ISMB), Montreal, Canada, 28 June–1 July, pp 175–182
Churchill GA (1992) Hidden Markov chains and the analysis of genome structure. Comput Chem 16(2):107–115
MATH Google Scholar
Soruri M, Zahiri SH, Sadri J (2013) A new approach of training Hidden Markov Model by PSO algorithm for gene sequence modeling. In: Proceeding of 1st Iranian conference on pattern recognition and image analysis (PRIA), Birjand, Iran, 6–8 March, pp 1–4
Krogh AS (1994) Hidden Markov models for labeled sequences. In: Proceedings of the 12th IAPR international conference on pattern recognition (ICPR), conference C: signal processing, Jerusalem, Israel, 9–13 October, pp 140–144
Manogaran G, Vijayakumar V, Varatharajan R, Kumar PM, Sundarasekar R, Hsu CH (2018) Machine learning based big data processing framework for cancer diagnosis using hidden Markov model and GM clustering. Wirel Pers Commun 102(3):2099–2116
Google Scholar
Xin G, Hamzaoui N, Antoni J (2018) Semi-automated diagnosis of bearing faults based on a hidden Markov model of the vibration signals. Measurement 127:141–166
Google Scholar
Yao L, Wang J, Chen A, Wang Y (2018) V2X routing in a VANET based on the hidden Markov model. IEEE Trans Intell Transp Syst 19(3):889–899
Google Scholar
Petersen BK, Mayhew MB, Ogbuefi KO, Greene JD, Liu VX, Ray P (2018) Modeling sepsis progression using hidden markov models. ArXiv preprint arXiv:1801.02736
Tang H, Dong C (2019) Detection of malicious domain names based on an improved hidden Markov model. Int J Wirel Mob Comput 16(1):58–65
MathSciNet Google Scholar
Zhuo Z, Zhang Y, Zhang ZL, Zhang X, Zhang J (2018) Website fingerprinting attack on anonymity networks based on profile hidden Markov model. IEEE Trans Inf Forensics Secur 13(5):1081–1095
MathSciNet Google Scholar
Putland RL, Ranjard L, Constantine R, Radford CA (2018) A hidden Markov model approach to indicate Bryde’s whale acoustics. Ecol Indic 84:479–487
Google Scholar
Habayeb M, Murtaza SS, Miranskyy A, Bener AB (2018) On the use of hidden Markov model to predict the time to fix bugs. IEEE Trans Softw Eng 44(12):1224–1244
Google Scholar
Ullah I, Ahmad R, Kim D (2018) A prediction mechanism of energy consumption in residential buildings using hidden Markov model. Energies 11(2):1–20
Google Scholar
Yip CF, Ng WL, Yau CY (2018) A hidden Markov model for earthquake prediction. Stoch Environ Res Risk Assess 32(5):1415–1434
Google Scholar
Pastell M, Frondelius L (2018) A hidden Markov model to estimate the time dairy cows spend in feeder based on indoor positioning data. Comput Electron Agric 152:182–185
Google Scholar
Alshamaa D, Chkeir A, Mourad-Chehade F, Honeine P (2019) A hidden Markov model for indoor trajectory tracking of elderly people. In: Proceedings of IEEE sensors applications symposium (SAS), Sophia Antipolis, France, 11–13 March, pp 1–7
Liu S, Zheng K, Zhao L, Fan P (2019) A driving intention prediction method based on hidden Markov model for autonomous driving. ArXiv preprint arXiv:1902.09068
Jiang J, Chen R, Chen M, Wang W, Zhang C (2019) Dynamic fault prediction of power transformers based on hidden Markov model of dissolved gases analysis. IEEE Trans Power Deliv 34(4):1393–1400
Google Scholar
Lu S, Lin G, Liu H, Ye C, Que H, Ding Y (2019) A weekly load data mining approach based on hidden Markov model. IEEE Access 7:34609–34619
Google Scholar
Xu R, Chen X, Zhang F (2019) Green technology innovation ecosystem based on hidden Markov model. Ekoloji 28(107):1729–1736
Google Scholar
Joo T, Seo M, Shin D (2019) An adaptive approach for determining batch sizes using the hidden Markov model. J Intell Manuf 30(2):917–932
Google Scholar
Coast DA, Stern RM, Cano CG, Briller SA (1990) An approach to cardiac arrhythmia analysis using hidden Markov models. IEEE Trans Biomed Eng 37(9):826–836
Google Scholar
Yang WJ, Lee JC, Chang YC, Wang HC (1998) Hidden Markov model for Mandarin lexical tone recognition. IEEE Trans Acoust Speech Signal Process 36(7):988–992
MATH Google Scholar
Lee LM, Jean FR (2016) High-order hidden Markov model for piecewise linear processes and applications to speech recognition. J Acoust Soc Am 140(2):204–210
Google Scholar
Xiong H, Mamon R (2016) A self-updating model driven by a higher-order hidden Markov chain for temperature dynamics. J Comput Sci 17(1):47–61
MathSciNet Google Scholar
Zhu DM, Lu J, Ching WK, Siu TK (2017) Discrete-time optimal asset allocation under higher-order hidden Markov model. Econ Model 66:223–232
Google Scholar
Quan C, Ren F (2016) Weighted high-order hidden Markov models for compound emotions recognition in text. Inf Sci 329:581–596
Google Scholar
Seifert M, Gohr A, Strickert M, Grosse I (2012) Parsimonious higher-order hidden Markov models for improved array-CGH analysis with applications to Arabidopsis thaliana. PLOS Comput Biol 8(1):1–15
Google Scholar
Lee LM, Lee JC (2006) A study on high-order hidden Markov models and applications to speech recognition. In: Proceedings of 19th international conference on industrial, engineering and other applications of applied intelligent systems (IEA/AIE 2006), Annecy, France, 27–30 June, Lecture Notes in Computer Science, vol 4031, pp 682–690
Xiong C, Yang D, Zhang L (2018) A high-order hidden Markov model and its applications for dynamic car ownership analysis. Transp Sci 52(6):1365–1375
Google Scholar
Zhang M, Jiang X, Fang Z, Zeng Y, Xu K (2019) High-order Hidden Markov Model for trend prediction in financial time series. Physica A Stat Mech Appl 517:1–12
Google Scholar
Chen Z, Qiu RC (2010) Prediction of channel state for cognitive radio using higher-order hidden Markov model. In: Proceedings of the IEEE southeast conference (SoutheastCon), ConCord, North Carolina, 18–21 March, pp 276–282
Boussemart Y, Cummings ML (2011) Predictive models of human supervisory control behavioral patterns using hidden semi-Markov models. Eng Appl Artif Intell 24(7):1252–1262
Google Scholar
Hidden Semi-Markov model. https://en.wikipedia.org/wiki/Hidden_semi-Markov_model, Accessed on 5 May 5 2019
Groves R (2013) Automatic harmonization using a hidden semi-Markov model. In: Proceedings of 9th artificial intelligence and interactive digital entertainment conference (AIIDE), Boston, Massachusetts, USA, 14–18 October, pp 48–54
Narimatsu H, Kasai H (2017) State duration and interval modeling in hidden semi-Markov model for sequential data analysis. Ann Math Artif Intell 81(3–4):377–403
MathSciNet MATH Google Scholar
Zhu K, Liu T (2018) Online tool wear monitoring via hidden semi-Markov model with dependent durations. IEEE Trans Ind Inform 14(1):69–78
Google Scholar
Liu T, Zhu K, Zeng L (2018) Diagnosis and prognosis of degradation process via hidden semi-Markov model. IEEE/ASME Trans Mechatron 23(3):1456–1466
Google Scholar
Li X, Makis V, Zuo H, Cai J (2018) Optimal Bayesian control policy for gear shaft fault detection using hidden semi-Markov model. Comput Ind Eng 119:21–35
Google Scholar
Liu Z, Wang S (2017) Decoding Chinese stock market returns: three-state hidden semi-markov model. Pac Basin Finance J 44:127–149
Google Scholar
Xiao Q, Fang Y, Liu Q, Zhou S (2018) Online machine health prognostics based on modified duration-dependent hidden semi-Markov model and high-order particle filtering. Int J Adv Manuf Technol 94(1–4):1283–1297
Google Scholar
Kong D, Chen Y, Li N (2017) Hidden semi-markov model-based method for tool-wear estimation in milling process. Int J Adv Manuf Technol 92(9–12):3467–3657
Google Scholar
Wu H, Yu Z, Wang Y (2017) Real-time FDM machine condition monitoring and diagnosis based on acoustic emission and hidden semi-Markov model. Int J Adv Manuf Technol 90(5–8):2027–2036
Google Scholar
Pertsinidou CE, Tsaklidis G, Papadimitriou E, Limnios N (2017) Application of hidden semi-Markov models for the seismic hazard assessment of the North and South Aegean Sea, Greece. J Appl Stat 44(6):1064–1085
MathSciNet MATH Google Scholar
Bang JH, Cho YJ, Kang K (2017) Anomaly detection of network-initiated LTE signaling traffic in wireless sensor and actuator networks based on a Hidden semi-Markov Model. Comput Secur 65:108–120
Google Scholar
Tanwani AK, Calinon S (2016) Learning robot manipulation tasks with task-parameterized semitied hidden semi-Markov model. IEEE Robot Autom Lett 1(1):235–242
Google Scholar
Cai J, Luo JZ, Lei F (2016) Analyzing network protocols of application layer using hidden Semi-Markov model. In: Mathematical Problems in Engineering, pp 1–15
Roman-Galvez R, Roman-Roldan R, Martinez-Aroza J, Gomez-Lopera JF (2015) Semi-hidden Markov models for generation and analysis of sequences. Math Comput Simul 118:320–328
MathSciNet MATH Google Scholar
Liu Q, Dong M, Lv W, Geng X, Li Y (2015) A novel method using adaptive hidden semi-Markov model for multi-sensor monitoring equipment health prognosis. Mech Syst Signal Process 64–65:217–232
Google Scholar
Xiao S, Dong M (2015) Hidden semi-Markov model-based reputation management system for online to offline (O2O) e-commerce markets. Decis Supp Syst 77:87–99
Google Scholar
Yue SG, Jiao P, Zha YB, Yin QJ (2015) A logical hierarchical hidden semi-Markov model for team intention recognition. Discrete Dyn Nat Soc 2015:1–20
MathSciNet MATH Google Scholar
Altuve M, Carrault G, Beuchee A, Pladys P, Hernandez AI (2015) Online apnea-bradycardia detection based on hidden semi-Markov models. Med Biol Eng Comput 53(1):1–13
Google Scholar
Votsi I, Limnios N, Tsaklidis G, Papadimitriou E (2014) Hidden semi-Markov modeling for the estimation of earthquake occurrence rates. Commun Stat Theory Methods 43(7):1484–1502
MathSciNet MATH Google Scholar
Du Y, Murani E, Ponsuksili S, Wimmers K (2014) BiomvRhsmm: genomic segmentation with hidden semi-Markov model. BioMed Res Int 2014:1–12
Google Scholar
Xu C, Du C, Zhao GF, Yu S (2013) A novel model for user clicks identification based on hidden semi-Markov. J Netw Comput Appl 36(2):791–798
Google Scholar
Liu AA, Li K, Kanade T (2012) A semi-Markov model for mitosis segmentation in time-lapse phase contrast microscopy image sequences of stem cell populations. IEEE Trans Med Imaging 31(2):359–369
Google Scholar
Dong M, Peng Y (2011) Equipment PHM using non-stationary segmental hidden semi-Markov model. Robot Comput Integr Manuf 27(3):581–590
Google Scholar
Liang Y, Liu X, Lou Y, Shan B (2011) An improved noise-robust voice activity detector based on hidden semi-Markov models. Pattern Recogn Lett 32(7):1044–1053
Google Scholar
Xie Y, Hu J, Tang S, Huang X (2012) A forward-backward algorithm for nested hidden semi-Markov model and application to network traffic. Comput J 56(2):229–238
Google Scholar
Kerk MVD, Onorato DP, Criffield MA, Bolker BM, Augustine BC, McKinley SA, Oli MK (2015) Hidden semi-Markov models reveal multiphasic movement of the endangered Florida panther. J Anim Ecol 84(2):576–585
Google Scholar
Duan C, Makis V, Deng C (2019) Optimal Bayesian early fault detection for CNC equipment using hidden semi-Markov process. Mech Syst Signal Process 122:290–306
Google Scholar
Chen R, Shen W, Srinivasamurthy A, Chordia P (2012) Chord recognition using Duration-explicit hidden Markov models. In: Proceedings of 13th international society for music information retrieval conference (ISMIR 2012), Porto, Portugal, 8–12 October, pp 445–450
Karg M, Seiberl W, Kreuzpointner F, Haas JP, Kulic D (2015) Clinical gait analysis: comparing explicit state duration HMMs using a reference-based index. IEEE Trans Neural Syst Rehabil Eng 23(2):319–331
Google Scholar
Benouareth A, Ennaji A, Sellami M (2008) Semi-continuous HMMs with explicit state duration for unconstrained Arabic word modeling and recognition. Pattern Recognit Lett 29(12):1742–1752
MATH Google Scholar
Benetos E, Weyde T (2013) Explicit duration hidden Markov Models for multiple-instrument polyphonic music transcription. In: Proceedings of 14th international society for music information retrieval conference (ISMIR), Curitiba, Brazil, 4–8 November, pp 269–274
Yue W, Hong GS, Wong YS (2010) HMM with explicit state duration for prognostics in face milling. In: Proceedings of IEEE conference on robotics, automation and mechatronics (RAM), Singapore, 28–30 June, pp 218–223
Calinon S, Pistillo A, Caldwell DG (2011) Encoding the time and space constraints of a task in explicit-duration hidden Markov model. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systems (IEEEIROS), San Francisco, CA, USA, 25–30 September, pp 3413–3418
Chordia P, Sastry A, entrk S (2011) Predictive Tabla modelling using variable-length Markov and hidden Markov models. J New Music Res 40(2):105–118
Google Scholar
Senturk S (2011) Computational modeling of improvisation in Turkish folk music using variable-length Markov models. Master of Science in Music Technology dissertation, Georgia Institute of Technology, Atlanta, Georgia, 1st edn
Senturk S, Chordia P (2011) Modeling melodic improvisation in Turkish folk music using variable-length Markov models. In: Proceedings of 12th international society for music information retrieval conference (ISMIR), Miami, Florida, USA, 24–28 October, pp 269–274
Pikrakis A, Theodoridis S, Kamarotos D (2006) Classification of musical patterns using variable duration hidden Markov models. IEEE Trans Audio Speech Lang Process 14(5):1795–1807
Google Scholar
Dumont T (2014) Context tree estimation in variable length hidden Markov models. IEEE Trans Inf Theory 60(6):3196–3208
MathSciNet MATH Google Scholar
Chen MY, Kundu A, Srihari SN (1995) Variable duration hidden Markov model and morphological segmentation for handwritten word recognition. IEEE Trans Image Process 4(12):1675–1687
Google Scholar
Liang YM, Shih SW, Shih ACC, Liao HYM, Lin CC (2009) Learning atomic human actions using variable-length Markov models. IEEE Trans Syst Man Cybern Part B (Cybern) 39(1):268–280
Google Scholar
Cao H, Jiang D, Pei J, Chen E, Li H (2009) Towards context-aware search by learning a very large variable length hidden Markov model from search logs. In: Proceedings of the 18th international conference on world wide web (IW3C2), Madrid, Spain, 20–24 April, pp 191–200
Bianne-Bernard AL, Menasri F, Likforman-Sulem L, Mokbel C, Kermorvant C (2012) Variable length and context-dependent HMM letter form models for Arabic handwritten word recognition. In: Proceedings of document recognition and retrieval conference, international society for optics and photonics, Burlingame, California, USA, 22–26 January, pp 1–8
Chen C, Liang J, Zhao H, Hu H, Tian J (2009) Factorial HMM and parallel HMM for gait recognition. IEEE Trans Syst Man Cybern Part C (Appl Rev) 39(1):114–123
Google Scholar
Betkowska A, Shinoda K, Furui S (2007) Robust speech recognition using factorial HMMs for home environments. EURASIP J Adv Signal Process 2007(1):1–9
MATH Google Scholar
Ghahramani Z, Jordan MI (1996) Factorial hidden Markov models. Adv Neural Inf Process Syst 9:472–478
MATH Google Scholar
Ozerov A, Fevotte C, Charbit M (2009) Factorial scaled hidden Markov model for polyphonic audio representation and source separation. In: Proceedings of IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), Mohonk, New York, United States, 18–21 October, pp 121–124
Bonfigli R, Principi E, Fagiani M, Severini M, Squartini S, Piazza F (2017) Non-intrusive load monitoring by using active and reactive power in additive Factorial Hidden Markov Models. Appl Energy 208:1590–1607
Google Scholar
Khorasani A, Daliri MR, Pooyan M (2016) Recognition of amyotrophic lateral sclerosis disease using factorial hidden Markov model. Biomed Eng 61(1):119–126
Google Scholar
Li Z, He Y, Chu F, Han J, Hao W (2006) Fault recognition method for speed-up and speed-down process of rotating machinery based on independent component analysis and Factorial Hidden Markov Model. J Sound Vib 291(1–2):60–71
Google Scholar
Husmeier D (2005) Discriminating between rate heterogeneity and interspecific recombination in DNA sequence alignments with phylogenetic factorial hidden Markov models. Bioinformatics 21(2):166–172
Google Scholar
Durrieu JL, Thiran JP (2013) Source/filter factorial hidden Markov model, with application to pitch and formant tracking. IEEE Trans Audio Speech Lang Process 21(12):2541–2553
Google Scholar
Kolter JZ, Jaakkola T (2012) Approximate inference in additive factorial HMMs with application to energy disaggregation. In: Proceedings of the 15th international conference on artificial intelligence and statistics (AISTATS), La Palma, Canary Islands, 21–23 April, pp 1472–1482
Mari JF, Fohr D, Junqua JC (1996) A second-order HMM for high Performance word and phoneme-based continuous speech recognition. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP), Atlanta, Georgia, USA, 7–10 May, pp 435–438
Sung-Hyun Y, Thapa K, Kabir MH, Hee-Chan L (2018) Log-viterbi algorithm applied on second-order hidden Markov Model for human activity recognition. Int J Distrib Sens Netw 14(4):1–11
Google Scholar
Kabir MH, Hoque MR, Thapa K, Yang SH (2016) Two-layer hidden Markov model for human activity recognition in home environments. Int J Distrib Sens Netw 12(1):1–12
Google Scholar
Zhou YH, Brooks P, Wang X (2018) A two-stage hidden Markov Model design for biomarker detection, with application to microbiome research. Stat Biosci 10:1–18
Google Scholar
Liang W, Zhang Y, Tan J, Li Y (2014) A novel approach to ECG classification based upon two-layered HMMs in body sensor networks. Sensors 14(4):5994–6011
Google Scholar
Othman H, Aboulnasr T (2001) A simplified second-order HMM with application to face recognition. In: Proceedings of IEEE international symposium on circuits and systems (ISCAS), Sydney, Australia, May 6–9, pp 161–164
Wu YC, Chen HS, Tsai WJ, Lee SY, Yu JY (2008) Human action recognition based on layered-HMM. In: Proceedings of IEEE international conference on multimedia and expo (ICME), Hanover, Germany, 23–26 June, pp 1453–1456
Zhang D, Gatica-Perez D, Bengio S, McCowan I (2006) Modeling individual and group actions in meetings with layered HMMs. IEEE Trans Multimed 8(3):509–520
Google Scholar
Mari JF, Haton JP, Kriouile A (1997) Automatic word recognition based on second-order hidden Markov models. IEEE Trans Speech Audio Process 5(1):22–25
Google Scholar
Thede SM, Harper MP (1997) A second-order hidden Markov model for part-of-speech tagging. In: Proceedings of the 37th annual meeting of the Association for Computational Linguistics, California, USA, 6–9 July, pp 175–182
Wei H, He J, Tan J (2011) Layered hidden Markov models for real-time daily activity monitoring using body sensor networks. Knowl Inf Syst 29(2):479–494
Google Scholar
Glodek M, Layher G, Schwenker F, Palm G Recognizing human activities using a layered HMM architecture. In: Proceedings of international conference on artificial neural networks (ICANN 2012), Lausanne, Switzerland, September 11–14, pp 677–684
Oliver N, Garg A, Horvitz E (2004) Layered representations for learning and inferring office activity from multiple sensory channels. Comput Vis Image Underst 96(2):163–180
Google Scholar
Lee YS, Cho SB (2016) Layered hidden Markov models to recognize activity with built-in sensors on Android smartphone. Pattern Anal Appl 19(4):1181–1193
MathSciNet Google Scholar
Razin YS, Pluckter K, Ueda J, Feigh K (2017) Predicting task intent from surface electromyography using layered hidden Markov models. IEEE Robot Autom Lett 2(2):1180–1185
Google Scholar
Glodek M, Bigalke L, Schels M, Schwenker F (2011) Incorporating uncertainty in a layered HMM architecture for human activity recognition. In: Proceedings of joint ACM workshop on Human gesture and behavior understanding, Scottsdale, Arizona, USA, 1 December, pp 33–34
Aarno D, Kragic D (2006) Layered HMM for motion intention recognition. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systems, Beijing, China, 9–15 October, pp 5130–5135
Oliver N, Horvitz E, Garg A (2002) Layered representations for human activity recognition. In: Proceedings of 4th IEEE international conference on multimodal interfaces, Pittsburgh, USA, 14–16 October, pp 3–8
Barnard M, Odobez JM (2005) Sports event recognition using layered HMMs. In: Proceedings of IEEE international conference on multimedia and expo (ICME), Amsterdam, Netherlands, 6–8 July, pp 1150–1153
Zhang C, Qiu J, Zheng S, Yang X (2012) Cross-layered Hidden Markov Modeling for surveillance event recognition. In: Proceedings of IEEE international conference on multimedia and expo workshop (ICMEW), Melbourne, Australia, 9–13 July, pp 175–180
Runsewe O, Samaan N (2017) Cloud resource scaling for big data streaming applications using a layered multi-dimensional hidden Markov model. In: Proceeding of 17th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGRID), Madrid, Spain, 14–17 May, pp 848–857
Solaimanpour S, Doshi P (2017) A layered HMM for predicting motion of a leader in multi-robot settings. In: Proceedings of IEEE international conference on robotics and automation (ICRA), Singapore, 29 May–3 June, pp 788–793
Ingels P (1996) Connected text recognition using layered HMMs and token passing. Arxiv, arXiv preprint cmp-lg/9607036
Perdikis S, Dimitrios T, Strintzis MG (2008) Recognition of humans actions using layered hidden Markov models. In: Proceedings of 1st IAPR workshop on cognitive information processing, Santorini, Greece, 9–10 June, pp 114–119
Guan X, Raich R, Wong WK (2016) Efficient multi-instance learning for activity recognition from time series data using an auto-regressive hidden Markov model. In: Proceedings of 33rd international conference on machine learning, New York, USA, 19–24 June, pp 2330–2339
Stanculescu I, Williams CKI, Freer Y (2014) Autoregressive hidden Markov models for the early detection of neonatal sepsis. IEEE J Biomed Health Inform 18(5):1560–1570
Google Scholar
Asahara A, Maruyama K, Shibasaki R (2012) A mixed autoregressive hidden-Markov-chain model applied to people’s movements. In: Proceedings of the 20th international conference on advances in geographic information systems, Redondo Beach, California, 6–9 November, pp 414–417
Dang S, Chaudhury S, Lall B, Roy PK (2017) Learning effective connectivity from fMRI using autoregressive hidden Markov model with missing data. J Neurosci Methods 278(8):87–100
Google Scholar
Zhao J, Zhoub J, Su W, Liu F (2017) Online outlier detection for time-varying time series on improved ARHMM in geological mineral grade analysis process. Earth Sci Res J 21(3):135–139
Google Scholar
Malesevic N, Markovic D, Kanitz G, Controzzi M, Cipriani C, Antfolk C (2018) Vector Autoregressive Hierarchical Hidden Markov Models for extracting finger movements using multichannel surface EMG signals. Complexity 2018:1–12
Google Scholar
Seifert M, Abou-El-Ardat K, Friedrich B, Klink B, Deutsch A (2014) Autoregressive higher-order hidden Markov models: exploiting local chromosomal dependencies in the analysis of tumor expression profiles. PLOS ONE 9(6):1–15
Google Scholar
Nakamura E, Cuvillier P, Cont A, Ono N, Sagayama S (2015) Autoregressive hidden semi-Markov model of symbolic music performance for score following. In: Proceedings of 16th international society for music information retrieval conference (ISMIR), Malaga, Spain, 26–30 October
Barber C, Bockhorst J, Roebber P (2010) Auto-regressive HMM inference with incomplete data for short-horizon wind forecasting. In: Proceedings of 24th Advances in neural information processing systems conference, Vancouver, British Columbia, Canada, 6–9 December, pp 136–144
Sasou A, Goto M, Hayamizu S, Tanaka K (2005) An auto-regressive, non-stationary excited signal parameter estimation method and an evaluation of a singing-voice recognition. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing (ICASSP), Philadelphia, Pennsylvania, USA, 18–23 March, pp 237–240
Quillen C (2012) Autoregressive HMM speech synthesis. In: Proceedings of IEEE international conference on acoustics, speech and signal processing (ICASSP), Kyoto, Japan, 25–30 March, pp 4021–4024
Ai B, Fan Z, Gao RX (2014) Occupancy estimation for smart buildings by an auto-regressive hidden Markov model. In: Proceedings of American control conference (ACC), Portland, Oregon, USA, 4–6 June, pp 2234–2239
Dong M (2008) A novel approach to equipment health management based on auto-regressive hidden semi-Markov model (AR-HSMM). Sci China Ser F Inf Sci 51(9):1291–1304
MathSciNet MATH Google Scholar
Bryan JD, Levinson SE (2015) Autoregressive hidden Markov model and the speech signal. Procedia Comput Sci 61:328–333
Google Scholar
Sin B, Kim JH (1995) Nonstationary hidden Markov model. Signal Process 46(1):31–46
MATH Google Scholar
Chen X, Zhang H, MacKenzie AB, Matinmikko M (2014) Predicting spectrum occupancies using a non-stationary hidden Markov model. IEEE Wirel Commun Lett 3(4):333–336
Google Scholar
Chatzis SP, Demiris Y (2012) A reservoir-driven non-stationary hidden Markov model. Pattern Recognit 45(11):3985–3996
MATH Google Scholar
Lin HP, Tseng MC (2009) Modelling fading properties for mobile satellite link channels using non-stationary hidden Markov model. IET Microw, Antennas Propag 3(1):171–180
Google Scholar
JingHui X, BingQuan L, XiaoLong W (2005) Principles of non-stationary hidden Markov model and its applications to sequence labeling task. In: Proceedings of international conference on natural language processing, Kanpur, India, 18–20 December lecture notes in computer science, vol 3651. Springer, Berlin, pp 827–837
Fine S, Singer Y, Tishby N (1998) The hierarchical hidden Markov model: analysis and applications. Mach Learn 32(1):41–62
MATH Google Scholar
Hierarchal hidden Markov model. https://wikivisually.com/wiki/Hierarchical_ hidden_Markov_model, Accessed on 5 May 5 2019
Kerr R (2011) Melodic analysis using Hierarchical Hidden Markov models, Doctoral dissertation, School of Informatics, The University of Edinburgh, Scotland, 1st edn
Weiland M, Smaill A, Nelson P (2005) Learning musical pitch structures with Hierarchical Hidden Markov model, 1st edn. University of Edinburgh, Scotland
Google Scholar
Hoffman MD, Cook PR, Blei DM (2008) Data-driven recomposition using the Hierarchical Dirichlet process hidden Markov model. In: Proceedings of 32nd international computer music conference (ICMC), Belfast, Ireland, UK, 24–29 August, pp 1–7
Patel M, Miro JV, Kragic D, Ek CH, Dissanayake G (2014) Learning object, grasping and manipulation activities using hierarchical HMMs. Auton Robots 37(3):317–331
Google Scholar
Martindale CF, Hoenig F, Strohrmann C, Eskofier BM (2017) Smart annotation of cyclic data using hierarchical hidden Markov Models. Sensors 17(10):1–16
Google Scholar
Marco E, Meuleman W, Huang J, Glass K, Pinello L, Wang J, Kellis M, Yuan GC (2017) Multi-scale chromatin state annotation using a hierarchical hidden Markov model. Nat Commun 8:1–9
Google Scholar
Chen Y, Shen K, Shan SO, Kou SC (2016) Analyzing single-molecule protein transportation experiments via hierarchical hidden Markov models. J Am Stat Assoc 111(515):951–966
MathSciNet Google Scholar
Raman N, Maybank SJ (2016) Activity recognition using a supervised non-parametric hierarchical HMM. Neurocomputing 199:163–177
Google Scholar
Karaman S, Benois-Pineau J, Dovgalecs V, Megret R, Pinquier J, Andre-Obrecht R, Gaestel Y, Dartigues JF (2014) Hierarchical Hidden Markov Model in detecting activities of daily living in wearable videos for studies of dementia. Multimed Tools Appl 69(3):743–771
Google Scholar

Download references

Acknowledgements

Bhavya Mor was supported under Senior Research Fellowship (SRF) by Human Resource Development (HRD) Group of Council of Scientific and Industrial Research (CSIR), Ministry of Science and Technology, Government of India.

Author information

Authors and Affiliations

Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala, India
Bhavya Mor, Sunita Garhwal & Ajay Kumar

Authors

Bhavya Mor
View author publications
You can also search for this author in PubMed Google Scholar
Sunita Garhwal
View author publications
You can also search for this author in PubMed Google Scholar
Ajay Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ajay Kumar.

Ethics declarations

Conflict of interest

The authors declare that they have no Conflict of interest.

Human and Animal Rights

This work doesn’t have any studies concerning to human or animal topics.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mor, B., Garhwal, S. & Kumar, A. A Systematic Review of Hidden Markov Models and Their Applications. Arch Computat Methods Eng 28, 1429–1448 (2021). https://doi.org/10.1007/s11831-020-09422-4

Download citation

Received: 08 July 2019
Accepted: 26 March 2020
Published: 12 May 2020
Issue Date: May 2021
DOI: https://doi.org/10.1007/s11831-020-09422-4

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Systematic Review of Hidden Markov Models and Their Applications

Abstract

Similar content being viewed by others

Hidden Markov Models

Introduction to Hidden Markov Models and Its Applications in Biology

Markov Chain

Explore related subjects

1 Introduction

1.1 Motivation

1.2 Outline

2 Review Process

2.1 Classification of Papers

2.1.1 Distribution of Papers for HMM Variants (RQ1)

2.1.2 Application Fields of HSMM (RQ2)

2.1.3 HMMs for Speech Recognition (RQ3)

2.1.4 Application Areas with HMMs (RQ4)

3 Preliminaries

4 Literature Survey

4.1 First-Order HMM

4.2 Higher-Order HMM

4.3 Hidden Semi-Markov Model

4.4 Factorial HMM

4.5 Second-Order HMM

4.6 Layered HMM

4.7 Autoregressive HMM

4.8 Non-stationary HMM

4.9 Hierarchical HMM

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Human and Animal Rights

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation