A Frequency Spectral Feature Modeling for Hidden Markov Model Based Automated Speech Recognition

Patel, Ibrahim; Srinivas Rao, Y.

doi:10.1007/978-3-642-14493-6_15

Ibrahim Patel⁵ &
Y. Srinivas Rao⁶

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 90))

Included in the following conference series:

1041 Accesses
2 Citations

Abstract

This paper presents an approach to the recognition of speech signal using frequency spectral information with Mel frequency for the improvement of speech feature representation in a HMM based recognition approach. A frequency spectral information is incorporated to the conventional Mel spectrum base speech recognition approach. The Mel frequency approach exploits the frequency observation for speech signal in a given resolution which results in resolution feature overlapping resulting in recognition limit. Resolution decomposition with separating frequency is mapping approach for a HMM based speech recognition system. The Simulation results show a improvement in the quality metrics of speech recognition with respect to computational time, learning accuracy for a speech recognition system.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

A Feature Based Classification and Analysis of Hidden Markov Model in Speech Recognition

Point Process Modeling of Spectral Peaks for Low Resource Robust Speech Recognition

Simplified scoring methods for HMM-based speech recognition

Article 09 August 2015

Keywords

References

Varga, A.P., Moore, R.K.: Hidden Markov Model decomposition of speech and noise. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, pp. 845–848 (1990)
Google Scholar
Allen, J.B.: How do humans process and recognize speech. IEEE Trans. on Speech and Audio Processing 2(4), 567–577 (1994)
Article Google Scholar
Kim, W., Kang, S., Ko, H.: Spectral subtraction based on phonetic dependency and masking effects. IEEE Proc.- Vision, Image and Signal Processing 147(5), 423–427 (2000)
Article Google Scholar
Elliott, R.J., Aggoun, L., Allen, J.B.: Moore Hidden Markov Models: Estimation and Control. Springer, Heidelberg (1995)
Google Scholar
Fujimoto, M., Riki, Y.A.: Robust speech recognition in additive and channel noise environments using GMM and EM algorithm. In: Proceedings of IEEE International Conference Acoustics, Speech, and Signal Processing ICASSP 2004, May 17-21, vol. 1 (2004)
Google Scholar
Segura, J.C., de la Torre, A., Benitez, M.C., Peinado, A.M.: Model Based Compensation of the Additive Noise for Continuous Speech Recognition. In: Experiments Using AURORA II Database and Tasks, EuroSpeech 2001, vol. I, pp. I–941–944 (2001)
Google Scholar
Gales, M.J.F., Young, S.J.: Robust Continuous Speech Recognition Using Parallel Model Combination. IEEE Trans. Speech and Audio Processing 4(5), 352–359 (1996)
Article Google Scholar
Renals, S., Morgan, N., Bourlard, H., Cohen, M., Franco, H.: Connectionist Probability Estimators in HMM Speech Recognition. IEEE Trans. on Speech and Audio Processing 2(1), 161–174 (1994)
Article Google Scholar
Neto, J., Martins, C., Almeida, L.: Speaker-Adaptation in a Hybrid HMM-MLP Recognizer. In: Proceedings ICASSP 1996, Atlanta, vol. 6, pp. 3383–3386 (1996)
Google Scholar
Furui, S.: Digital speech processing, synthesis and recognition, 2nd edn. (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of BME, Padmasri.Dr.B.V.Raju Institute of Technology, Narsapur Medak (Dist), A.P.
Ibrahim Patel
Department of Instrument Technology, Andhra University, Vizag, A.P.
Y. Srinivas Rao

Authors

Ibrahim Patel
View author publications
You can also search for this author in PubMed Google Scholar
Y. Srinivas Rao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Jackson State University, Jackson, MS, USA
Natarajan Meghanathan
CNAM / CEDRIC, Paris, France
Selma Boumerdassi
University of Calcutta, Calcutta, India
Nabendu Chaki
Wireilla Net Solutions PTY Ltd, Australia
Dhinaharan Nagamalai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Patel, I., Srinivas Rao, Y. (2010). A Frequency Spectral Feature Modeling for Hidden Markov Model Based Automated Speech Recognition. In: Meghanathan, N., Boumerdassi, S., Chaki, N., Nagamalai, D. (eds) Recent Trends in Networks and Communications. WeST VLSI NeCoM ASUC WiMoN 2010 2010 2010 2010 2010. Communications in Computer and Information Science, vol 90. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14493-6_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-14493-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14492-9
Online ISBN: 978-3-642-14493-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Frequency Spectral Feature Modeling for Hidden Markov Model Based Automated Speech Recognition

Abstract

Chapter PDF

Similar content being viewed by others

A Feature Based Classification and Analysis of Hidden Markov Model in Speech Recognition

Point Process Modeling of Spectral Peaks for Low Resource Robust Speech Recognition

Simplified scoring methods for HMM-based speech recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Frequency Spectral Feature Modeling for Hidden Markov Model Based Automated Speech Recognition

Abstract

Chapter PDF

Similar content being viewed by others

A Feature Based Classification and Analysis of Hidden Markov Model in Speech Recognition

Point Process Modeling of Spectral Peaks for Low Resource Robust Speech Recognition

Simplified scoring methods for HMM-based speech recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation