Audio Segmentation for Speech Recognition Using Segment Features

Bhandari, Gayatri M.; Kawitkar, Rameshwar S.; Borawake, Madhuri P.

doi:10.1007/978-3-319-03095-1_23

Gayatri M. Bhandari⁶,
Rameshwar S. Kawitkar⁷ &
Madhuri P. Borawake⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 249))

2645 Accesses
3 Citations

Abstract

The amount of audio available in different databases on the Internet today is immense. Even systems that do allow searches for multimedia content, like AltaVista and Lycos, only allow queries based on the multimedia filename, nearby text on the web page containing the file, and metadata embedded in the file such as title and author. This might yield some useful results if the metadata provided by the distributor is extensive. Producing this data is a tedious manual task, and therefore automatic means for creating this information is needed. In this paper an algorithm to segment the given audio and extract the features such as MFCC, SF, SNR, ZCR is proposed and the experimental results shown for the given algorithm.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Automatic Speech Segmentation for Automatic Speech Translation

Segmentation Algorithm Using Temporal Features and Group Delay for Speech Signals

Automatic Speech Recognition Based on Clustering Technique

Keywords

References

Peiszer, E., Lidy, T., Rauber, A.: Automatic Audio Segmentation: Segment Boundary and Structure Detection in Popular Music (2008)
Google Scholar
Cook, G.T.P.: Multifeature Audio Segmentation for Browsing and Annotation. In: Proc.1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, pp. W99-1–W99-4 (1999)
Google Scholar
Lu, G.: Indexing and Retrieval of Audio: A Survey, pp. 269–290 (2001)
Google Scholar
Zhang, J.X., Whalley, J., Brooks, S.: A Two Phase Method for general audio segmentation (2004)
Google Scholar
Foote, J.: Automatic Audio Segmentation Using A Measure of Audio Novelty
Google Scholar
Julien, P., José, A., Régine, A.: Audio classi_cation by search of primary components, pp. 1–12
Google Scholar
Lu, L., Zhang, H.-J., Jiang, H.: Content Analysis for Audio Classification and Segmentation. IEEE Transaction on Speech and Audio Processing, 504–516 (2002)
Google Scholar
Lu, L., Li, S.Z., Zhang, H.-J.: Content based audio segmentation using Support Vector Machines (2008)
Google Scholar
Aguilo, M., Butko, T., Temko, A., Nadeu, C.: A Hierarchical Architecture for Audio Segmentation in a Broadcast News Task, pp. 17–20 (2009)
Google Scholar
Cettolo, M., Vescovi, M., Rizzi, R.: Evaluation of BIC-based algorithms for audio segmentation, pp. 147–170. Elsevier (2005)
Google Scholar
Goodwin, M.M., Laroche, J.: Audio Segmentation by feature space clustering using linear discriminant analysis and dynamic programming (2003)
Google Scholar
Haque, M.A., Kim, J.-M.: An analysis of content-based classification of audio signals using a fuzzy c-means algorithm (2012)
Google Scholar
Mesgarani, N., Slaney, M., Shamma, S.A.: Discrimination of Speech From Nonspeech Based on Multiscale Spectro-Temporal Modulations, pp. 920–930 (2006)
Google Scholar
Krishnamoorthy, P., Kumar, S.: Hierarchical audio content classification system using an optimal feature selection algorithm, pp. 415–444 (2010)
Google Scholar
Panagiotis, S., Vasileios, M., Ioannis, K., Hugo, M., Miguel, B., Isabel, T.: On the use of audio events for improving video scene segmentation
Google Scholar
Abdallah, S., Sandler, M., Rhodes, C., Casey, M.: Using duration Models to reduce fragmentation in audio segmentation 65, 485–515 (2006)
Google Scholar
Cheng, S.-S., Wang, H.-M., Fu, H.-C.: BIC-BASED Audio Segmentation by divide and conquer
Google Scholar
Yong, S.: Audio Segmentation, pp. 1–4 (2007)
Google Scholar
Matsunaga, S., Mizuno, O., Ohtsuki, K., Hayashi, Y.: Audio source segmentation using spectral correlation features for automatic indexing of broadcast news, pp. 2103–2106
Google Scholar
Sainath, T.N., Kanevsky, D., Iyengar, G.: Uusupervised audio segmentation using extended Baum-Welch Transformations, I 209-I 212 (2007)
Google Scholar
Giannakopoulos, T., Pikrakis, A., Theodoridis, S.: A Novel Efficient Approach for Audio Segmentation (2008)
Google Scholar
Zhang, Y., Zhou, J.: Audio Segmentation based on Multiscale audio classification, pp. IV-349–IV-352 (2004)
Google Scholar
Peng, Y., Ngo, C.-W., Fang, C., Chen, X., Xiao, J.: Audio Similarity Measure by Graph Modeling and Matching, pp. 603–606
Google Scholar
Harchaoui, Z., Vallet, F., Lung-Yut-Fong, A., Cap, O.: Regularized Kernel-Based ApproachToUnsupervised Audio Segmentation
Google Scholar

Download references

Author information

Authors and Affiliations

JSPM’s Bhivarabai Sawant Institute of Tech. & Research(W), J.J.T. University, Pune, India
Gayatri M. Bhandari
Sinhgad Institute of Technology, Pune, India
Rameshwar S. Kawitkar
College of Engg., J.J.T. University and PDEA, Pune, India
Madhuri P. Borawake

Authors

Gayatri M. Bhandari
View author publications
You can also search for this author in PubMed Google Scholar
Rameshwar S. Kawitkar
View author publications
You can also search for this author in PubMed Google Scholar
Madhuri P. Borawake
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gayatri M. Bhandari .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Anil Neerukonda Institute of Technology and Sciences, Vishakapatnam, India
Suresh Chandra Satapathy
College of Engineering(A), Andhra University, Vishakapatnam, India
P. S. Avadhani
University of Hyderabad, Hyderabad, India
Siba K. Udgata
CSIR-National Institute of Oceanography, Visakhapatnam, India
Sadasivuni Lakshminarayana

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bhandari, G.M., Kawitkar, R.S., Borawake, M.P. (2014). Audio Segmentation for Speech Recognition Using Segment Features. In: Satapathy, S., Avadhani, P., Udgata, S., Lakshminarayana, S. (eds) ICT and Critical Infrastructure: Proceedings of the 48th Annual Convention of Computer Society of India- Vol II. Advances in Intelligent Systems and Computing, vol 249. Springer, Cham. https://doi.org/10.1007/978-3-319-03095-1_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-03095-1_23
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03094-4
Online ISBN: 978-3-319-03095-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Audio Segmentation for Speech Recognition Using Segment Features

Abstract

Chapter PDF

Similar content being viewed by others

Automatic Speech Segmentation for Automatic Speech Translation

Segmentation Algorithm Using Temporal Features and Group Delay for Speech Signals

Automatic Speech Recognition Based on Clustering Technique

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Audio Segmentation for Speech Recognition Using Segment Features

Abstract

Chapter PDF

Similar content being viewed by others

Automatic Speech Segmentation for Automatic Speech Translation

Segmentation Algorithm Using Temporal Features and Group Delay for Speech Signals

Automatic Speech Recognition Based on Clustering Technique

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation