Skip to main content

Timbre-Vibrato Model for Singer Identification

  • Conference paper
  • First Online:
Information and Communication Technology for Intelligent Systems

Abstract

Timbre and vibrato describe the vocal characteristics of the singer, but with different attributes. Timbre describes the spectral characteristics which depend on shape and size of the vocal cavities. Vibrato is the periodic variations of the pitch, where pitch is associated with vibration of vocal folds. Singers can be defined uniquely using the combination of these acoustic features. Considering this, the paper presents a technique of identifying the singers based on fusion of timbre and vibrato features. We start with discussion on selection of appropriate spectral timbre feature suitable for singer identification (SID). An accuracy of 80.5% is achieved in identifying the singers using a cappella database of 23 singers. The proposed SID system is robust to variations in the singing style of singer called as ‘Album effect’. Performance comparison of the proposed system with other SID approaches validates the superiority of the proposed work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Nwe, T.L., Li, H.: On fusion of timbre-motivated features for singing voice detection and singer identification. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 2225–2228 (2008)

    Google Scholar 

  2. Jehan, T., DesRoches, D.: The Echo Nest Analyzer documentation (2014)

    Google Scholar 

  3. Winckell, F.: Music, Sound and Sensation. Dover, NY (1967)

    Google Scholar 

  4. Watts, C., Barnes-Burroughs, K., Estis, J., Blanton, D.: The singing power ratio as an objective measure of singing voice quality in untrained talented and nontalented singers. J. Voice 20(1), 82–88 (2006)

    Article  Google Scholar 

  5. Erickson, M.L.: Dissimilarity and the classification of female singing voices: a preliminary study. J. Voice 17(2), 195–206 (2003)

    Article  Google Scholar 

  6. Masataka, G., Takeshi, S., Tomoyasu, N., Hiromasa, F.: Singing information processing based on singing voice modeling. In: Proceedings of the ICASSP, pp. 5506–5509 (2010)

    Google Scholar 

  7. Soichi, Y., Kazuhiro, K., Tomoki, T., Tomoyasu, N., Masataka, G., Satoshi, N.: An estimation method of voice timbre evaluation values using feature extraction with gaussian mixture model based on reference singer. Acoust. Speech Signal Proces (2016). https://doi.org/10.1109/ICASSP.2016.7472682

    Article  Google Scholar 

  8. Fujihara, H., Goto, M.: A music information retrieval system based on singing voice timbre. In: ISMIR, pp. 467–470 (2007)

    Google Scholar 

  9. Aucouturier, J.-J., Pachet, F., Sandler, M.: “The way it sounds”: timbre models for analysis and retrieval of music signals. IEEE Trans. Multimed. 7(6), 1028–1035 (2005)

    Article  Google Scholar 

  10. Jensen, J.H., Christensen, M.G., Ellis, D.P.W., Jensen, S.H.: Quantitative analysis of a common audio similarity measure. IEEE Trans. Audio Speech Lang. Process. 17, 693–703 (2009)

    Article  Google Scholar 

  11. Shen, J., Shepherd, J., Cui, B., Tan, K.-L.: A novel framework for efficient automated singer identification in large music databases. ACM Trans. Inf. Syst. 27(3), 1–31 (2009)

    Article  Google Scholar 

  12. Fujihara, H., Goto, M., Kitahara, T., Okuno, H.: A modeling of singing voice robust to accompaniment sounds and its application to singer identification and vocal-timbre-similarity-based music information retrieval. IEEE Trans. Audio Speech Lang. Process 8(3) 638–648 (2010)

    Article  Google Scholar 

  13. Poli, G.D., Prandoni, P.: Sonological models for timber characterization. J. New Music Res. 26, 170–197 (1997)

    Article  Google Scholar 

  14. Zhang, T., Kuo, C.C.J.: Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing. Kluwer Academic, USA (2001)

    Book  Google Scholar 

  15. Mauch, M., Fujihara, H., Yoshii, K., Goto, M.: Timbre and melody features for the recognition of vocal activity and instrumental solos in polyphonic music. In: 12th International Proceedings on the Music Information Retrieval, pp. 233–238 (2011)

    Google Scholar 

  16. Andersen, J.S.: Using the echo nest’s automatically extracted music features for a musicological purpose. In: 4th International Workshop on Cognitive Information Processing (CIP) pp. 1–6 (2014)

    Google Scholar 

  17. Tsai, W.-H., Lee, H.-C.: Automatic evaluation of karaoke singing based on pitch, volume, and rhythm features. IEEE Trans. Audio Speech Lang. Process. 20(4), 1233–1243 (2012)

    Article  Google Scholar 

  18. Ventura, J., Sousa, R., Ferreira, A.: Accurate analysis and visual feedback of vibrato in singing. In: Proceedings of the IEEE 5th International Symposium on Communications, Control and Signal Processing, pp. 1–6 (2012)

    Google Scholar 

  19. Prame, E.: Measurements of the vibrato rate of ten singers. J. Acoust. Soc. Am. 96, 1979–1984 (1994)

    Article  Google Scholar 

  20. Saitou, T., Goto, M.: Acoustic and perceptual effects of vocal training in amateur male singing. In: Proceedings of the International Speech Communication Association, pp. 832–835 (2009)

    Google Scholar 

  21. Desain, P., Honing, H., Aarts, R., Timmers, R.: Rhythmic aspects of vibrato. In: Proceedings of 1998 Rhythm Perception and Production Workshop, vol. 34, pp. 203–216 (1999)

    Google Scholar 

  22. Benn-Hur, A., Weston, J.: A User’s Guide to Support Vector Machines. Methods Mol. Biol. 609, 223–239 (2010)

    Article  Google Scholar 

  23. Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1331–1334 (1997)

    Google Scholar 

  24. Peeters, G., Giordano, B.L., Susini, P., Misdariis, N., McAdams, S.: The timbre toolbox: extracting audio descriptors from musical signals. J. Acoust. Soc. Am. 130(5), 2902–2916 (2011)

    Article  Google Scholar 

  25. Rao, V., Ramakrishnan, S., Rao, P.: Singing voice detection in north indian classical music. In: Proceedings of the National Conference on Communications (2008)

    Google Scholar 

  26. Regnier, L.: Localization, characterization and recognition of singing voices. Ph.D. Thesis, Universite Pierre et Marie Curie, Paris VI (2012)

    Google Scholar 

  27. Mesaros, A.: Singing Voice Recognition for Music Information Retrieval, vol. 1064. Tampere University of Technology Publication (2012)

    Google Scholar 

  28. Mesaros, A., Virtanen, T., Klapuri, A.: Singer identification in polyphonic music using vocal separation and pattern recognition methods. In: Proceedings of the 8th International Conference on Music Information Retrieval, pp. 375–378 (2007)

    Google Scholar 

  29. Holzapfel, A., Stylianou, Y.: Singer Identification in Rembetiko Music, Sound and Music Computing Conference (SMC), pp. 326–329 (2007)

    Google Scholar 

  30. Zhang.T.: Automatic singer identification. In: Proceedings of the ICME, Baltimore (2003)

    Google Scholar 

  31. Ratanpara, T., Patel, N.: Singer identification using perceptual features and cepstral coefficients of an audio signal from Indian video songs. EURASIP J. Audio Speech Music Process. 1, 2015 (2015)

    Google Scholar 

  32. Nwe, T.L., Li, H.: Exploring vibrato-motivated acoustic features for singer identification. IEEE Trans. Audio Speech Lang. Process. 15(2), 519–530 (2007)

    Article  Google Scholar 

  33. Loni, D., Subbaraman, S.: Singing voice analysis for singer identification using vibrato features. In: Second International Joint Colloquiums on Computer Electronics Electrical Mechanical and Civil, pp. 209–216 (2016)

    Google Scholar 

  34. Tsai, W.-H., Wang, H.-M.: Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals. IEEE Trans. Audio Speech Lang. Process. 14(1), 330–341 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepali Y. Loni .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Loni, D.Y., Subbaraman, S. (2019). Timbre-Vibrato Model for Singer Identification. In: Satapathy, S., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems . Smart Innovation, Systems and Technologies, vol 107. Springer, Singapore. https://doi.org/10.1007/978-981-13-1747-7_27

Download citation

Publish with us

Policies and ethics