Skip to main content

Sound Source Separation Based on Multichannel Non-negative Matrix Factorization with Weighted Averaging

  • Conference paper
  • First Online:
Complex, Intelligent and Software Intensive Systems (CISIS 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1194))

Included in the following conference series:

  • 1443 Accesses

Abstract

Herein, we propose a sound source separation method using multi-channel non-negative matrix factorization (MNMF). MNMF uses an iterative update algorithm for decomposing observed signals into sound source components. However, the separation accuracy of MNMF considerably depends on the initial value of the iterative update algorithm. In the proposed method, cluster analysis and multidimensional scaling were conducted using the features of the matrix decomposed by multiple initial values. A plurality of separated signals was obtained using the initial values included in the largest cluster, which were weighted and averaged. The distance between the matrices obtained by the multidimensional scaling method was used as the weight. As a result of the experiment, we found that the separation signal obtained using the proposed method is less dependent on the initial value and that the separation accuracy is improved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    This satisfies the distance axiom.

  2. 2.

    In a graph that resembles a tree structure, the places labelled are called leaves. Individuals are more similar as the height of the line extending from the leaf until the line is connected becomes shorter.

References

  1. Lee, T.-W.: Independent Component Analysis-Theory and Applications. Kluwer, Norwell (1998)

    Book  Google Scholar 

  2. Lee, I., et al.: Fast fixedpoint independent vector analysis algorithms for convolutive blind source separation. Signal Process. 87(8), 1859–1871 (2007)

    Article  Google Scholar 

  3. Lee, D.D., et al.: Learning the parts of objects with nonnegative matrix factorization. Nature 401, 788–791 (1999)

    Article  Google Scholar 

  4. Kitamura, D., et al.: Dtermined blind source separation unifying independent vector analysis and nennegative matrix factorization. IEEE/ACM Trans. Audio Speech Lang. Process. 24(9), 1626–1641 (2016)

    Article  Google Scholar 

  5. Sawada, H.: Blind signal separation by synchronized joint diagonalization. In: 32nd SIP SYMPOSIUM, pp. 332–337 (2017)

    Google Scholar 

  6. Sawada, H., et al.: Multichannel extensions of non-negative matrix factorization with complex-valued data. IEEE Trans. ASLP 21(5), 971–982 (2013)

    Google Scholar 

  7. Miura, I., et al.: Behavior analysis of initial value dependency in sound source separation using multi-channel NMF and evaluation in speech recognition. IEICE J. J100-D, 376–384 (2017)

    Google Scholar 

  8. Uramoto, T., et al.: Improvement of sound source separation performance using hierarchical cluster analysis in multi-channel nonnegative matrix factorization. IEICE J. J102-D(3), 118–129 (2019)

    Google Scholar 

  9. Fvotte, C., Bertin, N., et al.: Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis. Neural Comput. 21(3), 793–830 (2009)

    Article  Google Scholar 

  10. Shinnou, H.: Cluster analysis learning with R. Ohmsha, Ltd. (2007)

    Google Scholar 

  11. Hino, M.: Spectrum Analysis. Asakura Publishing Co., Ltd., Shinjuku (1977)

    Google Scholar 

  12. Vincent, E., et al.: First stereo audio source separation evaluation campaigh: data algprithm and results. In: Independent Component Analysis and Signal Separation, pp. 552–559. Springer, Berlin (2007)

    Google Scholar 

  13. Saitou, T., Yadohisa, H.: Analyzing Relevanc Data: Multidimensional Scaling and Cluster Analysis (2005)

    Google Scholar 

  14. RWCP: Sound Scene Database in Real Acoustic Enviroment (RWCP-SSD). Speech Resources Consortium. http://research.nii.ac.jp/src/RWCP-SSD.html. Accessed 21 Aug 2018

  15. Araki, S.,et al.: The 2011 Signal Separation Evaluation Campaign (SiSEC2011): -Audio Source Separation. In: Latent Variable Analysis and Signal Separation, pp. 414–422. Springer, Berlin (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tsuyoshi Yamamoto .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yamamoto, T., Uenohara, S., Nishijima, K., Furuya, K. (2021). Sound Source Separation Based on Multichannel Non-negative Matrix Factorization with Weighted Averaging. In: Barolli, L., Poniszewska-Maranda, A., Enokido, T. (eds) Complex, Intelligent and Software Intensive Systems. CISIS 2020. Advances in Intelligent Systems and Computing, vol 1194. Springer, Cham. https://doi.org/10.1007/978-3-030-50454-0_17

Download citation

Publish with us

Policies and ethics