Music Mood Categorization: A Survey

Vandana, Tula; Kalyani, Nara; Santhi Sree, K.

doi:10.1007/978-981-13-8461-5_14

Tula Vandana^6,7,
Nara Kalyani⁷ &
K. Santhi Sree⁸

Included in the following conference series:

International Conference on Intelligent Computing and Communication Technologies

1486 Accesses

Abstract

Music is one among the many emotion rich resources because of which it is very much common for music listeners to maintain music libraries in terms of mood. The growing availability of online music data and their vast applications have resulted in steady increase of interest among music researchers to move towards automatic music mood classification. A substantial amount of work has been reported on this task for western languages compared to Indian languages. Standard linguistic resources like WordNet and dictionaries are available for western languages. Due to scarcity of such resources not much work has been carried out for Indian languages. The central objective of this paper is to present a survey related to various existing music mood taxonomies, highlight different modalities considered for music mood classification and to discuss various techniques and systems with due focus on open challenges.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Unsupervised Approach to Hindi Music Mood Classification

Multimodal Music Mood Classification Framework for Kokborok Music

Mood Classification of Bangla Songs Based on Lyrics

Keywords

1 Introduction

Music is often referred to as language of emotions [32]. Generally music is classified into emotional categories as it evokes different emotions in the listener. Mood identification from music is challenging compared to domains like movie reviews and e-mails as music emotions are highly subjective. The wide availability and easy accessibility of online music libraries over the past few years has made Music Information Retrieval (MIR) researchers amplify the development and maintenance of automated MIR systems. The Music Information Research Evaluation eXchange (MIREX) is a community based framework for formally evaluating Music-IR systems and algorithms [7] since 2004 and has included music mood classification using audio as a task in 2007 [8]. The main modalities considered in the literature for music mood classification are audio, lyrics and combination of both. Very few works have focused on using meta data like social tags and user reviews [4, 15].

The relevance of music dimension depends on music style like audio for dance music and lyrics for poetic music [26]. Initial systems for music mood classification were audio based [17, 23, 24]. Later bi-modal research (combining audio and lyrics) gained importance and has proved increased accuracy [13, 33, 44, 47]. Recently, there is an increasing importance to Lyric based MIR as they exclusively express semantic information of part of a song or the whole and plays a key role in determining the mood of the song when considered from the perspective of reader and listener [26].

Generally, any music mood classification system follows the architecture shown in Fig. 1.

The remainder of the paper is organized as follows. Section 2 briefly discusses various Data collection and standardization techniques. Different mood Taxonomies are described in Sect. 3. Section 4 discusses various mood categorization techniques implemented and Sect. 5 describes Frameworks developed. Section 6 presents open challenges and Sect. 7 defines conclusion.

2 Data Collection and Standardization

Choosing an appropriate data set that fits the chosen mood categories is important for any mood classification task and the data standardization helps in maintaining the quality of the data which is crucial for any classification task. One of the challenges being faced by MIR research community is the development of standard data sets annotated for mood. Till date gold standard mood annotated data sets are available for AMC task at MIREX (only available to its participants) and no such data is available for Indian languages [30]. Because of huge availability of online music sources and lack of availability of standard data set, researchers have reported their work by developing their own data sets.

Ground truth data can be collected through one of the following means.

(1)
Employing human annotators.
(2)
Crawling websites.
(3)
Creating and playing annotation games.

A dataset comprising of 4578 English song lyrics was developed [44] where every song has an associated social tag from last.fm. To filter out noisy data of social tags, WordNet-Affect is used to assign labels to moods. A total of 18 mood categories was identified and experimented. A large scale data set of 5296 songs comprising of both audio and lyrics for every song, representing 18 mood categories for which social tags are available on last.fm is has been developed [13] and the same data set has been used by the authors for further experiments reported in [43]. [26] used a manually annotated data set of 180 song lyrics. A survey has been conducted [41] to collect CAL500 data set of music annotated using a vocabulary of 174 words and songs of 500 unique artists.

In the context of Indian languages [30] has developed a mood annotated data set for audio and lyrics for both Hindi and Western songs by collecting songs from music CDs and lyrics by crawling the web. They annotated the data with the help of human annotators belonging to various age groups. [10] has created their own data set of 300 Telugu songs and lyrics collected from YouTube.

All the above discussed works have manually annotated the data which involve huge man labour and at the same time expensive and time consuming task. To overcome the issues researchers have contributed in developing online games to collect ground truth data [27]. [42] has designed A Listen Game, which is an online multi player game to associate semantic labels to music pieces. An audio based two player game where human plays with a simulated player as its partner is implemented to annotate sounds and music [21]. An arousal-valence based two player online annotation game called MoodSwings is developed by [16] which records dynamic labels of music mood. The game consists of five rounds with scores calculated at each round based on the overlapping of cursor positions of both the players which shows maximum mood agreement with the players.

To the best of the knowledge of the authors no such gaming strategy has been developed for Indian language music mood annotations. However [6] has proposed an interactive game called “Dr. Sentiment” to create and validate SentiwordNet(s) for three Indian languages: Bengali, Hindi and Telugu with the help of internet population and is helpful for regular sentiment analysis tasks for Indian languages.

The authors feel that in spite of the data sets collected using above approaches that are claimed to be of highly accurate, there still lies a compromise with the quality as its contributors belong to varied communities with different psychological moods.

3 Mood Taxonomy

Taxonomy assigns descriptor labels for multiple levels of music content descriptors ranging from low-level acoustical features to high-level structural descriptions and acts as a bridge between system development and user interfaces [28]. The primary focus of any music mood classification system is the design and the selection of appropriate mood model. Literature supports two types of mood representations. They are Categorical and Dimensional. Apart from the above two, researchers have reported their work on social tagging. However, there is no universally accepted model that describes music mood [44].

3.1 Categorical Mood Representation

This approach denotes mood as a set of categories represented using a list of distinct adjectives or tags according to their relevance to a music piece. The study conducted by Hevner in 1936 was one of the earliest of this type. She proposed taxonomy with 8 related clusters comprising of a total of 66 adjectives [11] arranged in a circular fashion where adjacent clusters are dissimilar by a small varying factor. The adjectives in the same cluster are close in meaning and those of opposite clusters are differed by a larger factor. Hevner’s mood model is shown in Fig. 2.

Categorizing moods into 66 various categories is difficult especially for automatic systems. Some of the adjectives used in Hevner’s circle may not be used to define present day moods as language evolves with generations [12]. The data set collected using any of the approaches discussed in Sect. 2 might not also be annotated for all the categories defined by the model.

It is also supposed that taxonomy with reduced mood categories may help to achieve better performance for automatic systems. A five cluster categorical mood taxonomy as shown in Table 1 was proposed by MIREX community for Audio Mood Classification (AMC) task.

Table 1. MIREX mood taxonomy

Full size table

The categorization of Indian art, drama and music is generally based on Navarasa (meaning nine rasas), a word derived from Sanskrit by Indian musicologist, Bharata [19]. This model serves as the base for Indian mood categorization tasks. Figure 3 shows Navarasa mood model. In terms of Indian music, rasa is a combination of few emotional states. However, the number of rasas is a subject of debate [18].

3.2 Dimensional Mood Representation

This approach categorizes mood with respect to specific number of dimensions or axes that represents human emotions. This denotes a mood as a point on a dimensional space of emotions.

Two well known models of this mood taxonomy are Russell’s circumplex model and Thayer model. Russell’s model positions mood adjectives on a two dimensional plane with horizontal axis indicating valence and vertical axis indicating arousal [35]. He stated that moods are not independent or unique but are connected to each other in an orderly manner. He proposed a taxonomy with 8 related groups consisting of a total of 28 affect words arranged meaningfully in a circular fashion along the circumference of the circle in the two dimensional space. Figure 4 shows the Russell’s mood taxonomy.

Thayer’s model is based on two dimensions namely energy along vertical axis and stress along horizontal axis [38] and it is a variant of Russell’s model. This model arranges moods into four clusters, namely Contentment, Depression, Exuberance and Anxious. Figure 5 shows Thayer’s taxonomy. These adjectives of Thayer’s model can be mapped to the unique quadrants of Russell’s model [9].

Most of the above discussed models were criticized for lack of social context of music listening as they were laboratory based [14].

However, to avoid the confusion with large number of mood categories [30] proposed an extended mood taxonomy consisting of five mood classes each with three subclasses by grouping closely related mood adjectives of Russell’s circumplex model. [5] has derived a folksonomy representation consisting of four clusters by applying an unsupervised clustering method. With respect to Indian music [18] considered a taxonomy that consisted of ten rasas.

3.3 Social Tagging

Tag is a phrase or a label assigned to an item by a non-expert and contains relevant information. Social tagging of music helps to create better ground truth [5, 22] and is a good source of human generated contextual knowledge [20]. Till date, no work related to social tagging is reported in the context of Indian languages [30]. Since this approach involves laymen, the quality of data collected may be compromised.

4 Mood Categorization

Works on mood categorization is investigated in literature based on three modalities. They are audio, lyrics and multimodal (audio + lyric). Some researchers have also considered meta data for mood classification. The concentration of early music mood classification tasks were purely audio based. [24] proposed a framework to track the moods across four principal V-A quadrants by dividing the music piece into independent segments. Further the authors have extended the mood detection approach to mood tracking since mood changes during the entire duration of a musical piece. [31] proposed an unsupervised classifier to classify Hindi music by mood by considering a manually annotated dataset of 250 Hindi music clips and reported an accuracy of 48%.

Since using only spectral features lacked recognizing several of high level music features the music research community started combining audio and lyrics and showed improved performance. [13] and [47] are considered as one of the earliest works of this kind. Later the work reported by [34] has applied several natural language processing techniques to extract features and performed a bi-modal analysis on a dataset of 764 samples by combining the best audio and lyric features and attained an F-measure of 63.9. They run SVM, k-NN, C4.5 and NB algorithms. With concern to lyrical features, bag-of-words gave better results reinforcing the importance of content-based features. A fine grained classification of emotions has been addressed using a novel corpus of 100 songs for both music and lyrics annotated at line level for emotions [33]. They considered six Ekman’s emotions and repeated three sets of experiments using linear regression with textual features, musical features and combining both. This work is considered as first of its kind for mood classification at line level and is observed that bi-modal classification showed improved classification results.

In the Indian language context [30] has reported a study on multimodal mood classification of Hindi and Western songs using LibSVM and Feed-forward Neural Network and reported FFNNs as the best performing system with F-measures of 0.751 for Hindi and 0.835 for Western songs. A correlation based supervised feature selection technique is used to identify the important audio and lyric features. [10] has devised an approach to combine both audio and text features of 100 Telugu songs. Audio features are extracted from beginning, ending as well from the whole song and lyrical features are extracted from BOW and reported that considering beginning gave better results than whole or the end of the song. They have run SVM, NB and GMM algorithms to classify mood.

Lyric based Music Emotion Recognition has gained momentum since last decade. In spite of lyric of a song playing a prominent role in determining the mood of a song it is considered challenging as lyrics are much abstract and smaller in size than text contents like reviews. An unsupervised fuzzy clustering technique is proposed by [45] for detecting emotions from 500 Chinese song lyrics using an affective Lexicon called Affective Norm of English Words(ANEW) that works effectively in small devices. [1] has performed automatic classification of mood from Chinese lyrics using Naive Bayes approach and reported a final accuracy of 68%.

A novel Psychological emotional model using a training set of randomly chosen 1032 songs is developed [46] that covered 23 specific mood categories. This approach explored 182 Psychological features of each song lyric. The challenges behind lyric based mood classification are examined in [2]. They employed NLP techniques to identify the polarity of a song and concluded that a corpus-specific lexicon helps in improving the accuracy than using a generic subjectivity lexicon. The work addressed by [44] reported that lyric features performed well compared to audio features when mood categories are semantically bound with the lyrics. They also reported that combined features improve performance for more of the categories but not all of them. A later study of the same authors on feature analysis [43] has proved that certain lyric features has outperformed audio features in seven mood categories out of 18 mood categories used in the study comprising of a dataset of 5296 songs. While this study showed that lyric based mood classification works well, every single lyric feature underperformed audio features on negative valence and negative arousal quadrant.

Apart from the above listed modalities, some studies have concentrated on exploring usefulness of metadata in music mood classification. User-generated interpretations of lyrics collected from songmeaning.com to develop a system that classifies music subject automatically [15]. They run four classifiers linear SVM, RBF SVM, NB and k-NN on a dataset of 800 songs for 8 categories and reported that user-generated interpretations outperformed lyrics. They also reported that interpretation terms are more semantically related to subject categories than lyrics.

A different approach compared to all the above is the work presented in [37]. They made an attempt to predict the decade to which a song belongs to using lyric-based features and observed a general change of lyrics that make sense over time.

5 Various Frameworks

Another view point of the music research community which helps to browse music collections efficiently is the development of applications and interfaces that allows users to listen and retrieve music. A technique to build a search engine for a large collection of music by responding to a natural language query is proposed [17]. Relevant web pages of each song in the dataset are retrieved and are represented as term vectors to index the contents for retrieval tasks. A web service called Lyric Jumper to explore music is developed by [40] that allows user to look into lyrics based on the topic of the lyrics and [36] has proposed a lyric retrieval system called LyricsRadar that analyses the lyrics topics using a text analysis method called latent Dirichlet allocation (LDA). The system automatically generates a two dimensional space using LDA which analyses the common topics of the lyrics that appear in several music pieces. A music playback interface called LyricListPlayer has been developed which lets user to view lyrics while listening and also view the word sequences of other songs similar to currently playing song lyrics using local lyrics similarity [29].

A music visualization technique called Lyricon is presented [25] that automatically selects multiple icons of tunes using musical and lyrical features and helps users to choose the songs of their interest based on the visual representation of the mood icons. [3] proposed an application named Songwords that allows users to explore music collections based on lyrics of songs and [39] has proposed a system named SLAVE (Score Lyrics Audio Video Explorer) that allows users to explore multimedia music collections using different varieties of music documents.

6 Open Challenges

Music has become a part of lifestyle for all groups of people. Various researchers have contributed for automation of mood identification from music. A comparison of various music mood categorization systems is given in Table 2. It is observed from the table that most of the reported works has derived their own mood taxonomy and the most commonly used features for audio are intensity and rhythm where as that of lyrics is lexicon based and text stylistic features. Of the various approaches used SVM is most commonly used for audio, lyric and multimodal classification. Very few works were reported with respect to Indian languages of which the maximum performance is an F-measure of 0.751 for multimodal classification.

Table 2. Comparison of different music mood categorization systems

Full size table

On observing the works listed in the survey, the following are the open challenges that can be addressed.

1.
Scarcity of standard mood annotated data resulted in many researchers of music research community prepare and standardize their own data. This is both expensive and time consuming in spite of the fact that it involves more human annotators. Few works have proposed data collection and standardization of western music in the form of online games but no such data exists for Indian languages in the music domain.
2.
Work on sentiment analysis for Indian languages (Telugu) and few works on music mood categorization (specially for Hindi and Bengali) have been reported. However, extending from polarity classification to mood categorization (especially for Telugu language) can be addressed.
3.
Mood taxonomies refined by combining both theoretical models and social tags may meet current day music moods.
4.
The data set collected for Indian music might not include all the nine rasas of navarasa model as it is a generally used model even for Indian art and drama. Specific mood taxonomy only for music in terms of Indian languages can be addressed.
5.
A considerable amount of work has been reported on audio and combining audio and lyrics where as Lyric based Music information retrieval is still in the budding stage.
6.
Most lyrical features considered till date are BOW, PoS, Content words, lexicon based and text stylistic features which are generally used in regular text mining tasks.

But lyrics are generally considered as different from ordinary text because of their abstract nature and so is the text mining and lyric mining. Hence, introducing features dedicated to lyrics with due consideration to lyric dimensionality may improve classification accuracy.

7 Conclusion

Humans experience different kinds of emotions while listening to music as the mood being expressed by the same music piece is often ambiguous and changes over its entire duration and is also based on the psychological condition of the listener. This paper has presented a survey on recent updates in the field of automatic music mood classification by summarizing various contributions to the aspects like mood taxonomies, data creation and standardization, approaches and systems. The analysis of the contributions has revealed that most of the previous works have concentrated on combination of audio and lyrics where as purely lyric based MIR is an emerging interest area. It is observed that most frequently used algorithms for the mood classification task are Naive Bayes, SVMs and k-NN. Though techniques and methods used for automatic music mood classification are advancing, there are open challenges to be addressed and there is a good scope for research contributions in Indian languages and Indian music.

References

An, Y., Sun, S., Wang, S.: Naive Bayes classifiers for music emotion classification based on lyrics. In: Proceedings of the IEEE/ACIS 16th International Conference on Computer and Information Science (2017). https://doi.org/10.1109/icis.2017.7960070
Oudenne, A.M., Chasins, S.E.: Identifying the emotional polarity of song lyrics through natural language processing
Google Scholar
Baur, D., Steinmayr, B., Butz, A.: SongWords: exploring music collections through lyrics. In: Proceedings of International Society for Music Information Retrieval Conference (2010)
Google Scholar
Bischoff, K., Firan, C.S., Nejdl, W., Paiu, R.: How do you feel about dancing queen? Deriving mood & theme annotations from user tags. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries. ACM (2009)
Google Scholar
Laurier, C., Sordo, M., Serra, J., Herrera, P.: Music mood representations from social tags. In: Proceedings of 10th International Society for Music Information Retrieval Conference (2009)
Google Scholar
Das, A., Bandyopadhay, S.: Dr sentiment creates SentiWordNet(s) for Indian languages involving internet population. In: Proceedings of Indo-Wordnet Workshop (2010)
Google Scholar
Downie, J.S.: The music information retrieval evaluation exchange (2005–2007): a window into music information retrieval research. Acoust. Sci. Technol. 29(4), 247–255 (2007)
Article Google Scholar
Hu, X., Downie, J.S., Laurier, C., Bay, M., Ehmann, A.F.: The 2007 MIREX audio mood classification task: lessons learned. In: Proceedings of 9th International Conference on Music Information Retrieval (2008)
Google Scholar
Hampiholi, V.: A method for music classification based on perceived mood detection for Indian bollywood music. In: Proceedings of World Academy of Science, Engineering and Technology, No. 72. World Academy of Science, Engineering and Technology (WASET) (2012)
Google Scholar
Abburi, H., Akhil, E.S., Gangashetty, S.V., Mamidi, R.: Multimodal sentiment analysis of telugu songs. In: Proceedings of the 4th Workshop on Sentiment Analysis Where AI Meets Psychology (SAAIP 2016), IJCAI, pp. 48–52 (2016)
Google Scholar
Kate, H.: Experimental studies of the elements of expression in music. Am. J. Psychol. (1936). https://doi.org/10.2307/1415746
Article Google Scholar
Hu, X.: Music and mood: where theory and reality meet. In: Proceedings of 2010 iConference (2010). http://hdl.handle.net/2142/14956
Hu, X., Downie, J.S.: Improving mood classification in music digital libraries by combining lyrics and audio. In: Proceedings of the 10th Annual Joint Conference on Digital libraries. ACM (2010). https://doi.org/10.1145/1816123.1816146
Juslin, P.N., Laukka, P.: Expression, perception, and induction of musical emotions: a review and a questionnaire study of everyday listening. J. New Music Res. (2004). https://doi.org/10.1080/0929821042000317813
Article Google Scholar
Choi, K., Lee, J.H., Hu, X., Downie, J.S.: Music subject classification based on lyrics and user interpretations. In: Proceedings of the 79th ASIS&T Annual Meeting: Creating Knowledge, Enhancing Lives Through Information & Technology (2016)
Google Scholar
Kim, Y.E., Schmidt, E.M., Emelle, L.: Moodswings: a collaborative game for music mood label collection. In: Proceedings of 9th International Society for Music Information Retrieval Conference. pp. 231–236 (2008)
Google Scholar
Peter, K., Pohle, T., Schedl, M., Widmer, G.: A music search engine built upon audio-based and web-based similarity measures. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 447–454 (2007). https://doi.org/10.1145/1277741.1277818
Gopala, K.K.: Musicological and Technological Exploration of Truths and Myths in Carnatic Music, the Raagam in Particular. Dissertation in International Institute of Information Technology Hyderabad (2010)
Google Scholar
Krishna, K.G., Indurkhya, B.: A behavioral study of emotions in south Indian classical music and its implications in music recommendation systems. In: Proceedings of the ACM Workshop on Social, Adaptive and Personalized Multimedia Interaction and Access, pp. 55–60 (2010). https://doi.org/10.1145/1878061.1878079
Paul, L.: Social tagging and music information retrieval. J. New Music Res. (2008). https://doi.org/10.1080/09298210802479284
Article Google Scholar
Law Edith, L.M., von Ahn, L., Dannenberg, R.B., Crawford, M.: TagATune: a game for music and sound annotation. In: The International Society for Music Information Retrieval (2007)
Google Scholar
Mark, L., Sandler, M.: A semantic space for music derived from social tags. Austrian Computer Society, pp. 1–12 (2007)
Google Scholar
Beth, L., Ellis, D.P.W., Berenzweig, A.: Toward evaluation techniques for music similarity. The MIR/MDL Evaluation Project White Paper Collection (2003)
Google Scholar
Lie, L., Liu, D., Zhang, H.-J.: Automatic mood detection and tracking of music audio signals. IEEE Trans. Audio Speech Lang. Process. (2006). https://doi.org/10.1109/tsa.2005.860344
Article Google Scholar
Wakako, M., Itoh, T.: Lyricon: a visual music selection interface featuring multiple icons. In: Proceedings of 15th International Conference on Information Visualisation (IV). IEEE (2011). https://doi.org/10.1109/iv.2011.62
Malheiro, R., Panda, R., Gomes, P., Paiva, R.P.: Emotionally-relevant features for classification and regression of music lyrics. IEEE Trans. Affect. Comput. (2016). https://doi.org/10.1109/taffc.2016.2598569
Article Google Scholar
Mandel, M.I., Ellis, D.P.W.: A web-based game for collecting music metadata. J. New Music Res. (2008). https://doi.org/10.1080/09298210802479300
Article Google Scholar
Lesaffre, M., Leman, M., Tanghe, K., De Baets, B., De Meyer, H., Martens, J.P.: User-dependent taxonomy of musical features as a conceptual framework for musical audio-mining technology. In: Proceedings of the Stockholm Music Acoustics Conference (2003). 10.1.1.58.801
Google Scholar
Tomoyasu, N., Goto, M.: LyricListPlayer: a consecutive-query-by-playback interface for retrieving similar word sequences from different song lyrics. In: Proceedings of SMC (2016)
Google Scholar
Gopal, P.B., Das, D., Bandyopadhya, S.: Multimodal mood classification of Hindi and Western Songs. J. Intell. Inform. Syst. (2018). https://doi.org/10.1007/s10844-018-0497-4
Article Google Scholar
Gopal, P.B., Das, D., Bandyopadhya, S.: Unsupervised approach to Hindi Music Mood Classification. In: Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science (2013). https://doi.org/10.1007/978-3-319-03844-5_7
Chapter Google Scholar
Pratt, C.C.: Music as the Language of Emotion. The Library of Congress, Oxford, England (1952)
Google Scholar
Mihalcea, R., Strapparava, C.: Lyrics, music, and emotions. In: Proceedings of 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics (2012)
Google Scholar
Malheiro, R., Panda, R., Gomes, P., Paiva, R.: Music emotion recognition from lyrics: a comparative study. In: 6th International Workshop on Machine Learning and Music (MML13). Held in Conjunction with the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPPKDD13) (2013). http://repositorio.ismt.pt/handle/123456789/332
Russell, J.A.: A circumplex model of affect. J. Pers. Soc. Psychol. (1980). https://doi.org/10.1037/h0077714
Article Google Scholar
Shoto, S., Yoshii, K., Nakano, T., Goto, M., Morishima, S.: LyricsRadar: a lyrics retrieval system based on latent topics of lyrics. In: Proceedings of 15th International Society for Music Information Retrieval Conference (2014)
Google Scholar
Cody, S., Munger, C., Hannel, B.: Lyrical Features of Popular Music of the 20th and 21st Centuries: Distinguishing by Decade (2016). http://Stanford.edu
Thayer, R.E.: The Biopsychology of Mood and Arousal. Oxford University Press, USA (1990)
Google Scholar
Verena, T., Fremerey, C., Damm, D., Clausen, M.: Slave: a score-lyrics-audio-video-explorer. In: Proceedings of 10th International Society for Music Information Retrieval Conference (2009)
Google Scholar
Kosetsu, T., Ishida, K., Goto, M.: Lyric jumper: a lyrics-based music exploratory web service by modeling lyrics generative process. In: Proceedings of 18th International Society for Music Information Retrieval Conference (2017)
Google Scholar
Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. Audio Speech Lang. Process. (2008). https://doi.org/10.1109/tasl.2007.913750
Article Google Scholar
Turnbull, D., Liu, R., Barrington, L., Lanckriet, G.: A game-based approach for collecting semantic annotations of music. In: Proceedings of International Society for Music Information Retrieval Conference (2007)
Google Scholar
Hu, X., Downie, J.S.: When lyrics outperform audio for music mood classification: a feature analysis. In: Proceedings of 11th International Society for Music Information Retrieval Conference (2010)
Google Scholar
Hu, X., Downie, J.S., Ehmann, A.F.: Lyric text mining in music mood classification. In: Proceedings of 10th International Society for Music Information Retrieval Conference (2009)
Google Scholar
Hu, Y., Chen, X., Yang, D.: Lyric-based song emotion detection with affective lexicon and fuzzy clustering method. In: Proceedings of 10th International Society for Music Information Retrieval Conference (2009)
Google Scholar
Yang, D., Lee, W.S.: Music emotion identification from lyrics. In: 11th International IEEE Symposium on Multimedia (2009). https://doi.org/10.1109/ism.2009.123
Dan, Y., Lee, W.-S.: Disambiguating music emotion using software agents. In: Proceedings of International Society for Music Information Rertieval Conference (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Jawaharlal Nehru Technological University Hyderabad (JNTUH), Hyderabad, India
Tula Vandana
G. Narayanamma Institute of Technology and Science (GNITS), Hyderabad, India
Tula Vandana & Nara Kalyani
School of Information Technology (SIT), JNTUH, Hyderabad, India
K. Santhi Sree

Authors

Tula Vandana
View author publications
You can also search for this author in PubMed Google Scholar
Nara Kalyani
View author publications
You can also search for this author in PubMed Google Scholar
K. Santhi Sree
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Tula Vandana or Nara Kalyani .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, CMR Institute of Technology, Hyderabad, Telangana, India
Vinit Kumar Gunjan
Department of Computer Science, University of Oviedo, Oviedo, Spain
Vicente Garcia Diaz
Faculty of Engineering, Don Bosco University, San Salvador, El Salvador
Manuel Cardona
Department of Computer Science and Engineering, CMR Institute of Technology (Autonomous), Hyderabad, Telangana, India
Vijender Kumar Solanki
BVRIT Hyderabad College of Engineering for Women, Hyderabad, Telangana, India
K. V. N. Sunitha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vandana, T., Kalyani, N., Santhi Sree, K. (2020). Music Mood Categorization: A Survey. In: Gunjan, V., Garcia Diaz, V., Cardona, M., Solanki, V., Sunitha, K. (eds) ICICCT 2019 – System Reliability, Quality Control, Safety, Maintenance and Management. ICICCT 2019. Springer, Singapore. https://doi.org/10.1007/978-981-13-8461-5_14

Download citation

DOI: https://doi.org/10.1007/978-981-13-8461-5_14
Published: 28 June 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8460-8
Online ISBN: 978-981-13-8461-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics