A Novel Dataset for the Identification of Computer Generated Melodies in the CSMT Challenge

Li, Shengchen; Jing, Yinji; Fazekas, György

doi:10.1007/978-981-16-1649-5_15

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 761))

Included in the following conference series:

National Conference on Sound and Music Technology

434 Accesses
1 Citations

Abstract

This paper introduces a novel dataset for the identification of computer generated melodies as used in the data challenge organised by the Conference on Sound and Music Technology (CSMT). The CSMT data challenge requires participants to identify whether a given piece of melody is generated by computer or is composed by human. The dataset consists of two parts: a development dataset and an evaluation dataset. The development dataset contains only computer generated melodies whereas the evaluation dataset contain both computer generated melodies and human composed melodies. The aim of the dataset is to facilitate the develpment and assessment of methods to identified computer generated melodies and facilitate the creation of generative music systems.

Supported by Zhongwen Law Firm. Shengchen Li and Yinji Jing are considered joint first authors.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation

Deep learning’s shallow gains: a comparative evaluation of algorithms for automatic music generation

Article Open access 21 March 2023

Melody Transformation with Semiotic Patterns

Keywords

1 Introduction

Automatic music generation is becoming more and more popular with the development of deep learning techniques. At the same time, new challenges have emerged in juridical practices regarding copyright protection: the source of music leads to different juridical models. Although the discussion of legal issues is beyond the scope of this paper and it isn’t among the aims of the proposed data challenge, a new task is considered helpful in future juridical practices. Identifying whether a piece of melody is computer generated or human composed could help in recognising cases of music use where legal intervention of further scrutiny is necessary. As a result, the Conference on Sound and Music Technology (CSMT) proposes a data challenge that requires participants to identify human composed melodies among computer generated ones.

Existing automatic music generation methods have certain drawbacks, such as the lack of clear long term structure in the music or the existence of unusual harmonisation, which make the melody identification less challenging. For example, a Self Similarity Matrix (SSM) is used to identify the repetitions in music [4] that are commonly associated to music structure by composers but are seldom present in pieces produced by music generation algorithms [8]. Moreover in juridical practices, copyright infringement can be detected using the similarity of the variation of pitch in melodies, regardless of music structure and accompaniment. In this paper and the proposed challenge, the term “melody” refers to a sequence of pitches with dedicated duration but excludes the concept of music structure and accompaniment.

The proposed data challenge follows a possible scenario of melody source identification in juridical practice. There are two datasets used in the challenge. The development dataset consists of computer-generated melodies that are produced by a set of exemplar music generation systems. The evaluation dataset contains both computer-generated and human-composed melodies. Participants are required to submit a system that identifies human-composed melodies among the computer-generated melodies.

The authors and organisers of the data challenge reviewed existing computer music generation systems as outlined in this paper. Three exemplar methodologies were proposed including Generative Adversarial Network (GAN), Variational Auto Encoder (VAE) and transformer systems because these architectures are commonly used and represent the current state-of-the-art in music generation as of early 2020. All systems were used to produce computer-generated melodies in both development and evaluation datasets. The systems used to generate melodies for development and evaluation datasets are different as the results of a different initial values and different batch formation in the training process. For human-composed melodies in the evaluation dataset, the majority (95%) of human-composed melodies within the evaluation dataset overlaps with human-composed melodies used as training data for the automatic generation system. The remaining human-composed melodies are composed by university students whose major is music composition. Such melodies have not been published to the public.

The proposed data challenge can be approached in two different ways. If human-composed melodies are collected by the participants, data may be labelled as “human” vs. “computer”, hence the proposed task can be considered a binary classification problem. The human-composed melodies can also be considered outliers among computer-generated melodies. In this case, the proposed task can also be viewed as an unsupervised outlier detection problem.

The rest of the paper is organised as follows. A brief overview of automatic music generation is presented in Sect. 2 in order to justify the choice of melody generation systems. In Sect. 3, the dataset creation process is explained in detail together with the data representation proposed for the challenge. This is followed by a brief conclusion in Sect. 4.

2 Melody Generation Systems

This section provides an overview of automatic music generation systems. The majority of music generation systems can be divided to three types [10]: rule-based systems, methods that utilise mathematical models and machine learning systems. The machine learning systems, especially deep learning systems, are considered as the state-of-the-art automatic music generation systems [2]. As a result, the data challenge proposes to use deep learning systems to generate melodies that are labelled as computer-generated melodies.

The most important factor for automatic music generation that affects system performance is the modelling of temporal dependencies. Rule-based systems usually propose a set of rules to generate a sequences such as chords [16]. Systems that use mathematical models aim to describe time dependency in music mathematically. The generation process may then be considered a sampling process from a mathematical model. For modelling temporal dependencies, Markov models are considered the first choice since the very early stages of music generation [14]. One of more the recent works using this principle is the ALYSIA system [1] that creates both lyrics and melodies.

As music usually has a long-time dependency, it is almost impossible for rule-based and mathematical modelling systems to learn long-time dependencies accurately. Machine learning systems especially deep learning systems are better suited for the purpose of music generation as the long-term dependency can be modelled as a joint probability distribution akin to a language model [6].

One exemplar system is a Recurrent Neural Network (RNN). Makris [13] uses RNN to generate rhythm in drum patterns. The Microsoft team [20] uses RNN to encode the pitch, rhythm and chord of music. With the development of transformer systems that are better at modelling longer-time dependencies, Vaswani et al. [17] proposed transformer structure to catch longer temporal-dependency. This was adopted by Huang et al. [9] for music generation. In the proposed data challenge, the MusicTransformer [9] system is used as one of the candidate system to generate computer-generated melodies, where the authors claimed that the MusicTransformer models long-term dependencies in music [9].

Besides using a language model to model long-time dependency in music, music generation can also be performed by a generative model such as a Variational Auto-Encoder (VAE) or a Generative Adversarial Network (GAN).

VAE is a variant of the autoencoder, which is a generative deep learning model. Brunner [3] proposed a VAE-based automatic composition model MIDI-VAE, which processes polyphonic music with multiple instrument tracks and models the duration and speed of the notes in the generated music. Wang [18] proposed a new variant of Variational Autoencoder (VAE), which uses a modular approach to designing the model structure to generate music. Luo [12] used a variational autoencoder to generate different styles of Chinese folk music. MusicVAE [15] improves the structure of VAE according to the characteristics of music with hierarchical structure, which aims to solve the lack of coherence in generated music using vanilla VAE. The MusicVAE system is better at generating music with extended duration hence the proposed data challenge selects MusicVAE as the representative of VAE-based music generation systems in the development and evalution datasets.

Generative adversarial network (GAN) [7] is a generative model that contains a generator and a discriminator. In a GAN, the generator produces pseudo-samples and the discriminator judges whether a sample was produced by the generator. GAN is commonly used for music generation, for example, by Liu and Yang [11] and Dong et al. [5]. MidiNet [19] is one of few GAN systems that use piano roll as the representation of music and can generate melodies without the generation of music accompaniment. As a result, the proposed data challenge selects MidiNet as the choice of GAN based systems for music generation.

To summarise, deep learning based computer music generation systems outperform conventional rule-based and mathematical modelling systems. Among deep learning systems, there are three types of systems that are considered state-of-the-art: transformer systems, VAE-based systems and GANs. The proposed data challenge selects an exemplar system to represent each of these types: MusicTransformer, MusicVAE and MidiNet (GAN). The computer-generated melodies in the development and evaluation datasets are a combination of melodies generated by all three selected systems.

3 Dataset

3.1 Training Data

To investigate whether different music style affects the identification of computer-generated melodies, two datasets are used for training the selected models: Bach Chorales in Music21^{Footnote 1} and pop music from hooktheory^{Footnote 2}. These two training datasets are used for training two separate models for melody generation in this data challenge.

The raw melodies in the datasets are subject to a pre-processing stage. The Bach Chorales dataset contains several voices. Each voice is treated as a separate melody. With regards to pop music in hooktheory, only the melody part is used for training. All melodies are truncated to 32 beats to disregard music structure.

As used by all selected systems [9, 15, 19], all pre-processed melodies for training are converted into a form of binarised piano roll as demonstrated in Fig. 1. The binarised piano roll represents melodies using a matrix, where each column represents a quarter beat and each row represents a note (such as A4). As each melody has a length of 32 beats and each column represents a quarter beat, the binarised piano roll has 128 ($32\times 4 = 128$) columns. Moreover, as MIDI files have a pitch number defined between 0 to 127, there are 128 rows in the binarised piano roll. As a result, the music representation in this paper has a shape of $128\times 128$.

3.2 Computer-Generated Melodies

As discussed in our brief overview of music generation methods, the selected systems for melody generation are MusicTransformer [9], MusicVAE [15] and MidiNet [19]. In this section, the working principles of these systems are outlined briefly. For more details, the reader is kindly asked to refer to the original papers.

Two datasets (Music21 and hooktheory) are used to train all selected systems twice under the exact configuration hence two models are obtained for each style: Bach and Pop. For each style, one of the resulting models are used to generate melodies in the development dataset, the remaining model is used to generate melodies in the evaluation dataset.

MusicTransformer. MusicTransformer [9] uses a Neural Network Language Model (NNLM) to generate music where the pitch and duration of notes at a time can be considered a word and the motives or phrases can be considered a sentence. This work is among the first using a Transformer to generate music.

Given a sentence S which contains N words $w_i$, that is, $S=\,{<}w_1, w_2,\ldots ,w_n{>}\ \in V_n$, $V_n$ is the size of the overall vocabulary. The language model aims to find the probability distribution of the sentence, which can be formalised using Eq. (1). Given the forward sequence of a word, the probability of the entire word sequence can be decomposed into the product of the conditional probability of the next word with respect to its forward word. The results of the system show that longer temporal dependencies are well modelled, since repeated or similar phrases can be found in the music generated by the proposed system.

$$\begin{aligned} P(S)=P(w_1, w_2,\ldots ,w_N)=P(w_1)P(w_2|w_1)\cdot \cdot \cdot P(w_N|w_1, w_2,\ldots ,w_{N-1}) \end{aligned}$$

(1)

The initialisation process of the system depends on the joint probability distribution of the initial sequences hence usually a randomly selected melody with dedicated lengths is used for initialisation. In this data challenge, the effects of the initialisation process for the MusicTransformer are also investigated by examining whether melodies generated by different initialisation seeds can be identified.

MusicVAE. MusicVAE [15] improves the structure of VAE according to the characteristics of music with hierarchical structure, which aims to solve the problem of lack coherence in the generated music when a vanilla VAE is used. The music is first represented using an encoder, constructed with a recurrent neural network to obtain a low-dimensional hidden vector. The resulting vector is then decoded with a multi-level decoder, which reconstructs the vector into a 16-bar unit first, then the decoding process continues with lower-level decoders to generate finer units of melodies.

MidiNet. MidiNet [19] converts music binarised into a piano roll, which is akin to a two-dimensional image. The generator and discriminator of the GAN system then use convolutional neural networks to encode and decode the resulting binarised piano rolls. Besides binarised piano rolls generated by decoders, music composed by humans is also sent to the discriminator for training. At the same time, to maintain coherent connection between the music segments, MidiNet adds information of the front music segment to each layer in the generator. This system is one of the earliest works targeting automatic composition using the method of generating images. It demonstrates the feasibility of using CNN to generate piano roll.

3.3 Human-Composed Melodies

Human composed melodies in this challenge have two sources. Published melodies that are used train the selected music generation systems and unpublished melodies that are required to be composed for this data challenge by university students majoring in music composition.

Published melodies are randomly selected from the dataset that trains the selected music generation systems. The selected melodies are then truncated to 32-beat long segments.

The unpublished melodies in the evaluation dataset is used to test the ability of recognising unknown human-composed music. The data challenge invited professionally trained composers from the China Conservatory of Music to compose a number of melodies. The students were asked to compose melodies in two styles: the Baroque style as composed by J. S. Bach and the common pop style. The structure of the composed melodies is removed with melody truncated to 32-beat long as well.

3.4 Data Representation

The paper uses pretty_midi^{Footnote 3} to convert the generated piano roll into a MIDI file which requires the MIDI number and the duration of each note. The MIDI number can be directly indexed by the note. The duration of each beat requires a simple calculation. As each column in the binarised piano roll represents a quarter beat, given a tempo value, such as 120 beats per minute (bpm), the duration of each column in the binarised piano roll can be easily calculated.

The instrument selected in the MIDI file is “Bright_Piano” with the velocity setting to 127 in MIDI files. The tempi of the MIDI files are randomly selected in the range of 68 bpm, 78 bpm, 88 bpm, 98 bpm, 108 bpm and 118 bpm to avoid the situation where the columns occupied by an individual note would always be the same integer.

3.5 Dataset Formation

Once converted to MIDI files, the computer-generated and human-composed melodies are divided into two datasets: the development dataset and the evaluation dataset. Neither datasets contain labels and they consist of an equal number of Bach-style and pop-style melodies.

In the development dataset, there are 6,000 computer-generated melodies generated by three models. The specific composition of the development dataset is shown in Table 1.

For each type of music generation system, two different datasets were used for training two individual melody generation systems: melodies from Bach Chorales in Music21 (labelled as “Bach” in Table 1) and hooktheory dataset (labelled as “Pop” in Table 1).

Table 1. The development dataset composition of the data challenge where the number in the brackets indicates the number of melodies. “MTrans”, “MVAE” and “MNet” represent for music transformer, MusicVAE and MidiNet respectively.

Full size table

In the evaluation dataset, there are 4,000 melodies coming from two sources: computer models and human composition.

Among the human-composed melodies, the items truncated from melodies originally used for training music generation systems (labelled as “Training” in Table 2) and specially composed melodies for this data challenge (labelled as “Unpublished”) are delineated given the two styles: from Bach Chorales or similar with Bach style (labelled as “Bach” in Table 2) and from hooktheory dataset or common pop style (labelled as “Pop” in Table 2).

The composition of computer-generated melodies are complex. As a general principle, it is necessary to emphasise that the system used to generate melodies in the evaluation dataset and the system used to generate melodies in the development dataset are always different although system architectures may be shared. As the case in the development dataset, each proposed system is trained using two different datasets (labelled as “Bach” and “Pop” in Table 2) hence two separate melody generation systems for different styles are obtained.

Table 2 summarises the composition of the evalution dataset. It is worth mentioning that numbers of melodies generated by MusicTransformer is larger than the other systems in order to investigate the effects of different initialisation configurations. Unlike in the development dataset where only one configuration used for initialisation of the MusicTransformer, the melodies in the evaluation dataset generated by MusicTransformer are the result of three different initialisation configurations, among which one of the initialisation scheme is used in the training process.

Table 2. The evaluation dataset composition of the data challenge where the number in the brackets indicates the number of melodies. “MTrans”, “MVAE” and “MNet” represent for music transformer, MusicVAE and MidiNet respectively. The number in the brackets indicates the number of melodies. The title of each column is explained in the context.

Full size table

4 Conclusions

The CSMT data challenge requires participants to identify computer-generated melodies among human-composed melodies. The challenge aims to facilitate solutions for determining the source of melodies in possible copyright infringement cases in juridical practice. The term “melody” is used in a limited sense in this data challenge. Melodies were truncated to remove musical structure and they were used without accompaniment. This paper provided an in-depth discussion on the composition and the design of the dataset.

The challenge utilises two components, the development dataset and the evaluation dataset. The development dataset contains only computer-generated melodies whereas the evaluation dataset combines both computer-generated and human-composed melodies. The computer-generated melodies in the development and evaluation datasets are obtained from the same type of systems with slightly different settings. The human-composed melodies were composed specifically for the CSMT data challenge besides existing melodies that were used for system training.

With the presented setup of the challenge, the identification of computer-generated melodies can be considered either an unsupervised outlier detection problem or a supervised classification problem. Both methodologies may suffer from learning the inherent limitations of the selected music generation systems. As a result, the systems proposed by participants in the data challenge may not produce a universally valid approach to identify computer generated melodies, but rely on data distributions instead that characterise state-of-the-art music generation systems. Nevertheless, this approach can still prove to be valuable for practical purposes, as in the legal context introduced earlier, if the models are kept up to date. Moreover, the melody complexity in this data challenge is reduced artificially hence the algorithms from participants may have limited generalisability.

Notes

References

Ackerman, M., Loker, D.: Algorithmic songwriting with ALYSIA. In: International Conference on Evolutionary and Biologically Inspired Music and Art, Amsterdam, The Netherlands, pp. 1–16. Springer (2017)
Google Scholar
Briot, J.P., Hadjeres, G., Pachet, F.D.: Deep Learning Techniques for Music Generation. Springer (2020)
Google Scholar
Brunner, G., Wang, Y., Wattenhofer, R., Zhao, S.: Symbolic music genre transfer with CycleGAN. In: 2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI), Volos, Greece, pp. 786–793. IEEE (2018)
Google Scholar
Cheng, T., Smith, J.B., Goto, M.: Music structure boundary detection and labelling by a deconvolution of path-enhanced self-similarity matrix. In: 2018 IEEE International Conference on Acoustics. Speech and Signal Processing (ICASSP), Calgary, Canada, pp. 106–110. IEEE (2018)
Google Scholar
Dong, H.W., Hsiao, W.Y., Yang, L.C., Yang, Y.H.: MuseGAN: multi-track sequential generative adversarial networks for symbolic music generation and accompaniment. In: Thirty-Second AAAI Conference on Artificial Intelligence, Orlando, USA, pp. 34–41. AAAI (2018)
Google Scholar
Eck, D., Schmidhuber, J.: A first look at music composition using LSTM recurrent neural networks. Istituto Dalle Molle Di Studi Sull Intelligenza Artificiale 103, 48 (2002)
Google Scholar
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, Montreal, Canada, pp. 2672–2680 (2014)
Google Scholar
Herremans, D., Chuan, C.H., Chew, E.: A functional taxonomy of music generation systems. ACM Comput. Surv. (CSUR) 50(5), 69 (2017)
Article Google Scholar
Huang, C.Z.A., Vaswani, A., Uszkoreit, J., Simon, I., Hawthorne, C., Shazeer, N., Dai, A.M., Hoffman, M.D., Dinculescu, M., Eck, D.: Music transformer: generating music with long-term structure. In: International Conference on Learning Representations, Vancouver, BC, Canada (2018)
Google Scholar
Liu, C.H., Ting, C.K.: Computational intelligence in music composition: a survey. IEEE Trans. Emerg. Top. Comput. Intell. 1(1), 2–15 (2016)
Article Google Scholar
Liu, H.M., Yang, Y.H.: Lead sheet generation and arrangement by conditional generative adversarial network. In: 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, USA, pp. 722–727. IEEE (2018)
Google Scholar
Luo, J., Yang, X., Ji, S., Li, J.: MG-VAE: deep Chinese folk songs generation with specific regional styles. In: Proceedings of the 7th Conference on Sound and Music Technology (CSMT), Haerbin, China, pp. 93–106. Springer (2020)
Google Scholar
Makris, D., Kaliakatsos-Papakostas, M., Karydis, I., Kermanidis, K.L.: Combining LSTM and feed forward neural networks for conditional rhythm composition. In: International Conference on Engineering Applications of Neural Networks, pp. 570–582. Springer (2017)
Google Scholar
Pinkerton, R.C.: Information theory and melody. Sci. Am. 194(2), 77–87 (1956)
Article Google Scholar
Roberts, A., Engel, J., Raffel, C., Hawthorne, C., Eck, D.: A hierarchical latent vector model for learning long-term structure in music. In: Dy, J., Krause, A. (eds.) Proceedings of Machine Learning Research, Stockholmsmässan, Stockholm, Sweden, 10–15 July 2018, vol. 80, pp. 4364–4373. PMLR (2018)
Google Scholar
Steedman, M.J.: A generative grammar for Jazz chord sequences. Music Percept. Interdiscip. J. 2(1), 52–77 (1984)
Article Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, Long Beach, USA, pp. 5998–6008 (2017)
Google Scholar
Wang, Y.A., Huang, Y.K., Lin, T.C., Su, S.Y., Chen, Y.N.: Modeling melodic feature dependency with modularized variational auto-encoder. In: 2019 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) ICASSP 2019, Brighton, UK, pp. 191–195. IEEE (2019)
Google Scholar
Yang, L.C., Chou, S.Y., Yang, Y.H.: MidiNet: a convolutional generative adversarial network for symbolic-domain music generation. In: 18th International Society for Music Information Retrieval Conference, Suzhou, China, pp. 324–331 (2017)
Google Scholar
Zhu, H., Liu, Q., Yuan, N.J., Qin, C., Li, J., Zhang, K., Zhou, G., Wei, F., Xu, Y., Chen, E.: Xiaoice band: a melody and arrangement generation framework for pop music. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, United Kingdom, pp. 2837–2846. ACM (2018)
Google Scholar

Download references

Acknowledgement

The authors acknowledge the contribution from all members in the organisation committee (other than the authors): Prof. ZHANG Ru from Beijing University of Posts and Telecommunications, Dr. LI Zijin from China Conservatory of Music, Mr. ZHU Yidan from Beijing Acoustics Society, Mr. ZHOU Wei from Beijing Zhongwen (Shanghai) Law Firm.

Author information

Authors and Affiliations

Department of Intelligent Science, School of Advanced Technology, Xi’an Jiaotong-Liverpool University, 111 Ren’ai Road, Suzhou Industrial Park, Suzhou, 215123, Jiangsu Province, P. R. China
Shengchen Li
Beijing University of Posts and Telecommunications, Beijing, China
Yinji Jing
Queen Mary University of London, London, UK
György Fazekas

Authors

Shengchen Li
View author publications
You can also search for this author in PubMed Google Scholar
Yinji Jing
View author publications
You can also search for this author in PubMed Google Scholar
György Fazekas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shengchen Li .

Editor information

Editors and Affiliations

Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Xi Shao
The University of Tokyo, Bunkyo-ku, Tokyo, Japan
Kun Qian
China University of Geosciences, Wuhan, Hubei, China
Li Zhou
Communication University of China, Beijing, China
Xin Wang
Tianjin Normal University, Tianjin, China
Ziping Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, S., Jing, Y., Fazekas, G. (2021). A Novel Dataset for the Identification of Computer Generated Melodies in the CSMT Challenge. In: Shao, X., Qian, K., Zhou, L., Wang, X., Zhao, Z. (eds) Proceedings of the 8th Conference on Sound and Music Technology . CSMT 2020. Lecture Notes in Electrical Engineering, vol 761. Springer, Singapore. https://doi.org/10.1007/978-981-16-1649-5_15

Download citation

DOI: https://doi.org/10.1007/978-981-16-1649-5_15
Published: 25 April 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-1648-8
Online ISBN: 978-981-16-1649-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

A Novel Dataset for the Identification of Computer Generated Melodies in the CSMT Challenge

Abstract

Similar content being viewed by others

Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation

Deep learning’s shallow gains: a comparative evaluation of algorithms for automatic music generation

Melody Transformation with Semiotic Patterns

Keywords

1 Introduction

2 Melody Generation Systems

3 Dataset

3.1 Training Data

3.2 Computer-Generated Melodies

3.3 Human-Composed Melodies

3.4 Data Representation

3.5 Dataset Formation

4 Conclusions

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Novel Dataset for the Identification of Computer Generated Melodies in the CSMT Challenge

Abstract

Similar content being viewed by others

Chinese Chorales Dataset: A High-Quality Music Dataset for Score Generation

Deep learning’s shallow gains: a comparative evaluation of algorithms for automatic music generation

Melody Transformation with Semiotic Patterns

Keywords

1 Introduction

2 Melody Generation Systems

3 Dataset

3.1 Training Data

3.2 Computer-Generated Melodies

3.3 Human-Composed Melodies

3.4 Data Representation

3.5 Dataset Formation

4 Conclusions

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation