In this paper, we present two methods based on neural networks for the automatic transcription of polyphonic piano music. The input to these methods consists in piano music recordings stored in WAV files, while the pitch of all the notes in the corresponding score forms the output. The aim of this work is to compare the accuracy achieved using a feed-forward neural network, such as the MLP (MultiLayer Perceptron), with that supplied by a recurrent neural network, such as the ENN (Elman Neural Network). Signal processing techniques based on the CQT (Constant-Q Transform) are used in order to create a time-frequency representation of the input signals. Since large scale tests were required, the whole process (synthesis of audio data generated starting from MIDI files, comparison of the results with the original score) has been automated. Test, validation and training sets have been generated with reference to three different musical styles respectively represented by J.S Bach's inventions, F. Chopin's nocturnes and C. Debussy's preludes.

Costantini, G., Todisco, M., Carota, M., Casali, D. (2008). Static and Dynamic Classification Methods for Polyphonic Transcription of Piano Pieces in Different Musical Styles. In PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS - NEW ASPECTS OF CIRCUITS (pp.158-162). ATHENS : WORLD SCIENTIFIC AND ENGINEERING ACAD AND SOC.

Static and Dynamic Classification Methods for Polyphonic Transcription of Piano Pieces in Different Musical Styles

COSTANTINI, GIOVANNI;
2008-01-01

Abstract

In this paper, we present two methods based on neural networks for the automatic transcription of polyphonic piano music. The input to these methods consists in piano music recordings stored in WAV files, while the pitch of all the notes in the corresponding score forms the output. The aim of this work is to compare the accuracy achieved using a feed-forward neural network, such as the MLP (MultiLayer Perceptron), with that supplied by a recurrent neural network, such as the ENN (Elman Neural Network). Signal processing techniques based on the CQT (Constant-Q Transform) are used in order to create a time-frequency representation of the input signals. Since large scale tests were required, the whole process (synthesis of audio data generated starting from MIDI files, comparison of the results with the original score) has been automated. Test, validation and training sets have been generated with reference to three different musical styles respectively represented by J.S Bach's inventions, F. Chopin's nocturnes and C. Debussy's preludes.
12th WSEAS International Conference on Circuits
Heraklion, GREECE
JUL 22-25, 2008
Rilevanza internazionale
2008
Settore ING-IND/31 - ELETTROTECNICA
English
Automatic piano music transcription; MultiLayer Perceptron; Elman Neural Network; Constant-Q Transform
5
Intervento a convegno
Costantini, G., Todisco, M., Carota, M., Casali, D. (2008). Static and Dynamic Classification Methods for Polyphonic Transcription of Piano Pieces in Different Musical Styles. In PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS - NEW ASPECTS OF CIRCUITS (pp.158-162). ATHENS : WORLD SCIENTIFIC AND ENGINEERING ACAD AND SOC.
Costantini, G; Todisco, M; Carota, M; Casali, D
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/52647
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact