The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant Frequency Transform) and cepstrum coefficients is employed. We will adopt Min-Max Neurofuzzy Networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.
Costantini, G., Rizzi, A., Casali, D. (2003). Recognition of musical instruments by generalized MIN-MAX classifiers. In 2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03.
Recognition of musical instruments by generalized MIN-MAX classifiers
COSTANTINI, GIOVANNI;
2003-01-01
Abstract
The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant Frequency Transform) and cepstrum coefficients is employed. We will adopt Min-Max Neurofuzzy Networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.