Classifying musical instruments through neural approaches: an empirical study

dc.contributor.authorPark, Juhyoung
dc.contributor.authorUniversity of Lethbridge. Faculty of Arts and Science
dc.contributor.supervisorZhang, John Z.
dc.date.accessioned2026-05-21T15:43:22Z
dc.date.issued2026
dc.degree.levelMasters
dc.description.abstractMusical instrument classification is one of the important tasks in Music Information Retrieval (MIR), yet achieving robust performance in real-world music is still challenging. In this thesis, we investigate the problem of multi-class, multi-label musical instrument classification through neural approaches. Our study utilizes multi-genre instrument mixtures derived from the MUSDB18 and the MedleyDB, two popular datasets in MIR, and uses Mel-spectrogram, Mel-frequency cepstral coefficients (MFCCs), Constant-Q transform (CQT), Chroma Energy Normalized Statistics (CENS), and zero-crossing rate as audio features. Principal Component Analysis (PCA), Incremental PCA, and upsampling techniques are also employed to facilitate our experiments. In our investigation, we have found that the simple models using Artificial Neural Networks (ANNs) show lower performance in classifying mixed-instrument classes, and the hierarchical models using multiple simple ANN-based models show slightly improved performance. The models using Convolutional Neural Networks (CNNs) outperformed the models using ANNs, and employing combined audio feature images as input to the CNN-based models improves the performance on the mixed-instruments classes. We have designed and conducted a series of empirical experiments using our proposed neural architectures on the two datasets. The results are evaluated and discussed. We expect that our approach would achieve better performance in the real-world situation.
dc.embargoNo
dc.identifier.urihttps://hdl.handle.net/10133/7412
dc.language.isoen
dc.publisherLethbridge, Alta. : University of Lethbridge, Dept. of Mathematics and Computer Science
dc.publisher.departmentDepartment of Mathematics and Computer Science
dc.publisher.facultyArts and Science
dc.relation.ispartofseriesThesis (University of Lethbridge. Faculty of Arts and Science)
dc.subjectmusical instrument classification
dc.subjectneural network
dc.subjectConvolutional Neural Network
dc.subjectArtificial Neural Network
dc.subjectdeep learning
dc.subjectmusical instrument
dc.subjectclassification
dc.subjectaudio feature
dc.subjectdb-MFCCs
dc.subjectcombined audio feature images
dc.subject.lcshDissertations, Academic
dc.subject.lcshNeural networks (Computer science)
dc.subject.lcshDeep learning (Machine learning)
dc.subject.lcshMusical instruments--Data processing
dc.subject.lcshMusical instruments--Classification--Data processing
dc.subject.lcshInformation storage and retrieval systems--Music
dc.subject.lcshMusic--Data processing
dc.subject.lcshMusic--Acoustics and physics
dc.titleClassifying musical instruments through neural approaches: an empirical study
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
PARK_JUHYOUNG_MSC_2026.pdf
Size:
6.79 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.33 KB
Format:
Item-specific license agreed upon to submission
Description: