Classifying musical instruments through neural approaches: an empirical study
| dc.contributor.author | Park, Juhyoung | |
| dc.contributor.author | University of Lethbridge. Faculty of Arts and Science | |
| dc.contributor.supervisor | Zhang, John Z. | |
| dc.date.accessioned | 2026-05-21T15:43:22Z | |
| dc.date.issued | 2026 | |
| dc.degree.level | Masters | |
| dc.description.abstract | Musical instrument classification is one of the important tasks in Music Information Retrieval (MIR), yet achieving robust performance in real-world music is still challenging. In this thesis, we investigate the problem of multi-class, multi-label musical instrument classification through neural approaches. Our study utilizes multi-genre instrument mixtures derived from the MUSDB18 and the MedleyDB, two popular datasets in MIR, and uses Mel-spectrogram, Mel-frequency cepstral coefficients (MFCCs), Constant-Q transform (CQT), Chroma Energy Normalized Statistics (CENS), and zero-crossing rate as audio features. Principal Component Analysis (PCA), Incremental PCA, and upsampling techniques are also employed to facilitate our experiments. In our investigation, we have found that the simple models using Artificial Neural Networks (ANNs) show lower performance in classifying mixed-instrument classes, and the hierarchical models using multiple simple ANN-based models show slightly improved performance. The models using Convolutional Neural Networks (CNNs) outperformed the models using ANNs, and employing combined audio feature images as input to the CNN-based models improves the performance on the mixed-instruments classes. We have designed and conducted a series of empirical experiments using our proposed neural architectures on the two datasets. The results are evaluated and discussed. We expect that our approach would achieve better performance in the real-world situation. | |
| dc.embargo | No | |
| dc.identifier.uri | https://hdl.handle.net/10133/7412 | |
| dc.language.iso | en | |
| dc.publisher | Lethbridge, Alta. : University of Lethbridge, Dept. of Mathematics and Computer Science | |
| dc.publisher.department | Department of Mathematics and Computer Science | |
| dc.publisher.faculty | Arts and Science | |
| dc.relation.ispartofseries | Thesis (University of Lethbridge. Faculty of Arts and Science) | |
| dc.subject | musical instrument classification | |
| dc.subject | neural network | |
| dc.subject | Convolutional Neural Network | |
| dc.subject | Artificial Neural Network | |
| dc.subject | deep learning | |
| dc.subject | musical instrument | |
| dc.subject | classification | |
| dc.subject | audio feature | |
| dc.subject | db-MFCCs | |
| dc.subject | combined audio feature images | |
| dc.subject.lcsh | Dissertations, Academic | |
| dc.subject.lcsh | Neural networks (Computer science) | |
| dc.subject.lcsh | Deep learning (Machine learning) | |
| dc.subject.lcsh | Musical instruments--Data processing | |
| dc.subject.lcsh | Musical instruments--Classification--Data processing | |
| dc.subject.lcsh | Information storage and retrieval systems--Music | |
| dc.subject.lcsh | Music--Data processing | |
| dc.subject.lcsh | Music--Acoustics and physics | |
| dc.title | Classifying musical instruments through neural approaches: an empirical study | |
| dc.type | Thesis |