Abstract: This paper presents a comprehensive study on automatic music genre classification using machine learning techniques applied to the widely-used GTZAN dataset. We extracted 13 audio features including spectral and temporal characteristics to classify music into 10 distinct genres. Our Random Forest classifier achieved 78.2% accuracy, outperforming baseline models including Support Vector Machines (72.4%) and K-Nearest Neighbors (69.1%). Feature importance analysis revealed that Mel-Frequency Cepstral Coefficients (MFCCs) and spectral centroid are the most discriminative features.........
Key Word: Music Information Retrieval, Genre Classification, Audio Feature Extraction, Random Forest, GTZAN Dataset.
[1]
Bergstra, J., Casagrande, N., Erhan, D., Eck, D., & Kégl, B. (2006). Aggregate Features And Adaboost For Music Classification. Machine Learning, 65(2-3), 473-484.
[2]
Bertin-Mahieux, T., Ellis, D. P., Whitman, B., & Lamere, P. (2011). The Million Song Dataset. Proceedings Of The 12th International Society For Music Information Retrieval Conference, 591-596.
[3]
Caruana, R., Karampatziakis, N., & Yessenalina, A. (2008). An Empirical Evaluation Of Supervised Learning In High Dimensions. Proceedings Of The 25th International Conference On Machine Learning, 96-103.
[4]
Choi, K., Fazekas, G., Sandler, M., & Cho, K. (2017). Convolutional Recurrent Neural Networks For Music Classification. Proceedings Of The IEEE International Conference On Acoustics, Speech And Signal Processing, 2392-2396.
[5]
Costa, P. T., & Mccrae, R. R. (1992). Revised NEO Personality Inventory (NEO-PI-R) And NEO Five-Factor Inventory (NEO-FFI) Professional Manual. Psychological Assessment Resources.