Abstract: Background: Voice Similarity Analysis, is useful to identify and compare voices and is crucial in various applications, including speaker identification, music analysis, and surveillance. Traditional methods rely on features like pitch and formants, but they can be susceptible to variations in speaking style and environment. Spectrogram analysis offers a robust alternative by capturing the overall frequency distribution of the voice signal. Proposed a GUI application that utilizes cosine similarity on Mel-spectrograms to compare voice samples. User upload audio files, and the application extracts, resizes, and compares their spectrograms. A predefined similarity threshold determines whether......
Keywords: Mel-spectogram; Frequency; Libraries; Cosine Similarity, matplot lib; librosa; tkinter; Scikit-learn
[1].
Librosa: Mcfee, B., Raffel, C., Liang, D., Ellis, D. P., Mcvicar, M., & Battenberg, E. (2015). Librosa: Audio And Music Signal Analysis In Python. In Proceedings Of The 14th Python In Science Conference (Pp. 18-25).
[2].
Tkinter:Python Software Foundation. (2023). Tkinter Documentation. Retrieved From
Https://Docs.Python.Org/3/Library/Tkinter.Html
[3].
Matplotlib:Hunter, J. D. (2007). Matplotlib: A 2D Graphics Environment. Computing In Science & Engineering, 9(3), 90-95.
[4].
Numpy: Harris, C. R., Millman, K. J., Van Der Walt, S. J., Gommers, R., Virtanen, P., Cournapeau, D., ... & Oliphant, T. E. (2020). Array Programming With Numpy. Nature, 585(7825), 357-362.
[5].
Scikit-Learn (For Cosine_Similarity): Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., ... & Vanderplas, J. (2011). Scikit-Learn: Machine Learning In Python. Journal Of Machine Learning Research, 12(Oct), 2825-2830