Recognition of Pathological Voices by Human Factor Cepstral Coefficients (HFCC)

Rabeh Hamdi; Salah Hajji; Adnene Cherif

doi:10.3844/jcssp.2020.1085.1099

Research Article Open Access

Recognition of Pathological Voices by Human Factor Cepstral Coefficients (HFCC)

Rabeh Hamdi¹, Salah Hajji¹ and Adnene Cherif¹

¹ University of Tunis El Manar, Tunisia

Abstract

Human speech is a means of communication that is very important in our daily lives. It is characterized by its great ability to transmit our ideas, our emotions, our personality etc. So, any alteration of the voice can prevent the person from exercising his professional and daily life naturally. It is for these reasons that it is very necessary to implement systems for detecting and classifying vocal pathologies. These automatic systems can help clinicians customize and detect the existence of any vocal pathology. In this context, several tools have been introduced to achieve early detection of voice disorders. Among these tools are the Human Factor Cepstral Coefficients (HFCC) combined with prosodic parameters, the Noise-Harmonic Ratio (NHR), the Harmonic-Noise Ratio (HNR), analysis of trend Fluctuations (DFA) and Fundamental frequency (F0). These parameters are introduced and calculated in every frame. In this study, we used a variation of HFCC called Equivalent Rectangular Bandwidth (ERB) to study the effects of HFCC on the classification of pathological voices. Using the HTK classifiers, the classification is carried out on two pathological databases, Massachusetts Eye and Ear Infirmary (MEEI) and Saarbruecken Voice Database (SVD). To assess the performance of the system, we used sensitivity and specificity.

Journal of Computer Science

Volume 16 No. 8, 2020, 1085-1099

DOI: https://doi.org/10.3844/jcssp.2020.1085.1099

Submitted On: 11 May 2020 Published On: 7 August 2020

How to Cite: Hamdi, R., Hajji, S. & Cherif, A. (2020). Recognition of Pathological Voices by Human Factor Cepstral Coefficients (HFCC). Journal of Computer Science, 16(8), 1085-1099. https://doi.org/10.3844/jcssp.2020.1085.1099

Copyright: © 2020 Rabeh Hamdi, Salah Hajji and Adnene Cherif. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

5,937 Views
2,862 Downloads
6 Citations

Download

Keywords

Pathological Voices
Sensibility
Specificity
ERB
HFCC
HTK
MEEI
SVD