SciELO - Scientific Electronic Library Online

 
 número79Modelo de segmentación de campos aleatorios de Markov para imágenes de manchas de lagartoImpacto de la probabilidad de sensado erróneo en redes de sensores inalámbricas basadas en clusters con amplias áreas de cobertura índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • En proceso de indezaciónCitado por Google
  • No hay articulos similaresSimilares en SciELO
  • En proceso de indezaciónSimilares en Google

Compartir


Revista Facultad de Ingeniería Universidad de Antioquia

versión impresa ISSN 0120-6230

Resumen

GOMEZ-GARCIA, Jorge Andrés; MORO-VELAZQUEZ, Laureano; GODINO-LLORENTE, Juan Ignacio  y  CASTELLANOS-DOMINGUEZ, César Germán. An insight to the automatic categorization of speakers according to sex and its application to the detection of voice pathologies: A comparative study. Rev.fac.ing.univ. Antioquia [online]. 2016, n.79, pp.50-62. ISSN 0120-6230.  https://doi.org/10.17533/udea.redin.n79a06.

An automatic categorization of the speakers according to their sex improves the performance of an automatic detector of voice pathologies. This is grounded on findings demonstrating perceptual, acoustical and anatomical differences in males' and females' voices. In particular, this paper follows two objectives: 1) to design a system which automatically discriminates the sex of a speaker when using normophonic and pathological speech, 2) to study the influence that this sex detector has on the accuracy of a further voice pathology detector. The parameterization of the automatic sex detector relies on MFCC applied to speech; and MFCC applied to glottal waveforms plus parameters modeling the vocal tract. The glottal waveforms are extracted from speech via iterative lattice inverse filters. Regarding the pathology detector, a MFCC parameterization is applied to speech signals. Classification, in both sex and pathology detectors, is carried out using state of the art techniques based on universal background models. Experiments are performed in the Saarbrücken database, employing the sustained phonation of vowel /a/. Results indicate that the sex of the speaker may be discriminated automatically using normophonic and pathological speech, obtaining accuracy up to 95%. Moreover, including the a-priori information about the sex of the speaker produces an absolute performance improvement in EER of about 2% on pathology detection tasks.

Palabras clave : Voice pathology detection; inverse filtering; GMM; UBM.

        · resumen en Español     · texto en Inglés     · Inglés ( pdf )