GMM-based speaker age and gender classification in Czech and Slovak
Jiří Přibil, Anna Přibilová, Jindřich Matoušek
The paper describes an experiment with using the Gaussian mixture models (GMM) for automatic classification of the speaker age and gender. It analyses and compares the influence of different number of mixtures and different types of speech features used for GMM gender/age classification. Dependence of the computational complexity on the number of used mixtures is also analysed. Finally, the GMM classification accuracy is compared with the output of the conventional listening tests. The results of these objective and subjective evaluations are in correspondence.
Keywords: GMM classifier, spectral and prosodic features of speech, speaker gender and age classification
|