Discriminative classifiers for speaker recognition

Katz, Marcel

Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: http://dx.doi.org/10.25673/4890

Langanzeige der Metadaten

DC Element	Wert	Sprache
dc.contributor.author	Katz, Marcel	-
dc.date.accessioned	2018-09-24T16:20:22Z	-
dc.date.available	2018-09-24T16:20:22Z	-
dc.date.issued	2008	-
dc.identifier.uri	https://opendata.uni-halle.de//handle/1981185920/10932	-
dc.identifier.uri	http://dx.doi.org/10.25673/4890	-
dc.description.abstract	Due to the growing need for security applications speaker recognition as the biometric task of authenticating a claimant by voice has currently become a focus of interest. Traditionally approaches in the area of speaker recognition were mainly based on generative classifiers like Gaussian Mixture Models (GMMs). However, more recently other classifiers like Support Vector Machines (SVMs) have been successfully applied to several fields of pattern recognition. These discriminative classifiers which are theoretically derived from statistical learning theory obtain a high generalization ability. Therefore these so called discriminative methods have also been discussed as a promising approach to specifically improve performance of speaker recognition systems. Following this train of thought, this work focuses on the development and integration of different discriminative classifiers into the field of speaker recognition. As an alternative to the SVM we present the Sparse Kernel Logistic Regression (SKLR), a sparse non-linear expansion of the well known Logistic Regression. In contrast to Support Vector Machines the SKLR directly models the posterior probability of class membership and therefore naturally provides a probability output. For this reason a new speaker recognition environment is designed and implemented which includes two different recognition approaches, one for limited and one for large (the so called extended) training data. In the first recognition approach the discriminative classifiers are applied directly on feature vectors from parameterized speech frames and it is shown that both, SVM as well as SKLR outperform traditional GMM methods. In the second approach a state-of-the-art speaker recognition system for large amount of training data is designed that combines Gaussian Mixture Models (GMM) with discriminative classifiers and integrates the SKLR into this system. Furthermore, we investigate different feature extraction methods for speaker recognition on large amount of training data. It is shown that the application of fusion schemes which combine these subsystems yield a significant improvement of the recognition performance in comparison to the application of single subsystems. All presented approaches are evaluated on internationally recognized corpora and were published in appropriate international media. The comparison of our speaker recognition systems with other state-of-the-art systems revealed equal or significantly better recognition performance.	eng
dc.description.statementofresponsibility	von Marcel Katz	-
dc.format.extent	Online-Ressource (PDF-Datei: 149 S., 1282 KB)	-
dc.language.iso	eng	-
dc.publisher	Universitätsbibliothek	-
dc.publisher	Otto von Guericke University Library, Magdeburg, Germany	-
dc.subject	Hochschulschrift	-
dc.subject	Online-Publikation	-
dc.subject.ddc	006	-
dc.title	Discriminative classifiers for speaker recognition	-
dc.title.alternative	Diskriminative Klassifizierer zur Sprechererkennung	-
dcterms.type	Hochschulschrift	-
dc.type	PhDThesis	-
dc.identifier.urn	urn:nbn:de:101:1-201010182599	-
local.publisher.universityOrInstitution	Otto-von-Guericke-Universität Magdeburg	-
local.subject.keywords	Speaker Recognition, Speaker Verification, Sparse Kernel Logistic Regression, Support Vector Machine	-
local.openaccess	true	-
Enthalten in den Sammlungen:	Fakultät für Elektrotechnik und Informationstechnik

Dateien zu dieser Ressource:

Datei	Beschreibung	Größe	Format
markatz.pdf		1.41 MB	Adobe PDF	Öffnen/Anzeigen

Zur Kurzanzeige BibTeX EndNote