Speaker clustering using dominant sets

Hibraj, Feliks; Vascon, Sebastiano; Stadelmann, Thilo; Pelillo, Marcello

doi:10.1109/ICPR.2018.8546067

Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: https://doi.org/10.21256/zhaw-4254

Publikationstyp:	Konferenz: Paper
Art der Begutachtung:	Peer review (Publikation)
Titel:	Speaker clustering using dominant sets
Autor/-in:	Hibraj, Feliks Vascon, Sebastiano Stadelmann, Thilo Pelillo, Marcello
DOI:	10.1109/ICPR.2018.8546067 10.21256/zhaw-4254
Tagungsband:	2018 24th International Conference on Pattern Recognition (ICPR)
Seite(n):	3549
Seiten bis:	3554
Angaben zur Konferenz:	24th International Conference on Pattern Recognition (ICPR 2018), Beijing, China, 20-28 August 2018
Erscheinungsdatum:	2018
Verlag / Hrsg. Institution:	IEEE
ISBN:	978-1-5386-3788-3
Sprache:	Englisch
Schlagwörter:	Speaker recognition; Speaker embeddings
Fachgebiet (DDC):	006: Spezielle Computerverfahren
Zusammenfassung:	Speaker clustering is the task of forming speaker-specific groups based on a set of utterances. In this paper, we address this task by using Dominant Sets (DS). DS is a graphbased clustering algorithm with interesting properties that fits well to our problem and has never been applied before to speaker clustering. We report on a comprehensive set of experiments on the TIMIT dataset against standard clustering techniques and specific speaker clustering methods. Moreover, we compare performances under different features by using ones learned via deep neural network directly on TIMIT and other ones extracted from a pre-trained VGGVox net. To asses the stability, we perform a sensitivity analysis on the free parameters of our method, showing that performance is stable under parameter changes. The extensive experimentation carried out confirms the validity of the proposed method, reporting state-of-the-art results under three different standard metrics. We also report reference baseline results for speaker clustering on the entire TIMIT dataset for the first time.
URI:	https://digitalcollection.zhaw.ch/handle/11475/6081
Volltext Version:	Eingereichte Version
Lizenz (gemäss Verlagsvertrag):	Keine Angabe
Departement:	School of Engineering
Organisationseinheit:	Institut für Informatik (InIT)
Enthalten in den Sammlungen:	Publikationen School of Engineering

Dateien zu dieser Ressource:

Datei	Beschreibung	Größe	Format
ICPR18b.pdf		1.1 MB	Adobe PDF	Öffnen/Anzeigen

Zur Langanzeige

Hibraj, F., Vascon, S., Stadelmann, T., & Pelillo, M. (2018). Speaker clustering using dominant sets [Conference paper]. 2018 24th International Conference on Pattern Recognition (ICPR), 3549–3554. https://doi.org/10.1109/ICPR.2018.8546067

Hibraj, F. et al. (2018) ‘Speaker clustering using dominant sets’, in 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, pp. 3549–3554. Available at: https://doi.org/10.1109/ICPR.2018.8546067.

F. Hibraj, S. Vascon, T. Stadelmann, and M. Pelillo, “Speaker clustering using dominant sets,” in 2018 24th International Conference on Pattern Recognition (ICPR), 2018, pp. 3549–3554. doi: 10.1109/ICPR.2018.8546067.

HIBRAJ, Feliks, Sebastiano VASCON, Thilo STADELMANN und Marcello PELILLO, 2018. Speaker clustering using dominant sets. In: 2018 24th International Conference on Pattern Recognition (ICPR). Conference paper. IEEE. 2018. S. 3549–3554. ISBN 978-1-5386-3788-3

Hibraj, Feliks, Sebastiano Vascon, Thilo Stadelmann, and Marcello Pelillo. 2018. “Speaker Clustering Using Dominant Sets.” Conference paper. In 2018 24th International Conference on Pattern Recognition (ICPR), 3549–54. IEEE. https://doi.org/10.1109/ICPR.2018.8546067.

Hibraj, Feliks, et al. “Speaker Clustering Using Dominant Sets.” 2018 24th International Conference on Pattern Recognition (ICPR), IEEE, 2018, pp. 3549–54, https://doi.org/10.1109/ICPR.2018.8546067.

Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.