Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen:
https://doi.org/10.21256/zhaw-4254
Publikationstyp: | Konferenz: Paper |
Art der Begutachtung: | Peer review (Publikation) |
Titel: | Speaker clustering using dominant sets |
Autor/-in: | Hibraj, Feliks Vascon, Sebastiano Stadelmann, Thilo Pelillo, Marcello |
DOI: | 10.1109/ICPR.2018.8546067 10.21256/zhaw-4254 |
Tagungsband: | 2018 24th International Conference on Pattern Recognition (ICPR) |
Seite(n): | 3549 |
Seiten bis: | 3554 |
Angaben zur Konferenz: | 24th International Conference on Pattern Recognition (ICPR 2018), Beijing, China, 20-28 August 2018 |
Erscheinungsdatum: | 2018 |
Verlag / Hrsg. Institution: | IEEE |
ISBN: | 978-1-5386-3788-3 |
Sprache: | Englisch |
Schlagwörter: | Speaker recognition; Speaker embeddings |
Fachgebiet (DDC): | 006: Spezielle Computerverfahren |
Zusammenfassung: | Speaker clustering is the task of forming speaker-specific groups based on a set of utterances. In this paper, we address this task by using Dominant Sets (DS). DS is a graphbased clustering algorithm with interesting properties that fits well to our problem and has never been applied before to speaker clustering. We report on a comprehensive set of experiments on the TIMIT dataset against standard clustering techniques and specific speaker clustering methods. Moreover, we compare performances under different features by using ones learned via deep neural network directly on TIMIT and other ones extracted from a pre-trained VGGVox net. To asses the stability, we perform a sensitivity analysis on the free parameters of our method, showing that performance is stable under parameter changes. The extensive experimentation carried out confirms the validity of the proposed method, reporting state-of-the-art results under three different standard metrics. We also report reference baseline results for speaker clustering on the entire TIMIT dataset for the first time. |
URI: | https://digitalcollection.zhaw.ch/handle/11475/6081 |
Volltext Version: | Eingereichte Version |
Lizenz (gemäss Verlagsvertrag): | Keine Angabe |
Departement: | School of Engineering |
Organisationseinheit: | Institut für Informatik (InIT) |
Enthalten in den Sammlungen: | Publikationen School of Engineering |
Dateien zu dieser Ressource:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
ICPR18b.pdf | 1.1 MB | Adobe PDF | ![]() Öffnen/Anzeigen |
Zur Langanzeige
Hibraj, F., Vascon, S., Stadelmann, T., & Pelillo, M. (2018). Speaker clustering using dominant sets [Conference paper]. 2018 24th International Conference on Pattern Recognition (ICPR), 3549–3554. https://doi.org/10.1109/ICPR.2018.8546067
Hibraj, F. et al. (2018) ‘Speaker clustering using dominant sets’, in 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, pp. 3549–3554. Available at: https://doi.org/10.1109/ICPR.2018.8546067.
F. Hibraj, S. Vascon, T. Stadelmann, and M. Pelillo, “Speaker clustering using dominant sets,” in 2018 24th International Conference on Pattern Recognition (ICPR), 2018, pp. 3549–3554. doi: 10.1109/ICPR.2018.8546067.
HIBRAJ, Feliks, Sebastiano VASCON, Thilo STADELMANN und Marcello PELILLO, 2018. Speaker clustering using dominant sets. In: 2018 24th International Conference on Pattern Recognition (ICPR). Conference paper. IEEE. 2018. S. 3549–3554. ISBN 978-1-5386-3788-3
Hibraj, Feliks, Sebastiano Vascon, Thilo Stadelmann, and Marcello Pelillo. 2018. “Speaker Clustering Using Dominant Sets.” Conference paper. In 2018 24th International Conference on Pattern Recognition (ICPR), 3549–54. IEEE. https://doi.org/10.1109/ICPR.2018.8546067.
Hibraj, Feliks, et al. “Speaker Clustering Using Dominant Sets.” 2018 24th International Conference on Pattern Recognition (ICPR), IEEE, 2018, pp. 3549–54, https://doi.org/10.1109/ICPR.2018.8546067.
Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.