Audiomate : a Python package for working with audio datasets

Büchi, Matthias; Ahlenstorf, Andreas

doi:10.21105/joss.02135

Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: https://doi.org/10.21256/zhaw-22925

Publikationstyp:	Beitrag in wissenschaftlicher Zeitschrift
Art der Begutachtung:	Peer review (Publikation)
Titel:	Audiomate : a Python package for working with audio datasets
Autor/-in:	Büchi, Matthias Ahlenstorf, Andreas
et. al:	No
DOI:	10.21105/joss.02135 10.21256/zhaw-22925
Erschienen in:	Journal of Open Source Software
Band(Heft):	5
Heft:	52
Seite(n):	2135
Erscheinungsdatum:	2020
Verlag / Hrsg. Institution:	Open Journals
ISSN:	2475-9066
Sprache:	Englisch
Fachgebiet (DDC):	005: Computerprogrammierung, Programme und Daten
Zusammenfassung:	Machine learning tasks in the audio domain frequently require large datasets with training data. Over the last years, numerous datasets have been made available for various purposes, for example, (Snyder, Chen, & Povey, 2015) and (Ardila et al., 2019). Unfortunately, most of the datasets are stored in widely differing formats. As a consequence, machine learning practitioners have to convert datasets into other formats before they can be used or combined. Furthermore, common tasks like reading, partitioning, or shuffling of datasets have to be developed over and over again for each format and require intimate knowledge of the formats. We purpose Audiomate, a Python toolkit, to solve this problem. Audiomate provides a uniform programming interface to work with numerous datasets. Knowledge about the structure or on-disk format of the datasets is not necessary. Audiomate facilitates and simplifies a wide range of tasks: • Reading and writing of numerous dataset formats using a uniform programming interface, for example (Snyder et al., 2015), (Panayotov, Chen, Povey, & Khudanpur, 2015) and (Ardila et al., 2019) • Accessing metadata, like speaker information and labels • Reading audio data (single files, batches of files) • Retrieval of information about the data (e.g., number of speakers, total duration). • Merging of multiple datasets (e.g., combine two speech datasets). • Splitting data into smaller subsets (e.g., create training, validation, and test sets with a reasonable distribution of classes). • Validation of data for specific requirements (e.g., check whether all samples were assigned a label)
URI:	https://digitalcollection.zhaw.ch/handle/11475/22925
Volltext Version:	Publizierte Version
Lizenz (gemäss Verlagsvertrag):	CC BY 4.0: Namensnennung 4.0 International
Departement:	School of Engineering
Organisationseinheit:	Institut für Informatik (InIT)
Enthalten in den Sammlungen:	Publikationen School of Engineering

Dateien zu dieser Ressource:

Datei	Beschreibung	Größe	Format
2021_Buechi-Ahlenstorf_audiomate-Python-package.pdf		165.98 kB	Adobe PDF	Öffnen/Anzeigen

Zur Langanzeige

Büchi, M., & Ahlenstorf, A. (2020). Audiomate : a Python package for working with audio datasets. Journal of Open Source Software, 5(52), 2135. https://doi.org/10.21105/joss.02135

Büchi, M. and Ahlenstorf, A. (2020) ‘Audiomate : a Python package for working with audio datasets’, Journal of Open Source Software, 5(52), p. 2135. Available at: https://doi.org/10.21105/joss.02135.

M. Büchi and A. Ahlenstorf, “Audiomate : a Python package for working with audio datasets,” Journal of Open Source Software, vol. 5, no. 52, p. 2135, 2020, doi: 10.21105/joss.02135.

BÜCHI, Matthias und Andreas AHLENSTORF, 2020. Audiomate : a Python package for working with audio datasets. Journal of Open Source Software. 2020. Bd. 5, Nr. 52, S. 2135. DOI 10.21105/joss.02135

Büchi, Matthias, and Andreas Ahlenstorf. 2020. “Audiomate : A Python Package for Working with Audio Datasets.” Journal of Open Source Software 5 (52): 2135. https://doi.org/10.21105/joss.02135.

Büchi, Matthias, and Andreas Ahlenstorf. “Audiomate : A Python Package for Working with Audio Datasets.” Journal of Open Source Software, vol. 5, no. 52, 2020, p. 2135, https://doi.org/10.21105/joss.02135.

Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.