Full metadata record
DC FieldValueLanguage
dc.contributor.authorInan, Berkay-
dc.contributor.authorCernak, Milos-
dc.contributor.authorGrabner, Helmut-
dc.contributor.authorTukuljac, Helena Peic-
dc.contributor.authorPena, Rodrigo C. G.-
dc.contributor.authorRicaud, Benjamin-
dc.date.accessioned2019-10-18T08:54:17Z-
dc.date.available2019-10-18T08:54:17Z-
dc.date.issued2019-
dc.identifier.urihttps://digitalcollection.zhaw.ch/handle/11475/18478-
dc.description.abstractSource separation involving mono-channel audio is a challenging problem, in particular for speech separation where source contributions overlap both in time and frequency. This task is of high interest for applications such as video conferencing. Recent progress in machine learning has shown that the combination of visual cues, coming from the video, can increase the source separation performance. Starting from a recently designed deep neural network, we assess its ability and robustness to separate the visible speakers’ speech from other interfering speeches or signals. We test it for different configuration of video recordings where the speaker’s face may not be fully visible. We also asses the performance of the network with respect to different sets of visual features from the speakers’ faces.de_CH
dc.language.isoende_CH
dc.publisherInternational Speech Communication Association (ISCA)de_CH
dc.rightsLicence according to publishing contractde_CH
dc.subjectSpeech enhancementde_CH
dc.subjectSource separationde_CH
dc.subjectMulti-modalde_CH
dc.subjectAaudiovisualde_CH
dc.subject.ddc621.3: Elektro-, Kommunikations-, Steuerungs- und Regelungstechnikde_CH
dc.titleEvaluating audiovisual source separation in the context of video conferencingde_CH
dc.typeKonferenz: Paperde_CH
dcterms.typeTextde_CH
zhaw.departementSchool of Engineeringde_CH
zhaw.organisationalunitInstitut für Datenanalyse und Prozessdesign (IDP)de_CH
dc.identifier.doi10.21437/Interspeech.2019-2671de_CH
zhaw.conference.detailsInterspeech 2019, Graz, Austria, 15-19 September 2019de_CH
zhaw.funding.euNode_CH
zhaw.originated.zhawYesde_CH
zhaw.pages.end4583de_CH
zhaw.pages.start4579de_CH
zhaw.publication.statuspublishedVersionde_CH
zhaw.publication.reviewPeer review (Publikation)de_CH
zhaw.title.proceedingsProceedings Interspeech 2019de_CH
zhaw.author.additionalNode_CH
Appears in collections:Publikationen School of Engineering

Files in This Item:
There are no files associated with this item.
Show simple item record
Inan, B., Cernak, M., Grabner, H., Tukuljac, H. P., Pena, R. C. G., & Ricaud, B. (2019). Evaluating audiovisual source separation in the context of video conferencing [Conference paper]. Proceedings Interspeech 2019, 4579–4583. https://doi.org/10.21437/Interspeech.2019-2671
Inan, B. et al. (2019) ‘Evaluating audiovisual source separation in the context of video conferencing’, in Proceedings Interspeech 2019. International Speech Communication Association (ISCA), pp. 4579–4583. Available at: https://doi.org/10.21437/Interspeech.2019-2671.
B. Inan, M. Cernak, H. Grabner, H. P. Tukuljac, R. C. G. Pena, and B. Ricaud, “Evaluating audiovisual source separation in the context of video conferencing,” in Proceedings Interspeech 2019, 2019, pp. 4579–4583. doi: 10.21437/Interspeech.2019-2671.
INAN, Berkay, Milos CERNAK, Helmut GRABNER, Helena Peic TUKULJAC, Rodrigo C. G. PENA und Benjamin RICAUD, 2019. Evaluating audiovisual source separation in the context of video conferencing. In: Proceedings Interspeech 2019. Conference paper. International Speech Communication Association (ISCA). 2019. S. 4579–4583
Inan, Berkay, Milos Cernak, Helmut Grabner, Helena Peic Tukuljac, Rodrigo C. G. Pena, and Benjamin Ricaud. 2019. “Evaluating Audiovisual Source Separation in the Context of Video Conferencing.” Conference paper. In Proceedings Interspeech 2019, 4579–83. International Speech Communication Association (ISCA). https://doi.org/10.21437/Interspeech.2019-2671.
Inan, Berkay, et al. “Evaluating Audiovisual Source Separation in the Context of Video Conferencing.” Proceedings Interspeech 2019, International Speech Communication Association (ISCA), 2019, pp. 4579–83, https://doi.org/10.21437/Interspeech.2019-2671.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.