Evaluating audiovisual source separation in the context of video conferencing

Inan, Berkay; Cernak, Milos; Grabner, Helmut; Tukuljac, Helena Peic; Pena, Rodrigo C. G.; Ricaud, Benjamin

doi:10.21437/Interspeech.2019-2671

Full metadata record

DC Field	Value	Language
dc.contributor.author	Inan, Berkay	-
dc.contributor.author	Cernak, Milos	-
dc.contributor.author	Grabner, Helmut	-
dc.contributor.author	Tukuljac, Helena Peic	-
dc.contributor.author	Pena, Rodrigo C. G.	-
dc.contributor.author	Ricaud, Benjamin	-
dc.date.accessioned	2019-10-18T08:54:17Z	-
dc.date.available	2019-10-18T08:54:17Z	-
dc.date.issued	2019	-
dc.identifier.uri	https://digitalcollection.zhaw.ch/handle/11475/18478	-
dc.description.abstract	Source separation involving mono-channel audio is a challenging problem, in particular for speech separation where source contributions overlap both in time and frequency. This task is of high interest for applications such as video conferencing. Recent progress in machine learning has shown that the combination of visual cues, coming from the video, can increase the source separation performance. Starting from a recently designed deep neural network, we assess its ability and robustness to separate the visible speakers’ speech from other interfering speeches or signals. We test it for different configuration of video recordings where the speaker’s face may not be fully visible. We also asses the performance of the network with respect to different sets of visual features from the speakers’ faces.	de_CH
dc.language.iso	en	de_CH
dc.publisher	International Speech Communication Association (ISCA)	de_CH
dc.rights	Licence according to publishing contract	de_CH
dc.subject	Speech enhancement	de_CH
dc.subject	Source separation	de_CH
dc.subject	Multi-modal	de_CH
dc.subject	Aaudiovisual	de_CH
dc.subject.ddc	621.3: Elektro-, Kommunikations-, Steuerungs- und Regelungstechnik	de_CH
dc.title	Evaluating audiovisual source separation in the context of video conferencing	de_CH
dc.type	Konferenz: Paper	de_CH
dcterms.type	Text	de_CH
zhaw.departement	School of Engineering	de_CH
zhaw.organisationalunit	Institut für Datenanalyse und Prozessdesign (IDP)	de_CH
dc.identifier.doi	10.21437/Interspeech.2019-2671	de_CH
zhaw.conference.details	Interspeech 2019, Graz, Austria, 15-19 September 2019	de_CH
zhaw.funding.eu	No	de_CH
zhaw.originated.zhaw	Yes	de_CH
zhaw.pages.end	4583	de_CH
zhaw.pages.start	4579	de_CH
zhaw.publication.status	publishedVersion	de_CH
zhaw.publication.review	Peer review (Publikation)	de_CH
zhaw.title.proceedings	Proceedings Interspeech 2019	de_CH
zhaw.author.additional	No	de_CH
Appears in collections:	Publikationen School of Engineering

Files in This Item:

There are no files associated with this item.

Show simple item record

Inan, B., Cernak, M., Grabner, H., Tukuljac, H. P., Pena, R. C. G., & Ricaud, B. (2019). Evaluating audiovisual source separation in the context of video conferencing [Conference paper]. Proceedings Interspeech 2019, 4579–4583. https://doi.org/10.21437/Interspeech.2019-2671

Inan, B. et al. (2019) ‘Evaluating audiovisual source separation in the context of video conferencing’, in Proceedings Interspeech 2019. International Speech Communication Association (ISCA), pp. 4579–4583. Available at: https://doi.org/10.21437/Interspeech.2019-2671.

B. Inan, M. Cernak, H. Grabner, H. P. Tukuljac, R. C. G. Pena, and B. Ricaud, “Evaluating audiovisual source separation in the context of video conferencing,” in Proceedings Interspeech 2019, 2019, pp. 4579–4583. doi: 10.21437/Interspeech.2019-2671.

INAN, Berkay, Milos CERNAK, Helmut GRABNER, Helena Peic TUKULJAC, Rodrigo C. G. PENA und Benjamin RICAUD, 2019. Evaluating audiovisual source separation in the context of video conferencing. In: Proceedings Interspeech 2019. Conference paper. International Speech Communication Association (ISCA). 2019. S. 4579–4583

Inan, Berkay, Milos Cernak, Helmut Grabner, Helena Peic Tukuljac, Rodrigo C. G. Pena, and Benjamin Ricaud. 2019. “Evaluating Audiovisual Source Separation in the Context of Video Conferencing.” Conference paper. In Proceedings Interspeech 2019, 4579–83. International Speech Communication Association (ISCA). https://doi.org/10.21437/Interspeech.2019-2671.

Inan, Berkay, et al. “Evaluating Audiovisual Source Separation in the Context of Video Conferencing.” Proceedings Interspeech 2019, International Speech Communication Association (ISCA), 2019, pp. 4579–83, https://doi.org/10.21437/Interspeech.2019-2671.