Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

Multi-Source Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals

  • Nils Poschadel
  • , Stephan Preihs
  • , Jürgen Peissig

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Abstract

Convolutional recurrent neural networks provide state of the art results in direction of arrival estimation based on first-order Ambisonics signals, especially in the presence of noise and/or interfering sound sources. In this work, we investigate whether increasing the order of Ambisonics up to the fourth order further improves the estimation results in a challenging multi-speaker setting with two or three simultaneously active speakers. Our results show that each additional order of the Ambisonics representation further improves the localization performance for both speech signals based on simulated and real measured spatial room impulse responses. The greatest gains in accuracy can be observed in the particularly demanding scenarios with three speakers and poor signal-to-interference-ratio.

OriginalspracheEnglisch
Titel des Sammelwerks29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings
Herausgeber (Verlag)IEEE
Seiten1015-1019
Seitenumfang5
ISBN (elektronisch)9789082797060
ISBN (Print)978-1-6654-0900-1
DOIs
PublikationsstatusVeröffentlicht - 2021
Veranstaltung29th European Signal Processing Conference, EUSIPCO 2021 - Dublin, Irland
Dauer: 23 Aug. 202127 Aug. 2021

Publikationsreihe

NameEuropean Signal Processing Conference
Band2021-August
ISSN (Print)2219-5491
ISSN (elektronisch)2076-1465

Konferenz

Konferenz29th European Signal Processing Conference, EUSIPCO 2021
Land/GebietIrland
OrtDublin
Zeitraum23 Aug. 202127 Aug. 2021

ASJC Scopus Sachgebiete

  • Signalverarbeitung
  • Elektrotechnik und Elektronik

Dieses zitieren