Tampere University of Technology

TUTCRIS Research Portal

Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review


Original languageEnglish
Title of host publication2018 26th European Signal Processing Conference (EUSIPCO)
Number of pages5
ISBN (Electronic)978-9-0827-9701-5
ISBN (Print)978-1-5386-3736-4
Publication statusPublished - Sep 2018
Publication typeA4 Article in a conference publication
EventEuropean Signal Processing Conference -
Duration: 1 Jan 1900 → …

Publication series

ISSN (Electronic)2076-1465


ConferenceEuropean Signal Processing Conference
Period1/01/00 → …


This paper proposes a deep neural network for estimating the directions of arrival (DOA) of multiple sound sources. The proposed stacked convolutional and recurrent neural network (DOAnet) generates a spatial pseudo-spectrum (SPS) along with the DOA estimates in both azimuth and elevation. We avoid any explicit feature extraction step by using the magnitudes and phases of the spectrograms of all the channels as input to the network. The proposed DOAnet is evaluated by estimating the DOAs of multiple concurrently present sources in anechoic, matched and unmatched reverberant conditions. The results show that the proposed DOAnet is capable of estimating the number of sources and their respective DOAs with good precision and generate SPS with high signal-to-noise ratio.


  • array signal processing, direction-of-arrival estimation, feature extraction, feedforward neural nets, recurrent neural nets, signal classification, spatial pseudospectrum, SPS, DOA estimates, explicit feature extraction step, DOAnet, multiple concurrently present sources, anechoic unmatched reverberant conditions, matched unmatched reverberant conditions, arrival estimation, multiple sound sources, convolutional recurrent neural network, deep neural network, Direction-of-arrival estimation, Estimation, Azimuth, Feature extraction, Spectrogram, Multiple signal classification, Two dimensional displays

Publication forum classification