TY - JOUR
T1 - Multi-Source Direction-of-Arrival Estimation Using Steered Response Power and Group-Sparse Optimization
AU - Tengan, Elisa
AU - Dietzen, Thomas
AU - Elvander, Filip
AU - Waterschoot, Toon van
N1 - Publisher Copyright:
IEEE
PY - 2024
Y1 - 2024
N2 - In this paper, a method is proposed for estimating the direction of arrival (DOA) of multiple broadband sound sources. This is achieved through the solution of a group-sparse optimization problem, which models an observed broadband steered response power (SRP) map as a linear function of power spectral densities (PSDs), corresponding to a set of candidate DOAs, and forming a PSD vector. Given the assumption of spatial sparsity, the estimation of the source DOAs is then accomplished by identifying peaks in the resulting spatial power density, i.e., the estimated direction-specific PSDs integrated over frequency. The motivation behind the proposed method lies in its potential to reveal more distinct peaks in the estimated spatial power density than those directly observed in the broadband SRP map, which can be beneficial to the robustness in DOA estimation performance when multiple sources need to be distinguished under varying acoustic conditions. An implementation of the proposed method using the alternating direction method of multipliers (ADMM) is presented, and the DOA estimation performance is evaluated with both simulated and experimental data. Results show that, especially in reverberant scenarios, the proposed method presents an advantage in locating closely spaced sources when compared to the conventional SRP-PHAT, the group-sparse iterative covariance-based estimation (GSPICE) method, and the wideband MUSIC method with geometric averaging. Furthermore, it is observed that for a compact microphone array, the proposed method overall maintained its performance even when using SRP maps computed with grid resolutions that are lower than the sampling requirements of the broadband SRP function. Finally, results obtained with experimental data showed the validity and applicability of the proposed method in a practical meeting room environment.
AB - In this paper, a method is proposed for estimating the direction of arrival (DOA) of multiple broadband sound sources. This is achieved through the solution of a group-sparse optimization problem, which models an observed broadband steered response power (SRP) map as a linear function of power spectral densities (PSDs), corresponding to a set of candidate DOAs, and forming a PSD vector. Given the assumption of spatial sparsity, the estimation of the source DOAs is then accomplished by identifying peaks in the resulting spatial power density, i.e., the estimated direction-specific PSDs integrated over frequency. The motivation behind the proposed method lies in its potential to reveal more distinct peaks in the estimated spatial power density than those directly observed in the broadband SRP map, which can be beneficial to the robustness in DOA estimation performance when multiple sources need to be distinguished under varying acoustic conditions. An implementation of the proposed method using the alternating direction method of multipliers (ADMM) is presented, and the DOA estimation performance is evaluated with both simulated and experimental data. Results show that, especially in reverberant scenarios, the proposed method presents an advantage in locating closely spaced sources when compared to the conventional SRP-PHAT, the group-sparse iterative covariance-based estimation (GSPICE) method, and the wideband MUSIC method with geometric averaging. Furthermore, it is observed that for a compact microphone array, the proposed method overall maintained its performance even when using SRP maps computed with grid resolutions that are lower than the sampling requirements of the broadband SRP function. Finally, results obtained with experimental data showed the validity and applicability of the proposed method in a practical meeting room environment.
KW - Acoustics
KW - Broadband communication
KW - Direction-of-arrival estimation
KW - Estimation
KW - Location awareness
KW - Spatial resolution
KW - Vectors
KW - direction-of-arrival estimation
KW - group sparsity
KW - source localization
KW - steered response power
KW - Source localization
UR - http://www.scopus.com/inward/record.url?scp=85197641830&partnerID=8YFLogxK
U2 - 10.1109/TASLP.2024.3419417
DO - 10.1109/TASLP.2024.3419417
M3 - Article
AN - SCOPUS:85197641830
SN - 2329-9290
VL - 32
SP - 3517
EP - 3531
JO - IEEE/ACM Transactions on Audio Speech and Language Processing
JF - IEEE/ACM Transactions on Audio Speech and Language Processing
ER -