TY - JOUR
T1 - Optimal Transport Based Impulse Response Interpolation in the Presence of Calibration Errors
AU - Sundstrom, David
AU - Elvander, Filip
AU - Jakobsson, Andreas
N1 - Publisher Copyright:
IEEE
PY - 2024
Y1 - 2024
N2 - Acoustic impulse responses (IRs) are widely used to model sound propagation between two points in space. Being a point-to-point description, IRs are generally estimated based on input-output pairs for source and sensor positions of interest. Alternatively, the IR at an arbitrary location in space may be constructed based on interpolation techniques, thus alleviating the need of densely sampling the space. The resulting IR interpolation problem is of general interest, e.g., for imaging of subsurface structures based on seismic waves, rendering of audio and radar IRs, as well as for numerous spatial audio applications. A commonly used model represents the acoustic reflections as image sources, often being determined using a sparse reconstruction framework employing spatial dictionaries. However, in the presence of calibration errors, such spatial dictionaries tend to inaccurately represent the actual propagation, limiting the use of these methods in practical applications. Instead of explicitly assuming an image source model, we here introduce a trade-off between minimizing the distance to an image source model and fitting the data by means of a multi-marginal optimal transport problem. The proposed method is evaluated on the early part of real acoustic IRs from the MeshRIR data set, illustrating its preferable performance as compared to state-of-the-art spatial dictionary-based IR interpolation approaches.
AB - Acoustic impulse responses (IRs) are widely used to model sound propagation between two points in space. Being a point-to-point description, IRs are generally estimated based on input-output pairs for source and sensor positions of interest. Alternatively, the IR at an arbitrary location in space may be constructed based on interpolation techniques, thus alleviating the need of densely sampling the space. The resulting IR interpolation problem is of general interest, e.g., for imaging of subsurface structures based on seismic waves, rendering of audio and radar IRs, as well as for numerous spatial audio applications. A commonly used model represents the acoustic reflections as image sources, often being determined using a sparse reconstruction framework employing spatial dictionaries. However, in the presence of calibration errors, such spatial dictionaries tend to inaccurately represent the actual propagation, limiting the use of these methods in practical applications. Instead of explicitly assuming an image source model, we here introduce a trade-off between minimizing the distance to an image source model and fitting the data by means of a multi-marginal optimal transport problem. The proposed method is evaluated on the early part of real acoustic IRs from the MeshRIR data set, illustrating its preferable performance as compared to state-of-the-art spatial dictionary-based IR interpolation approaches.
KW - Calibration
KW - Delays
KW - Dictionaries
KW - Geometry
KW - Interpolation
KW - Optimal mass transport
KW - Radar imaging
KW - Reflection
KW - Robust time-delay estimation
KW - impulse response interpolation
UR - http://www.scopus.com/inward/record.url?scp=85186988523&partnerID=8YFLogxK
U2 - 10.1109/TSP.2024.3372249
DO - 10.1109/TSP.2024.3372249
M3 - Article
AN - SCOPUS:85186988523
SN - 1053-587X
VL - 72
SP - 1548
EP - 1559
JO - IEEE Transactions on Signal Processing
JF - IEEE Transactions on Signal Processing
ER -