You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
In many situations, measuring the amount and type of reverberation in a room assumes that the room impulse response is available for the computation. When that impulse response is not available, a nonintrusive room acoustic (NIRA) method must be used. In this report, the authors use the C50 clarity index to characterize reverberation in the signal because it has been shown to be more highly correlated with the speech recognition performance then other measures of reverberation. Multiple features are extracted from a reverberant speech signal and they are then used to train a bidirectional long short-term memory model that maps from the feature space into the target C50 value. Prediction intervals, which provide an upper and lower bound of the estimate, can be derived from the standard deviation of the per frame estimations. Confidence measures are then obtained by normalizing these prediction intervals. These measures are highly correlated with the absolute C50 estimation errors. The performance of the prediction intervals and confidence measure are shown to be consistent in many different noisy reverberant environments. The procedure proposed in this paper for deriving C50 prediction intervals and confidence measures could as well be applied to other room acoustic parameter estimation, for example, T60 (reverberation decay time to 60 dB) or DRR (direct to reverberation ratio).
Author (s): Parada, Pablo Peso; Sharma, Dushyant; van Waterschoot, Toon; Naylor, Patrick A.
Affiliation:
Nuance Communications Inc., Marlow, UK; Dept. of Electrical Engineering (ESAT-STADIUS / ETC), KU Leuven, Leuven, Belgium; Dept. of Electrical and Electronic Engineering, Imperial College London, UK
(See document for exact affiliation information.)
Publication Date:
2017-01-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=18546
(706KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Parada, Pablo Peso; Sharma, Dushyant; van Waterschoot, Toon; Naylor, Patrick A.; 2017; Confidence Measures for Nonintrusive Estimation of Speech Clarity Index [PDF]; Nuance Communications Inc., Marlow, UK; Dept. of Electrical Engineering (ESAT-STADIUS / ETC), KU Leuven, Leuven, Belgium; Dept. of Electrical and Electronic Engineering, Imperial College London, UK; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=18546
Parada, Pablo Peso; Sharma, Dushyant; van Waterschoot, Toon; Naylor, Patrick A.; Confidence Measures for Nonintrusive Estimation of Speech Clarity Index [PDF]; Nuance Communications Inc., Marlow, UK; Dept. of Electrical Engineering (ESAT-STADIUS / ETC), KU Leuven, Leuven, Belgium; Dept. of Electrical and Electronic Engineering, Imperial College London, UK; Paper ; 2017 Available: https://aes2.org/publications/elibrary-page/?id=18546
@article{parada2017confidence,
author={parada pablo peso and sharma dushyant and van waterschoot toon and naylor patrick a.},
journal={journal of the audio engineering society},
title={confidence measures for nonintrusive estimation of speech clarity index},
year={2017},
volume={65},
issue={1/2},
pages={90-99},
month={january},}