You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The correspondence of various spectral difference error metrics to human discrimination data was investigated. Time-varying harmonic amplitude data were obtained from the spectral analysis of eight musical instrument sounds (bassoon, clarinet, flute, horn, oboe, saxophone, trumpet, and violin). Sounds were resynthesized with various levels of random spectral alteration, ranging from 1 to 50%. Listeners were asked to discriminate the randomly altered sounds from reference sounds resynthesized from the original data. Then several formulas designed to predict discrimination performance were evaluated by calculating the correspondence between the discrimination data and the associated spectral difference measurements. Averaged over the eight instruments, the best correspondence was achieved using a spectral error metric based on linear harmonic amplitude differences normalized by rms amplitude and raised to a power a. While an optimum correspondence of 91% was achieved for a 0.64, good correspondence occurred over a wide range of a. For linear harmonic amplitudes without rms normalization, good correspondence occurred within a narrower range, with a maximum correspondence of 88%. Correspondence was approximately 80% for decibelamplitude differences over an even narrower range. Other error metrics such as those based on critical-band grouping of components worked well but did not give any improvement over the method based on harmonic amplitudes, and in some cases yielded worse results. Spectral differences using a small number of representative frames emphasizing attack and decay transients yielded results slightly better than using all frames.
Author (s): Horner, Andrew B.; Beauchamp, James W.; So, Richard H. Y.
Affiliation:
Hong Kong University of Science and Technology, Kowloon, Hong Kong; University of Illinois at Urbana-Champaign, Urbana, IL, USA
(See document for exact affiliation information.)
Publication Date:
2006-03-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=13671
(461KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Horner, Andrew B.; Beauchamp, James W.; So, Richard H. Y.; 2006; A Search for Best Error Metrics to Predict Discrimination of Original and Spectrally Altered Musical Instrument Sounds [PDF]; Hong Kong University of Science and Technology, Kowloon, Hong Kong; University of Illinois at Urbana-Champaign, Urbana, IL, USA; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=13671
Horner, Andrew B.; Beauchamp, James W.; So, Richard H. Y.; A Search for Best Error Metrics to Predict Discrimination of Original and Spectrally Altered Musical Instrument Sounds [PDF]; Hong Kong University of Science and Technology, Kowloon, Hong Kong; University of Illinois at Urbana-Champaign, Urbana, IL, USA; Paper ; 2006 Available: https://aes2.org/publications/elibrary-page/?id=13671
@article{horner2006a,
author={horner andrew b. and beauchamp james w. and so richard h. y.},
journal={journal of the audio engineering society},
title={a search for best error metrics to predict discrimination of original and spectrally altered musical instrument sounds},
year={2006},
volume={54},
issue={3},
pages={140-156},
month={march},}
TY – paper
TI – A Search for Best Error Metrics to Predict Discrimination of Original and Spectrally Altered Musical Instrument Sounds
SP – 140 EP – 156
AU – Horner, Andrew B.
AU – Beauchamp, James W.
AU – So, Richard H. Y.
PY – 2006
JO – Journal of the Audio Engineering Society
VO – 54
IS – 3
Y1 – March 2006