You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
The spatial speech reproduction capabilities of a KEMAR mouth simulator, a loudspeaker, the piston on the sphere model, and a circular harmonic fitting are evaluated in the near-field. The speech directivity of 24 human subjects, both male and female, is measured using a semicircular microphone array with a radius of 36.5 cm in the horizontal plane. Impulse responses are captured for the two devices, and filters are generated for the two numerical models to emulate their directional effect on speech reproduction. The four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. Results show that the repeatable sources perform relatively well under the metric of directivity, but irregularities in their directivity patterns introduce audible coloration for off-axis directions.
Author (s): Gonzalez, Raimundo; Mckenzie, Thomas; Politis, Archontis; Lokki, Tapio
Affiliation:
Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Audio & Speech Processing Group, Tampere University of Technology, Tampere, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland.
(See document for exact affiliation information.)
Publication Date:
2022-07-06
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=21828
(1190KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Gonzalez, Raimundo; Mckenzie, Thomas; Politis, Archontis; Lokki, Tapio; 2022; Near-Field Evaluation of Reproducible Speech Sources [PDF]; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Audio & Speech Processing Group, Tampere University of Technology, Tampere, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland.; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=21828
Gonzalez, Raimundo; Mckenzie, Thomas; Politis, Archontis; Lokki, Tapio; Near-Field Evaluation of Reproducible Speech Sources [PDF]; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland; Audio & Speech Processing Group, Tampere University of Technology, Tampere, Finland; Acoustics Lab, Department of Signal Processing & Acoustics, Aalto University, Espoo, Finland.; Paper ; 2022 Available: https://aes2.org/publications/elibrary-page/?id=21828
@article{gonzalez2022near-field,
author={gonzalez raimundo and mckenzie thomas and politis archontis and lokki tapio},
journal={journal of the audio engineering society},
title={near-field evaluation of reproducible speech sources},
year={2022},
volume={70},
issue={7/8},
pages={621-633},
month={july},}