ERIC Number: EJ1396795
Record Type: Journal
Publication Date: 2023
Pages: 24
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1092-4388
EISSN: EISSN-1558-9102
Reproducible Speech Research with the Artificial Intelligence--Ready PERCEPT Corpora
Benway, Nina R.; Preston, Jonathan L.; Hitchcock, Elaine; Rose, Yvan; Salekin, Asif; Liang, Wendy; McAllister, Tara
Journal of Speech, Language, and Hearing Research, v66 n6 p1986-2009 2023
Background: Publicly available speech corpora facilitate reproducible research by providing open-access data for participants who have consented/assented to data sharing among different research teams. Such corpora can also support clinical education, including perceptual training and training in the use of speech analysis tools. Purpose: In this research note, we introduce the PERCEPT (Perceptual Error Rating for the Clinical Evaluation of Phonetic Targets) corpora, PERCEPT-R (Rhotics) and PERCEPT-GFTA (Goldman-Fristoe Test of Articulation), which together contain over 36 hr of speech audio (> 125,000 syllable, word, and phrase utterances) from children, adolescents, and young adults aged 6-24 years with speech sound disorder (primarily residual speech sound disorders impacting /[Voiced alveolar and postalveolar approximant]s/) and age-matched peers. We highlight PhonBank as the repository for the corpora and demonstrate use of the associated speech analysis software, Phon, to query PERCEPT-R. A worked example of research with PERCEPT-R, suitable for clinical education and research training, is included as an appendix. Support for end users and information/descriptive statistics for future releases of the PERCEPT corpora can be found in a dedicated Slack channel. Finally, we discuss the potential for PERCEPT corpora to support the training of artificial intelligence clinical speech technology appropriate for use with children with speech sound disorders, the development of which has historically been constrained by the limited representation of either children or individuals with speech impairments in publicly available training corpora. Conclusions: We demonstrate the use of PERCEPT corpora, PhonBank, and Phon for clinical training and research questions appropriate to child citation speech. Increased use of these tools has the potential to enhance reproducibility in the study of speech development and disorders.
Descriptors: Artificial Intelligence, Technology Uses in Education, Speech Language Pathology, Allied Health Personnel, Clinical Diagnosis, Phonetics, Articulation (Speech), Training, Audio Equipment
American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: National Institute on Deafness and Other Communication Disorders (NIDCD) (DHHS/NIH); National Science Foundation (NSF), Office of Advanced Cyberinfrastructure (OAC)
Authoring Institution: N/A
Grant or Contract Numbers: R01DC017476S2; 1341006; 1541396