ERIC Number: EJ1317425
Record Type: Journal
Publication Date: 2021-Oct
Pages: 24
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1092-4388
EISSN: N/A
Efficient Estimation of Children's Language Exposure in Two Bilingual Communities
Cychosz, Margaret; Villanueva, Anele; Weisleder, Adriana
Journal of Speech, Language, and Hearing Research, v64 n10 p3843-3866 Oct 2021
Purpose: The language that children hear early in life is associated with their speech-language outcomes. This line of research relies on naturalistic observations of children's language input, often captured with daylong audio recordings. However, the large quantity of data that daylong recordings generate requires novel analytical tools to feasibly parse thousands of hours of naturalistic speech. This study outlines a new approach to efficiently process and sample from daylong audio recordings made in two bilingual communities, Spanish-English in the United States and Quechua-Spanish in Bolivia, to derive estimates of children's language exposure. Method: We employed a general sampling with replacement technique to efficiently estimate two key elements of children's early language environments: (a) proportion of child-directed speech (CDS) and (b) dual language exposure. Proportions estimated from random sampling of 30-s segments were compared to those from annotations over the entire daylong recording (every other segment), as well as parental report of dual language exposure. Results: Results showed that approximately 49 min from each recording or just 7% of the overall recording was required to reach a stable proportion of CDS and bilingual exposure. In both speech communities, strong correlations were found between bilingual language estimates made using random sampling and all-day annotation techniques. A strong association was additionally found for CDS estimates in the United States, but this was weaker at the Bolivian site, where CDS was less frequent. Dual language estimates from the audio recordings did not correspond well to estimates derived from parental report collected months apart. Conclusions: Daylong recordings offer tremendous insight into children's daily language experiences, but they will not become widely used in developmental research until data processing and annotation time substantially decrease. We show that annotation based on random sampling is a promising approach to efficiently estimate ambient characteristics from daylong recordings that cannot currently be estimated via automated methods.
Descriptors: Bilingualism, Linguistic Input, Native Language, Second Language Learning, Speech Communication, English (Second Language), Spanish, American Indian Languages, Cross Cultural Studies, Parent Attitudes, Preschool Children, Audio Equipment, Parent Child Relationship, Comparative Analysis, Language Acquisition, Correlation, Foreign Countries, Prediction, Computational Linguistics
American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: National Institute on Deafness and Other Communication Disorders (NIDCD) (DHHS/NIH)
Authoring Institution: N/A
Identifiers - Location: Bolivia; United States
Grant or Contract Numbers: T32DC000046; R21DC018357