ERIC Number: EJ1453674
Record Type: Journal
Publication Date: 2024-Dec
Pages: 25
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1560-4292
EISSN: EISSN-1560-4306
Available Date: N/A
Multilingual Age of Exposure 2.0
International Journal of Artificial Intelligence in Education, v34 n4 p1353-1377 2024
Age of Acquisition (AoA) scores approximate the age at which a language speaker fully understands a word's semantic meaning and represent a quantitative measure of the relative difficulty of words in a language. AoA word lists exist across various languages, with English having the most complete lists that capture the largest percentage of the vocabulary. In contrast, other languages have smaller lists making large-scale analyses difficult. Given the usefulness of AoA scores, methods have been developed to leverage the use of Machine Learning models to estimate AoA scores automatically through Age of Exposure (AoE) scores for the entire vocabulary of a language. These generated AoE scores use simulated learning trajectories to evaluate properties similar to AoA. In this work, we propose a method that leverages the greater size of existing English AoA lists to improve the performance of AoE prediction models for other languages. Our main contributions are threefold. First, we introduce a novel AoE regression architecture that uses a Recurrent Neural Network applied to the simulated word exposure trajectories. Second, we consider word embeddings projected into a unified multilingual space. Third, we apply transfer learning on the English AoE regressor to improve the performance of non-English AoE regressors. We show that AoA lists across languages share inherent similarities that enable Machine Learning models to transfer insights from one language to another, thus diminishing the effect of the smaller sample sizes for non-English languages.
Descriptors: Multilingualism, English (Second Language), Second Language Learning, Second Language Instruction, Age Differences, Semantics, Word Lists, Vocabulary Development, Artificial Intelligence, Scores, Simulation, Networks, Prediction, Models, Computational Linguistics, Natural Language Processing, Spanish, French, German
Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://bibliotheek.ehb.be:2123/
Related Records: ED660607
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: Institute of Education Sciences (ED); Office of Naval Research (ONR) (DOD)
Authoring Institution: N/A
IES Funded: Yes
Grant or Contract Numbers: R305A180261; R305A190050; N000142012623
Author Affiliations: N/A