Caregiver Talk and Medical Risk as Predictors of Language Outcomes in Full Term and Preterm Toddlers
Abstract
This study examined associations between caregiver talk and language skills in full term (FT) and preterm (PT) children (n = 97). All-day recordings of caregiver–child interactions revealed striking similarities in amount of caregiver talk heard by FT and PT children. Children who heard more caregiver talk at 16 months demonstrated better knowledge- and processing-based language skills at 18 months. The unique contributions of caregiver talk were tempered by medical risk in PT children, especially for processing speed. However, there was no evidence that birth status or medical risk moderated the effects of caregiver talk. These findings highlight the role of caregiver talk in shaping language outcomes in FT and PT children and offer insights into links between neurodevelopmental risk and caregiver–child engagement.
Identifying the causes of variation in children's early language development is a critical public health concern because early oral language skills form the foundation for later cognitive and language outcomes (Marchman & Fernald, 2008). One powerful correlate of language outcomes is the amount of talk directed to children during interactions with caregivers. Amount of caregiver talk is an index of a child's language nutrition (Zauche, Thul, Mahoney, & Stapel-Wax, 2016), shown to predict individual variation in vocabulary knowledge and early processing skills that support vocabulary learning in children born full term (FT; e.g., Hoff, 2003; Hurtado, Marchman & Fernald, 2008; Weisleder & Fernald, 2013). Children born preterm (PT) are at particular risk for poor developmental outcomes, yet also demonstrate considerable variability in their language skills (Marchman, Adams, Loi, Fernald, & Feldman, 2016; van Noort-van der Spek, Franken, & Weisglas-Kuperus, 2012). Such variation has been attributed in part to the neurobiological risks associated with PT birth (Allen, 2008), as well as to variation in features of children's learning environments (e.g., Landry, Smith, Miller-Loncar, & Swank, 1997; Smith et al., 1996). Using measures of number of adult words heard during naturalistic day-long home recordings in FT and PT children, the goals of this study were to determine if PT children are exposed to different amounts of caregiver talk compared to their FT peers, to examine relations between the quantity of caregiver talk and both knowledge- and processing-based language outcomes in FT children, and to explore whether variation in caregiver talk predicted PT language outcomes in a similar way as in the FT group. Finally, we evaluated whether medical risk is a moderator of relations between quantity of caregiver talk and language skills of PT children.
Caregiver Talk to FT and PT Children
In a seminal observational study, Hart and Risley (1995) explored the quantity of caregiver talk across a diverse sample of families. They found that children from families low in socioeconomic status (SES) heard, on average, thousands fewer words each day than their higher-SES peers, and this gap contributed to meaningful differences in long-term language outcomes. In studies using home observation and laboratory-based paradigms, children who heard more talk from their caregivers showed greater gains in both vocabulary (Huttenlocher, Haight, Bryk, Seltzer, & Lyons, 1991; Rowe, 2012) and language processing efficiency, compared to children who heard less talk from caregivers (Hurtado, Marchman, & Fernald, 2008; Weisleder & Fernald, 2013). One study with children ages 2–48 months assessed over a 6-month time period found that each 1,000 additional words spoken to a child was associated with a 0.44 point increase in average scores on the Preschool Language Scale 4th ed., a measure of receptive and expressive vocabulary (Zimmerman et al., 2009). On the basis of these and other findings, intervention efforts targeting quantity of caregiver talk as a means to improve language skills in children growing up in poverty are now increasing (e.g., Suskind et al., 2016).
Additional research is required to determine if quantity of caregiver talk is an important factor in language development for other populations at risk for language delay, such as children born PT. Children born PT are generally slower than their full term peers to reach early language milestones (Landry, Smith, & Swank, 2002), and language delays can persist into the school years, even in the absence of general cognitive delays (Lee, Yeatman, Luna, & Feldman, 2011). Studies suggest that there is great variability in parents’ responsiveness during interactions with PT children (Landry et al., 1997; Smith et al., 1996). Parents of children born PT may need to modulate their interactions to the appropriate level of stimulation to avoid over- or underwhelming the PT child's information processing or arousal regulation capacities (e.g., Eckerman, Hsu, Molitor, Leung, & Goldstein, 1999). This narrower “activation band” for optimal parent–child interaction (Field, 1982) may increase the likelihood of low-quality interactions because PT children may show greater negative social responses to parental bids, and parents may engage in more controlling and overstimulating behaviors compared to dyads with FT children (Forcada-Guex, Pierrehumbert, Borghini, Moessinger, & Muller-Nix, 2006). PT children and their caregivers may be less successful as social partners (De Schuymer, De Groote, Beyers, Striano, & Roeyers, 2011), possibly resulting in a reduction in the amount of caregiver talk heard by PT compared to FT children. Other studies find no differences between FT and PT groups in measures of maternal sensitivity (for a review, see Bilgin & Wolke, 2015) or the amount of caregiver talk heard during play (Salerni, Suttora, & D'Odorico, 2007). Given these inconsistent findings, additional research is needed to explore group differences in the language environment of FT and PT children.
In this study, we collected day-long naturalistic interactions using the LENA™ audio-recording and analysis system (Xu, Yapanel, & Gray, 2009) which provides estimates of the adult word count (AWC) heard by children over the course of a day. Most previous research on caregiver talk with PT children has relied on short recordings that sample caregivers’ speech during activities associated with peak language use, such as playing with toys or book reading. While such samples provide useful snapshots of many features of caregiver–child interactions, these data do not reflect the natural ebb and flow of caregiver–child engagement throughout a full day. Moreover, these samples often involve goal-directed tasks that may necessitate higher levels of directive or overstimulating behavior which could alter the natural dynamics of caregiver–child interactions (Field, 1982). LENA-recorded samples of caregiver talk are collected over a much longer time period (10–16 h), typically in the home environment, and thus may have fewer observer artifacts (Zauche et al., 2016) than samples collected in shorter, laboratory-based samples. Using full-day naturalistic recordings may be more sensitive than laboratory observations in capturing meaningful group differences in caregiver–child language interactions.
Previous studies using LENA™ with PT children suggest that higher amounts of caregiver talk support language function in this at-risk population. For example, PT children who heard more words from caregivers during Neonatal Intensive Care Unit (NICU) visits had higher rates of child vocalizations (CVC; Caskey, Stephens, Tucker, & Vohr, 2011) and larger vocabularies when the children were 18 months (Caskey, Stephens, Tucker, & Vohr, 2014). However, estimates of amount of caregiver talk in the NICU may not necessarily reflect the amount of talk that children experience during daily interactions with caregivers outside of the NICU environment. Here, we directly compare caregiver talk captured in the naturalistic language environments of FT and PT children when they were 16 months old.
Predictors of Language Delay in PT Children
Preterm birth is associated with increased risk for delay in knowledge-based domains of language function, including vocabulary and syntactic structures (Adams-Chapman, Bann, Carter, & Stoll, 2015; Foster-Cohen, Friesen, Champion, & Woodward, 2010; Sansavini et al., 2011). Children born PT are also at high risk for delays in tasks that involve real-time processing of linguistic stimuli or verbal working memory (Jansson-Verkasalo et al., 2004; Lee et al., 2011). Some studies suggest that PT children show greater difficulty in complex language processing tasks than in knowledge-based assessments (Bosch, 2011). Tasks requiring efficient language processing may reveal differences that are not captured as effectively in untimed tasks (Lee et al., 2011; Venker & Kover, 2015). In young children, individual differences in language processing speed predict later language outcomes in FT (Fernald, Perfors & Marchman, 2006; Fernald & Marchman, 2012) and PT populations (Marchman et al., 2016). In this study, we use both knowledge- and processing-based measures of language proficiency.
Higher risk of language delay in PT children has been attributed in part to perinatal factors, such as gestational age, birth weight (BW), and medical complications. Foster-Cohen and colleagues found that younger gestational age was associated with smaller parent-reported vocabulary (Foster-Cohen, Edgin, Champion, & Woodward, 2007). The Nursery Neurobiologic Risk Score (Brazy, Eckerman, Oehler, Goldstein, & O'Rand, 1991), capturing conditions such as intraventricular hemorrhage, periventricular leukomalacia, and hypoglycemia, is a robust predictor of cognitive and language skills at 24 months (Wickremasinghe et al., 2012). Similarly, neonatal illnesses associated with PT birth, such as bronchopulmonary dysplasia and necrotizing enterocolitis, increase risk for neurodevelopmental impairments related to language delay (Behrman & Butler, 2007; Landry, Chapieski, Fletcher, & Denson, 1988; Rees, Pierro, & Eaton, 2007; Siegel et al., 2001). Other studies suggest that variability in language skills may be attributable to a combination of neurobiological, perinatal, and environmental factors that together shape language development (Cusson, 2003; Foster-Cohen et al., 2010), such as length of hospital stay, BW, Apgar scores, infant irritability, and features of parenting quality.
Here, we examined associations between variability in the amount of talk heard from caregivers and young children's knowledge- and processing-based language outcomes, with particular emphasis on how medical factors might contribute to and interact with this relation in children born PT. One possibility is that there may be a weaker relation between caregiver talk and child outcomes in PT compared to FT infants (Field, 1982). If increased sensitivity to stimulation in PT children, particularly those of high medical risk, leads to more variability in responsiveness to caregiver engagement, then increases in quantity of caregiver talk might not be linked to the same level of improvements in PT as FT children. A second possibility is that optimal parent–child interactions might play an especially large role in supporting PT language development, following a model of differential susceptibility (Gueron-Sela, Atzaba-Poria, Meiri, & Marks, 2015; Madigan, Wade, Plamondon, Browne, & Jenkins, 2015). That is, stronger associations between caregiver talk and language outcomes would be predicted for higher-risk PT compared to lower-risk PT or FT children (Landry et al., 1997; Smith et al., 1996). A third possibility is that caregiver talk might be an equally strong predictor of outcomes in FT and PT children, with parallel effects across groups regardless of medical risk status.
Research Questions
- Group differences will be observed between FT and PT children in the number of adult words (AWC) heard during naturalistic recordings over the course of a typical day.
- For children born FT, variation in AWC will account for variance in children's language knowledge and processing outcomes, after controlling for SES, gender, and the child's earlier language skill.
- For children born PT, variation in AWC will be associated with outcomes in ways that are different to those seen in the FT group.
- Medical risk will account for additional variance in outcomes in PT children, beyond caregiver talk, and will moderate the associations between AWC and child outcomes.
Method
Participants
Participants were 97 children (48 males, 49 females) born either full term (FT, n = 56, 27 males, 29 females), with a gestational age ≥ 37 weeks, or preterm (PT, n = 41, 21 males, 20 females), with a gestational age ≤ 32 weeks and BW < 1,800 g. Most families with PT children were recruited between the summer of 2011 and fall of 2014 from the NICU, intermediate care nursery, and the High-Risk Infant Follow-up Clinic of the local children's hospital. Others were recruited during that time period through a research registry, local parent groups, an early intervention service provider, or birth records. Children born FT were recruited between the summer of 2012 and winter of 2015 through birth records, staff or sibling referral, and parent registration on the laboratory's web site. Exclusionary criteria for all children were known developmental disorders and medical conditions that would prevent child participants from engaging in our research assessments, such as a history of meningitis, a seizure disorder requiring medication, a ventriculoperitoneal shunt, a genetic disorder, and visual or hearing impairments. All children were primarily English speaking and heard a non-English language ≤ 25% of the time, as reported by parents. The protocol was approved by a university IRB and parents gave signed consent at each visit.
All participants were drawn from a larger longitudinal study in which children born FT were individually matched (on the basis of SES, age, and gender) to a child born PT. This one-to-one matching procedure allowed for more reliable testing of the effect of birth status on children's outcomes. Children in both the FT and PT groups completed visits at approximately 16 months (FT chronological age: M = 16.3, range = 15.1–17.4 months; PT-corrected age: M = 16.1, range = 15.2–17.6 months) and 18 months (FT chronological age: M = 18.8, range = 17.7–20.4 months; PT-corrected age: M = 18.6, range = 17.7–20.3 months). The sample reported here includes unequal numbers of FT and PT children because some children were missing data or otherwise did not meet inclusion criteria for the current analyses. Table 1 shows the sociodemographic background of both groups of children. Participants were from mid- to high-SES backgrounds, with no difference across groups, t(95) = 0.02, p = .99, d = 0.01, although there was substantial variability in SES backgrounds in both groups.
FT | PT | |||
---|---|---|---|---|
n (%) or M (SD) | Range | n (%) or M (SD) | Range | |
% Male | 27 (48.2) | – | 21 (51.2) | – |
Maternal education (years) | 16.6 (1.5) | 12–18 | 16.9 (1.6) | 12–18 |
SES (HI)a | 58.7 (8.8) | 22–66 | 58.7 (8.5) | 30–66 |
Birth weight (g) | 3,566 (444) | 2,438–4,649 | 1,251 (281) | 670–1,750 |
Gestational age (weeks) | 40.2 (1.1) | 37.6–43.4 | 29.9 (2.0) | 26.9–32.7 |
Twins | 0 (0) | – | 14 (34.1%) | – |
First born | 34 (60.7) | – | 25 (61.0) | – |
Medical riskb | – | – | 3.5 (1.8) | 1–7 |
Note
- aHollingshead Four-Factor Index of Socioeconomic Status (HI, Hollingshead, 1975, possible range=8–66). bComposite of 13 medical risk factors (possible range = 0–13).
Birth History and Medical Risk
Information about birth history is also presented in Table 1. Similar to Marchman et al. (2016), we catalogued the factors of extreme preterm birth (gestational age < 28 weeks), extended length of hospital stay (> 51 days), and 11 medical conditions (small for gestational age, respiratory distress syndrome, chronic lung disease/bronchopulmonary dysplasia, patent ductus arteriosus, necrotizing enterocolitis, intraventricular hemorrhage, periventricular leukomalacia, retinopathy of prematurity, seizures, hearing loss, and hyperbilirubinemia/jaundice). For each child, the presence or absence of each factor was coded by research assistants in consultation with the last author using information in the daily progress notes, NICU discharge summaries, and summary letters from outpatient follow-up visits, and then summed to yield a medical risk score (max = 13, Mdn = 3.0). In this relatively healthy group of PT children, none had a hearing or visual impairment that precluded participation in the study. Based on either an MRI or a head ultrasound conducted during hospitalization, two children were identified as having white matter injury associated with periventricular leukomalacia. Seven children were identified as having an intraventricular hemorrhage, six of which were mild grades. For some analyses, the PT children were categorized as lower (n = 22) versus higher (n = 19) medical risk based on median medical risk score.
Naturalistic Day-Long Recordings
At 16 months, audio recordings were made using the LENA™ recording and speech analysis system. A digital language recorder worn by the child in the chest pocket of specialized clothing contains a directional microphone that enables unobtrusive recording of all adult-like speech that is produced “near and clear” to the target child during daily activities. Parents were asked to record on a typical weekday, following their regular routine. The recorder could be turned off whenever the caregiver chose to do so (e.g., when visitors requested not to be recorded), and all families were given the option to delete the entire recording. Families generally recorded when they were at home; however, they frequently also recorded in other locations, including outside or in other peoples’ homes, which reflected their normal routines. In the case of twins, each target child wore separate recorders on different days and, thus, each recording captured the speech occurring during the interactions of that particular child with others. Raw recording lengths ranged from 8.4 to 16.0 h (M = 14.7 h), with variability due in part to the fact that some families turned off the recorder at different points in the day, whereas other families did not. Trained listeners “cleaned” all of the recordings, deleting segments when the child was sleeping or not wearing the recorder, or when the recording quality was poor. The lengths of the cleaned recordings ranged from 5.4 to 13.4 h (M = 9.5 h) over a single day, with no group differences (PT M = 9.6 h, FT M = 9.5 h), t(95) = 0.10, p = .92, d = 0.02. An additional 38 families were eligible for participation but not included in the final sample because they chose not to take the recorder (FT: n = 3; PT: n = 9), they returned it with less than 3.5 h of recording (FT: n = 2; PT: n = 2), or they did not have complete data at both testing points within the target age window (FT: n = 6; PT: n = 16). These additional families came from similar SES backgrounds as the families in the final sample (M = 56.7, SD = 9.9), t(133) = 1.4, p = .18. While the PT children who did not meet eligibility criteria were slightly less healthy than the PT children in the final sample, the group difference on the index of medical risk did not achieve statistical significance (M = 4.3, SD = 2.1), t(65) = 1.9, p = .07.
The home recordings were analyzed using the LENA™ analysis software which automatically derives the following measures for each 5 min segment: (a) adult word count (AWC): the number of words spoken “near and clear” to the child, excluding overlapping adult and child speech, TV, and music; and (b) CVC: the number of speech-like sounds produced by the target child, including words, babbling, and prespeech communicative sounds or “protophones” such as squeals, growls, or raspberries, but excluding crying, whining, and vegetative sounds (e.g., breathing, burping). This CVC measure captures the frequency with which children are vocalizing and thereby may influence the frequency of caregiver vocalizations. This measure is not intended to capture the complexity or developmental sophistication of children's vocalizations. The accuracy of the AWC estimates for English language recordings has been established in previous studies as highly reliable (Oller et al., 2010; Xu et al., 2009). Accuracy for CVC estimates is reported to be somewhat weaker (75%; Oller et al., 2010; Xu et al., 2009), but are nevertheless within an acceptable range. To control for differences in recording length, AWC and CVC counts for each 5-min segment were summed to create a total count over each child's cleaned recording, and then divided by the number of hours in the cleaned recording length. This procedure produced the final measures of AWC/h and CVC/h.
Parent Report and Laboratory Measures of Child Outcomes
Vocabulary Size
Expressive vocabulary was assessed at 16 and 18 months with the MacArthur-Bates Communicative Development Inventory (CDI): Words and Gestures (CDI: W&G), a reliable and valid parent report instrument appropriate for children 8–18 months (Fenson et al., 2007). Parents marked words that their child “understands and says” and a total expressive vocabulary score was derived (396 max).
Receptive and Expressive Language
When the children were 18 months, trained examiners administered the BSID-III (Bayley, 2006), a standardized assessment for children aged 16 days to 42 months. The composite language scale is commonly applied in clinical protocols and offers a comprehensive evaluation of a broad range of children's language skills. Scaled scores were summed and converted into a standard score, based on the child's age at test, corrected for degree of prematurity in the PT group.
Real-Time Language Comprehension
Children's efficiency in comprehending words in real time was assessed at 16 and 18 months using the looking-while-listening (LWL) procedure (Fernald, Zangl, Portillo, & Marchman, 2008). At each age, children were tested in two visits, typically 1 week apart. Pairs of pictures of familiar objects appeared on a screen and a prerecorded voice named one of the pictures. The child sat on their caregiver's lap and the caregiver wore opaque sunglasses to block their view of the images. Looking patterns were video recorded and later coded offline. Each session lasted approximately 5 min.
Visual stimuli were color pictures of familiar objects, matched for animacy and visual salience. Target images familiar to children in this age range were presented in yoked pairs: baby-doggy, birdie-kitty, ball-shoe, and book-car. Target images were presented four times each as target and distracter, interspersed between four filler trials, yielding 32 experimental trials at each session for a total of 64 experimental trials. Trials were presented in two pseudorandom orders, such that target order and picture position were counterbalanced across participants and across sessions. Pictures were displayed on the screen for 2 s prior to speech onset and remained onscreen for 1 s after sound offset. In the auditory stimuli, target nouns were presented in sentence-final position followed by an attention getter (e.g., Where's the doggy? Do you like it?). Mean length of target noun was 639 ms (range = 565–769).
To ensure that children were tested only on words with which they were familiar, parents were asked to indicate whether their child understood each of the target words at each visit. Trials on which the parent reported that the child did not understand the target word were excluded on a child-by-child basis. At 16 months, FT children were reported to understand significantly more target words than PT children (FT M = 7.4, PT M = 6.9 words), t(95) = 2.0, p < .05, d = 0.41. However, all children were reported to know at least four of the target words, and just over half (55 of 97, 56.7%) were reported to know all eight words. At 18 months, all children were reported to know at least five words and about 85% (82 of 97, 84.5%) knew all eight target words, with no group difference (FT M = 7.8; PT M = 7.7), t(95) = 0.97, p = .34, d = 0.20.
All LWL sessions were prescreened by trained research assistants blind to target side. Trials on which the child was judged to be inattentive or there was parent interference were not coded. Children were excluded from analyses if they did not have at least 16 coded trials, or 25% of all experimental trials. Children's gaze was identified for each 33-ms interval as either fixed on one of the images (left or right), or shifting between pictures or away (off). Based on fixations at target noun onset, trials were later designated as target initial (T-initial) or distracter initial (D-initial).
Language processing efficiency is reflected in two measures: First, accuracy was the mean proportion looking to the target image from 300–1,800 ms after noun onset, divided by the total looking time to either image. At both ages, mean accuracy scores were based on significantly more coded trials for children in the FT than in the PT group (16 months: FT M = 46.3, PT M = 40.2 trials, t(95) = 2.5, p < .02, d = 0.52; 18 months: FT M = 51.1, PT M = 44.1; t(95) = 3.8, p < .001, d = 0.76). Second, reaction time (RT) was the mean latency (ms) to initiate a shift from distracter to target image only on D-initial trials. Shifts prior to 300 ms and later than 1,800 ms were excluded because they were less likely to be in response to the target word. Mean RT scores were based on a minimum of two valid shifts with the FT children contributing more valid shifts, on average, than children in the PT group at the younger, but not at the older, age (16 months: FT M = 17.7, PT M = 13.1 trials, t(95) = 2.9, p < .004, d = 0.60; 18 months: FT M = 20.2, PT M = 18.3, t(95) = 1.4, p = .14, d = 0.31). One child in the PT group had only one valid shift at 16 months and, therefore, analyses of RT at this age were based on 40 children in the PT group.
To evaluate coding reliability, 25% of the sessions were randomly selected and fully recoded. At 16 months, intercoder agreement was 98% for trial-level accuracy within .05, and the proportion of trials on which first-shift latency agreed within one 33-ms frame was 99%. At 18 months, accuracy reliability was 96% and latency reliability was 98%.
Results
Overview
We first present descriptive statistics of AWC from the LENA™ recordings at 16 months, asking whether there are group differences between FT and PT children. We next explore FT versus PT group differences in child outcomes. Using a series of multiple regression models, we then examine the shared and unique contributions of AWC to FT and PT children's language knowledge on the BSID-III and lexical processing efficiency in the LWL task. We then explore whether the relation between caregiver talk and children's outcomes is parallel in the two groups, that is, whether birth status moderates these relations. Finally, we explore the unique contribution of medical risk in the PT group, asking whether medical risk moderates the relations of caregiver talk to child outcomes in children born PT. Prior to all analyses, it was determined that all predictor variables had no outliers. We confirmed that the predictors met the assumption for normality based on visual inspection of the residuals from the regression models (SW = .946–.99; p = .05–.92).
Caregiver Talk During Naturalistic Recordings
As shown in Table 2, AWC/h was quite similar in the FT and PT groups. At the same time, there was substantial variation in both groups, with some children hearing as few as 500 words/h, on average, and other children hearing nearly seven times more, on average, close to 3,500 words/h. AWC/h did not differ as a function of birth order or gender in either group (all p > .17). To determine if AWC/h was influenced by the inclusion of twins, we constructed two subsamples of PT children which included all singletons plus one randomly selected child from each twin pair. The AWC/h values were nearly identical in both subsamples (sample 1: M = 1,740.2; sample 2: M = 1,735.0), suggesting that including both members of each twin pair did not artificially inflate the AWCs for PT children.
FT | PT | t(95) | p | Cohen's d | |
---|---|---|---|---|---|
M (SD) | M (SD) | ||||
Naturalistic recordings (LENA™)a | |||||
AWC/h | 1,642.8 (592.0) | 1,781.7 (594.2) | 1.1 | .26 | .23 |
CVC/h | 206.9 (83.6) | 190.1 (80.6) | 0.9 | .32 | .21 |
Vocabulary (CDI)b | |||||
Words produced: 16 months | 39.5 (43.7) | 22.8 (28.1) | 2.2 | .03 * | .45 |
Words produced: 18 months | 88.8 (86.7) | 65.0 (67.4) | 1.5 | .15 | .31 |
Language (BSID-III)c | |||||
Language scale: 18 months | 105.7 (17.7) | 100.0 (17.7) | 1.6 | .13 | .32 |
Processing efficiency (LWL)d | |||||
Accuracy: 16 months | 0.63 (0.09) | 0.59 (0.09) | 2.0 | .05 * | .44 |
Accuracy: 18 months | 0.66 (0.09) | 0.66 (0.09) | 0.1 | .91 | .01 |
RT: 16 months | 814 (144) | 873 (209) | 1.7 | .10 | .33 |
RT: 18 months | 722 (143) | 775 (164) | 1.7 | .09 | .34 |
Note
- All two-tailed independent sample t tests use df = 95, except RT at 16 months (df = 94). aAdult Word Count (AWC) and Child Vocalizations (CVC) per hour based on naturalistic recordings using LENA™. bTotal number of words reported as “understands and says” on the MacArthur-Bates CDI: Words & Gestures (Fenson et al., 2007). cStandard scores (corrected for prematurity) on the Language Scale of the Bayley Scales of Infant and Toddler Development 3rd ed. (BSID-III; Bayley, 2006) at 18 months. dMean proportion looking to target (accuracy) and mean latency to shift from distracter to target picture (RT) in the looking-while-listening (LWL) task. *p < .05. Significant effects also indicated in bold face.
Outcomes in Children Born FT and PT
Table 2 also reports on several child outcomes. At 16 months, there were no group differences in CVC/h, a measure of all vocalizations, including nonvegetative sounds, such as growls and squeals, as well as babbling and words, produced by the target child during the naturalistic recording. This suggests that children in the FT and PT groups were similar in their overall frequency of vocalizations which could serve as bids for adult communicative responses. Given the lower reliability of the CVC measure reported in earlier studies (e.g., Oller et al., 2010), this finding should be interpreted with caution. On a parent reported measure of children's production of recognizable vocabulary words, children born FT were reported to produce significantly more different words at 16 months than their PT counterparts. This group difference was no longer reliable at 18 months. On the BSID-III, there were no reliable group differences, yet substantial variation in both groups. On the LWL task, both FT and PT children looked to the named picture at above chance levels at 16, FT: t(55) = 10.9, p < .001, d = 1.5; PT: t(40) = 6.4, p < .001, d = 0.10, and 18 months, FT: t(55) = 12.6, p < .001, d = 1.7, PT: t(40) = 11.8, p < .001, d = 1.8. As a group, FT children were significantly more accurate than PT children at 16 months, a difference that was no longer reliable at 18 months. Finally, processing speed (RT) was comparable at both 16 and 18 months. Accuracy and RT on the LWL task were initially computed following standard practice (Fernald et al., 2008) of including only those trials on which the parents indicated that the child understood the target word. Because parents may have underestimated children's familiarity with the target words, we also ran the analyses including all target words used in the study. The patterns of findings were unchanged.
Predicting Child Outcomes from Caregiver Talk in Children Born FT
We now ask whether variation in the amount of caregiver talk (AWC/h) at 16 months accounted for significant variance in FT children's outcomes at 18 months using linear regression models. In each analysis, control factors were SES and child gender, factors that are associated with variation in the amount of caregiver talk. We also controlled for the number of the child's own vocalizations on the recording (CVC/h) because frequency of child bids for caregiver attention can influence the frequency of adult communicative utterances (e.g., as measured by AWC/h). We also controlled for the child's prior language performance at 16 months because earlier scores are likely to be associated with later outcomes.
In Table 3, Models 1–3 examined predictors of FT children's scores on the BSID-III. Model 1 shows that both SES and child gender were significant predictors, indicating that girls and children from higher-SES groups tended to score more highly. Model 2 shows that CVC and an earlier measure of child language, the CDI, contributed an R2 change of approximately 19% above gender and SES, F(2, 51) = 9.71, p < .001. Finally, Model 3 reveals that AWC/h contributed an R2 change of nearly 8% additional variance, with all factors except SES remaining significant predictors, F(1, 50) = 9.29, p = .004. The coefficient for AWC in Model 3 can be interpreted as every 100-word increase in AWC/h is associated with a 0.9 point increase in BSID-III total language score, controlling for SES, gender, and earlier child language skills. The overall model accounted for nearly 60% of the variance in BSID-III scores.
BSID-III languagea | Model 1 | Model 2 | Model 3 |
---|---|---|---|
HI: SESb | 0.52 (0.23) * | 0.36 (0.20) | 0.28 (0.19) |
Child gender | 16.70 (4.02) ** | 11.06 (3.72) ** | 10.02 (3.46) ** |
CVC/h at 16 monthsc | – | 0.05 (0.02) * | 0.05 (0.02) * |
Vocabulary at 16 monthsd | – | 0.13 (0.05) ** | 0.12 (0.04) ** |
AWC/h at 16 monthse | – | – | 0.01 (0.01) ** |
R2 change | – | 18.8% ** | 7.7% ** |
Total R2 | 31.8% ** | 50.6% *** | 58.3% *** |
LWL accuracyf | Model 4 | Model 5 | Model 6 |
HI: SESb | 0.01 (0.01) | 0.01 (0.01) | 0.01 (0.01) |
Child gender | 0.02 (0.03) | −0.01 (0.03) | −0.01 (0.02) |
CVC/h at 16 monthsc | – | 0.01 (0.01) | 0.01 (0.01) |
Accuracy at 16 monthsf | – | 0.38 (0.14) ** | 0.30 (0.13) * |
AWC/h at 16 monthse | – | – | 0.01 (0.01) ** |
R2 change | – | 14.5% * | 15.5% ** |
Total R2 | 3.8% | 18.4% ** | 33.9% ** |
LWL RTg | Model 7 | Model 8 | Model 9 |
HI: SESb | 0.24 (2.06) | 1.15 (1.79) | 1.67 (1.70) |
Child gender | −112.0 (36.06)** | −53.53 (33.77) | −50.39 (31.85) |
CVC/h at 16 monthsc | – | −0.01 (.19) | 0.05 (0.18) |
RT at 16 monthsg | – | 0.56 (0.12) *** | 0.50 (0.11) *** |
AWC/h at 16 monthse | – | – | −0.07 (0.03)** |
R2 change | – | 26.9% *** | 7.5% ** |
Total R2 | 15.5% * | 42.4% *** | 49.8% *** |
Note
- Unstandardized B (SE). aStandard score on the Language Scale of the BSID-III (Bayley, 2006). bHollingshead Four-factor Index of Socioeconomic Status (Hollingshead, 1975). cCVC/hour based on LENA™ recordings. dReported words produced on the MacArthur-Bates CDI (Fenson et al., 2007). eAWC/h based on LENA™ recordings. fMean proportion looking to target (accuracy) in the LWL task at 16 or 18 months. gMean latency to shift from distracter to target picture (RT) in the LWL task at 16 or 18 months. *p < .05. **p < .01. ***p < .001. Significant effects also indicated in bold face.
Models 4–6 examined the contribution of our predictors to accuracy in the LWL task. There was a nonsignificant contribution of SES and child gender (Model 4), and CVC and child accuracy at the younger age together contributed about 18% additional variance, F(2, 51) = 4.54, p = .015 (Model 5). In Model 6, the addition of AWC/h nearly doubled the variance accounted for, adding more than 15% additional variance in predicting accuracy in lexical processing, F(1, 50) = 11.70, p = .001. This model indicates that every 100-word increase in AWC/h is associated with a nearly 1% increase in accuracy in LWL for FT children, controlling for SES, gender, and earlier child language skills.
Finally, child gender, but not SES, was a significant predictor of speed of lexical processing (RT; Model 7), indicating girls were faster in lexical processing than boys. However, this effect was reduced when CVC and RT at 16 months were added, F(2, 51) = 11.92, p < .001 (Model 8). Critically, AWC/h added about 7% additional variance beyond these control variables, F(1, 50) = 7.43, p = .009, with all factors accounting for nearly 50% of the variance (Model 9). Model 9 indicates that every 100-word increase in AWC/h is associated with a 6.9 ms decrease in RT, controlling for SES, gender, and earlier child language skills. In sum, these results both replicate and extend earlier findings which demonstrate relations between FT children's early learning environments and later language outcomes (Weisleder & Fernald, 2013).
Predicting Child Outcomes from Caregiver Talk in Children Born PT
To examine the relations between caregiver talk and child outcomes in children born PT, three parallel series of models are presented in Table 4. On the BSID-III, neither SES nor child gender contributed (Model 10), but CVC and child expressive vocabulary reported on the CDI at 16 months together added significant additional variance, F(2, 36) = 7.20, p = .002 (Model 11). Importantly, Model 12 indicated that AWC/h accounted for approximately 20% additional variance, F(1, 35) = 13.94, p = .001, and that every 100-word increase in AWC/h is associated with a 1.5 point increase in BSID-III total language scores, controlling for SES, gender, and earlier child language skills.
BSID-III languagea | Model 10 | Model 11 | Model 12 | Model 13 |
---|---|---|---|---|
HI: SESb | 0.02 (0.34) | 0.01 (0.31) | −0.04 (0.27) | −0.02 (0.28) |
Child gender | 6.07 (5.60) | 2.72 (4.97) | 0.71 (4.29) | 0.36 (4.43) |
CVC/h at 16 monthsc | – | −.02 (.04) | −.06 (.03) | −.06 (.03) |
Vocabulary at 16 monthsd | – | 0.36 (0.10) ** | 0.34 (0.08) ** | 0.35 (0.09) ** |
AWC/h at 16 monthse | – | – | 0.02 (0.01) ** | 0.02 (0.01) ** |
Medical riskf | – | – | – | 0.58 (1.35) |
R2 change | – | 27.7% ** | 19.7% ** | 0.3% |
Total R2 | 3.0% | 30.7% ** | 50.5% ** | 50.7% ** |
LWL accuracyg | Model 14 | Model 15 | Model 16 | Model 17 |
HI: SESb | 0.01 (0.01) | −0.01 (0.01) | −0.01 (0.01) | −0.01 (0.01) |
Child gender | 0.02 (0.03) | 0.02 (0.03) | 0.01 (0.03) | 0.02 (0.02) |
CVC/h at 16 monthsc | – | 0.01 (0.01) | 0.01 (0.01) | 0.01 (0.01) |
Accuracy at 16 monthsg | – | 0.16 (0.16) | 0.15 (0.15) | −0.07 (0.15) |
AWC/h at 16 monthse | 0.01 (0.01) * | 0.01 (0.01) * | ||
Medical riskf | – | – | – | −0.02 (0.01)** |
R2 change | – | 2.8% | 15.9% * | 17.6% ** |
Total R2 | 1.2% | 4.0% | 19.9% | 37.5% ** |
LWL RTh | Model 18 | Model 19 | Model 20 | Model 21 |
HI: SESb | −0.81 (3.03) | 1.27 (3.38) | 1.63 (3.27) | 2.03 (3.11) |
Child gender | −46.04 (50.72) | −39.65 (50.85) | −27.0 (49.63) | −39.03 (47.34) |
CVC/h at 16 monthsc | – | −0.10 (0.34) | 0.13 (0.35) | 0.13 (0.33) |
Child RT at 16 monthsh | – | 0.20 (0.13) | 0.21 (0.13) | 0.16 (0.12) |
AWC/h at 16 monthse | – | – | −0.08 (0.05)# | −0.07 (0.04) |
Medical riskf | – | – | – | 30.67 (13.9) * |
R2 change | – | 6.4% | 8.4% # | 10.6% * |
Total R2 | 2.3% | 8.7% | 17.1% | 27.8% # |
Note
- Unstandardized B (SE). aStandard score for the Language Scale of the BSID-III (Bayley, 2006). bHollingshead Four-factor Index of Socioeconomic Status (Hollingshead, 1975). cCVC/hour from LENA™ recordings. dReported words produced on the CDI: Words & Gestures (Fenson et al., 2007). eAWC/h from LENA™ recordings. fScores on a medical risk composite (max = 13). gMean proportion looking to target (accuracy) in the LWL task at 16 or 18 months. hMean latency to shift from distracter to target picture (RT) in the LWL task at 16 or 18 months (n = 40). #p < .08 *p < .05. **p < .01.
To directly test whether the patterns of relations between AWC and outcomes on the BSID-III differed by FT/PT group status, we assessed a moderation effect following Hayes (2013) using the PROCESS macro in ver. 23. All tests of estimation used bias-corrected 95% confidence intervals with 10,000 bootstrap samples. The results indicated that the coefficient for the interaction between caregiver talk and group did not achieve statistical significance, b = .002 (.004) [CI: −.007, .010]. Thus, as shown in Figure 1a, the relations between caregiver talk and child outcomes on the BSID-III, a standardized test of language, were parallel in the FT and PT groups.

Turning now to accuracy, neither demographic (Model 14) nor child variables at 16 months (Model 15) were significant predictors; however, AWC/h accounted for nearly 16% additional variance, F(1, 35) = 6.93, p = .013 (Model 16). Model 16 suggests that every 100-word increase in AWC/h is associated with nearly a 1% increase in accuracy, controlling for SES, gender, and earlier child language skills. As shown in Figure 1b, the relations between AWC/h and processing accuracy were comparable in FT and PT children. A test of moderation indicated that the increase in variance accounted for by the interaction term did not reach statistical significance, b = .001 (.001), [CI: −.001, .001], suggesting parallel relations in both the FT and PT children.
Finally, none of the control variables accounted for significant variance in RT in the PT group (Models 18 and 19). AWC/h added a marginally significant 8% additional variance, F(1, 34) = 3.45, p = .072 (Model 20), however, the overall model was not significant. The regression coefficient for AWC indicates that every 100-word increase in AWC/h is associated with an 8.4 ms decrease in RT, controlling for SES, gender, and earlier child language skills. The parallel effects of AWC/h on residual variance in RT in FT and PT children are illustrated in Figure 1c, again reflected in a nonsignificant interaction term in a test of moderation, b = .004 (.04), [CI: −.084, .092].
Outcomes as a Function of Medical Risk in PT Children
We now examine group differences between PT children with higher versus lower medical risk. Notably, AWC/h was not different in PT children from higher (n = 19; M = 1,753.3) versus lower (n = 22; M = 1,806.3) medical risk groups, F(1, 37) = 0.05, p = .82, , controlling for SES and child gender. Thus, healthier PT children were not necessarily exposed to more caregiver talk, on average, than PT children who were experiencing more medical complications.
To determine if PT children higher in medical risk produced fewer words during the day-long recordings or scored more poorly on tests of language knowledge and processing than their healthier PT peers, controlling for SES and gender, we conducted a series of comparisons of means. CVC/h did not differ in children with higher (M = 185.2) versus lower (M = 194.3) medical risk, F(1, 37) = 0.02, p = .89, . Differences in reported vocabulary size were not significant at either the 16- (higher: M = 15.5; lower: M = 29.1), F(1, 37) = 2.6, p = .12,
, or 18-month time points (higher: M = 45.3; lower: M = 82.1), F(1, 37) = 3.1, p = .09,
. There was also no reliable group difference in BSID-III (higher: M = 98.2, SD = 17.4; lower: M = 101.6, SD = 18.3), F(1, 37) = 0.5, p = .50,
.
In contrast, PT children with higher medical risk were significantly less accurate at 18 months, on average, in identifying a familiar referent during real-time language comprehension (M = .63, SD = .09), compared to children with lower medical risk (M = .68, SD = 0.07), F(1, 37) = 5.9, p < .02, . Moreover, PT children from the higher medical risk group demonstrated slower RTs (M = 838, SD = 152) than those from the lower medical risk group (M = 720, SD = 157), F(1, 37) = 6.1, p < .02,
, controlling for SES and child gender. Thus, child outcomes that reflect information processing skills during real-time language comprehension appear to be more susceptible than knowledge-based assessments to health-related factors in this PT sample.
To further illustrate the effect of medical risk on child performance in the LWL task, Figure 2 plots the time course of the proportion looking to target over the course of the stimulus sentence at 16 and 18 months for PT children from lower and higher medical risk groups. For all children at both ages, looking to target is at chance levels at the onset of the target noun and increases as the sentences unfolds in time. However, PT children from the lower medical risk group tended to increase their proportion target looking earlier in the sentence and reach a higher asymptote than PT children from the higher medical risk group. Interestingly, the curve for the higher medical risk children at 18 months essentially overlaps with the curve for the lower medical risk children at 16 months, indicating that 18-month-old PT children at increased risk were performing at the same level overall as lower-risk children who were 2 months younger.

We now ask whether the contribution of medical risk to child outcomes is unique or overlapping in relation to caregiver talk and then test for moderation effects. In Table 4, the final model in each series explored the contribution of medical risk after the control variables and caregiver talk were taken into account. In Model 13, we see little role for medical risk in predicting additional variance in BSID-III, F(1, 34) = .184, p = .671, whereas earlier child vocabulary and AWC/h remained significant predictors. To assess the moderating effect of medical risk, we again used the PROCESS macro (Hayes, 2013) to evaluate the additional contribution of the interaction between AWC/h and medical risk to the prediction of BSID-III scores. The interaction term was not significant, b = .001 (.003), [CI: −.007, .007]. As illustrated in Figure 3a, AWC/h was associated with BSID-III scores to the same degree at lower (−1 SD), average, and higher (+1 SD) values of medical risk.

For accuracy, Model 17 shows that medical risk added nearly 18% additional variance beyond AWC/h, F(1, 34) = 9.58, p = .004, nearly doubling the prediction of the entire model, and both remaining significant. Thus, the contributions of medical risk and AWC/h were additive in predicting to processing accuracy. In the test of moderation, the interaction term between AWC/h and medical risk was again not significant, b = .001 (.001), [CI: −.001, .001]. Figure 3b shows that higher-risk PT children had lower accuracy scores overall, but there were comparable relations between AWC/h and accuracy at lower, average, and higher values of medical risk.
Finally, medical risk significantly contributed to variation in PT children's RT scores (Model 21), adding more than 10% additional variance, F(1, 33) = 4.87, p = .034. Unlike BSID-III and accuracy, AWC/h did not account for significant unique variance in RT after taking medical risk into account, with medical risk effectively eliminating the modest effect of AWC/h on child RT. The test of moderation again revealed a nonsignificant interaction between AWC/h and medical risk, b = −.02 (.035), [CI: −.089, .054]. Figure 3c shows that medical risk affects RT scores, but that links between AWC/h and RT are roughly parallel at lower, average, and higher values of medical risk.
Discussion
This study examined differences in the amount of caregiver talk heard by FT and PT toddlers and assessed the contribution of caregiver talk to later knowledge- and processing-based language skills. This research revealed four main findings. First, there was remarkable similarity in the number of adult words that FT and PT children heard, on average, but there was also substantial variability in both groups. Second, amount of caregiver talk was significantly related to both knowledge-based and processing-based outcomes in FT children. Third, for PT children, caregiver talk was a significant predictor of language outcomes, accounting for substantial variance in BSID-III and in language processing accuracy. However, the relation of caregiver talk to processing speed did not achieve statistical significance. Tests of moderation revealed that relations between amount of caregiver talk and child outcomes were parallel across the FT and PT groups. Fourth, medical risk added significant variance to child outcomes in the PT group for accuracy and processing speed, but not for knowledge-based outcomes. Importantly, medical risk did not moderate the relation between caregiver talk and outcomes in the PT group. Thus, caregiver talk remains an important predictor of language outcomes in children born FT and PT.
FT and PT Children Hear Similar Amounts of Caregiver Talk
In this study, we first hypothesized that we would find a difference in the amount of caregiver talk as a function of PT status. In contrast, our results suggested that the language learning environments of SES-matched children born FT and PT are quite similar, on average. On this index of “language nutrition,” many parents of PT children were highly engaged in caregiver–child interactions despite the increased cognitive and social risks associated with PT birth. The LENA™ technology enabled unobtrusive naturalistic recordings of the child's engagement with one or more caregivers during activities over the course of a full day. These estimates captured not just the number of words heard when a caregiver and child were engaging in one particular activity, but also reflected the frequency with which those interactions occurred during daily activities. As in earlier studies, there was considerable variation among families, with some children hearing nearly seven times more words on average than other children over the day. Importantly, there was much greater within-group than between-group variation in these FT and PT children.
Caregiver Talk Predicts Outcomes in Both FT and PT Toddlers
Our main focus was the extent to which amount of caregiver talk predicted children's language outcomes. Looking first in the FT children, we replicated previous findings by Weisleder and Fernald (2013); those FT children who heard more talk were also those who were faster and more accurate in directing their gaze to the object named by a familiar noun. We also documented similar relations on a widely used standardized test of language knowledge, the BSID-III. Taken together, these results suggest that those FT children who experience more language-rich interactions from caregivers are better positioned to develop both stronger information processing skills and a greater body of language knowledge, compared to peers who experience fewer interactions.
We asked if quantity of caregiver talk differentially predicted outcomes in FT and PT children, with increased, decreased, or parallel effects. Moderation analyses revealed largely parallel effects of the relations between caregiver talk and outcomes in FT and PT children. Thus, we find no support for the hypothesis that the development of language skills by children with increased risk due to PT birth is more or less sensitive to the quantity of caregiver talk than children born FT at lower risk. Our results also suggested some similarities and differences in the patterns of relations between caregiver talk and our three outcomes for FT and PT children. Regardless of birth status, those children who heard more adult words tended to score higher on a test of language knowledge and to comprehend familiar words more accurately in real time, even after controlling for SES, gender, and earlier language skills. However, this relation was only marginally significant for processing speed in the PT group. This weaker overall predictive validity in PT than FT children in this particular measure may be due to weaker stability of RT in the PT group. In general, the models were weaker in predicting to processing speed and accuracy than to scores on the knowledge-based BSID-III.
What is the Role of Medical Risk in PT Children?
The next question of interest was the nature of the contribution of medical risk to outcomes in the PT group and whether medical risk moderated the relations between caregiver talk and outcomes. We found that PT children with higher medical risk did not hear less talk from caregivers, on average, than did children with lower medical risk. The regression analyses revealed that medical risk did not account for additional variance in BSID-III scores, caregiver talk remained a significant predictor after medical risk was taken into account, and medical risk did not moderate the relation between AWC/h and children's scores. These findings suggest that caregivers of children born PT who talk more may support the acquisition of their child's knowledge-based language skills regardless of the child's level of medical risk. It is possible that untimed, face-to-face measures like the BSID-III are administered in a manner that partially accommodates for behaviors associated with medical complications—for example, by allowing repetitions, breaks, and pauses to reorient the child to the task. Thus, knowledge-based measures may be particularly good at capturing underlying language skills, although they may also attenuate the effects of behavioral or attentional factors that influence performance during timed processing-based language tasks.
In contrast, medical risk accounted for significant variance in language processing efficiency. For accuracy, knowing a child's medical risk in addition to caregiver talk nearly doubled the model's prediction. For RT, medical risk added considerable variance, and eliminated the contribution of caregiver talk. Thus, efficiency of language processing during timed tasks may be particularly influenced by factors included in the medical risk composite used here. At the same time, the relations between caregiver talk and child outcomes were parallel at all levels of medical risk, indicating that medical risk did not moderate these effects.
These results provide support for intervention efforts that increase caregiver talk as a means of facilitating language development in FT and PT children. For PT children, medical risk should be considered an important additional determinant of outcomes in this early stage of development, especially in relation to processing-based measures. Note that medical factors have been shown to decrease in their effect, while environmental factors, such as quality of caregiver–child interaction, increase in their role in shaping outcomes over the first few years of life (Miceli et al., 2000). It is possible that our results would have revealed a stronger link between caregiver talk and language processing speed in PT children at older ages. Critically, the effects of caregiver talk were not moderated by medical risk, suggesting that improving language environments is beneficial regardless of birth status or health history.
Limitations of This Study
One limitation is that this sample was relatively small, and it is possible that moderation effects would have been identified if we had a larger sample and increased statistical power. A second limitation was that the sample was predominantly higher SES. Because PT birth is disproportionally overrepresented in low-SES compared to higher-SES groups in the United States (Kramer, Séguin, Lydon, & Goulet, 2000), our findings may not be characteristic of those seen in a more nationally representative sample. Moreover, SES is strongly related to the amount of caregiver talk that children hear, and thus the full range of caregiver talk for PT children may not be represented in this sample as would be found in a more diverse sample. Fourth, without additional coding, we did not have access to features beyond AWC, such as diversity of vocabulary and syntactic complexity, that are known to support child language outcomes (Weizman & Snow, 2001). Such additional coding might also have revealed differences in style of caregiver talk. For example, caregivers may use more directive talk with those PT children who may require more support for daily routines and be less capable of engaging in reciprocal interactions. More directive talk may result in caregivers devoting less of their interactions to providing names for objects, descriptions, or elaborations on the activities in which the child and caregiver are engaged (Rowe, 2012). A fifth limitation is that the audio recording alone could not provide access to nonverbal features of caregiver–child interactions, such as eye gaze or tone of speech, which might have revealed group differences (Landry et al., 1997). Future studies should continue to explore both quantitative and qualitative features of caregiver talk using multiple methods that could impact language outcomes in children born PT.
Conclusions
This study led to several new discoveries regarding the effect of caregiver speech on children's language outcomes. We found that children heard similar amounts of caregiver talk over the course of a full-day home recording, regardless of birth status or medical risk. Moreover, variability in the amount of speech that children heard had strikingly similar predictive relations to language function in FT and PT children. Nevertheless, we found both similarities and differences in the patterns of relations between caregiver talk and our three outcomes in FT and PT children. In FT children, caregiver talk consistently predicted knowledge- and processing-based language functions. For PT children, caregiver talk predicted knowledge-based language functioning and processing accuracy, but caregiver talk was less consistently related to processing speed, especially when medical risk was taken into account. Critically, the potential benefits of caregiver talk to PT children's outcomes were equally evident in those children who were more or less vulnerable to the adverse consequences of preterm birth, suggesting that interventions that focus on improving caregiver talk may be beneficial regardless of children's birth status or medical risk. These findings highlight the critical role of caregiver talk in shaping language outcomes in both FT and PT children and offer new insights into the links between neurodevelopmental risk factors and caregiver–child engagement.