The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely. As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsement of, or agreement with, the contents by NLM or the National Institutes of Health. Learn more about our disclaimer.
PLoS One. 2015; 10(11): e0142566.
Published online 2015 Nov 12. doi: 10.1371/journal.pone.0142566
PMCID: PMC4642969

How Visuo-Spatial Mental Imagery Develops: Image Generation and Maintenance

Marina C. Wimmer

University of Plymouth, School of Psychology, Cognition Centre, Plymouth, United Kingdom

Find articles by Marina C. Wimmer

Katie L. Maras

University of Bath, Department of Psychology, Claverton Down, Bath, United Kingdom

Find articles by Katie L. Maras

Elizabeth J Robinson

University of Warwick, Department of Psychology, Coventry, United Kingdom

Find articles by Elizabeth J Robinson

Martin J Doherty

University of East Anglia, School of Psychology, Norwich, United Kingdom

Find articles by Martin J Doherty

Nicolas Pugeault

University of Surrey, Centre of Vision, Speech and Signal Processing, Guildford, United Kingdom University of Plymouth, School of Psychology, Cognition Centre, Plymouth, United Kingdom University of Bath, Department of Psychology, Claverton Down, Bath, United Kingdom University of Warwick, Department of Psychology, Coventry, United Kingdom University of East Anglia, School of Psychology, Norwich, United Kingdom University of Surrey, Centre of Vision, Speech and Signal Processing, Guildford, United Kingdom Catholic University of Sacro Cuore, ITALY Study : Participants viewed a stimulus (e.g., vase) in a specific location on the screen (x/y coordinates: 26.5mm/60mm from the closest boarders of the screen) for 20 seconds and were asked to label it. Each stimulus appeared in a different location on the screen (x/y coordinates: Range(x) = 3.9 to 51.5 mm/ Range(y) = 3.9 to 102.3 mm). To enhance visuo-spatial encoding, a 3 mm wide frame appeared on the border of the screen, highlighting the border. Stimuli and frame appeared in flashing rotating rainbow colours to enhance perceptual encoding and avoid after image effects.

  • Distractor/maintenance period : The stimulus disappeared from the screen. In the image generation task , to eliminate visual short-term memory residuals and ensure that images had to be regenerated on the next phase, the participant was given a 30-second age-appropriate distractor task (counting for the 4-year-olds and mathematical calculations of increasing complexity for the older children and adults). This method is commonly applied in memory research to tap into long-term memory processes [ 28 ]. In the image maintenance task no distractor was given, and instead participants were simply required to retain the image for 500ms (light load) or 3000ms (heavy load) during which they were presented with a blank screen.
  • Test (image generation or image maintenance) : A new image (e.g., flower) appeared on the screen and participants were asked whether it fell on the original image (e.g., “Is the flower in the vase?”) that was no longer visible and thus had to be imaged.

    Phases (1)–(3) were repeated with the remaining stimuli pairs of that set, with the order of presentation randomised. Stimuli-pairs appeared in three different trial types counterbalanced between participants: i) in far trials, images appeared far from the study image ( Mdistance = 55.3 mm), ii) in near trials images appeared close to the study image ( Mdistance = 12.4 mm) and iii) in superimposed trials images fell on the study image. The nearer the image appeared to the study image (superimposed versus near versus far) the finer grained the generation of the shape and resolution of the image required to answer correctly. In total, there were 6 test trials for image generation (2 far, 2 near, and 2 superimposed), and 12 trials for image maintenance (6 500ms trials comprising: 2 far, 2 near, and 2 superimposed; 6 3000ms trials comprising: 2 far, 2 near, and 2 superimposed). Type of trial (near, far, superimposed) appeared in random order with each task. The participants’ answer required was “yes” or “no”. Correct answers for superimposed trials were “yes” (image generation: N = 2, image maintenance: N = 4 (2 500ms/2 3000ms)), and “no” for both near and far trials (image generation: N = 4, image maintenance N = 8 (4 500ms/4 3000ms).

  • Memory assessment : After all image generation or maintenance test trials had been completed participants were asked to recognize the locations of each object, assessing the memory for each original study image. Participants were presented with 4 images of the same kind (e.g., vases) next to each other on the screen (in a 2 by 2 square configuration, subtending between 3°-5° by 3°-5° of visual angle), one of which corresponded to the original studied image. The task was to select the one in the correct position. Images were presented in the same order in which they were shown earlier.
  • Perception control : As a final check on whether children had understood the task, perceptual control trials were administered. This phase followed exactly the same pattern as the test phase (3), except that both images were visible on the screen when the participant judged whether the second image (e.g., flower) “fell” onto the original image (i.e., vase). Results are not presented as accuracy was at ceiling for all ages.
  • Results and Discussion Experiment 1

    Outliers in response time, twice the mean of the age group in a particular trial (far, near, superimposed), were removed (6-year-olds: 1 data point; 10-year-olds: 3 data points). Bonferroni confidence interval adjustments and post-hoc Bonferroni analysis were used throughout.

    Preliminary analysis

    The order in which image maintenance and generation tasks were completed did not affect performance (all p s > .05). Because there were no effects due to gender, neither within each age group nor overall ( t < 1), gender was eliminated from subsequent analyses.

    Image generation

    Fig 2 shows the mean proportion of errors and the mean response times in the image generation trials. Image generation error rate and response time were analysed using a 5(age: 4-, vs. 6-, vs. 8-, vs. 10-year-olds, vs. adults) x 3 trial type: far vs. near vs. superimposed) analyses of variance (ANOVAs) where the first variable was between subjects and the latter within.

    Image generation proportional error rate (bar chart) and mean response time in milliseconds (line chart) and mean standard error across all ages as a function of trial type.

    Error rate

    Image generation accuracy increased with age, F (4, 97) = 3.35, p = .013, ηp 2 = .12, where both 4- ( M = .69, p = .06 marginally significant) and 8-year-olds ( M = .68, p = .02) differed from adults ( M = .82). The remaining age groups did not differ (all p s > .05) (6-year-olds: M = .76; 10-year-olds: M = .77). Even 4-year-olds performed above chance (chance = .50), collapsed across superimposed, near and far trials, t (19) = 6.61, p < .001. Performance remained above chance when including only superimposed and near trials, excluding far trials, t (19) = 3.78, p = .001. This finding indicates that with minimised task demands generation abilities can be present at 4 years. The effect of trial type, F (2, 194) = 51.71, p < .001, ηp 2 = .35, indicated that accuracy decreased when more precision was required. Fewer errors ( p < .001) occurred in far trials ( M = .07) than both near ( M = .35) and superimposed trials ( M = .34), which did not differ. There was no trial type x age interaction, F (8, 194) = 1.31, p = .24, ηp 2 = .05, also indicating that even 4-year-olds understood the task instructions and their poorer performance was not due to the nature of task.

    Response time

    The time taken to generate a mental image decreased with age, F (4, 96) = 32.50, p < .001, ηp 2 = .58, between all adjacent age groups (all p s < .001) until 8 years, where the oldest three groups did not differ ( p > .05) (4-year-olds: M = 5413ms; 6-year-olds: M = 4071ms; 8-year-olds: M = 3340ms; 10-year-olds: M = 2906ms; adults: M = 3167ms). Four-year-olds’ response time was about half that of 5-year-olds in previous research (approximately 10 seconds) [ 23 ]. Thus, verbally reporting the product of imagery significantly reduces response time and highlights that Kosslyn et al’s [ 23 ] response time results may not directly reflect the time it took to generate an image but additional task factors such as remembering which button to press interfering with imagery [ 30 ]. The closer the image appeared to the visible stimulus the longer it took to respond, F (2, 192) = 37.70, p < .001, ηp 2 = .28, and this was the case for all ages as there was no trial x age interaction, F (8, 192) = 1.53, p = .15, ηp 2 = .06. Post-hoc analyses show that superimposed trials took the longest in each age group, however, differences between trial types are not exactly the same across age groups. Specifically, 4-year-olds took longer to respond in superimposed than both near and far trials ( p s < .036), 6-year-olds took longer in superimposed than far trials ( p = .003) and near than far trials ( p = .02), 8-year-olds took longer in superimposed than far trials ( p = .028) and marginally longer in near than far trials ( p = .063), 10-year-olds took longer in superimposed than far trials ( p = .016) and adults took longer in superimposed than far trials ( p < .001) and near than far trials ( p < .001). There were no further differences. The comparable effects of complexity on response time and error rate rule out a speed-accuracy trade off.

    Visuo-spatial memory versus image generation

    How well was the original image remembered? All age groups performed above chance ( p < .001) in selecting the correct study image out of 4 presented images (one-sample t-tests, chance = .25). Although performance increased with age, F (4, 97) = 7.41, p < .001, ηp 2 = .23, this effect was due to significantly poorer performance (all p s < .01) by 4-year-olds ( M = .55) than older participants (ranging from M = .77 at 6 years to M = .87 for adults) who did not differ (all p s > .05). This contrasts with image generation performance, which continued to increase with age, suggesting that processes beyond visuo-spatial memory are involved in the generation of a mental image.

    To directly examine the link between memory and image generation, correlations were performed for the youngest 3 age groups only, a10-year-olds and adults performed at ceiling. No association emerged between image generation accuracy and corresponding memory accuracy overall for the youngest 3 age groups, r = .18, p = .17 and when examining associations for each age group individually: 4-year-olds: r = .04; 6-year-olds: r = .35, 8-year-olds: r = -.24, all p s > .05. Again, this suggests that poor image generation is not a result of poor visual memory.

    Image Maintenance

    Fig 3 shows the mean proportion of errors and mean response times in the image maintenance trials. Image maintenance proportional error rate and response time were analysed in two 5(age: 4-, vs. 6-, vs. 8-, vs. 10-year-olds, vs. adults) x 3(trial type: far vs. near vs. superimposed) x 2(load: 500ms vs. 3000ms) ANOVAs. As there were no effects of maintenance load (500ms vs. 3000ms) for accuracy, F (1, 194) = .01 p = .98, ηp 2 = .00, and for response time, F (1, 194) = 2.11, p = .15, ηp 2 = .02, this variable was eliminated from subsequent analyses. The means of both imagery loads’ error rates and response times were used for further analyses.

    Image maintenance proportional error rate (bar chart) and mean response time in milliseconds (line chart) and mean standard error across all ages as a function of trial type.

    Error rate

    Image maintenance accuracy increased with age F (4, 97) = 5.63, p < .001, ηp 2 = .19, particularly between 4-year-olds ( M = .74) and all older age groups (8-year-olds: M = .84, p = .04; 10-year-olds: M = .83, p = .06 marginally significant; adults: M = .90, p < .001) except 6-year-olds ( M = .81, p > .05). There were no other age group differences ( p s > .05). However, even 4-year-olds performed significantly above chance, t (19) = 7.75, p < .001. Performance remained above chance when including only superimposed and near trials, excluding far trials, t (19) = 4.48, p < .001. Further, accuracy was better for far versus near or superimposed trials, F (2, 194) = 27.79, p < .001, ηp 2 = .22. There was no interaction between trial type and age ( p > .05).

    Response time

    Response time decreased with age, F (4, 97) = 27.81, p < .001, ηp 2 = .53, where 4-year-olds ( M = 4578ms) differed (all p s < .001) from all older age groups (6-year-olds: M = 3455ms; 8-year-olds: M = 3009ms; 10-year-olds: M = 2799ms; adults: M = 2525ms) and 6-year-olds took longer than both 10-year-olds ( p = .03) and adults ( p < .001). There were no further differences. Importantly, response times were affected by trial type; F (2, 194) = 17.57, p = .001, ηp 2 = .15, in that participants responded faster in far trials compared to near and superimposed trials which did not differ ( M = 2848ms versus M = 3386ms M = 3586ms, p < .001). Again, the superior performance on far trials suggests that mental imagery processes go beyond the time it takes to visually inspect a stimulus. The comparable effects of trial complexity on response time and error rate also rule out speed-accuracy trade off.

    Visuo-spatial memory versus image maintenance

    One-sample t-tests indicated that each age group performed above chance ( p s < .001) in selecting each study image out of 4 presented images (chance = .25). Four-year-olds ( M = .62) memorized fewer study images (ranging from M = .77 for 6-year-olds to M = .81 for adults, all p s < .05), F (4, 96) = 6.08, p < .001, ηp 2 = .20, and there were no further differences between age groups ( p s > .05). As above, there were no correlations between memory accuracy and the corresponding maintenance accuracy overall for the youngest 3 age groups, r = .23, p = .07 (after partialling out age, r partial = .10, p = .45) and when examining associations for each age group individually (4-year-olds: r = .01, 6-year-olds: r = .03, and 8-year-olds: r = .31, all p s > .05).

    Image generation versus image maintenance performance

    To directly compare error rates and response times in image generation and maintenance two 5(age: 4-, vs. 6-, vs. 8-, vs. 10-year-olds, vs. adults) x 2(task: image generation vs. image maintenance) repeated measures ANOVAs were conducted. For error rates, there was a main effect of age, F (4, 97) = 6.21, p < .001, ηp 2 = .20, where both 4-year-olds ( M = .72, p < .001) and 8-year-olds ( M = .76, p = .01) differed from adults ( M = .86) but not from 6-year-olds ( M = .78) and 10-year-olds ( M = .79). There were no further age group differences ( p s > .05). There was also a main effect of task, F (1, 97) = 21.81, p < .001, ηp 2 = .18, indicating that image generation ( M = .74) was harder than image maintenance ( M = .82). There was no age x task interaction, F (4, 97) = 1.49, p = .21, ηp 2 = .06, but when examining differences for each age group separately, then only the oldest three age groups performed significantly worse in image generation than maintenance ( p s < .05).

    For response times there was also a main effect of age, F (4, 96) = 42.63, p < .001, ηp 2 = .64, where 4-year-olds ( M = 4995ms) took longer to respond than all older age groups (all p s < .001). Six-year-olds ( M = 3763ms) took also longer than all older age groups (all p s < .025). The oldest 3 age groups did not differ (all p s > .05) (8-year-olds: M = 3175ms; 10-year-olds: M = 2864ms; adults: M = 2846ms). Additionally, image generation took longer ( M = 3779ms) than image maintenance ( M = 3278ms), F (1, 96) = 37.64, p < .001, ηp 2 = .28. However, these effects were qualified by an age x task interaction, F (4, 96) = 2.52, p = .046, ηp 2 = .09. All age groups (all p s < .001; except 8-year-olds: p = .07, marginally significant), except 10-year-olds ( p > .05) took longer in image generation than image maintenance trials; the lack of difference for 10-year-olds alone may not be a reliable phenomenon.

    The overall task performance differences suggest that additional processes are required for generation compared to maintenance. Both processes require activating stored visual memories, but image generation may additionally require accessing stored visual memories for information about what to image [ 23 ]. Our findings of higher error rate in the three oldest age groups and longer response times in image generation support this claim.

    In sum, the results of Experiment 1 suggest that basic image generation and maintenance abilities can be present at 4 years of age. That is, in addition to research finding that 2- to 4-year-old children use imagery when prompted to do so (e.g., “make a picture in your head”) [ 16 18 ] the current research reveals early spontaneous imagery abilities. Additionally, the precision with which images are generated and maintained improves particularly between 4 and 8 years of age. Moreover, mental imagery processes go beyond the time it takes to visually inspect a stimulus for all ages, shown by the finding that response time was shorter in far trials compared to both near and superimposed ones. Finally, the lack of a correlation between imagery and according visuo-spatial memory may suggest that visuo-spatial memory is not required for both image generation and maintenance tasks. Alternatively, it may indicate that visuo-spatial recognition may lack sensitivity and not be the most suitable comparator to image generation and maintenance, see [ 31 , 32 ] for associating recall with recognition. Therefore, in Experiment 2 a measure of visuo-spatial recall was introduced.

    Experiment 2

    Experiment 1 examined the precision with which images are generated and maintained. The aim of Experiment 2 was to compare the ability to coordinate mental and real images. Comparing coordinating a mental image and a visual image with children’s ability to coordinate two visual images should provide a very accurate assessment of their mental imagery abilities, holding other task factors constant. Image generation and maintenance were measured by the precision with which a second visible object was manipulated within the context of the first imagined object (imagery trials), compared to how the second object was manipulated when the first image was also visible (perception control trials). As in Experiment 1, the crucial difference between image generation and maintenance was that once the image had been memorized, in generation trials participants received a 30 seconds distractor task in order to eliminate any short-term memory residuals [ 28 ]. In image maintenance trials there was no distractor task and the image had to be maintained for 3000ms.

    Additionally, as before visuo-spatial memory was assessed. Instead of recognising the location of each object (Experiment 1), in the current experiment participants recalled the location of each object by dragging and dropping the stimulus onto its original place. Examination of an objects’ dropped location compared to its original actual location provided precise measures of visuo-spatial location memory. Further, comparison between imagery and visuo-spatial memory performance allowed a check that the image generation and maintenance tasks required mental imagery and not just visuo-spatial memory.

    Method

    Participants

    A total of 88 children [23 (12 males) 4-year-olds ( M = 4.8, SD = 4 months), 21 (10 males) 6-year-olds ( M = 6.7, SD = 3 months), 22 (11 males) 8-year-olds ( M = 8.8, SD = 3 months), 22 (11 males) 10-year-olds ( M = 10.9, SD = 3 months) and 21 (10 males) adults ( M = 23.6, SD = 84 months)] were recruited. All children were tested following written parental consent and their own oral assent on the day of testing. Adults were tested following written consent on the day of testing. Ethical approval for this study was obtained from the Research Ethics Committee at Warwick and Plymouth University.

    Design

    All participants completed both the image generation and the image maintenance tasks in the same session, with task order counterbalanced within each age group.

    Materials and Procedure

    The stimuli comprised two sets of 6 pairs of stimuli taken from those used in Experiment 1 and each participant completed 12 trials. As in Experiment 1 the program recorded response times (reaction time from stimulus onset to pressing the return key) and imagery-perception overlap (as described below) and memory scores.

    The procedure followed the same pattern as Experiment 1 with two modifications ( Fig 4 ). First, instead of judging whether a presented stimulus fell on an imaged one, participants were required to drag and drop a visible stimulus onto an imaged one (imagery trials), before later dragging and dropping the same stimulus, but this time with both stimuli being visible (perception control trials). Because some of the ‘correct’ positions for the stimuli were subjective (e.g., the flower could be placed on top of the vase, or it could be placed further inside the vase; both positions would be correct), overlap scores were calculated, using Pythagoras’ theorem (h 2 = a 2 + b 2 ), as the difference (in pixels) between the centre of the object (defined as the centre of a non-visible box drawn around it) being moved on the imagery trials and its centre point on the perception control trials. Hence, participants determined their own individual baseline positions for the stimuli in the perception control trials. Second, in the memory trials, rather than recognizing the imaged objects’ correct location, participants recalled spatial location by dragging and dropping the object back in its original place. Memory accuracy for each image was calculated as the distance (in pixels) from the centre point of the original image location to the centre point of the dropped image location (again using Pythagoras theorem).

    Image Generation and Maintenance task Experiment 2.

    Example of an image generation trial. Image maintenance followed the same pattern except that a blank screen was presented for 3000ms instead of the distractor task.

    Results and Discussion Experiment 2

    Outliers in response time and overlap (i.e. two standard deviations above the mean) of the according age group were removed (only one 4-year-old was removed as a whole participant as more than half of his data points were outliers and there were no further outlier data points). Bonferroni confidence interval adjustments and post-hoc Bonferroni analysis were used throughout. ‘Overlap’ refers to the difference in pixels between the location a picture was placed on a real image (in the perception control trials) and on an imagined image (in the image generation/maintenance trials). Lower scores indicate greater similarity between accuracy using mental images and real pictures.

    Preliminary analyses

    There was a main effect of gender for image maintenance overlap, F (1, 99) = 4.53, p < .05, ηp 2 = .04. For females, accurate placement of a picture on a real image and on a mental image was more similar than for males (M = 133.13 versus M = 143.45). However, there was a marginally significant age x gender interaction for image maintenance overlap, F (4, 99) = 2.15, p = .08, ηp 2 = .08, due to the fact this gender difference was significant only for 4-year-olds ( p < .05), and not for any of the older age groups (all p s > .12).

    Image generation

    Figs Figs5 5 and and6 6 display age-related differences in image generation abilities.

    Fig 5
    Image generation and image maintenance overlap scores and mean standard error for each age group.
    Mean response times for image generation and image maintenance tasks and mean standard error for each age group.

    Overlap generation-perception

    There was a significant effect of age for image generation overlap, F (4, 103) = 3.97, p < .005, ηp 2 = .13. Adults’ placement of a picture on a real image and on a mental image overlapped more ( M = 126.25) than 4-year-olds ( M = 166.81, p < .005). However there were no significant differences between adults and children older than 4 years (all p s > .52), indicating that children could generate adult-like mental images in this task by age 6.

    Response time generation-perception

    To examine age-related differences in processes underlying image generation compared to those involved in visual inspection and motor coordination (dragging and dropping of a stimulus using the computer mouse), image generation and perception control response times were compared. A 2(trial: generation vs. perception) x 5(age: 4- vs. 6- vs. 8- vs. 10-year-olds vs. adults) ANOVA showed main effects of age, F (4, 103) = 16.09, p < .001, ηp 2 = .39, where 4-year-olds ( M = 14428m) took equally long as 6-year-olds ( M = 13394ms, p > .05) and were slower than all older age groups (all p s < .001). Additionally, 6-year-olds took longer than 8-year-olds ( M = 10063ms, p = .007), 10-year-olds ( M = 8249ms, p < .001) and adults ( M = 9204, p < .001). Children from age 8 onwards did not differ (p > .05). Further, image generation trials took markedly longer ( M = 14187ms) than perception control trials ( M = 7945ms), F (1, 103) = 362.83, p < .001, ηp 2 = .78, suggesting image generation processes go beyond visual inspection of a stimulus and the motoric components for all age groups. There was no age x trial interaction, F (4, 103) = 1.44, p = .23, ηp 2 = .05.

    Visuo-spatial memory versus generation overlap

    Memory scores were calculated in terms of overlap as before. Lower scores reflect more accurate memory. A one-way ANOVA revealed a significant main effect of age, F (4, 103) = 11.60, p < .001, ηp 2 = .31. Four-year-olds remembered the image locations significantly (all p s < .001) less accurately ( M = 157.14) than each of the other age groups (overlap scores ranged from M = 100.00 for 6-year-olds to M = 78.19 for adults). There were no further significant differences (all p s > .05).

    To explore whether image generation overlap depends on how well the visuo-spatial locations of the images are remembered, we conducted a correlational analysis between memory accuracy and image generation overlap. There was a strong relation between memory accuracy and generation overlap, r = .38, p < .001, that held when controlling for age, r = .32, p = .001, suggesting that spatial memory is required for this visuo-spatial image generation task and vice versa .

    Image maintenance

    Figs Figs5 5 and and6 6 display developmental differences in image maintenance abilities.

    Overlap maintenance-perception

    There was a main effect of age for image maintenance overlap, F (4, 104) = 17.90, p < .001, ηp 2 = .41. Four-year-olds ( M = 173.19) maintained images had significantly less overlap than those of older children (overlaps ranged from M = 138.26 for six-year-olds to M = 117.96 for adults; the difference between 6-year-olds and adults was marginally significant, p = .06). Thus, at least by 8-years-of-age children maintained images as precisely as adults in this task ( Fig 5 ).

    Response time maintenance-perception

    To examine developmental processes underlying image maintenance beyond processes underlying visual inspection and motor coordination, image maintenance and perception control response times were compared. A 2 (trial: maintenance vs. perception) x 5(age: 4- vs. 6- vs. 8- vs. 10-year-olds vs. adults) ANOVA showed decreasing response time with increasing age, F (4, 103) = 23.29, p < .001, ηp 2 = .47, where 4-year-ods ( M = 12446ms) took equally long as 6-year-olds ( M = 10435ms, p > .05) but longer than all older age groups (all p s < .001). Six-year-olds took also longer than 8- ( M = 7511ms, p = .004), 10-year-olds ( M = 6158ms, p < .001) and adults ( M = 6721ms, p < .001), who did not differ from each other ( p > .05). Further, image maintenance trials ( M = 9668ms) took longer than perception trials ( M = 7640ms), F (1, 104) = 65.88, p < .001, ηp 2 = .39. There was no interaction, F (4, 104) = .51, p = .73, ηp 2 = .02. Findings suggest that image maintenance processes go beyond visual inspection of a stimulus and the motoric components of the task for all ages.

    Visuo-spatial memory versus maintenance overlap

    There was a main effect of age group for memory accuracy, F (4, 104) = 12.09, ηp 2 = .32. Four-year-olds had significantly less accurate memory for the images ( M = 156.69) than did older participants (ranging from M = 93.20 for 6-year-olds to M = 74.60 for adults, all p s < .001). There were no differences in memory accuracy between any of the older age groups within whom there were no significant differences ( p > .05). Moreover, there was a significant relation between image maintenance performance and memory accuracy, r = .55, p < .001, that held when controlling for age, r partial = .49, p < .001, suggesting that spatial memory is associated with image maintenance processes and vice versa .

    Image generation versus image maintenance performance

    We compared precision on the image generation and image maintenance tasks with a 5(age group: 4-, vs. 6-, vs. 8-, vs. 10-year-olds, vs. adults) x 2(task: image generation vs. image maintenance) repeated measures ANOVA. There was a main effect of age for overlap, F (4, 103) = 16.51, p < .001, ηp 2 = .39, where 4-year-olds ( M = 169.78) performed less accurately (all p s < .001) than all older age groups (6-year-olds: M = 142.84; 8-year-olds: M = 133.74; 10-year-olds: M = 135.29; adults: M = 122.11) and 6-year-olds performed less accurately ( p = .01) than adults. There were no further age group performance differences ( p s > .05). Additionally, overlap did not differ between image generation ( M = 143.98) and image maintenance ( M = 137.83), F (1, 103) = 2.20, ηp 2 = .02 and there was no significant age x task interaction, F (4, 103) = .55, p = .70, ηp 2 = .02 ( Fig 5 ).

    For response times a 5(age group) x 2(task) ANOVA yielded significant main effects of age, F (4, 103) = 14.16, ηp 2 = .36, where 4-year-olds ( M = 15000ms) performed as fast as 6-year-olds ( M = 14053ms, p > .05) but slower than all older age groups (all p s < .001). Six-year-olds took also longer than all older age groups (8-year-olds: M = 10906ms, p = 0.11; 10-year-olds: M = 9322ms, p < .001; and adults: M = 10196ms, p = .001). The oldest three age groups did not differ (all p s > .05). Moreover, participants were faster to respond on the image maintenance task ( M = 9605ms) than they were on the image generation task ( M = 14177ms), F (1, 103) = 236.37, p < .001, ηp 2 = .70. There was no interaction, F (4, 103) = 1.44, p = .23, ηp 2 = .05, indicating that all age groups were slower to respond on the image generation task compared to the maintenance task ( Fig 6 ). Thus, as in Experiment 1, response time differences may reflect the time taken accessing long-term memory for information about what to image (image generation) in addition to holding an image online (image maintenance).

    General Discussion

    This research aimed to examine the precision with which children from 4 years up to adulthood generate and maintain mental images (Experiment 1) and their accuracy coordinating mental images compared to visually perceived objects (Experiment 2). Findings from Experiment 1 suggest that the precision with which images are generated and maintained undergoes significant development over the preschool period. Moreover, although 4-year-olds’ accuracy was above chance in both image generation and maintenance in Experiment 1, from 4- to 8-years-of-age, children’s ability to coordinate mental and real images becomes increasingly similar to adults’ (Experiment 2).

    By modifying methods from Kosslyn et al. [ 23 ], who reported that 5- and 8-year-old children were significantly worse at generating and maintaining mental images than 14-year-old children, we have shown that, children as young as 4 can generate and maintain mental images. Hence our findings are more in-line with research that reported even 3-year-olds are capable of using mental imagery for reasoning and problem solving, although they tend not to do so spontaneously and require instructions and support in order to use their imagination [ 17 , 18 , 33 ]. Importantly, what the current findings add is that not only can young children use mental imagery when instructed to do so but both their accuracy in a task requiring image generation and maintenance and the ability to coordinate real with imaged images increases with age, with 4-year-olds showing basic imagery abilities.

    Age-related advances in image maintenance abilities that we observed occur at an earlier age than reported improvements in visuo-spatial working memory span. Image maintenance may be regarded as a critical function of visuo-spatial working memory [ 34 ]. Thus, improvements on image maintenance may precede processes required for visuo-spatial working memory span tasks. Studies that ask children to remember the locations of filled in squares or dots within an image or a square, for example, report that 5-year-olds perform at approximately 25% of adult-level [ 35 ], and that visual and spatial working memory span continue to improve from age 5 through to 11 years [ 36 , 37 , 38 ]. Kail [ 8 ] argues that mental imagery actually predicts spatial memory span, and thus improvements in mental imagery may facilitate improvements in spatial memory abilities. The current finding of developments in image maintenance over preschool prior to reported later developments in visuo-spatial working memory span [ 38 ] and the relation between imagery performance and memory performance (Experiment 2) may support this claim.

    In contrast to image maintenance which relies on short-term memory, image generation involves generating a previously seen image from long-term memory. Kosslyn et al.’s [ 23 ] measure of image generation was the time taken to retrieve an image from long-term memory. Their task required children to generate an upper-case version when cued with a lower-case letter, while in our study participants were cued verbally to imagine a previously presented image. Both of these measures of image generation require that participants generate an image from memory. Both also require generating reproductive images, a term first used by [ 20 ], that is, evoking images for objects or events that are already known, which is possible at a younger age than transformed images. Future work should extend the present methodologies to examine the developmental trajectory of transformed image abilities, and at which age children can produce these types of images with adult-like precision.

    Can we rule out the possibility that the present tasks are measures of visuo-spatial memory rather than visuo-spatial mental imagery? To explore the relation between imagery and spatial memory for the image, after the imagery trials had been completed we asked participants to recognize and recall the spatial locations of each of the images. Memory trials were examined after all imagery trials, yet even after this relatively long period, performance on the visuo-spatial recognition and recall tasks remained high. In Experiment 1 there was no relation between imagery accuracy and visuo-spatial recognition. However, the comparison of recognition (visuo-spatial memory) and recall (image generation and maintenance) may not have been appropriate [ 31 ]. Our finding that visuo-spatial recall (Experiment 2) correlated with the ability to coordinate a visible image with an imaged one supports this and suggests that the difficulty of the image generation and maintenance tasks lies in visuo-spatial memory. Weakly stored visuo-spatial locations may have contributed to poorer imagery performance. That is, for our visuo-spatial imagery tasks a lack of spatial memory may make it difficult to impose an image onto the imagined object, even if the shape and resolution of the imagined object were generated precisely. However, the findings of different developmental trajectories for each imagery task and visuo-spatial memory task suggest that the imagery tasks demand more than visuo-spatial memory. Specifically, in both visuo-spatial recognition and recall tasks only 4-year-olds differed from each of the age groups whereas image generation and maintenance performance continued to improve with age, particularly between 4- and 8 years. Thus, memory of the spatial location is necessary but not sufficient to generate and maintain the correct size and resolution of an image.

    The current research also adds to our understanding of development of spatial cognition. Research on visuo-spatial cognition indicates that basic spatial coding abilities are present in infancy [ 39 ]. However, location memory undergoes significant changes from toddlerhood into adulthood. Geometric biases in how children categorise space occur when younger children form large geometric categories and are biased towards the centre of the space (e.g., a midline) while older children subdivide space into smaller regions, showing biases away from a category boundary at midline [ 40 , 41 ]. For example, 7-year-old children replace objects into their original locations with significantly less accuracy than older children and adults, suggesting increasing precision in metric location estimation [ 42 ]. The current findings suggest that additionally, on a local level, the precision with which the shape and resolution of remembered visual stimuli are generated and maintained in their visuo-spatial location undergoes significant changes over the primary school period. Moreover, the extent of the accuracy of coordinating mental images compared to visual images increases particularly over the preschool period.

    Research on encoding spatial object relations, suggests that even infants are able to encode basic spatial relations between objects [ 43 ] and by 4 years children encode extent in absence of frame of reference [ 44 , 45 ]. The current findings add that encoding spatial relations is also possible between mental and visual images.

    In sum, this is the first investigation to demonstrate basic image generation and maintenance abilities in 4-year-olds. Importantly, the precision with which they generate and maintain visuo-spatial mental images undergoes significant developments over the early primary school period. Moreover, children’s mental images can be accurately coordinated with visually perceived objects. Thus, in addition to previous research that has shown that mental imagery aids cognitive functioning, the current research suggests that children can accurately generate and maintain their remembered visuo-spatial mental images and coordinate mental and visually perceived images in preschool age.

    Acknowledgments

    We are very grateful to the staff and children of Canon Evans Infant School, Race Leys Infant School, Canon Maggs Junior School, Wembury Primary School, St Rumon’s C of E Infants School and Peter’s Junior School for their invaluable help with the research.

    Funding Statement

    This research was supported by a grant from the Economic and Social Research Council, UK (RES-000-22-4158). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

    Data Availability

    All data files are available from figshare: http://dx.doi.org/10.6084/m9.figshare.125463 .

    References

    1. Kosslyn SM. Image and brain: The resolution of the imagery debate Cambridge,MA, US: MIT Press; 1994. [ Google Scholar ]
    2. Kosslyn SM. Mental image In: Jones CA. editor. Sensorium: Embodied experience,technology, and contemporary art . Cambridge, MA: MIT Press; 2006. [ Google Scholar ]
    3. Ganis G, Shendan HE. Visual imagery . Wiley Interdisciplinary Reviews: Cognitive Science . 2010. [ Google Scholar ]
    4. Jastrow J. Fact and fable in psychology Oxford, England: Houghton, Mifflin; 1900. [ Google Scholar ]
    5. Gale AG, Findlay JM. Eye movement patterns in viewing ambiguous figures In Groner R, Menz C, Fisher DF, Monty RA, editors. Eye movements and psychological functions: International views . Hillsdale, NJ: Erlbaum; 1983. pp. 145–168. [ Google Scholar ]
    6. Rock I, Hall S, Davis J. Why do ambiguous figures reverse? Acta Psychologica . 1994; 87 : 33–59. [ PubMed ] [ Google Scholar ]
    7. Wimmer MC, Doherty M J. The development of ambiguous figure perception . Monographs of the Society for Research in Child Development . 2011; 76 : 1–130. [ PubMed ] [ Google Scholar ]
    8. Kail R. Processing time, imagery, and spatial memory . Journal of Experimental Child Psychology . 1997; 64 : 67–78. [ PubMed ] [ Google Scholar ]
    9. Keogh R, Pearson J. The sensory strength of voluntary visual imagery predictsvisual working memory capacity . Journal of Vision . 2014; 14 : 1–13. [ PubMed ] [ Google Scholar ]
    10. Perner J, Kloo D, Rohwer M. Retro- and prospection for mental time travel: Emergence of episodic remembering and mental rotation in 5- to 8-year-old children . Consciousness and Cognition . 2010; 19 : 802–815. 10.1016/j.concog.2010.06.022 [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
    11. Paivio A. Imagery and verbal processes New York: Holt, Rinehart & Winston; 1971. [ Google Scholar ]
    12. Saunders J, Fernandes M, Kosnes L. Retrieval-induced forgetting and mentalimagery . Memory & Cognition . 2009; 37 : 819–828. 10.3758/MC.37.6.819 [ PubMed ] [ CrossRef ] [ Google Scholar ]
    13. Wilkinson C, Hyman JR IE. Individual differences related to two types of memory errors: Word lists may not generalize to autobiographical memory . Applied Cognitive Psychology . 1998; 12 : S29–S46. [ Google Scholar ]
    14. Winograd E, Peluso JP, Glover TA. Individual differences in susceptibility to memory illusions . Applied Cognitive Psychology . 1998; 12 : S5–S27. [ Google Scholar ]
    15. Nash RA, Wade KA, Lindsay SD. Digitally manipulating memory: Effects of doctored videos and imagination in distorting beliefs and memories . Memory & Cognition . 2009; 37 : 414–424. 10.3758/MC.37.4.414 [ PubMed ] [ CrossRef ] [ Google Scholar ]
    16. Dias MG, Harris PL. The influence of the imagination on reasoning by young children . British Journal of Developmental Psychology . 1990; 8 : 305–318. [ Google Scholar ]
    17. Joh AS, Jaswal VK, Keen R. Imagining a way out of the gravity bias: Preschoolers can visualize the solution to a spatial problem . Child Development . 2011; 82 : 744–750. 10.1111/j.1467-8624.2011.01584.x [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
    18. Richards CA, Sanderson JA. The role of imagination in facilitating deductive reasoning in 2-, 3-, and 4-year-olds . Cognition . 1999; 72 :,B1–B9. [ PubMed ] [ Google Scholar ]
    19. Beck SR, McColgan KLT, Robinson EJ, Rowley MG. Imagining what might be: Why children underestimate uncertainty . Journal of Experimental Child Psychology . 2011; 110 : 603–610. 10.1016/j.jecp.2011.06.010 [ PubMed ] [ CrossRef ] [ Google Scholar ]
    20. Piaget J, Inhelder B. Mental imagery in the child New York: Basic; 1971. [ Google Scholar ]
    21. Brandimonte MA, Hitch GJ, Bishop DVM. Manipulation of visual mental images in children and adults . Journal of Experimental Child Psychology . 1992; 53 : 300–312. [ PubMed ] [ Google Scholar ]
    22. Finke RA, Pinker S, Farah MJ. Reinterpreting visual patterns in mental imagery . Cognitive Science . 1989; 13 : 51–78. [ Google Scholar ]
    23. Kosslyn SM, Margolis JA, Barrett AM, Goldknopf EJ, Daly PF. Age differences in imagery abilities . Child Development . 1990; 61 : 995–1010. [ PubMed ] [ Google Scholar ]
    24. Ghetti S. Rejection of false events in childhood. A metamemory account . Current Directions in Psychological Sciences . 2008; 17 : 16–20. [ Google Scholar ]
    25. Pearson J, Clifford CWG, Tong F. The functional impact of mental imagery on conscious perception . Current Biology . 2008; 18 : 982–986. 10.1016/j.cub.2008.05.048 [ PMC free article ] [ PubMed ] [ CrossRef ] [ Google Scholar ]
    26. Kail R, Pellegrino J, Carter P. Developmental changes in mental rotation . Journal of Experimental Child Psychology . 1980; 29 : 102–116. [ PubMed ] [ Google Scholar ]
    27. Walker P, Bremner JG, Smart L, Pitt T, Apsey D. Object categorisation, object naming, and viewpoint independence in visual remembering: Evidence from young children’s drawings of a novel object . Memory . 2008; 16 : 626–636. 10.1080/09658210802135369 [ PubMed ] [ CrossRef ] [ Google Scholar ]
    28. Howe ML, Wimmer MC, Blease K. The role of associative strength in children's false memory illusions . Memory . 2009; 17 : 8–16. 10.1080/09658210802438474 [ PubMed ] [ CrossRef ] [ Google Scholar ]
    29. Snodgrass JG, Vanderwart M. A standardized set of 260 pictures: Norms for name agreement, image agreement, familiarity, and visual complexity . Journal of Experimental Psychology: Human Learning and Memory . 1980; 6 : 174–215. [ PubMed ] [ Google Scholar ]
    30. Kail R. Developmental change in speed of processing during childhood and adolescence . Psychological Bulletin . 1991; 109 : 490–501. [ PubMed ] [ Google Scholar ]
    31. Ackerman BP. Item-specific and relational encoding effects in children’s recall and recognition memory for words . Journal of Experimental Child Psychology . 1984; 37 : 426–450. [ Google Scholar ]
    32. von Wright JM. Relation between verbal recall and visual recognition of the same stimuli in young children . Journal of Experimental Child Psychology . 1973; 15 : 481–487. [ PubMed ] [ Google Scholar ]
    33. Rieser JJ, Garing AE, Young MF. Imagery, action, and young children’s spatial observation: It’s not being there that counts, it’s what one has in mind . Child Development . 1994; 65 : 1262–1278. [ PubMed ] [ Google Scholar ]
    34. Cattaneo Z, Fastame MC, Vecchi T, Cornoldi C. Working memory, imagery and visuo-spatial mechanisms In: Vecchi T, Bottini G, editors. Imagery and spatial cognition . John Benjamins Publishing Co, Amsterdam, Netherlands; 2006. pp. 101–136. [ Google Scholar ]
    35. Wilson LJT, Scott JH, Power KG. Developmental differences in the span of visual memory for pattern . British Journal of Developmental Psychology . 1987; 5 : 249–255. [ Google Scholar ]
    36. Hamilton CJ, Coates RO, Heffernan T. What develops in visuo-spatial working memory development? European Journal of Cognitive Psychology . 2003; 15 : 43–69. [ Google Scholar ]
    37. Logie RH, Pearson DG. The inner eye and the inner scribe of visuo-spatial working memory: Evidence from developmental fractionation . European Journal of Cognitive Psychology . 1997; 9 : 241–257. [ Google Scholar ]
    38. Riggs KJ, McTaggart J, Simpson A, Freeman RPJ. Changes in the capacity of visual working memory in 5- to 10-year-olds . Journal of Experimental Child Psychology . 2006; 95 : 18–26. [ PubMed ] [ Google Scholar ]
    39. Newcombe NS, Huttenlocher J, Learmonth A. Infants’ coding of location in continuous space . Infant Behavior & Development . 1999; 22 : 483–510. [ Google Scholar ]
    40. Huttenlocher J, Newcombe N, Sandberg EH. The coding of spatial location in young children . Cognitive Psychology . 1994; 27 : 115–147. [ PubMed ] [ Google Scholar ]
    41. Plumert JM, Spencer J. The emerging spatial mind New York: Oxford University Press; 2007. [ Google Scholar ]
    42. Hund AM, Plumert JA. Delay-induced bias in children’s memory for location . Child Development . 2002; 73 : 829–840. [ PubMed ] [ Google Scholar ]
    43. Duffy S, Huttenlocher J, Levine S, Duffy R. How infants encode spatial extent . Infancy . 2005; 8 : 81–90. [ Google Scholar ]
    44. Duffy S, Huttenlocher J, Levine S. It is all relative: How young children encode,extent . Journal of Cognition and Development . 2005; 6 : 51–63. [ Google Scholar ]
    45. Huttenlocher J, Duffy S, Levine S. Infants and toddlers discriminate amount: Are they measuring? Psychological Science . 2002; 13 : 244–249. [ PubMed ] [ Google Scholar ]

    Articles from PLOS ONE are provided here courtesy of PLOS