Reading has immense value for present and future generations, and the foundation of national literacy is built on children’s reading skills. Children’s ability to read is crucial not only for the accumulation of knowledge and the improvement of education, but also as a support system for the improvement of national quality and social progress. However, reading ability is not innate and word recognition serves as a fundamental aspect of the acquisition process necessary for the optimal development of children’s reading skills. To gain insight into the developmental patterns of Chinese children’s reading skills and to improve their literacy, it is essential to understand the processes and factors that influence character and word processing. Although previous studies have investigated the effects of cognitive skills and word recognition on reading ability, few have examined the mechanisms underlying children’s word recognition or the role of lexical variables in their word processing. Recent mega-studies have provided psycholinguistic databases for many languages, indicating the importance of lexical and psycholinguistic variables in word processing. However, their role in Chinese children’s character and word processing remains unclear. Therefore, this study aims to clarify the roles of these variables through four series of studies (comprising seven sub-studies), to establish a systematic framework for Chinese reading, and to provide a better understanding of the developmental process of Chinese children’s character and word cognition.
Study 1 consisted of two sub-studies that constructed psycholinguistic databases for Chinese characters and compound words, respectively, and investigated the role of psycholinguistic variables in character and word processing. Study 1A created a database of 3,600 Chinese characters. Four psycholinguistic variables, namely age of acquisition, familiarity, imageability, and concreteness, were evaluated by 441 participants, while 12 lexical variables were also obtained. The latent response durations (RT) and accuracy (ACC) of college students’ character naming were used to assess the influence of lexical and psycholinguistic variables on character processing. Regression analyses showed that lexical variables significantly predicted both RT and ACC of character naming. Moreover, after controlling for lexical variables, psycholinguistic variables accounted for additional significant variance. These results suggest that psycholinguistic variables are essential for character processing. Study 1B developed a psycholinguistic database of 10,426 two-character compound words. A total of 1,317 participants rated these words on age of acquisition, familiarity, imageability and concreteness. In addition, 14 lexical variables were collected to examine the influence of lexical and psycholinguistic variables on word processing through RT and ACC of lexical decisions using the MELD-SCH. Regression analyses revealed that, after accounting for lexical variables, age of acquisition, familiarity, and imageability significantly increased the additional variance. These results show that psycholinguistic variables play an important role in word processing.
Study 2 consisted of two sub-studies that investigated the role of lexical and psycholinguistic variables in Chinese children’s character and word processing, respectively, utilizing the Chinese Psycholinguistic Database. Study 2A involved 422 Grade 3 and 485 Grade 5 children and examined their character naming ACC and RT through a character naming task. The results indicated that both Grade 3 and Grade 5 children showed improvements in accuracy and automation of character processing, with Grade 5 children approaching adult levels. Regression analyses highlighted the critical roles played by both lexical and psycholinguistic variables in character processing for both age groups. Relative importance analyses suggested that psycholinguistic variables were similar in importance to lexical variables in character processing, with all variables contributing more significantly to Grade 3 character processing according to the linear mixed effects model. Study 2B examined the ACC and RT of lexical processing in 630 Grade 3 and 593 Grade 5 children through the use of a lexical decision task based on 10,426 two-character words. The results indicated that both age groups showed increased accuracy and automation in word processing, although some gaps remained compared to adult readers. Regression analyses showed that both lexical and psycholinguistic variables played an important role in word processing for both age groups. Relative importance analyses indicated that all variables were more significant for accuracy, with psycholinguistic variables also playing a significant role in word processing. Overall, the linear mixed effects model suggested that all variables had a greater impact on the children’s word processing in Grade 3.
Study 3 included two sub-studies that employed an eye-movement sentence-reading experiment to investigate the role of lexical and psycholinguistic variables of Chinese characters and words in children’s natural reading. Study 3A assessed eye-movement processes during sentence reading in 42 Grade 3 and 26 Grade 5 children. The results showed that children’s character processing was more efficient during sentence reading, with both lexical and psycholinguistic variables of characters playing important roles in children’s character reading, consistent with isolated character recognition. However, children’s character reading was more influenced by sentence variables, and imageability played different roles in character naming and reading. Interaction effects indicated that lexical and psycholinguistic variables had similar effects on character reading. Study 3B, derived from Study 3A, extracted features of all words in sentences using two-character areas of interest and assessed children’s eye movement performance during word reading. The results showed that children’s word processing was also more efficient during sentence reading, with both lexical and psycholinguistic variables playing a crucial role in children’s word reading, consistent with isolated word recognition. In addition, children’s word reading was more influenced by sentence variables, and imageability played different roles in lexical decision and reading. Interaction effects suggested that lexical and psycholinguistic variables influenced word reading similarly for both age groups.
Study 4 integrated the datasets from the previous three studies and evaluated the relative importance of psycholinguistic variables in character and word processing by machine learning. First, 43 undergraduate students were administered the character and word reading performance. Twelve datasets for character and lexical were created by merging individual datasets. Using random forest, a total of 48 models were generated with the character and word processing performance of adults, Grade 3 and Grade 5 as dependent variables and lexical, psycholinguistic, sentence and reader variables as independent variables to determine the relative importance of all variables in character and word processing. The results indicate that psycholinguistic variables play a crucial role in both character and word processing, and their importance is comparable to that of lexical variables. The important variables identified by adults and children showed significant overlap, further confirming the consistency of character and word processing patterns among children and adults. However, compared to isolated character and word recognition, sentence and reader variables had a greater impact on character and word reading.
The present study constructed a systematic framework for Chinese reading with a series of mega-studies based on a psycholinguistic database to examine the role of psycholinguistic variables in children’s character and word processing at the macro level, as well as the importance of various variables from an ecological perspective. The results indicate that, among psycholinguistic variables, age of acquisition and familiarity with the character and word are extremely important. Imageability and concreteness play different roles in different character and word processing tasks and are essential for studying the reading development process of Chinese children in depth. In addition, lexical variables also play a significant role in character and word processing for Grade 3 and Grade 5, clarifying the factors that influence the difficulty of characters and words. While both lexical and psycholinguistic variables consistently affect both children’s and adults’ word processing, the results suggest that children’s character and word processing patterns are similar to those of adults. However, the results for character and word recognition and sentence reading suggest that the role of lexical variables in children’s natural reading has diminished and is influenced by sentence and reader variables. From a developmental perspective, this study provides scientific evidence for understanding the patterns of Chinese children’s character and word processing and reading development, with important implications for future interventions in Chinese literacy and reading instruction.