Cookies?
Library Header Image
LSE Research Online LSE Library Services

Chunk-based incremental processing and learning: an integrated theory of word discovery, implicit statistical learning, and speed of lexical processing

Jessop, Andrew, Pine, Julian M. and Gobet, Fernand ORCID: 0000-0002-9317-6886 (2025) Chunk-based incremental processing and learning: an integrated theory of word discovery, implicit statistical learning, and speed of lexical processing. Psychological Review. ISSN 0033-295X (In Press)

[img] Text (Jessop et al (2025)-Chunk-based Incremental Processing and Learning-Accepted version) - Accepted Version
Pending embargo until 1 January 2100.
Available under License Creative Commons Attribution.

Download (2MB)

Abstract

According to chunking theories, children discover their first words by extracting sub-sequences embedded in their continuous input. However, the mechanisms proposed in these accounts are often incompatible with data from other areas of language development. We present a new theory to connect the chunking accounts of word discovery with the broader developmental literature. We argue that (a) children build a diverse collection of chunks, including words, multi-word phrases, and sublexical units; (b) these chunks have different processing times determined by how often each chunk is used to recode the input; and (c) these processing times interact with short-term memory limitations and incremental processing to constrain learning. We implemented this theory as a computational modelling architecture called CIPAL (Chunk-based Incremental Processing and Learning). Across nine studies, we demonstrate that CIPAL can model word discovery in different contexts. First, we trained the model with 70 child-directed speech corpora from 15 languages. CIPAL gradually discovered words in each language, with cross-linguistic variation in performance. The model’s average processing time also improved with experience, resembling the developmental changes observed in children’s speed of processing. Second, we showed that CIPAL could simulate seven influential effects reported in statistical learning experiments with artificial languages. This included a preference for words over nonwords, part words, frequency-matched part words, phantom words, and sub-lexical units. On this basis, we argue that incremental chunking is an effective implicit statistical learning mechanism that may be central to children’s vocabulary development.

Item Type: Article
Additional Information: © 2025 The Author
Divisions: CPNSS
Subjects: P Language and Literature
B Philosophy. Psychology. Religion > BF Psychology
Date Deposited: 31 Mar 2025 14:15
Last Modified: 31 Mar 2025 14:15
URI: http://eprints.lse.ac.uk/id/eprint/127743

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics