Record Detail

Advanced Search

Text

Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators

Byung-Doh Oh - Personal Name
Christian Clark - Personal Name
William Schuler - Personal Name
Sebastian Padó - Personal Name

Expectation-based theories of sentence processing posit that processing difficulty is determined by predictability in context. While predictability quantified via surprisal has gained empirical support, this representation-agnostic measure leaves open the question of how to best approximate the human comprehender’s latent probability model. This article first describes an incremental left-corner parser that incorporates information about common linguistic abstractions such as syntactic categories, predicate-argument structure, and morphological rules as a computational-level model of sentence processing. The article then evaluates a variety of structural parsers and deep neural language models as cognitive models of sentence processing by comparing thepredictive power of their surprisal estimates on self-paced reading, eye-tracking, and fMRI data collected during real-time language processing. The results show that surprisal estimates from the proposed left-corner processing model deliver comparable and often superior fits to self-paced reading and eye-tracking data when compared to those from neural language models trained on much more data. This may suggest that the strong linguistic generalizations made by the proposed processing model may help predict humanlike processing costs that manifest in latency-based measures, even when the amount of training data is limited. Additionally, experiments using Transformer-based language models sharing the same primary architecture and training data show asurprising negative correlation between parameter count and fit to self-paced reading and eye-tracking data. These findings suggest that large-scale neural language models are making weaker generalizations based on patterns of lexical items rather than stronger, more humanlike generalizations based on linguistic structure.

Availability

No copy data

Detail Information

Series Title	-
Call Number	-
Publisher	Frontiers in Artificial Intelligence : Switzerland., 2022
Collation	006
Language	English
ISBN/ISSN	2624-8212
Classification	NONE
Content Type	-

Media Type	-
Carrier Type	-
Edition	-
Subject(s)	language models eye-tracking sentence processing incremental parsers surprisal theory self-paced reading fMRI
Specific Detail Info	-
Statement of Responsibility	-

Other Information

Accreditation	Scopus Q3

Other version/related

No other version available

File Attachment

Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators

Information

Web Online Public Access Catalog - Use the search options to find documents quickly