UCL#

University College London corpus

UCL dataset [Frank et al., 2013].

UCL is a dataset of word-by-word reading times collected through self-paced reading and eye-tracking experiments to evaluate computational psycholinguistic models of English sentence comprehension. 361 sentences from narrative sources, ensuring they were understandable without context, and recorded reading times from participants using both methods.

For more details check out the original paper [Frank et al., 2013].

How to Download#

import pymovements as pm

# Initialize the dataset object with its name
# Specify your local directory for saving and loading data
dataset = pm.Dataset(name='UCL', path='path/to/your/data/directory')

# Download the dataset and extract all archives.
dataset.download()

# Load the dataset into memory for processing
dataset.load()