The data set contains all research data of the project “Causality and prediction in linguistic discourse”: 1. All data from the data mining process for designing the stimuli for the study. 2. Stimuli in the import formats for Experiment Builder (SR Research) and the SoSci Survey platform. 3. All raw data from the eye-tracking experiment, the subject questionnaire, the SoSci Survey platform and the comprehension questions during the experiment. 4. Completely cleaned eye tracking data with the final interest areas and the export configuration for Data Viewer (SR Research). 5. All data for calculating the test statistics and visualizations, including the corresponding R script.